llvm-project

Author	SHA1	Message	Date
Craig Topper	e67ec12640	[RISCV] Remove experimental from Smctr and Ssctr. (#153903 ) These extensions were ratified in November 2024.	2025-08-15 17:18:09 -07:00
Yanzuo Liu	3b27d50cc7	[LLVM][utils] Add script which clears release notes (#153593 ) The script copies `ReleaseNotesTemplate.txt` to corresponding `ReleaseNotes.rst`/`.md` to clear release notes. The suffix of `ReleaseNotesTemplate.txt` must be `.txt`. If it is `.rst`/`.md`, it will be treated as a documentation source file when building documentation.	2025-08-15 19:00:08 +08:00
Luke Lau	1761b80a4a	[RISCV] Add a release note about tail folding being enabled. NFC (#153535 ) It's probably useful for users to know how to get the old scalar epilogue back if they need it.	2025-08-15 03:53:44 +00:00
Alexey Bataev	67af2f6c5c	[SLP]Initial FMAD support (#149102 ) Added initial check for potential fmad conversion in reductions and operands vectorization. Added the check for instruction to fix #152683 Skipped the code for reduction to avoid regressions.	2025-08-11 05:53:55 -07:00
David Green	cfe190979e	Revert "[SLP]Initial FMAD support (#149102 )" This reverts commit 0fffb9f9ed81f4c2084b8fe040c88b60bb6c372a due to major performance regressions.	2025-08-10 15:16:01 +01:00
Alexey Bataev	0fffb9f9ed	[SLP]Initial FMAD support (#149102 ) Added initial check for potential fmad conversion in reductions and operands vectorization. Added the check for instruction to fix #152683	2025-08-08 10:30:23 -07:00
Alexander Richardson	3a4b351ba1	[IR] Introduce the `ptrtoaddr` instruction This introduces a new `ptrtoaddr` instruction which is similar to `ptrtoint` but has two differences: 1) Unlike `ptrtoint`, `ptrtoaddr` does not capture provenance 2) `ptrtoaddr` only extracts (and then extends/truncates) the low index-width bits of the pointer For most architectures, difference 2) does not matter since index (address) width and pointer representation width are the same, but this does make a difference for architectures that have pointers that aren't just plain integer addresses such as AMDGPU fat pointers or CHERI capabilities. This commit introduces textual and bitcode IR support as well as basic code generation, but optimization passes do not handle the new instruction yet so it may result in worse code than using ptrtoint. Follow-up changes will update capture tracking, etc. for the new instruction. RFC: https://discourse.llvm.org/t/clarifiying-the-semantics-of-ptrtoint/83987/54 Reviewed By: nikic Pull Request: https://github.com/llvm/llvm-project/pull/139357	2025-08-08 10:12:39 -07:00
Alexey Bataev	0419b459be	Revert "[SLP]Initial FMAD support (#149102 )" This reverts commit 0bcf45ea3458ba79eb4257afcfd6af954292c9ce to fix the regresions, reported in https://github.com/llvm/llvm-project/issues/152683	2025-08-08 09:17:59 -07:00
Sam Elliott	4e11f89904	[RISCV] Basic Objdump Mapping Symbol Support (#151452 ) This implements very basic support for RISC-V mapping symbols in llvm-objdump, sharing the implementation with how Arm/AArch64/CSKY implement this feature. This only supports the `$x` (instruction) and `$d` (data) mapping symbols for RISC-V, and not the version of `$x` which includes an architecture string suffix.	2025-08-07 11:28:07 -07:00
Alexey Bataev	0bcf45ea34	[SLP]Initial FMAD support (#149102 ) Added initial check for potential fmad conversion in reductions and operands vectorization.	2025-08-07 09:51:43 -04:00
Jonas Devlieghere	7abb519e9e	[lldb] Add release note for Wasm debugging (#151445 ) Add a release note for Wasm debugging.	2025-07-31 08:15:23 -07:00
Alexey Bataev	ef98e248c7	[SLP]Initial support for copyable elements (non-schedulable only) Adds initial support for copyable elements. This patch only models adds and model copyable elements as add <element>, 0, i.e. uses identity constants for missing lanes. Only support for elements, which do not require scheduling, is added to reduce size of the patch. Fixed compile time regressions, reported crashes, updated release notes Reviewers: RKSimon, hiraditya Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/140279	2025-07-25 10:55:07 -07:00
Tobias Hieta	3d8db8ef50	Clear release notes on main for LLVM 22	2025-07-24 11:36:11 +02:00
Alexey Bataev	898bba311f	[SLP]Initial support for copyable elements (non-schedulable only) Adds initial support for copyable elements. This patch only models adds and model copyable elements as add <element>, 0, i.e. uses identity constants for missing lanes. Only support for elements, which do not require scheduling, is added to reduce size of the patch. Fixed compile time regressions, updated release notes Reviewers: RKSimon, hiraditya Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/140279	2025-07-23 13:38:34 -07:00
Jay Foad	d449d3dc13	[CodeGen] Remove FinalizeMachineBundles pass (#149806 ) Replace its only use in the AMDGPU R600 backend with a call to finalizeBundles.	2025-07-23 11:36:49 +01:00
gulfemsavrun	e94bc16b8e	[llvm-objdump] Add inlined function display support (#142246 ) This patch adds the support for displaying inlined functions into llvm-objdump. 1) It extends the source variable display support for inlined functions both for ascii and unicode formats. 2) It also introduces a new format called limits-only that only prints a line for the start and end of an inlined function without line-drawing characters.	2025-07-21 12:51:27 +03:00
Trevor Gross	a78a0f8d20	[X86] Align f128 and i128 to 16 bytes when passing on x86-32 (#138092 ) The i386 psABI specifies that `__float128` has 16 byte alignment and must be passed on the stack; however, LLVM currently stores it in a stack slot that has an offset of 4. Add a custom lowering to correct this alignment to 16-byte. i386 does not specify an `__int128`, but it seems reasonable to keep the same behavior as `__float128` so this is changed as well. There also isn't a good way to distinguish whether a set of four registers came from an integer or a float. The main test demonstrating this change is `store_perturbed` in `llvm/test/CodeGen/X86/i128-fp128-abi.ll`. Referenced ABI: https://gitlab.com/x86-psABIs/i386-ABI/-/wikis/uploads/14c05f1b1e156e0e46b61bfa7c1df1e2/intel386-psABI-2020-08-07.pdf Fixes: https://github.com/llvm/llvm-project/issues/77401	2025-07-17 11:30:36 +02:00
cmtice	33396d71e9	[LLDB] Update release note about DIL as requested. (#149117 ) A post-commit review on PR #147887 requested a minor update to the formatting of the LLDB DIL implementation release note.	2025-07-16 08:29:33 -07:00
cmtice	f5c676d6d9	[LLDB] Switch to using DIL as default implementation for 'frame var'. (#147887 )	2025-07-15 08:19:10 -07:00
David Spickett	a9cfe84d88	[llvm][docs] Correct formatting in lldb release note When you use <> in markdown it thinks it's an HTML tag and does not show anything.	2025-07-15 12:31:05 +00:00
Ami-zhang	9ef293ea24	[LoongArch] Add supplemental release notes for LLVM 21 (#148771 )	2025-07-15 15:39:00 +08:00
tedwoodward	eb6da944af	[lldb] Improve disassembly of unknown instructions (#145793 ) LLDB uses the LLVM disassembler to determine the size of instructions and to do the actual disassembly. Currently, if the LLVM disassembler can't disassemble an instruction, LLDB will ignore the instruction size, assume the instruction size is the minimum size for that device, print no useful opcode, and print nothing for the instruction. This patch changes this behavior to separate the instruction size and "can't disassemble". If the LLVM disassembler knows the size, but can't dissasemble the instruction, LLDB will use that size. It will print out the opcode, and will print "<unknown>" for the instruction. This is much more useful to both a user and a script. The impetus behind this change is to clean up RISC-V disassembly when the LLVM disassembler doesn't understand all of the instructions. RISC-V supports proprietary extensions, where the TD files don't know about certain instructions, and the disassembler can't disassemble them. Internal users want to be able to disassemble these instructions. With llvm-objdump, the solution is to pipe the output of the disassembly through a filter program. This patch modifies LLDB's disassembly to look more like llvm-objdump's, and includes an example python script that adds a command "fdis" that will disassemble, then pipe the output through a specified filter program. This has been tested with crustfilt, a sample filter located at https://github.com/quic/crustfilt . Changes in this PR: - Decouple "can't disassemble" with "instruction size". DisassemblerLLVMC::MCDisasmInstance::GetMCInst now returns a bool for valid disassembly, and has the size as an out paramter. Use the size even if the disassembly is invalid. Disassemble if disassemby is valid. - Always print out the opcode when -b is specified. Previously it wouldn't print out the opcode if it couldn't disassemble. - Print out RISC-V opcodes the way llvm-objdump does. Code for the new Opcode Type eType16_32Tuples by Jason Molenda. - Print <unknown> for instructions that can't be disassembled, matching llvm-objdump, instead of printing nothing. - Update max riscv32 and riscv64 instruction size to 8. - Add example "fdis" command script. - Added disassembly byte test for x86 with known and unknown instructions. - Added disassembly byte test for riscv32 with known and unknown instructions, with and without filtering. - Added test from Jason Molenda to RISC-V disassembly unit tests.	2025-07-14 21:50:22 -05:00
Jim Lin	22707fd4a5	[RISCV] Add Andes XAndesBFHCvt (Andes Scalar BFLOAT16) extension (#148563 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. The extension includes only two instructions: one for converting from f32 to f16, and another for converting from f16 to f32. This patch only implements MC support for XAndesBFHCvt.	2025-07-15 08:59:00 +08:00
Jim Lin	3f33e7ba5b	[RISCV] Add Andes XAndesVSIntLoad (Andes Vector INT4 Load) extension (#147005 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only implements MC support for XAndesVSIntLoad. --------- Co-authored-by: Lino Hsing-Yu Peng <linopeng@andestech.com>	2025-07-07 13:01:22 +08:00
Rahman Lavaee	6b623a6622	[SHT_LLVM_BB_ADDR_MAP] Remove support for versions 1 and 0 (SHT_LLVM_BB_ADDR_MAP_V0). (#146186 ) Version 2 was added more than two years ago (`6015a045d7`). So it should be safe to deprecate older versions.	2025-07-02 10:31:52 -07:00
David Spickett	8b3cc4d1ed	[llvm][docs] Remove LLDB DWARF64 release note As a precaution until the issues reported (by me) in https://github.com/llvm/llvm-project/issues/135208 are investigated.	2025-07-02 14:02:57 +00:00
Hemang Gadhavi	da0828b1e9	[lldb] Enable support for DWARF64 format handling (#145645 ) This PR introduces support for the DWARF64 format, enabling handling of 64-bit DWARF sections as defined by the DWARF specification. The update includes adjustments to header parsing and modification of form values to accommodate 64-bit offsets and values. Also Added the testcase to verify the DWARF64 format.	2025-07-01 18:35:40 +05:30
Craig Topper	c3b160bcaa	[RISCV] Remove -mattr=+no-rvc-hints (#145138 ) As far as I know binutils does not have a similar option and I don't know of a reason we shouldn't accept the RVC hint instructions. The wording in the spec in the past suggested that maybe these weren't valid instruction names, but that's been modified recently.	2025-06-25 08:24:24 -07:00
Jim Lin	2f9c97c030	[RISCV] Add Andes AX45MPV processor definition (#145267 ) Andes AX45MPV is 64-bit in-order dual-issue 8-stage pipeline linux-capable CPU implementing the RV64IMAFDCV ISA extension. That is developed by Andes Technology https://www.andestech.com, a RISC-V IP provider. The overviews for AX45MPV: https://www.andestech.com/en/products-solutions/andescore-processors/riscv-ax45mpv/ Scheduling model for RVV extension will be implemented a follow-up PR.	2025-06-24 08:57:55 +08:00
Benji Smith	8f01edfa11	[C API] Add getter/setter for samesign flag on icmp (#145247 ) This was added to the C++ API in https://github.com/llvm/llvm-project/pull/111419 so this change adds accessors in the C API, along with a couple tests.	2025-06-22 18:05:17 -04:00
Jim Lin	8ddada41df	[RISCV] Add Andes XAndesVBFHCvt (Andes Vector BFLOAT16 Conversion) extension (#144320 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. The instructions are similar to `Zvfbfmin` and the only difference with `Zvfbfmin` is that `XAndesVBFHCvt` doesn't have mask variant.	2025-06-18 09:17:46 +08:00
Ebuka Ezike	89d2d62e46	[symbolizer] Update Release notes. (#142951 ) Also add post-commit changes from commit #71ba852 in PR #135857 --------- Co-authored-by: James Henderson <James.Henderson@sony.com>	2025-06-06 13:25:07 +01:00
Jim Lin	2a8c7d3c69	[RISCV] Add support for -mtune=andes-45-series (#142900 ) Enables the use of `-mtune=andes-45-series` to generate code optimized with the Andes 45 series scheduling model and tuning features.	2025-06-06 11:34:19 +08:00
Min-Yih Hsu	feb21e26fa	[RISCV] Add SiFive X390 processor definition (#142517 ) X390 is an in-order core designed for AI/ML workload, with VLEN=1024. https://www.sifive.com/cores/intelligence-x300-series Scheduling model will be added in a follow-up patch.	2025-06-04 09:25:59 -07:00
Nikita Popov	7547ff5cad	[X86] Consistently use f128 libcalls (#142386 ) On x86, the `l` libcalls are for 80-bit extended precision. `fp128` needs to use the `f128` libcalls instead. Add a few missing ones, esp. for FP min/max. Also use the `f128` libcalls on x86-32. I believe the situation there is the same as on x86-64.	2025-06-03 12:28:31 +02:00
Simon Tatham	56acb06bc6	[ARM,AArch64] Don't put BTI at asm goto branch targets (#141562 ) In 'asm goto' statements ('callbr' in LLVM IR), you can specify one or more labels / basic blocks in the containing function which the assembly code might jump to. If you're also compiling with branch target enforcement via BTI, then previously listing a basic block as a possible jump destination of an asm goto would cause a BTI instruction to be placed at the start of the block, in case the assembly code used an _indirect_ branch instruction (i.e. to a destination address read from a register) to jump to that location. Now it doesn't do that any more: branches to destination labels from the assembly code are assumed to be direct branches (to a relative offset encoded in the instruction), which don't require a BTI at their destination. This change was proposed in https://discourse.llvm.org/t/85845 and there seemed to be no disagreement. The rationale is: 1. it brings clang's handling of asm goto in Arm and AArch64 in line with gcc's, which didn't generate BTIs at the target labels in the first place. 2. it improves performance in the Linux kernel, which uses a lot of 'asm goto' in which the assembly language just contains a NOP, and the label's address is saved elsewhere to let the kernel self-modify at run time to swap between the original NOP and a direct branch to the label. This allows hot code paths to be instrumented for debugging, at only the cost of a NOP when the instrumentation is turned off, instead of the larger cost of an indirect branch. In this situation a BTI is unnecessary (if the branch happens it's direct), and since the code paths are hot, also a noticeable performance hit. Implementation: `SelectionDAGBuilder::visitCallBr` is the place where 'asm goto' target labels are handled. It calls `setIsInlineAsmBrIndirectTarget()` on each target `MachineBasicBlock`. Previously it also called `setMachineBlockAddressTaken()`, which made `hasAddressTaken()` return true, which caused a BTI to be added in the Arm backends. Now `visitCallBr` doesn't call `setMachineBlockAddressTaken()` any more on asm goto targets, but `hasAddressTaken()` also checks the flag set by `setIsInlineAsmBrIndirectTarget()`. So call sites that were using `hasAddressTaken()` don't need to be modified. But the Arm backends don't call `hasAddressTaken()` any more: instead they test two more specific query functions that cover all the reasons `hasAddressTaken()` might have returned true _except_ being an asm goto target. Testing: The new test `AArch64/callbr-asm-label-bti.ll` is testing the actual change, where it expects not to see a `bti` instruction after `[[LABEL]]`. The rest of the test changes are all churn, due to the flags on basic blocks changing. Actual output code hasn't changed in any of the existing tests, only comments and diagnostics. Further work: `RISCVIndirectBranchTracking.cpp` and `X86IndirectBranchTracking.cpp` also call `hasAddressTaken()` in a way that might benefit from using the same more specific check I've put in `ARMBranchTargets.cpp` and `AArch64BranchTargets.cpp`. But I'm not sure of that, so in this commit I've only changed the Arm backends, and left those alone.	2025-06-03 08:44:13 +01:00
Jacob Lalonde	1a1927abd3	[lldb] Add release note about ELF `thread siginfo` and negative SI Codes. (#141631 ) Adding a release note about adding `thread siginfo` support to `ThreadELFCore` and expanding Linux signals to understand user space signals.	2025-05-30 09:15:16 -07:00
Ying Chen	5483190216	[RISCV] Add shlcofideleg extension (#141572 ) This is for `shlcofideleg` extension, that supports delegating LCOFI interrupts to VS-mode. Spec: https://github.com/riscv/riscv-isa-manual/blob/main/src/hypervisor.adoc	2025-05-30 16:52:08 +08:00
Jonas Devlieghere	4000113b88	[lldb] Add release note for riscv32 elf core file support in LLDB (#141629 ) Add a release note for riscv32 elf core file support in LLDB.	2025-05-28 08:39:58 -07:00
Weining Lu	581d175a86	[LoongArch] Document the inline asm `q` constraint See #141037.	2025-05-26 09:14:27 +08:00
Jim Lin	569b6f6dad	[RISCV] Add Andes A25/AX25 processor definition (#140681 ) Andes A25/AX25 are 32/64bit, 5-stage pipeline, linux-capable CPUs that implement the RV[32\|64]IMAFDC_Zba_Zbb_Zbc_Zbs ISA extensions. They are developed by Andes Technology https://www.andestech.com, a RISC-V IP provider. The overviews for A25/AX25: https://www.andestech.com/en/products-solutions/andescore-processors/riscv-a25/ https://www.andestech.com/en/products-solutions/andescore-processors/riscv-ax25/ Scheduling model will be implemented in a later PR.	2025-05-22 09:22:32 +08:00
Craig Topper	a0b6cfd975	[RISCV] Add MC layer support for XSfmm*. (#133031 ) This adds assembler/disassembler support for XSfmmbase 0.6 and related SiFive matrix multiplication extensions based on the spec here https://www.sifive.com/document-file/xsfmm-matrix-extensions-specification Functionality-wise, this is the same as the Zvma extension proposal that SiFive shared with the Attached Matrix Extension Task Group. The extension names and instruction mnemonics have been changed to use vendor prefixes. Note this is a non-conforming extension as the opcodes used here are in the standard opcode space in OP-V or OP-VE. --------- Co-authored-by: Brandon Wu <brandon.wu@sifive.com>	2025-05-21 08:26:35 -07:00
Krzysztof Drewniak	4bdd116b80	[AMDGPU] Add a new amdgcn.load.to.lds intrinsic (#137425 ) This PR adds a amdgns_load_to_lds intrinsic that abstracts over loads to LDS from global (address space 1) pointers and buffer fat pointers (address space 7), since they use the same API and "gather from a pointer to LDS" is something of an abstract operation. This commit adds the intrinsic and its lowerings for addrspaces 1 and 7, and updates the MLIR wrappers to use it (loosening up the restrictions on loads to LDS along the way to match the ground truth from target features). It also plumbs the intrinsic through to clang.	2025-05-19 07:15:04 -07:00
Iris Shi	1e503d08e1	[RISCV][MC] Add support for Q extension (#139369 ) Closes #130217. https://github.com/riscv/riscv-isa-manual/blob/main/src/q-st-ext.adoc	2025-05-15 10:51:06 +08:00
Jim Lin	2a8960e48b	[RISCV] Add Andes XAndesVDot (Andes Vector Dot Product) extension. (#139849 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. Intrinsics support will be added in a later patch.	2025-05-15 10:14:58 +08:00
Jim Lin	9e27db0a50	[RISCV] Add Andes XAndesVPackFPH (Andes Vector Packed FP16) extension. (#138827 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. Intrinsics support will be added in a later patch.	2025-05-12 16:58:39 +08:00
Matt Arsenault	4d60c6d9b2	Reapply "IR: Remove reference counts from ConstantData (#137314 )" (#138962 ) This reverts commit 0274232b87177779e5c985eca06df22bf140f6cb.	2025-05-08 08:02:54 +02:00
Matt Arsenault	9383fb23e1	Reapply "IR: Remove uselist for constantdata (#137313 )" (#138961 ) Reapply "IR: Remove uselist for constantdata (#137313)" This reverts commit 5936c02c8b9c6d1476f7830517781ce8b6e26e75. Fix checking uselists of constants in assume bundle queries	2025-05-08 08:00:09 +02:00
Kirill Stoimenov	5936c02c8b	Revert "IR: Remove uselist for constantdata (#137313 )" Possibly breaks the build: https://lab.llvm.org/buildbot/#/builders/24/builds/8119 This reverts commit 87f312aad6ede636cd2de5d18f3058bf2caf5651.	2025-05-07 00:07:55 +00:00
Kirill Stoimenov	0274232b87	Revert "IR: Remove reference counts from ConstantData (#137314 )" This reverts commit 51a3bd919d68a8fb1b026377d6e86b1523d37433. Possible breaks the build: https://lab.llvm.org/buildbot/#/builders/24/builds/8119/steps/9/logs/stdio	2025-05-07 00:07:55 +00:00

1 2 3 4

191 Commits