llvm-project

Author	SHA1	Message	Date
Steven Wu	deab049b5c	[CAS] Add ActionCache to LLVMCAS Library (#114097 ) ActionCache is used to store a mapping from CASID to CASID. The current implementation of the ActionCache can only be used to associate the key/value from the same hash context. ActionCache has two operations: `put` to store the key/value and `get` to lookup the key/value mapping. ActionCache uses the same TrieRawHashMap data structure to store the mapping, where is CASID of the key is the hash to index the map. While CASIDs for key/value are often associcate with actual CAS ObjectStore, it doesn't provide the guarantee of the existence of such object in any ObjectStore.	2025-08-20 14:42:44 -07:00
Gang Chen	60dbde69cd	[AMDGPU] report named barrier cnt part2 (#154588 )	2025-08-20 12:00:45 -07:00
Rajat Bajpai	fad3272286	[NVPTX] Add support for "blocksareclusters" kernel attr (#152265 ) This change introduces a new kernel attribute that allows thread blocks to be mapped to clusters. In addition, it also adds support of `+ptx90` PTX ISA support.	2025-08-20 11:09:39 +05:30
Kazu Hirata	18123cc91d	[llvm] Proofread Legalizer.rst (#154266 )	2025-08-18 22:40:00 -07:00
Henrik G. Olsson	e1ff432eb6	Reland "[Utils] Add new --update-tests flag to llvm-lit" (#153821 ) This reverts commit `e495231238` to reland the --update-tests feature, originally landed in https://github.com/llvm/llvm-project/pull/108425.	2025-08-18 20:24:27 -07:00
林克	6842cc5562	[RISCV] Add SpacemiT XSMTVDot (SpacemiT Vector Dot Product) extension. (#151706 ) The full spec can be found at spacemit-x60 processor support scope: Section 2.1.2.2 (Features): https://developer.spacemit.com/documentation?token=BWbGwbx7liGW21kq9lucSA6Vnpb#2.1 This patch only supports assembler.	2025-08-18 18:03:17 +08:00
Kazu Hirata	fc6f235101	[llvm] Proofread LangRef.rst (#153932 )	2025-08-16 06:47:33 -07:00
Craig Topper	e67ec12640	[RISCV] Remove experimental from Smctr and Ssctr. (#153903 ) These extensions were ratified in November 2024.	2025-08-15 17:18:09 -07:00
Tim Renouf	f279c47cb3	AMDGPU gfx12: Add _dvgpr$ symbols for dynamic VGPRs (#148251 ) For each function with the AMDGPU_CS_Chain calling convention, with dynamic VGPRs enabled, add a _dvgpr$ symbol, with the value of the function symbol, plus an offset encoding one less than the number of VGPR blocks used by the function (16 VGPRs per block, no more than 128) in bits 5..3 of the symbol value. This is used by a front-end to have functions that are chained rather than called, and a dispatcher that dynamically resizes the VGPR count before dispatching to a function.	2025-08-15 16:33:06 +01:00
Yanzuo Liu	3b27d50cc7	[LLVM][utils] Add script which clears release notes (#153593 ) The script copies `ReleaseNotesTemplate.txt` to corresponding `ReleaseNotes.rst`/`.md` to clear release notes. The suffix of `ReleaseNotesTemplate.txt` must be `.txt`. If it is `.rst`/`.md`, it will be treated as a documentation source file when building documentation.	2025-08-15 19:00:08 +08:00
Luke Lau	1761b80a4a	[RISCV] Add a release note about tail folding being enabled. NFC (#153535 ) It's probably useful for users to know how to get the old scalar epilogue back if they need it.	2025-08-15 03:53:44 +00:00
Stanislav Mekhanoshin	49f2093477	[AMDGPU] Increase LDS to 320K on gfx1250 (#153645 )	2025-08-14 12:52:00 -07:00
Kazu Hirata	4393d16f55	[llvm] Proofread LangRef.rst (#153358 ) This patch quotes code-related terms like ``llvm.memset``.	2025-08-13 10:37:22 -07:00
David Spickett	dc41571cd8	[llvm][docs] Update CMake commands for cross compiling Arm builtins (#151544 ) This does a few things: * LLVM_CONFIG_PATH is deprecated, use LLVM_CMAKE_DIR instead. * Don't use $ before command examples. I would normally, but the key cmake commands didn't use it so I removed it from all commands. * Makes the commands shown full commands, so you don't have to piece them together. * Uses shell variables to cut down on repetition and make this easier to port to other targets. * Adds a few options to disable more compiler-rt things. * Use the built in cmake options for sysroot and toolchains. * Include test options in the first cmake command, so you don't have to re-do the whole thing after you read the testing section. * Removes the section about using BaremetalARM.cmake. The closest I got to getting that cache to work was: ``` SYSROOT=/home/david.spickett/arm-gnu-toolchain-14.3.rel1-x86_64-arm-none-eabi/arm-none-eabi/libc LLVM_TOOLCHAIN=/home/david.spickett/LLVM-20.1.8-Linux-X64/ cmake \ -G Ninja \ -DCMAKE_C_COMPILER=${LLVM_TOOLCHAIN}/bin/clang \ -DBAREMETAL_ARMV6M_SYSROOT=${SYSROOT} \ -DBAREMETAL_ARMV7M_SYSROOT=${SYSROOT} \ -DBAREMETAL_ARMV7EM_SYSROOT=${SYSROOT} \ -DCMAKE_BUILD_TYPE=Release \ -DLLVM_ENABLE_RUNTIMES="compiler-rt" \ -C ../llvm-project/clang/cmake/caches/BaremetalARM.cmake \ -DCOMPILER_RT_BUILD_BUILTINS=ON \ -DCOMPILER_RT_BUILD_LIBFUZZER=OFF \ -DCOMPILER_RT_BUILD_MEMPROF=OFF \ -DCOMPILER_RT_BUILD_PROFILE=OFF \ -DCOMPILER_RT_BUILD_CTX_PROFILE=OFF \ -DCOMPILER_RT_BUILD_SANITIZERS=OFF \ -DCOMPILER_RT_BUILD_XRAY=OFF \ -DCOMPILER_RT_BUILD_ORC=OFF \ -DCOMPILER_RT_BUILD_CRT=OFF \ ../llvm-project/runtimes ``` All this does is build the x86 builtins. I tried forcing the issue with: ``` -DBUILTIN_SUPPORTED_ARCH="armv7m;armv6m;armv7em" \ ``` But again, just x86. It's probably something deep in compiler-rt failing a compiler check for the Arm targets. Even if that's the case, fixing that means adding more options to the cmake command. I can't find evidence of a full command using this cache file since the commit that introduced it and that command no longer works. I think if you ever got this to work again the command would be as long and complex as the ones already shown in the document. I would also argue that some of the other caches, for example Fuschia's, are much better example of multi-target runtimes builds. If what's in this document isn't enough, folks should be learning from those files and about the runtimes build overall before attempting anything complex (though it does not take much to be "complex").	2025-08-13 09:47:43 +01:00
Nick Sarnie	da3182a288	[LLVM][docs] Update full list of options for LLVM_TARGETS_TO_BUILD (#153299 ) We added `SPIRV` as a non-experimental backend in `cda81b1ec9`. Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>	2025-08-12 21:48:29 +00:00
Steven Wu	dda996b875	[CAS] Add LLVMCAS library with InMemoryCAS implementation (#114096 ) Add llvm::cas::ObjectStore abstraction and InMemoryCAS as a in-memory CAS object store implementation. The ObjectStore models its objects as: * Content: An array of bytes for the data to be stored. * Refs: An array of references to other objects in the ObjectStore. And each CAS Object can be idenfied with an unqine ID/Hash. ObjectStore supports following general action: * Expected<ID> store(Content, ArrayRef<Ref>) * Expected<Ref> get(ID) It also introduces following types to interact with a CAS ObjectStore: * CASID: Hash representation for an CAS Objects with its context to help print/compare CASIDs. * ObjectRef: A light-weight ref for an object in the ObjectStore. It is implementation defined so it can be optimized for read/store/references depending on the implementation. * ObjectProxy: A proxy for the users of CAS to interact with the data inside CAS Object. It bundles a ObjectHandle and an ObjectStore instance.	2025-08-12 10:25:43 -07:00
Kazu Hirata	f8a69058cb	[llvm] Proofread CMake.rst (#153129 )	2025-08-11 22:51:56 -07:00
Matt Arsenault	ff53086924	AMDGPU: Add new VA inline asm constraint for AV registers (#152665 ) Add a new constraint corresponding to the AV_* register classes for operands which can allocate AGPRs or VGPRs. This applies to load and stores on gfx90a+, and srcA / srcB for MFMA instructions. The error emitted on unsupported targets isn't ideal, it is produced by the register allocator without a rationale, but it is consistent with the existing errors. I mostly want this for writing allocation tests.	2025-08-12 10:17:28 +09:00
Alexey Bataev	67af2f6c5c	[SLP]Initial FMAD support (#149102 ) Added initial check for potential fmad conversion in reductions and operands vectorization. Added the check for instruction to fix #152683 Skipped the code for reduction to avoid regressions.	2025-08-11 05:53:55 -07:00
parabola94	d218789d3e	[Docs] Fix the description about %clang_cpp (#152842 )	2025-08-11 08:41:36 +09:00
Pedro Lobo	57627bd197	[LangRef] Fix `ptrtoaddr` code block (#152927 ) Fixes formatting of the example code block under the `ptrtoaddr` instruction.	2025-08-10 17:32:12 +01:00
David Green	cfe190979e	Revert "[SLP]Initial FMAD support (#149102 )" This reverts commit 0fffb9f9ed81f4c2084b8fe040c88b60bb6c372a due to major performance regressions.	2025-08-10 15:16:01 +01:00
Kazu Hirata	190ad0b127	[llvm] Proofread SourceLevelDebugging.rst (#152838 )	2025-08-09 06:58:13 -07:00
Alexey Bataev	0fffb9f9ed	[SLP]Initial FMAD support (#149102 ) Added initial check for potential fmad conversion in reductions and operands vectorization. Added the check for instruction to fix #152683	2025-08-08 10:30:23 -07:00
Alexander Richardson	3a4b351ba1	[IR] Introduce the `ptrtoaddr` instruction This introduces a new `ptrtoaddr` instruction which is similar to `ptrtoint` but has two differences: 1) Unlike `ptrtoint`, `ptrtoaddr` does not capture provenance 2) `ptrtoaddr` only extracts (and then extends/truncates) the low index-width bits of the pointer For most architectures, difference 2) does not matter since index (address) width and pointer representation width are the same, but this does make a difference for architectures that have pointers that aren't just plain integer addresses such as AMDGPU fat pointers or CHERI capabilities. This commit introduces textual and bitcode IR support as well as basic code generation, but optimization passes do not handle the new instruction yet so it may result in worse code than using ptrtoint. Follow-up changes will update capture tracking, etc. for the new instruction. RFC: https://discourse.llvm.org/t/clarifiying-the-semantics-of-ptrtoint/83987/54 Reviewed By: nikic Pull Request: https://github.com/llvm/llvm-project/pull/139357	2025-08-08 10:12:39 -07:00
Alexey Bataev	0419b459be	Revert "[SLP]Initial FMAD support (#149102 )" This reverts commit 0bcf45ea3458ba79eb4257afcfd6af954292c9ce to fix the regresions, reported in https://github.com/llvm/llvm-project/issues/152683	2025-08-08 09:17:59 -07:00
Kazu Hirata	8afa70f1c8	[llvm] Proofread SourceLevelDebugging.rst (#152646 ) This patch takes care of the highly mechanical part of proofreading SourceLevelDebugging.rst, namely: - hyphenating "32 bit value" and similar and - hypenating "Objective C"	2025-08-08 07:45:14 -07:00
Nikita Popov	c23b4fbdbb	[IR] Remove size argument from lifetime intrinsics (#150248 ) Now that #149310 has restricted lifetime intrinsics to only work on allocas, we can also drop the explicit size argument. Instead, the size is implied by the alloca. This removes the ability to only mark a prefix of an alloca alive/dead. We never used that capability, so we should remove the need to handle that possibility everywhere (though many key places, including stack coloring, did not actually respect this).	2025-08-08 11:09:34 +02:00
Sam Elliott	4e11f89904	[RISCV] Basic Objdump Mapping Symbol Support (#151452 ) This implements very basic support for RISC-V mapping symbols in llvm-objdump, sharing the implementation with how Arm/AArch64/CSKY implement this feature. This only supports the `$x` (instruction) and `$d` (data) mapping symbols for RISC-V, and not the version of `$x` which includes an architecture string suffix.	2025-08-07 11:28:07 -07:00
Kazu Hirata	e10fdb989b	[llvm] Proofread MergeFunctions.rst (#152444 )	2025-08-07 07:23:06 -07:00
Alexey Bataev	0bcf45ea34	[SLP]Initial FMAD support (#149102 ) Added initial check for potential fmad conversion in reductions and operands vectorization.	2025-08-07 09:51:43 -04:00
Abhilash Majumder	fee6e539d0	[NVPTX] Add prefetch tensormap variant (#146203 ) [NVPTX] Add Prefetch tensormap intrinsics This PR adds prefetch intrinsics with the relevant tensormap_space. * Lit tests are added as part of prefetch.ll * The generated PTX is verified with a 12.3 ptxas executable. * Added docs for these intrinsics in NVPTXUsage.rst. For more information, refer to the PTX ISA for prefetch intrinsic : [Prefetch Tensormap](https://docs.nvidia.com/cuda/parallel-thread-execution/#data-movement-and-conversion-instructions-prefetch-prefetchu) @durga4github @schwarzschild-radius	2025-08-07 16:19:21 +05:30
Kazu Hirata	5342c33f1d	[llvm] Proofread MIRLangRef.rst (#152263 )	2025-08-06 07:11:16 -07:00
David Spickett	3f6f5b9121	[llvm][docs] Tell people how to list runtime names This list was already out of date, so I think it's better we tell them how to get CMake to tell them the list.	2025-08-06 13:39:07 +00:00
Craig Topper	73685583c8	[VP][RISCV] Add a vp.load.ff intrinsic for fault only first load. (#128593 ) There's been some interest in supporting early-exit loops recently. https://discourse.llvm.org/t/rfc-supporting-more-early-exit-loops/84690 This patch was extracted from our downstream where we've been using it in our vectorizer.	2025-08-05 16:12:42 -07:00
Kazu Hirata	302a452bd6	[llvm] Proofread TestingGuide.rst (#152089 ) Co-authored-by: James Henderson <James.Henderson@sony.com>	2025-08-05 09:46:17 -07:00
Kazu Hirata	0398ad41bd	[llvm] Proofread Coroutines.rst (#151906 )	2025-08-04 07:40:54 -07:00
Gergely Futo	1454db130a	[RISCV] Support resumable non-maskable interrupt handlers (#148134 ) The `rnmi` interrupt attribute value has been added for the `Smrnmi` extension. --------- Co-authored-by: Sam Elliott <sam@lenary.co.uk>	2025-08-04 10:54:50 +02:00
Nikita Popov	86727fe9a1	[IR] Allow poison argument to lifetime markers (#151148 ) This slightly relaxes the invariant established in #149310, by also allowing the lifetime argument to be poison. This is to support the typical pattern of RAUWing with poison when removing an instruction. It's worth noting that this does not require any conservative assumptions, lifetimes with poison arguments can simply be skipped. Fixes https://github.com/llvm/llvm-project/issues/151119.	2025-08-04 10:02:04 +02:00
Kazu Hirata	84fa58c5d8	[llvm] Proofread ProgrammersManual.rst (#151853 ) Co-authored-by: Tim Gymnich <tim@gymni.ch>	2025-08-03 09:41:27 -07:00
S. VenkataKeerthy	21f1f9558d	[IR2Vec][llvm-ir2vec] Changing clEnumValN to cl::SubCommand (#151384 ) Refactor llvm-ir2vec to use subcommands instead of a mode flag for better CLI usability. - Converted the `--mode` flag to three distinct subcommands: `triplets`, `entities`, and `embeddings` - Updated documentation, tests, and python script	2025-08-02 13:44:55 -07:00
Kazu Hirata	183b38078a	[llvm] Proofread *.rst (#151087 ) This patch hyphenates words that are used as adjecives, such as: - architecture specific - human readable - implementation defined - language independent - language specific - machine readable - machine specific - target independent - target specific	2025-08-01 07:01:50 -07:00
veera	6da1a0908a	[Docs] Update Opt's Option to Specify Pass Pipeline (NFC) (#148402 ) Since the new pass manager, we use `--passes=<string>` to specify the pass pipeline instead of the `-{passname}` syntax.	2025-08-01 15:18:01 +02:00
Mészáros Gergely	1194353650	[LLVM][Docs][SPIRV] Correct `-spirv-ext` option name, reorder (#150423 ) The option is `-spirv-ext` not `-spirv-extensions`. Also move the examples after the description of the option, instead of after the list of extensions, where its easy to miss when skimming. --------- Co-authored-by: Nathan Gauër <github@keenuts.net>	2025-08-01 09:46:08 +02:00
Joel E. Denny	37e03b56b8	Revert "[PGO] Add `llvm.loop.estimated_trip_count` metadata" (#151585 ) Reverts llvm/llvm-project#148758 [As requested.](https://github.com/llvm/llvm-project/pull/148758#pullrequestreview-3076627201)	2025-07-31 15:56:31 -04:00
Mircea Trofin	b383efc3e3	[lit] Optionally exclude xfail tests (#151191 ) See the related issue. We want to set up a build bot where `opt` runs with `-enable-profcheck`, which inserts `MD_prof` before running the rest of the pipeline requested from `opt`, and then validates resulting profile information (more info in the RFC linked by the aforementioned issue) In that setup, we will also ignore `FileCheck`: while the profile info inserted is, currently, equivalent to the profile info a pass would observe via `BranchProbabilityInfo`/`BlockFrequencyInfo`, (1) we may want to change that, and (2) some tests are quite sensitive to the output IR, and break if, for instance, extra metadata is present (which it would be due to `-enable-profcheck`). Since we're just interested in profile consistency on the upcoming bot, ignoring `FileCheck` is simpler and sufficient. However, this has the effect of passing XFAIL tests. Rather than listing them all, the alternative is to just exclude XFAIL tests. This PR adds support for that by introducing a `--exclude-xfail` option to `llvm-lit`. Issue #147390	2025-07-31 21:50:38 +02:00
Joel E. Denny	f7b65011de	[PGO] Add `llvm.loop.estimated_trip_count` metadata (#148758 ) This patch implements the `llvm.loop.estimated_trip_count` metadata discussed in [[RFC] Fix Loop Transformations to Preserve Block Frequencies](https://discourse.llvm.org/t/rfc-fix-loop-transformations-to-preserve-block-frequencies/85785). As [suggested in the RFC comments](https://discourse.llvm.org/t/rfc-fix-loop-transformations-to-preserve-block-frequencies/85785/4), it adds the new metadata to all loops at the time of profile ingestion and estimates each trip count from the loop's `branch_weights` metadata. As [suggested in the PR #128785 review](https://github.com/llvm/llvm-project/pull/128785#discussion_r2151091036), it does so via a new `PGOEstimateTripCountsPass` pass, which creates the new metadata for each loop but omits the value if it cannot estimate a trip count due to the loop's form. An important observation not previously discussed is that `PGOEstimateTripCountsPass` often cannot estimate a loop's trip count, but later passes can sometimes transform the loop in a way that makes it possible. Currently, such passes do not necessarily update the metadata, but eventually that should be fixed. Until then, if the new metadata has no value, `llvm::getLoopEstimatedTripCount` disregards it and tries again to estimate the trip count from the loop's current `branch_weights` metadata.	2025-07-31 12:28:25 -04:00
Jonas Devlieghere	7abb519e9e	[lldb] Add release note for Wasm debugging (#151445 ) Add a release note for Wasm debugging.	2025-07-31 08:15:23 -07:00
David Spickett	c10736aa49	[llvm][docs] Refresh Arm builtins cross compile guide (#150966 ) Someone asked about this on Discord and it was a bit hard to follow. I found them a config that worked, but the doc was not as much help as it should have been. It probably needs some updates for the runtime build era, but for now, I'm just making it easier to read. I know the basic build can work at least. Some aspects of it may be literally wrong now, but I'll check that later. * Remove contractions. * Remove references to the old separate llvm repo layout. * Remove mentions of cmake versions older than what llvm requires now. * Make a bunch of things plain text. * Make a bunch of things code blocks so they are easier to copy and paste from.	2025-07-31 09:55:38 +01:00
S. VenkataKeerthy	8e9a0fc0f2	[IR2Vec] Add triplet generation utility script for vocabulary training (#149215 ) Added a Python utility script for generating IR2Vec triplets and updated documentation to reference it. The script generates triplets in a form suitable for training the vocabulary. (Tracking issues - #141817, #141834; closes - #141834)	2025-07-30 10:08:15 -07:00

1 2 3 4 5 ...

11851 Commits