llvm-project

Author	SHA1	Message	Date
Steffen Larsen	9501114ca0	[Verifier] Make verifier fail when global variable size exceeds address space size (#179625 ) When a global variable has a size that exceeds the size of the address space it resides in, the verifier should fail as the variable can neither be materialized nor fully accessed. This patch adds a check to the verifier to enforce it. --------- Signed-off-by: Steffen Holst Larsen <HolstLarsen.Steffen@amd.com> Co-authored-by: Steffen Holst Larsen <HolstLarsen.Steffen@amd.com>	2026-02-10 13:27:38 +01:00
Rahul Joshi	b12e3122c8	[NFC][Core][CodeGen] Remove pass initialization from pass constructors (#180153 )	2026-02-06 09:05:47 -08:00
Matt Arsenault	2502e3b7ba	IR: Promote "denormal-fp-math" to a first class attribute (#174293 ) Convert "denormal-fp-math" and "denormal-fp-math-f32" into a first class denormal_fpenv attribute. Previously the query for the effective denormal mode involved two string attribute queries with parsing. I'm introducing more uses of this, so it makes sense to convert this to a more efficient encoding. The old representation was also awkward since it was split across two separate attributes. The new encoding just stores the default and float modes as bitfields, largely avoiding the need to consider if the other mode is set. The syntax in the common cases looks like this: `denormal_fpenv(preservesign,preservesign)` `denormal_fpenv(float: preservesign,preservesign)` `denormal_fpenv(dynamic,dynamic float: preservesign,preservesign)` I wasn't sure about reusing the float type name instead of adding a new keyword. It's parsed as a type but only accepts float. I'm also debating switching the name to subnormal to match the current preferred IEEE terminology (also used by nofpclass and other contexts). This has a behavior change when using the command flag debug options to set the denormal mode. The behavior of the flag ignored functions with an explicit attribute set, per the default and f32 version. Now that these are one attribute, the flag logic can't distinguish which of the two components were explicitly set on the function. Only one test appeared to rely on this behavior, so I just avoided using the flags in it. This also does not perform all the code cleanups this enables. In particular the attributor handling could be cleaned up. I also guessed at how to support this in MLIR. I followed MemoryEffects as a reference; it appears bitfields are expanded into arguments to attributes, so the representation there is a bit uglier with the 2 2-element fields flattened into 4 arguments.	2026-02-05 13:31:26 +00:00
Vladislav Dzhidzhoev	b9cecee3fb	Reland "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)" (#165032 ) This is an attempt to merge https://reviews.llvm.org/D144006 with LTO fix. The last merge attempt was https://github.com/llvm/llvm-project/pull/75385. The issue with it was investigated in https://github.com/llvm/llvm-project/pull/75385#issuecomment-2386684121. The problem happens when 1. Several modules are being linked. 2. There are several DISubprograms that initially belong to different modules but represent the same source code function (for example, a function included from the same source code file). 3. Some of such DISubprograms survive IR linking. It may happen if one of them is inlined somewhere or if the functions that have these DISubprograms attached have internal linkage. 4. Each of these DISubprograms has a local type that corresponds to the same source code type. These types are initially from different modules, but have the same ODR identifier. If the same (in the sense of ODR identifier/ODR uniquing rules) local type is present in two modules, and these modules are linked together, the type gets uniqued. A DIType, that happens to be loaded first, survives linking, and the references on other types with the same ODR identifier from the modules loaded later are replaced with the references on the DIType loaded first. Since defintion subprograms, in scope of which these types are located, are not deduplicated, the linker output may contain multiple DISubprogram's having the same (uniqued) type in their retainedNodes lists. Further compilation of such modules causes crashes. To tackle that, * previous solution to handle LTO linking with local types in retainedNodes is removed (cloneLocalTypes() function), * for each loaded distinct (definition) DISubprogram, its retainedNodes list is scanned after loading, and DITypes with a scope of another subprogram are removed. If something from a Function corresponding to the DISubprogram references uniqued type, we rely on cross-CU links. Additionally: * a check is added to Verifier to report about local types located in a wrong retainedNodes list, Original commit message follows. --------- RFC https://discourse.llvm.org/t/rfc-dwarfdebug-fix-and-improve-handling-imported-entities-types-and-static-local-in-subprogram-and-lexical-block-scopes/68544 Similar to imported declarations, the patch tracks function-local types in DISubprogram's 'retainedNodes' field. DwarfDebug is adjusted in accordance with the aforementioned metadata change and provided a support of function-local types scoped within a lexical block. The patch assumes that DICompileUnit's 'enums field' no longer tracks local types and DwarfDebug would assert if any locally-scoped types get placed there. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Co-authored-by: Jeremy Morse <jeremy.morse@sony.com>	2026-02-04 00:34:52 +01:00
Diana Picus	9022f47ca4	[AMDGPU] Implement llvm.sponentry (#176357 ) In some of our use cases, the GPU runtime stores some data at the top of the stack. It figures out where it's safe to store it by using the PAL metadata generated by the backend, which includes the total stack size. However, the metadata does not include the space reserved at the bottom of the stack for the trap handler when CWSR is enabled in dynamic VGPR mode. This space is reserved dynamically based on whether or not the code is running on the compute queue. Therefore, the runtime needs a way to take that into account. Add support for `llvm.sponentry`, which should return the base of the stack, skipping over any reserved areas. This allows us to keep this computation in one place rather than duplicate it between the backend and the runtime. The implementation for functions that set up their own stack uses a pseudo that is expanded to the same code sequence as that used in the prolog to set up the stack in the first place. In callable functions, we generate a fixed stack object and use that instead, similar to the Arm/AArch64 approach. This wastes some stack space but that's not a problem for now because we're not planning to use this in callable functions yet.	2026-02-03 15:02:07 +01:00
Matt Arsenault	6934ed51b3	IR: Add !nofpclass metadata (#177140 ) This adds the analogous metadata to the nofpclass attribute to assert values are not a certain set of floating-point classes. This allows the same information to be expressed if a function argument is passed indirectly. This matches the bitmask encoding of nofpclass. I also think this should be allowed for stores to symmetrically handle sret, but leave that for later. Alternatively we could add a more expressive !fprange metadata, but that would be much more complex. It's useful to match the attribute, and more annotations can always be added. Fixes #133560	2026-01-22 20:49:34 +01:00
Nathan Gauër	9247e89706	[IR] Add llvm.structured.gep instruction (#176145 ) This commit adds initial support for `@llvm.structured.gep` instruction in Clang. This intrinsic is supposed to be used as an alternative to ptrdiff/GEP when pointers arithmetic is invalid and only structured access is possible. Link to the RFC: https://discourse.llvm.org/t/rfc-adding-instructions-to-to-carry-gep-type-traversal-information/ Previous discussion around the documentation: https://github.com/llvm/llvm-project/pull/167883	2026-01-21 15:45:08 +00:00
Jameson Nash	2458387ac1	[NFC] replace getValueType with more specific getFunctionType (#177175 ) When trivially valid already, use the more specific method, instead of casting the result of the less specific method.	2026-01-21 10:30:09 -05:00
Luke Lau	cee36b23cc	[IR] Allow non-constant offsets in @llvm.vector.splice.{left,right} (#174693 ) Following on from #170796, this PR implements the second part of https://discourse.llvm.org/t/rfc-allow-non-constant-offsets-in-llvm-vector-splice/88974 by allowing non-constant offsets in the vector splice intrinsics. Previously @llvm.vector.splice had a restriction enforced by the verifier that the offset had to be known to be within the range of the vector at compile time. Because we can't enforce this with non-constant offsets, it's been relaxed so that offsets that would slide the vector out of bounds return a poison value, similar to insertelement/extractelement. @llvm.vector.splice.left also previously only allowed offsets within the range 0 <= Offset < N, but this has been relaxed to 0 <= Offset <= N so that it's consistent with @llvm.vector.splice.right. In lieu of the verifier checks that were removed, InstSimplify has been taught to fold splices to poison when the offset is out of bounds. The cost model isn't implemented in this PR, and just returns invalid for any non-constant offsets for now. I think the correct way to cost these non-constant offets isn't through getShuffleCost because they can't handle variable masks, but instead just through getIntrinsicInstCost.	2026-01-21 10:58:40 +00:00
nataliakokoromyti	aa995d9634	[IR][Verifier] Reject GEP into vector with non-byte-addressable element type (#176689 ) Add a verifier check to reject GEP instructions that index into vectors with non-byte-addressable element types (e.g., <2 x i1>). Such GEPs cannot have their offset computed in bytes, causing assertions in passes like SROA that try to compute byte offsets. Fixes #176628.	2026-01-20 12:02:26 +01:00
Oxygen	9671aae8d5	[DSE][Verifier] Respect the calling convention of the function specified by "alloc-variant-zeroed" (#175911 ) Require that the calling convention between the zeroed and non-zeroed variants is the same, and set it appropriate in the DSE transform.	2026-01-16 15:45:40 +00:00
Dmitry Sidorov	4e95be7043	[RFC][SPIR-V] Add intrinsics to convert to/from ap.float (#164252 ) The patch adds two intrinsics: llvm.convert.to.arbitrary.fp and llvm.convert.from.arbitrary.fp. The intrinsics perform conversions between values whose interpretation differs from their representation in LLVM IR. The intrinsics are overloaded on both its return type and first argument. Metadata operands describe how the raw bits should be interpreted before and after the conversion. Typical use case is to convert IEEE-754 floating point types to FP8/FP4 and backwards for ML applications. Addresses https://discourse.llvm.org/t/rfc-spir-v-way-to-represent-float8-in-llvm-ir/87758/10	2026-01-14 16:53:53 +01:00
Kerry McLaughlin	04e5bc7dfb	[AArch64] Add support for range prefetch intrinsic (#170490 ) This patch adds support in Clang for the RPRFM instruction, by adding the following intrinsics: ``` void __pldx_range(unsigned int access_kind, unsigned int retention_policy, signed int length, unsigned int count, signed int stride, size_t reuse distance, void const addr); void __pld_range(unsigned int access_kind, unsigned int retention_policy, uint64_t metadata, void const addr); ``` The `__ARM_PREFETCH_RANGE` macro can be used to test whether these intrinsics are implemented. If the RPRFM instruction is not available, this instruction is a NOP. This implements the following ACLE proposal: https://github.com/ARM-software/acle/pull/423	2026-01-12 15:53:17 +00:00
Sean Fertile	b6212a4caf	XCOFF associated metadata (#159096 ) Add a new metadata node `!implicit.ref` to represent an implicit dependency between 2 symbols. The metadata is unique to AIX and gets lowered to a relocation that adds an explicit link between the section the global that the metadata is placed on is allocated in, to the asscoiated symbol. This relocation will cause the associated symbol to remain live if the section is not garbage collected. This is used mainly for compiler features where there is some hidden runtime dependency between the symbols that isn't otherwise obvious to the linker.	2026-01-09 13:49:21 -05:00
Luke Lau	ad4bfac732	[IR] Split vector.splice into vector.splice.left and vector.splice.right (#170796 ) This PR implements the first change outlined in https://discourse.llvm.org/t/rfc-allow-non-constant-offsets-in-llvm-vector-splice/88974?u=lukel In order to allow non-immediate offsets in the llvm.vector.splice intrinsic, we need to separate out the "shift left" and "shift right" modes into two separate intrinsics, which were previously determined by whether or not the offset is positive or negative. The description in the LangRef has also been reworded in terms of sliding elements left or right and extracting either the upper or lower half as opposed to extracting from a certain index, which brings it inline with the definition of `llvm.fshr.`/`llvm.fshl.`. This patch teaches AutoUpgrade.cpp to upgrade the old intrinsics into their new equivalent one based on their offset, so existing uses of vector.splice should still work. Uses of llvm.vector.splice in `llvm/test/CodeGen` haven't been replaced in this PR to keep the diff small and kick the tyres on the AutoUpgrader a bit. I planned to do this in a follow up NFC but can include it in this PR if reviewers prefer. Similarly the shuffle costing kind `SK_Splice` has just been kept the same for now, to be split into `SK_SpliceLeft` and `SK_SpliceRight` later.	2026-01-06 15:41:26 +08:00
Stefan Weigl-Bosker	da8497ed08	[IR][Verifier] Verification for `target-features` attribute (#173119 ) Fixes https://github.com/llvm/llvm-project/issues/172647 Currently, MC assumes that all `target-feature` flag attributes are well formed and will crash otherwise. This change handles those cases more gracefully.	2025-12-22 11:13:56 +01:00
Teresa Johnson	37a73d587a	[MemProf] Update metadata verification for a single string tag (#172543 ) The memprof metadata verifier supported multiple string tags, but in reality, the other code (e.g. addCallStack) only supports a single such tag. Update the verifier to reflect that limitation, and the associated tests. Fixes #157217	2025-12-18 22:19:04 -08:00
Alexis Engelke	6813f8f037	[IR] Don't store switch case values as operands SwitchInst case values must be ConstantInt, which have no use list. Therefore it is not necessary to store these as Use, instead store them more efficiently as a simple array of pointers after the uses, similar to how PHINode stores basic blocks. After this change, the successors of all terminators are stored consecutively in the operand list. This is preparatory work for improving the performance of successor access. Add new C API functions so that switch case values remain accessible from bindings for other languages. While this could also be achieved by merely changing the order of operands (i.e., first all successors, then all constants), doing so would increase the asymptotic runtime of addCase from O(1) to O(n) (i.e., adding n cases would be O(n^2)), because it would need to shift all constants by one slot. Having null/invalid operands is also a bad idea and would cause much more breakage. Pull Request: https://github.com/llvm/llvm-project/pull/170984	2025-12-11 18:38:39 +01:00
Nikita Popov	c9648d7acd	[Verifier] Make sure all constexprs in instructions are visited (#171643 ) Previously this only happened for constants of some types and missed incorrect ptrtoaddr.	2025-12-11 08:13:48 +01:00
Diana Picus	578a26ada2	[AMDGPU] Relax restrictions on amdgcn.cs.chain intrinsic (#169785 ) We have a new use-case for chain functions, so slightly relax the restriction on which calling conventions may contain calls to chain functions.	2025-12-10 11:12:46 +01:00
Vitaly Buka	90e3ac6c55	Revert "[IR] Don't store switch case values as operands" (#170962 ) Reverts llvm/llvm-project#166842 Breaks Mips LLVM tests, and LLD on bots. See llvm/llvm-project#166842	2025-12-06 03:09:58 +00:00
Alexis Engelke	f26360f215	[IR] Don't store switch case values as operands (#166842 ) SwitchInst case values must be ConstantInt, which have no use list. Therefore it is not necessary to store these as Use, instead store them more efficiently as a simple array of pointers after the uses, similar to how PHINode stores basic blocks. After this change, the successors of all terminators are stored consecutively in the operand list. This is preparatory work for improving the performance of successor access.	2025-12-05 17:25:23 +01:00
Luke Lau	cc5b07c761	[IR] Fix vector.splice verifier scaling by vscale for fixed length vectors (#170807 ) Currently we multiply the known minimum number of elements by vscale even if the vector in question is fixed, so sometimes we miss some fixed vectors with out of bounds indices.	2025-12-05 16:49:28 +08:00
Robert Imschweiler	e84fdbe1ef	[IR] Add CallBr intrinsics support (#133907 ) This commit adds support for using intrinsics with callbr. The uses of this will most of the time look like this example: ```llvm callbr void @llvm.amdgcn.kill(i1 %c) to label %cont [label %kill] kill: unreachable cont: ... ```	2025-12-04 10:21:00 +01:00
Tom Tromey	efbbca62d1	[llvm][DebugInfo] Allow DIDerivedType as a bound in DISubrangeType (#165880 ) Consider this Ada type: ``` type Array_Type is array (Natural range <>) of Integer; type Record_Type (L1, L2 : Natural) is record I1 : Integer; A1 : Array_Type (1 .. L1); I2 : Integer; A2 : Array_Type (1 .. L2); I3 : Integer; end record; ``` Here, the array fields have lengths that depend on the discriminants of the record type. However, in this case the array lengths cannot be expressed as DWARF location expressions, with the issue being that "A2" has a non-constant offset, but an expression involving DW_OP_push_object_address will push the address of the field -- with no way to find the location of "L2". In a case like this, I believe the correct DWARF is to emit the array ranges using a direct reference to the discriminant, like: ``` <3><1156>: Abbrev Number: 1 (DW_TAG_member) <1157> DW_AT_name : l1 ... <3><1177>: Abbrev Number: 6 (DW_TAG_array_type) <1178> DW_AT_name : (indirect string, offset: 0x1a0b): vla__record_type__T4b <117c> DW_AT_type : <0x1287> <1180> DW_AT_sibling : <0x118e> <4><1184>: Abbrev Number: 7 (DW_TAG_subrange_type) <1185> DW_AT_type : <0x1280> <1189> DW_AT_upper_bound : <0x1156> ``` (FWIW this is what GCC has done for years.) This patch makes this possible in LLVM, by letting a DISubrangeType refer to a DIDerivedType. gnat-llvm can then arrange for the DIE reference to be correct by setting the array type's scope to be the record.	2025-12-04 09:38:14 +09:00
Peter Collingbourne	d2379effe9	Add deactivation symbol operand to ConstantPtrAuth. Deactivation symbol operands are supported in the code generator by building on the previously added support for IRELATIVE relocations. Reviewers: ojhunt, fmayer, ahmedbougacha, nikic, efriedma-quic Reviewed By: fmayer Pull Request: https://github.com/llvm/llvm-project/pull/133537	2025-11-26 12:39:40 -08:00
Shubham Sandeep Rastogi	20ebc7ea82	Add new llvm.dbg.declare_value intrinsic. (#168132 ) For swift async code, we need to use a debug intrinsic that behaves like an llvm.dbg.declare but can take any location type rather than just a pointer or integer. To solve this, a new debug instrinsic called llvm.dbg.declare_value has been created, which behaves exactly like an llvm.dbg.declare but can take non pointer and integer location types. More information here: https://discourse.llvm.org/t/rfc-introduce-new-llvm-dbg-coroframe-entry-intrinsic/88269 This is the first patch as part of a stack of patches, with the one succeeding it being: https://github.com/llvm/llvm-project/pull/168134	2025-11-22 00:49:35 -08:00
Laxman Sole	58b8e6e424	[DebugInfo][IR] Verifier checks for the extraData (#167971 ) LLVM IR verifier checks for `extraData` in debug info metadata. This is a follow-up PR based on discussions in #165023	2025-11-18 14:33:40 -05:00
Daniel Thornburgh	c9ff2df8c3	[IR] "modular-format" attribute for functions using format strings (#147429 ) A new InstCombine transform uses this attribute to rewrite calls to a modular version of the implementation along with llvm.reloc.none relocations against aspects of the implementation needed by the call. This change only adds support for the 'float' aspect, but it also builds the structure needed for others. See issue #146159	2025-11-11 11:52:56 -08:00
Nabeel Omer	7a58b417bc	Add FramePointerKind::NonLeafNoReserve (#163775 ) This patch adds a new `FramePointerKind::NonLeafNoReserve` and makes it the default for `-momit-leaf-frame-pointer`. It also adds a new commandline option `-m[no-]reserve-frame-pointer-reg`. This should fix #154379, the main impact of this patch can be found in `clang/lib/Driver/ToolChains/CommonArgs.cpp`.	2025-11-11 17:25:49 +00:00
Vladislav Dzhidzhoev	e2a2c03eef	[DebugInfo] Add Verifier check for incorrectly-scoped retainedNodes (#166855 ) These checks ensure that retained nodes of a DISubprogram belong to the subprogram. Tests with incorrect IR are fixed. We should not have variables of one subprogram present in retained nodes of other subprograms. Also, interface for accessing DISubprogram's retained nodes is slightly refactored. `DISubprogram::visitRetainedNodes` and `DISubprogram::forEachRetainedNode` are added to avoid repeating checks like ``` if (const auto LV = dyn_cast<DILocalVariable>(N)) ... else if (const auto L = dyn_cast<DILabel>(N)) ... else if (const auto *IE = dyn_cast<DIImportedEntity>(N)) ... ```	2025-11-10 13:13:49 +01:00
Damian Heaton	70f4b596cf	Add `llvm.vector.partial.reduce.fadd` intrinsic (#159776 ) With this intrinsic, and supporting SelectionDAG nodes, we can better make use of instructions such as AArch64's `FDOT`.	2025-11-07 15:36:54 +00:00
Daniel Thornburgh	5f08fb4d72	[IR] llvm.reloc.none intrinsic for no-op symbol references (#147427 ) This intrinsic emits a BFD_RELOC_NONE relocation at the point of call, which allows optimizations and languages to explicitly pull in symbols from static libraries without there being any code or data that has an effectual relocation against such a symbol. See issue #146159 for context.	2025-11-06 08:52:46 -08:00
Rahul Joshi	37fff6e17e	[NFC][LLVM][IR] Cleanup namespace usage in LLVM IR cpp files (#166477 )	2025-11-05 11:06:22 -08:00
jofrn	5c666f559c	IR/Verifier: Allow vector type in atomic load and store (#148893 ) Vector types on atomics are assumed to be invalid by the verifier. However, this type can be valid if it is lowered by codegen.	2025-10-23 01:33:57 -04:00
Nikita Popov	573ca36753	[IR] Replace alignment argument with attribute on masked intrinsics (#163802 ) The `masked.load`, `masked.store`, `masked.gather` and `masked.scatter` intrinsics currently accept a separate alignment immarg. Replace this with an `align` attribute on the pointer / vector of pointers argument. This is the standard representation for alignment information on intrinsics, and is already used by all other memory intrinsics. This means the signatures now match llvm.expandload, llvm.vp.load, etc. (Things like llvm.memcpy used to have a separate alignment argument as well, but were already migrated a long time ago.) It's worth noting that the masked.gather and masked.scatter intrinsics previously accepted a zero alignment to indicate the ABI type alignment of the element type. This special case is gone now: If the align attribute is omitted, the implied alignment is 1, as usual. If ABI alignment is desired, it needs to be explicitly emitted (which the IRBuilder API already requires anyway).	2025-10-20 08:50:09 +00:00
Nathan Corbyn	b00c4ff4b9	[Matrix][IR] Cap stride bitwidth at 64 (#163729 ) a1ef81d added overloads for `llvm.matrix.column.major.store` and `llvm.matrix.column.major.load` that allow strides to occupy an arbitrary bitwidth. This change wasn't reflected in the verifier, causing an assertion to trip when given strides overflowing 64-bit. This patch explicitly caps the bitwidth at 64, repairing the crash and avoiding future complexity dealing with strides that overflow 64 bits. PR: https://github.com/llvm/llvm-project/pull/163729	2025-10-17 12:54:28 +01:00
Juan Manuel Martinez Caamaño	7429a08e73	[NFC][Verifier] Fix typo initalizer->initializer (#163193 )	2025-10-13 14:07:36 +00:00
Marco Elver	6359980f5b	[AllocToken, Clang] Implement TypeHashPointerSplit mode (#156840 ) Implement the TypeHashPointerSplit mode: This mode assigns a token ID based on the hash of the allocated type's name, where the top half ID-space is reserved for types that contain pointers and the bottom half for types that do not contain pointers. This mode with max tokens of 2 (`-falloc-token-max=2`) may also be valuable for heap hardening strategies that simply separate pointer types from non-pointer types. Make it the new default mode. Link: https://discourse.llvm.org/t/rfc-a-framework-for-allocator-partitioning-hints/87434 --- This change is part of the following series: 1. https://github.com/llvm/llvm-project/pull/160131 2. https://github.com/llvm/llvm-project/pull/156838 3. https://github.com/llvm/llvm-project/pull/162098 4. https://github.com/llvm/llvm-project/pull/162099 5. https://github.com/llvm/llvm-project/pull/156839 6. https://github.com/llvm/llvm-project/pull/156840 7. https://github.com/llvm/llvm-project/pull/156841 8. https://github.com/llvm/llvm-project/pull/156842	2025-10-08 21:58:37 +02:00
Marco Elver	224873d7ac	[AllocToken] Introduce sanitize_alloc_token attribute and alloc_token metadata (#160131 ) In preparation of adding the "AllocToken" pass, add the pre-requisite `sanitize_alloc_token` function attribute and `alloc_token` metadata. --- This change is part of the following series: 1. https://github.com/llvm/llvm-project/pull/160131 2. https://github.com/llvm/llvm-project/pull/156838 3. https://github.com/llvm/llvm-project/pull/162098 4. https://github.com/llvm/llvm-project/pull/162099 5. https://github.com/llvm/llvm-project/pull/156839 6. https://github.com/llvm/llvm-project/pull/156840 7. https://github.com/llvm/llvm-project/pull/156841 8. https://github.com/llvm/llvm-project/pull/156842	2025-10-07 12:51:42 +02:00
Nikita Popov	63ca8483d0	[IR] Introduce !captures metadata (#160913 ) This introduces `!captures` metadata on stores, which looks like this: ``` store ptr %x, ptr %y, !captures !{!"address", !"read_provenance"} ``` The semantics are the same as replacing the store with a call like this: ``` call void @llvm.store(ptr captures(address, read_provenance) %x, ptr %y) ``` This metadata is intended for annotation by frontends -- it's not something we can feasibly infer at this point, as it would require analyzing uses of the pointer stored in memory. The motivating use case for this is Rust's `println!()` machinery, which involves storing a reference to the value inside a structure. This means that printing code (including conditional debugging code), can inhibit optimizations because the pointer escapes. With the new metadata we can annotate this as a read-only capture, which has less impact on optimizations.	2025-10-01 08:58:47 +02:00
Nikita Popov	fd8adf3ccf	[IR] Use immarg for preallocated intrinsics (NFC) (#155835 ) Mark the attributes as immarg to indicate that they require a constant integer. This was previously enforced with a manual verifier check.	2025-09-29 09:14:33 +02:00
Antonio Frighetto	8f7cfd4e9e	[Verifier] Modify TBAAVerifier helpers signatures to accept a nullable (NFC) sanitizer-aarch64-linux-bootstrap-ubsan buildbot was previously failing. Resolves: https://lab.llvm.org/buildbot/#/builders/169/builds/15232.	2025-09-24 17:47:59 +02:00
Antonio Frighetto	32c6e16246	[IR] Introduce `llvm.errno.tbaa` metadata for errno alias disambiguation Add a new named module-level frontend-annotated metadata that specifies the TBAA node for an integer access, for which, C/C++ `errno` accesses are guaranteed to use (under strict aliasing). This should allow LLVM to prove the involved memory location/ accesses may not alias `errno`; thus, to perform optimizations around errno-writing libcalls (store-to-load forwarding amongst others). Previous discussion: https://discourse.llvm.org/t/rfc-modelling-errno-memory-effects/82972.	2025-09-24 15:59:32 +02:00
Nikita Popov	5887006510	[IR] Forbid mixing condition and operand bundle assumes (#160460 ) Assumes either have a boolean condition, or a number of attribute based operand bundles. Currently, we also allow mixing both forms, though we don't make use of this in practice. This adds additional complexity for code dealing with assumes. Forbid mixing both forms, by requiring that assumes with operand bundles have an i1 true condition.	2025-09-24 12:42:50 +02:00
Craig Topper	678dcf13d8	[IR] Fix a few implicit conversions from TypeSize to uint64_t. NFC (#159894 )	2025-09-20 14:18:47 -07:00
Wael Yehia	74bea4c1ad	[IR] enable attaching metadata on ifuncs (#158732 ) Teach the IR parser and writer to support metadata on ifuncs, and update documentation. In PR #153049, we have a use case of attaching the `!associated` metadata to an ifunc. Since an ifunc is similar to a function declaration, it seems natural to allow metadata on ifuncs. Currently, the metadata API allows adding Metadata to llvm::GlobalObject, so the in-memory IR allows for metadata on ifuncs, but the IR reader/writer is not aware of that. --------- Co-authored-by: Wael Yehia <wyehia@ca.ibm.com>	2025-09-19 11:41:57 -04:00
Sander de Smalen	17e008db17	[IR] NFC: Remove 'experimental' from partial.reduce.add intrinsic (#158637 ) The partial reduction intrinsics are no longer experimental, because they've been used in production for a while and are unlikely to change.	2025-09-17 11:44:47 +01:00
Joel E. Denny	0e3c5566c0	[PGO] Add llvm.loop.estimated_trip_count metadata (#152775 ) This patch implements the `llvm.loop.estimated_trip_count` metadata discussed in [[RFC] Fix Loop Transformations to Preserve Block Frequencies](https://discourse.llvm.org/t/rfc-fix-loop-transformations-to-preserve-block-frequencies/85785). As the RFC explains, that metadata enables future patches, such as PR #128785, to fix block frequency issues without losing estimated trip counts.	2025-09-11 15:55:18 -04:00
Mircea Trofin	f2d827c444	[profcheck] Require `unknown` metadata have an origin parameter (#157594 ) Rather than passes using `!prof = !{!”unknown”}`for cases where don’t have enough information to emit profile values, this patch captures the pass (or some other information) that can help diagnostics - i.e. `!{!”unknown”, !”some-pass-name”}`. For example, suppose we emitted a `select` with the unknown metadata, and, later, end up needing to lower that to a conditional branch. If we observe (via sample profiling, for example) that the branch is biased and would have benefitted from a valid profile, the extra information can help speed up debugging. We can also (in a subsequent pass) generate optimization remarks about such lowered selects, with a similar aim - identify patterns lowering to `select` that may be worth some extra investment in extracting a more precise profile.	2025-09-10 15:34:35 -07:00

1 2 3 4 5 ...

1260 Commits