llvm-project

Author	SHA1	Message	Date
Grigory Pastukhov	f66bd8e81a	[LLVM] Add flatten function attribute to LLVM IR and implement recursive inlining in AlwaysInliner (#174899 ) This adds a new function-level `flatten` LLVM IR attribute and implements support for it in the AlwaysInliner pass, bringing LLVM's behavior in line with GCC. Previously, the `flatten` attribute only existed as a Clang attribute, which was lowered to `alwaysinline` on individual call sites. Per the RFC discussion [1], the consensus was to match GCC semantics: recursively inline the entire call tree into the flattened function, rather than just immediate call sites. This PR: - Adds the `flatten` function attribute to LLVM IR - Implements recursive inlining of all viable callees in AlwaysInliner - Uses inline history tracking to detect and stop at recursive call cycles - Emits optimization remarks when inlining is skipped due to recursion A follow-up patch will update Clang to emit the LLVM `flatten` attribute on functions instead of marking individual call sites with `alwaysinline`. [1] https://discourse.llvm.org/t/rfc-function-level-flatten-depth-attribute-for-depth-limited-inlining	2026-03-19 11:25:46 -07:00
gonzalobg	ea8fb06f24	[atomicrmw] fminimumnum/fmaximumnum support (#187030 ) Adds support for `atomicrmw` `fminimumnum`/`fmaximumnum` operations. These were added to C++ in P3008, and are exposed in libc++ in #186716 . Adding LLVM IR support for these unblocks work in both backends with HW support, and frontends.	2026-03-18 09:35:49 +01:00
Pedro Lobo	57568c288d	[Reland][IR] Add initial support for the byte type (#186888 ) This patch relands https://github.com/llvm/llvm-project/pull/178666. The original version caused CI failures due to the missing target triple in `llvm/test/CodeGen/X86/byte-constants.ll`. CI should be green now.	2026-03-16 23:32:24 +00:00
Pedro Lobo	70cd2acbd3	Revert "[IR] Add initial support for the byte type" (#186713 ) Reverts llvm/llvm-project#178666 to unblock CI. `CodeGen/X86/byte-constants.ll` is at fault. Will look into it and hopefully fix it by tomorrow.	2026-03-15 23:29:21 +00:00
Pedro Lobo	80f2ef70f5	[IR] Add initial support for the byte type (#178666 ) Following the [byte type RFC](https://discourse.llvm.org/t/rfc-add-a-new-byte-type-to-llvm-ir/89522) and the discussions within the [LLVM IR Formal Specification WG](https://discourse.llvm.org/t/rfc-forming-a-working-group-on-formal-specification-for-llvm/89056), this PR introduces initial support for the byte type in LLVM. This PR: - Adds the byte type to LLVM's type system - Extends the `bitcast` instruction to accept the byte operands - Adds parsing tests for all new functionality - Fixes failing regressions tests (IR2Vec and IRNormalizer) --------- Co-authored-by: George Mitenkov <georgemitenk0v@gmail.com>	2026-03-15 21:56:06 +00:00
Alexis Engelke	64fc793dd1	[IR][Core][NFC] Drop some BranchInst uses (#186352 ) Now that CondBrInst and UncondBrInst are explicit subclasses, use them instead. HotColdSplitting was trying to inspect prof metadata also on unconditional branches, fix this. Also introduce C API cast functions and deprecate LLVMIsConditional in favor of LLVMIsACondBrInst. This patch covers all LLVM uses outside of Transforms, Analysis, CodeGen/Target, SandboxIR, Frontend/OpenMP, tools, examples.	2026-03-13 11:37:02 +01:00
Teresa Johnson	cfa039e96e	[MemProf] Skip handling of memprof records for non-prevailing functions (#185963 ) When building the combined summary index during a thin link, we already performed a memory optimization for non-prevailing copies of a function by not recording their allocation and callsite info in the associated function summary. We can save on the thin link time as well by avoiding building the memprof summary structures just to throw them away later in the non-prevailing case. The reason we were eagerly building these structures is that the memprof summaries precede the corresponding function summary record, and we don't know whether this is the prevailing copy of the function until we parse the function summary record. To facilitate the new handling, we emit the memprof summary records after the corresponding function summary record. The bitcode summary version is bumped, and the reader is changed to support both versions, for backwards compatibility. Note that there is already a memprof test that tests an older record type and will also test reading of the legacy version of the ordering: (llvm/test/ThinLTO/X86/memprof-old-alloc-context-summary.ll. To make the new handling even more efficient, the lookup/insertion of stack IDs in the combined summary index and the caching of their corresponding stack index in the StackIdToIndex map is made lazy. This resulted in a 27% reduction in thin link time for a large target (21% without the lazy insertion change).	2026-03-12 11:00:25 -07:00
Alexis Engelke	4fd826d1f9	[IR] Split Br into UncondBr and CondBr (#184027 ) BranchInst currently represents both unconditional and conditional branches. However, these are quite different operations that are often handled separately. Therefore, split them into separate opcodes and classes to allow distinguishing these operations in the type system. Additionally, this also slightly improves compile-time performance.	2026-03-11 12:31:10 +00:00
yonghong-song	3e05ab6322	[ThinLTO] Reduce the number of renaming due to promotions (#183793 ) Currently for thin-lto, the imported static global values (functions, variables, etc) will be promoted/renamed from e.g., foo() to foo.llvm.(). Such a renaming caused difficulties in live patching since function name is changed ([1]). It is possible that some global value names have to be promoted to avoid name collision and linker failure. But in practice, majority of name promotions can be avoided. In [2], the suggestion is that thin-lto pre-link decides whether a particular global value needs name promotion or not. If yes, later on in thinBackend() the name will be promoted. I compiled a particular linux kernel version (latest bpf-next tree) and found 1216 global values with suffix .llvm.. With this patch, the number of promoted functions is 2, 98% reduction from the original kernel build. If some native objects are not participating with LTO, name promotions have to be done to avoid potential linker issues. So the current implementation cannot be on by default. But in certain cases, e.g., linux kernel build, people can enable lld flag --lto-whole-program-visibility to reduce the number of functions like foo.llvm.(). For ThinLTOCodeGenerator.cpp which is used by llvm-lto tool and a few other rare cases, reducing the number of renaming due to promotion, is not implemented as lld flag '-lto-whole-program-visibility' is not supported in ThinLTOCodeGenerator.cpp for now. In summary, this pull request only supports llvm-lto2 style workflow. The feature is off by default. To enable the future, lld flag '-lto-whole-program-visibility' and llvm flag '-always-rename-promoted-locals=false' are needed. The link [3] has more context for the pull request discussions. [1] https://lpc.events/event/19/contributions/2212 [2] https://discourse.llvm.org/t/rfc-avoid-functions-like-foo-llvm-for-kernel-live-patch/89400 [3] https://github.com/llvm/llvm-project/pull/178587	2026-02-28 12:44:25 -08:00
yonghong-song	cd50a3074b	Revert "[ThinLTO] Reduce the number of renaming due to promotions (#178587 )" (#183782 ) There is a conflict with existing code. See https://github.com/llvm/llvm-project/pull/178587 Revert and resolve the conflict and then will submit later.	2026-02-27 10:04:30 -08:00
yonghong-song	975dba2863	[ThinLTO] Reduce the number of renaming due to promotions (#178587 ) Currently for thin-lto, the imported static global values (functions, variables, etc) will be promoted/renamed from e.g., foo() to foo.llvm.<hash>(). Such a renaming caused difficulties in live patching since function name is changed ([1]). It is possible that some global value names have to be promoted to avoid name collision and linker failure. But in practice, majority of name promotions can be avoided. In [2], the suggestion is that thin-lto pre-link decides whether a particular global value needs name promotion or not. If yes, later on in thinBackend() the name will be promoted. I compiled a particular linux kernel version (latest bpf-next tree) and found 1216 global values with suffix .llvm.<hash>. With this patch, the number of promoted functions is 2, 98% reduction from the original kernel build. If some native objects are not participating with LTO, name promotions have to be done to avoid potential linker issues. So the current implementation cannot be on by default. But in certain cases, e.g., linux kernel build, people can enable lld flag --lto-whole-program-visibility to reduce the number of functions like foo.llvm.<hash>(). For ThinLTOCodeGenerator.cpp which is used by llvm-lto tool and a few other rare cases, reducing the number of renaming due to promotion, is not implemented as lld flag '-lto-whole-program-visibility' is not supported in ThinLTOCodeGenerator.cpp for now. In summary, this pull request only supports llvm-lto2 style workflow. [1] https://lpc.events/event/19/contributions/2212 [2] https://discourse.llvm.org/t/rfc-avoid-functions-like-foo-llvm-for-kernel-live-patch/89400	2026-02-27 09:09:54 -08:00
Peter Collingbourne	943504eb08	IR: Add prefalign attribute for function definitions. The prefalign attribute determines the function's preferred alignment. By default, the function's preferred alignment is set in a target-specific way, but it may be overridden with this attribute. The backend logic will be added in followup patches. Part of this RFC: https://discourse.llvm.org/t/rfc-enhancing-function-alignment-attributes/88019 Reviewers: efriedma-quic, nikic, arsenm Pull Request: https://github.com/llvm/llvm-project/pull/155527	2026-02-20 10:54:01 -08:00
Teresa Johnson	90e8deb1d5	[MemProf] Optimize BitcodeReader stack id lookups (#182097 ) Introduce StackIdToIndex to ModuleSummaryIndexBitcodeReader to cache the mapping from module-local stack id indices to the global index in the ModuleSummaryIndex's StackIds vector. This avoids repeated hash lookups when processing callsite and allocation records. This reduced the thin link time for a large target built with memprof by ~16%. Also add assertions to ensure STACK_IDS records are processed once and that the cache is empty initially.	2026-02-18 11:29:29 -08:00
Sam Elliott	0d08cb0e70	[outliners] Turn nooutline into an Enum Attribute (#163665 ) This change turns the `"nooutline"` attribute into an enum attribute called `nooutline`, and adds an auto-upgrader for bitcode to make the same change to existing IR. This IR attribute disables both the Machine Outliner (enabled at Oz for some targets), and the IR Outliner (disabled by default).	2026-02-10 21:44:17 -08:00
Matt Arsenault	2502e3b7ba	IR: Promote "denormal-fp-math" to a first class attribute (#174293 ) Convert "denormal-fp-math" and "denormal-fp-math-f32" into a first class denormal_fpenv attribute. Previously the query for the effective denormal mode involved two string attribute queries with parsing. I'm introducing more uses of this, so it makes sense to convert this to a more efficient encoding. The old representation was also awkward since it was split across two separate attributes. The new encoding just stores the default and float modes as bitfields, largely avoiding the need to consider if the other mode is set. The syntax in the common cases looks like this: `denormal_fpenv(preservesign,preservesign)` `denormal_fpenv(float: preservesign,preservesign)` `denormal_fpenv(dynamic,dynamic float: preservesign,preservesign)` I wasn't sure about reusing the float type name instead of adding a new keyword. It's parsed as a type but only accepts float. I'm also debating switching the name to subnormal to match the current preferred IEEE terminology (also used by nofpclass and other contexts). This has a behavior change when using the command flag debug options to set the denormal mode. The behavior of the flag ignored functions with an explicit attribute set, per the default and f32 version. Now that these are one attribute, the flag logic can't distinguish which of the two components were explicitly set on the function. Only one test appeared to rely on this behavior, so I just avoided using the flags in it. This also does not perform all the code cleanups this enables. In particular the attributor handling could be cleaned up. I also guessed at how to support this in MLIR. I followed MemoryEffects as a reference; it appears bitfields are expanded into arguments to attributes, so the representation there is a bit uglier with the 2 2-element fields flattened into 4 arguments.	2026-02-05 13:31:26 +00:00
Vladislav Dzhidzhoev	b9cecee3fb	Reland "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)" (#165032 ) This is an attempt to merge https://reviews.llvm.org/D144006 with LTO fix. The last merge attempt was https://github.com/llvm/llvm-project/pull/75385. The issue with it was investigated in https://github.com/llvm/llvm-project/pull/75385#issuecomment-2386684121. The problem happens when 1. Several modules are being linked. 2. There are several DISubprograms that initially belong to different modules but represent the same source code function (for example, a function included from the same source code file). 3. Some of such DISubprograms survive IR linking. It may happen if one of them is inlined somewhere or if the functions that have these DISubprograms attached have internal linkage. 4. Each of these DISubprograms has a local type that corresponds to the same source code type. These types are initially from different modules, but have the same ODR identifier. If the same (in the sense of ODR identifier/ODR uniquing rules) local type is present in two modules, and these modules are linked together, the type gets uniqued. A DIType, that happens to be loaded first, survives linking, and the references on other types with the same ODR identifier from the modules loaded later are replaced with the references on the DIType loaded first. Since defintion subprograms, in scope of which these types are located, are not deduplicated, the linker output may contain multiple DISubprogram's having the same (uniqued) type in their retainedNodes lists. Further compilation of such modules causes crashes. To tackle that, * previous solution to handle LTO linking with local types in retainedNodes is removed (cloneLocalTypes() function), * for each loaded distinct (definition) DISubprogram, its retainedNodes list is scanned after loading, and DITypes with a scope of another subprogram are removed. If something from a Function corresponding to the DISubprogram references uniqued type, we rely on cross-CU links. Additionally: * a check is added to Verifier to report about local types located in a wrong retainedNodes list, Original commit message follows. --------- RFC https://discourse.llvm.org/t/rfc-dwarfdebug-fix-and-improve-handling-imported-entities-types-and-static-local-in-subprogram-and-lexical-block-scopes/68544 Similar to imported declarations, the patch tracks function-local types in DISubprogram's 'retainedNodes' field. DwarfDebug is adjusted in accordance with the aforementioned metadata change and provided a support of function-local types scoped within a lexical block. The patch assumes that DICompileUnit's 'enums field' no longer tracks local types and DwarfDebug would assert if any locally-scoped types get placed there. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Co-authored-by: Jeremy Morse <jeremy.morse@sony.com>	2026-02-04 00:34:52 +01:00
Teresa Johnson	b30971c4bb	[ThinLTO] Remove unused relative block frequency support (#177215 ) This removes most of the handling of the relative block frequency support added in 2018 in c73cec84c99e5a63dca961fef67998a677c53a3c, which was disabled by default and never utilized in the thin link as expected. Support for reading old Bitcode containing the record is maintained as required for backwards compatibility requirements, as is the support for parsing old LLVM assembly containing that information. Tests ensure that this backwards compatibility is maintained. This came up in the context of redundant BFI/DT computations which existed largely for the purpose of computing this information and are being addressed in PR176646.	2026-01-21 11:39:57 -08:00
Aiden Grossman	e2d7cd685d	[IR] Make dead_on_return attribute optionally sized This patch makes the dead_on_return parameter attribute optionally require a number of bytes to be passed in to specify the number of bytes known to be dead upon function return/unwind. This is aimed at enabling annotating the this pointer in C++ destructors with dead_on_return in clang. We need this to handle cases like the following: ``` struct X { int n; ~X() { this[n].n = 0; } }; void f() { X xs[] = {42, -1}; } ``` Where we only certain that sizeof(X) bytes are dead upon return of ~X. Otherwise DSE would be able to eliminate the store in ~X which would not be correct. This patch only does the wiring within IR. Future patches will make clang emit correct sizing information and update DSE to only delete stores to objects marked dead_on_return that are provably in bounds of the number of bytes specified to be dead_on_return. Reviewers: nikic, alinas, antoniofrighetto Pull Request: https://github.com/llvm/llvm-project/pull/171712	2026-01-21 08:22:05 -08:00
Jameson Nash	2458387ac1	[NFC] replace getValueType with more specific getFunctionType (#177175 ) When trivially valid already, use the more specific method, instead of casting the result of the less specific method.	2026-01-21 10:30:09 -05:00
Michael Buch	8f90efdee8	[llvm][DebugInfo][NFC] Remove DITypeRefArray in favour of DITypeArray (#177066 ) `DITypeRefArray` is just an alias (since https://github.com/llvm/llvm-project/pull/176938). Remove it in favour of just using `DITypeArray`.	2026-01-21 01:10:58 +00:00
Nikita Popov	af7c10618b	[BitcodeReader] Improve error messages Avoid using "Invalid record" for all errors. At least mention what kind of record it is.	2026-01-19 14:28:40 +01:00
Alexis Engelke	6813f8f037	[IR] Don't store switch case values as operands SwitchInst case values must be ConstantInt, which have no use list. Therefore it is not necessary to store these as Use, instead store them more efficiently as a simple array of pointers after the uses, similar to how PHINode stores basic blocks. After this change, the successors of all terminators are stored consecutively in the operand list. This is preparatory work for improving the performance of successor access. Add new C API functions so that switch case values remain accessible from bindings for other languages. While this could also be achieved by merely changing the order of operands (i.e., first all successors, then all constants), doing so would increase the asymptotic runtime of addCase from O(1) to O(n) (i.e., adding n cases would be O(n^2)), because it would need to shift all constants by one slot. Having null/invalid operands is also a bad idea and would cause much more breakage. Pull Request: https://github.com/llvm/llvm-project/pull/170984	2025-12-11 18:38:39 +01:00
Nikita Popov	53ce8502c6	[Bitcode] Use ConstantInt::getSigned() This is encoded as a signed value, so use getSigned().	2025-12-09 16:02:47 +01:00
Nikita Popov	ec1ea0a4ca	[llvm-c] Deprecate functions working on the global context (#163979 ) One of the most common mistakes when working with the LLVM C API is to mix functions that work on the global context and those that work on an explicit context. This often results in seemingly nonsensical errors because types from different contexts are mixed. We have considered the APIs working on the global context to be obsolete for a long time already, and do not add any new APIs using the global context. However, the fact that these still exist (and have shorter names) continues to cause issues. This PR proposes to deprecate these APIs, with intent to remove them at some point in the future. RFC: https://discourse.llvm.org/t/rfc-deprecate-c-api-functions-using-the-global-context/88639	2025-12-08 08:29:48 +00:00
Vitaly Buka	90e3ac6c55	Revert "[IR] Don't store switch case values as operands" (#170962 ) Reverts llvm/llvm-project#166842 Breaks Mips LLVM tests, and LLD on bots. See llvm/llvm-project#166842	2025-12-06 03:09:58 +00:00
Alexis Engelke	f26360f215	[IR] Don't store switch case values as operands (#166842 ) SwitchInst case values must be ConstantInt, which have no use list. Therefore it is not necessary to store these as Use, instead store them more efficiently as a simple array of pointers after the uses, similar to how PHINode stores basic blocks. After this change, the successors of all terminators are stored consecutively in the operand list. This is preparatory work for improving the performance of successor access.	2025-12-05 17:25:23 +01:00
Mingjie Xu	91531f3208	[ThinLTO] Fix parsing null aliasee in alias summary (#169490 ) In `f8182f1aef`, we add support for printing "null" aliasee in AsmWriter, but missing support in LLParser.	2025-12-02 10:08:50 +08:00
Peter Collingbourne	d2379effe9	Add deactivation symbol operand to ConstantPtrAuth. Deactivation symbol operands are supported in the code generator by building on the previously added support for IRELATIVE relocations. Reviewers: ojhunt, fmayer, ahmedbougacha, nikic, efriedma-quic Reviewed By: fmayer Pull Request: https://github.com/llvm/llvm-project/pull/133537	2025-11-26 12:39:40 -08:00
Jay Foad	d54168013a	[LLVM] Use "syncscope" instead of "synchscope" in comments. NFC. (#134615 ) This matches the spelling of the keyword in LLVM IR.	2025-11-25 14:11:49 +00:00
Shubham Sandeep Rastogi	20ebc7ea82	Add new llvm.dbg.declare_value intrinsic. (#168132 ) For swift async code, we need to use a debug intrinsic that behaves like an llvm.dbg.declare but can take any location type rather than just a pointer or integer. To solve this, a new debug instrinsic called llvm.dbg.declare_value has been created, which behaves exactly like an llvm.dbg.declare but can take non pointer and integer location types. More information here: https://discourse.llvm.org/t/rfc-introduce-new-llvm-dbg-coroframe-entry-intrinsic/88269 This is the first patch as part of a stack of patches, with the one succeeding it being: https://github.com/llvm/llvm-project/pull/168134	2025-11-22 00:49:35 -08:00
Kazu Hirata	4749cc4071	[Bitcode] Use a range-based for loop (NFC) (#168489 ) Identified with modernize-loop-convert.	2025-11-18 07:16:50 -08:00
Jay Foad	f037f41350	[IR] Add new function attribute nocreateundeforpoison (#164809 ) Also add a corresponding intrinsic property that can be used to mark intrinsics that do not introduce poison, for example simple arithmetic intrinsics that propagate poison just like a simple arithmetic instruction. As a smoke test this patch adds the new property to llvm.amdgcn.fmul.legacy.	2025-11-04 12:00:44 +00:00
Andrew Ng	ab487b6378	[BitcodeReader][NFC] Tidy getEnableSplitLTOUnitAndUnifiedFlag (#165732 )	2025-11-04 10:07:54 +00:00
Orlando Cazalet-Hyams	aa5fe56db4	[DebugInfo] Add dataSize to DIBasicType to add DW_AT_bit_size to _BitInt types (#164372 ) DW_TAG_base_type DIEs are permitted to have both byte_size and bit_size attributes "If the value of an object of the given type does not fully occupy the storage described by a byte size attribute" * Add DataSizeInBits to DIBasicType (`DIBasicType(... dataSize: n ...)` in IR). * Change Clang to add DataSizeInBits to _BitInt type metadata. * Change LLVM to add DW_AT_bit_size to base_type DIEs that have non-zero DataSizeInBits. TODO: Do we need to emit DW_AT_data_bit_offset for big endian targets? See discussion on the PR. Fixes [#61952](https://github.com/llvm/llvm-project/issues/61952) --------- Co-authored-by: David Stenberg <david.stenberg@ericsson.com>	2025-10-29 15:23:46 +00:00
Michael Buch	49f918d4c3	[llvm][Bitcode][ObjC] Fix order of setter/getter argument to DIObjCProperty constructor (#165421 ) Depends on: * https://github.com/llvm/llvm-project/pull/165401 We weren't testing `DIObjCProperty` roundtripping. So this was never caught. The consequence of this is that the `setter:` would have the getter name and `getter:` would have the setter name.	2025-10-29 12:14:56 +00:00
Teresa Johnson	eb74d8e03c	[ThinLTO] Add index flag for internalization/promotion status (#164530 ) Add an index-wide flag indicating whether index-based internalization and promotion have completed. This will be used in a follow on change.	2025-10-22 07:30:43 -07:00
Daniel Kiss	048070ba6f	[ARM][AArch64] BTI,GCS,PAC Module flag update. (#86212 ) Module flag is used to indicate the feature to be propagated to the function. As now the frontend emits all attributes accordingly let's help the auto upgrade to only do work when old and new bitcodes are merged. Depends on #82819 and #86031	2025-10-22 09:29:06 +02:00
Teresa Johnson	683e2bf059	[ThinLTO] Make SummaryList private (NFC) (#164355 ) In preparation for a follow on change that will require checking every time a new summary is added to the SummaryList for a GUID, make the SummaryList private and require all accesses to go through one of two new interfaces. Most changes are to access the list via the read only getSummaryList() method, and the few that add new summaries (e.g. while building the combined summary) use the new addSummary() method.	2025-10-21 06:53:40 -07:00
Michael Buch	cf1cdde24e	[llvm][DebugInfo] Add 'sourceLanguageVersion' field support to DICompileUnit (#162632 ) Depends on: * https://github.com/llvm/llvm-project/pull/162445 In preparation to emit DWARFv6's `DW_AT_language_version`.	2025-10-15 16:52:45 +01:00
Mingjie Xu	26eca2439c	Move the preserve-{bc,ll}-uselistorder options out of individual tools, make them global defaults for AsmWriter and BitcodeWriter (#160079 ) This patch moves the `preserve-bc-uselistorder` and `preserve-ll-uselistorder` options out of individual tools(opt, llvm-as, llvm-dis, llvm-link, llvm-extract) and make them global defaults for AsmWriter and BitcodeWriter. These options are useful when we use `-print-*` options to dump LLVM IR.	2025-10-11 09:19:50 +08:00
Michael Buch	c32753a77a	[llvm][DebugInfo] Add 'sourceLanguageName' field support to DICompileUnit (#162445 ) Depends on: * https://github.com/llvm/llvm-project/pull/162255 * https://github.com/llvm/llvm-project/pull/162434 Part of a patch series to support the DWARFv6 `DW_AT_language_name`/`DW_AT_language_version` attributes.	2025-10-10 09:54:04 +01:00
Michael Buch	6cba572d9e	[llvm][DebugInfo][NFC] Abstract DICompileUnit::SourceLanguage to allow alternate DWARF SourceLanguage encoding (#162255 ) This patch sets up `DICompileUnit` to support the DWARFv6 `DW_AT_language_name` and `DW_AT_language_version` attributes (which are set to replace `DW_AT_language`). This patch changes the `DICompileUnit::SourceLanguage` field type to a `DISourceLanguageName` that encapsulates the notion of "versioned vs. unversioned name". A "versioned" name is one that has an associated version stored separately in `DISourceLanguageName::Version`. This patch just changes all the clients of the `getSourceLanguage` API to the expect a `DISourceLanguageName`. Currently they all just `assert` (via `DISourceLanguageName::getUnversionedName`) that we're dealing with "unversioned names" (i.e., the pre-DWARFv6 language codes). In follow-up patches (e.g., draft is at https://github.com/llvm/llvm-project/pull/162261), when we start emitting versioned language codes, the `getUnversionedName` calls can then be adjusted to `getName`. Implementation considerations * We could have added a new member to `DICompileUnit` alongside the existing `SourceLanguage` field. I don't think this would have made the transition any simpler (clients would still need to be aware of "versioned" vs. "unversioned" language names). I felt that encapsulating this inside a `DISourceLanguageName` was easier to reason about for maintainers. * Currently DISourceLanguageName is a `12` byte structure. We could probably pack all the info inside a `uint64_t` (16-bits for the name, 32-bits for the version, 1-bit for answering the `hasVersionedName`). Just to keep the prototype simple I used a `std::optional`. But since the guts of the structure are hidden, we can always change the layout to a more compact representation instead. How to review * The new `DISourceLanguageName` structure is defined in `DebugInfoMetadata.h`. All the other changes fall out from changing the `DICompileUnit::SourceLanguage` from `unsigned` to `DISourceLanguageName`.	2025-10-08 18:27:22 +01:00
Marco Elver	224873d7ac	[AllocToken] Introduce sanitize_alloc_token attribute and alloc_token metadata (#160131 ) In preparation of adding the "AllocToken" pass, add the pre-requisite `sanitize_alloc_token` function attribute and `alloc_token` metadata. --- This change is part of the following series: 1. https://github.com/llvm/llvm-project/pull/160131 2. https://github.com/llvm/llvm-project/pull/156838 3. https://github.com/llvm/llvm-project/pull/162098 4. https://github.com/llvm/llvm-project/pull/162099 5. https://github.com/llvm/llvm-project/pull/156839 6. https://github.com/llvm/llvm-project/pull/156840 7. https://github.com/llvm/llvm-project/pull/156841 8. https://github.com/llvm/llvm-project/pull/156842	2025-10-07 12:51:42 +02:00
Antonio Frighetto	32c6e16246	[IR] Introduce `llvm.errno.tbaa` metadata for errno alias disambiguation Add a new named module-level frontend-annotated metadata that specifies the TBAA node for an integer access, for which, C/C++ `errno` accesses are guaranteed to use (under strict aliasing). This should allow LLVM to prove the involved memory location/ accesses may not alias `errno`; thus, to perform optimizations around errno-writing libcalls (store-to-load forwarding amongst others). Previous discussion: https://discourse.llvm.org/t/rfc-modelling-errno-memory-effects/82972.	2025-09-24 15:59:32 +02:00
Wael Yehia	74bea4c1ad	[IR] enable attaching metadata on ifuncs (#158732 ) Teach the IR parser and writer to support metadata on ifuncs, and update documentation. In PR #153049, we have a use case of attaching the `!associated` metadata to an ifunc. Since an ifunc is similar to a function declaration, it seems natural to allow metadata on ifuncs. Currently, the metadata API allows adding Metadata to llvm::GlobalObject, so the in-memory IR allows for metadata on ifuncs, but the IR reader/writer is not aware of that. --------- Co-authored-by: Wael Yehia <wyehia@ca.ibm.com>	2025-09-19 11:41:57 -04:00
Rahul Joshi	3a695e1abe	[NFC][LLVM] Namespace cleanup in MetadataLoader.cpp (#157595 ) - Remove forward declaration of `llvm::Argument` and include Argument.h instead. - Restrict scope of anonymous namespaces to just class declarations. - Move local static function out of anonymous namespace. - Remove a redundant assert when indexing a vector.	2025-09-15 10:32:45 -07:00
Alexandre Ganea	5cda2424c8	[LLD][COFF] Add more `--time-trace` tags for ThinLTO linking (#156471 ) In order to better see what's going on during ThinLTO linking, this PR adds more profile tags when using `--time-trace` on a `lld-link.exe` invocation. After PR, linking `clang.exe`: <img width="3839" height="2026" alt="Capture d’écran 2025-09-02 082021" src="https://github.com/user-attachments/assets/bf0c85ba-2f85-4bbf-a5c1-800039b56910" /> Linking a custom (Unreal Engine game) binary gives a completly different picture, probably because of using Unity files, and the sheer amount of input files (here, providing over 60 GB of .OBJs/.LIBs). <img width="1940" height="1008" alt="Capture d’écran 2025-09-02 102048" src="https://github.com/user-attachments/assets/60b28630-7995-45ce-9e8c-13f3cb5312e0" />	2025-09-05 15:28:19 -04:00
Peter Collingbourne	a1eeb59027	Bitcode: Stop combining function alignments into MaxAlignment. MaxAlignment is used to produce the abbreviation for MODULE_CODE_GLOBALVAR and is not used for anything related to function alignments, so stop combining function alignments and rename it to make its purpose clearer. Reviewers: teresajohnson Reviewed By: teresajohnson Pull Request: https://github.com/llvm/llvm-project/pull/155341	2025-08-26 11:22:33 -07:00
Alexander Richardson	3a4b351ba1	[IR] Introduce the `ptrtoaddr` instruction This introduces a new `ptrtoaddr` instruction which is similar to `ptrtoint` but has two differences: 1) Unlike `ptrtoint`, `ptrtoaddr` does not capture provenance 2) `ptrtoaddr` only extracts (and then extends/truncates) the low index-width bits of the pointer For most architectures, difference 2) does not matter since index (address) width and pointer representation width are the same, but this does make a difference for architectures that have pointers that aren't just plain integer addresses such as AMDGPU fat pointers or CHERI capabilities. This commit introduces textual and bitcode IR support as well as basic code generation, but optimization passes do not handle the new instruction yet so it may result in worse code than using ptrtoint. Follow-up changes will update capture tracking, etc. for the new instruction. RFC: https://discourse.llvm.org/t/clarifiying-the-semantics-of-ptrtoint/83987/54 Reviewed By: nikic Pull Request: https://github.com/llvm/llvm-project/pull/139357	2025-08-08 10:12:39 -07:00
Austin	c7bacc9f26	[llvm] using wrapper llvm::sort(nfc) (#151000 ) using wrapper llvm::sort(nfc)	2025-08-04 09:27:01 +08:00

1 2 3 4 5 ...

2317 Commits