llvm-project

Author	SHA1	Message	Date
Peter Collingbourne	75bb30ddbf	Move {load,store}(llvm.protected.field.ptr) lowering to InstCombine. The previous position of llvm.protected.field.ptr lowering for loads and stores was problematic as it not only inhibited optimizations such as DSE (as stores to a llvm.protected.field.ptr were not considered to must-alias stores to the non-protected.field pointer) but also required changes to other optimization passes to avoid transformations that would reduce PFP coverage. Address this by moving the load/store part of the lowering to InstCombine, where it will run earlier than the PFP-breaking and AA-relying transformations. The deactivation symbol, null comparison and EmuPAC parts of the lowering remain in PreISelLowering. Now that the transformation inhibitions are no longer needed, remove them (i.e. partially revert #151649, and revert #182976). This change resulted in a 2.4% reduction in Fleetbench .text size and the following improvements to PFP performance overhead for BM_PROTO_Arena on various microarchitectures: before after Apple M2 Ultra 3.5% 3.3% Google Axion C4A 3.3% 2.9% Google Axion N4A 2.7% 2.2% Reviewers: fmayer, nikic, vitalybuka Reviewed By: fmayer Pull Request: https://github.com/llvm/llvm-project/pull/186548	2026-04-06 17:47:24 -07:00
Alexis Engelke	7581430722	[IR] Require well-formed IR for BasicBlock::getTerminator (#189416 ) BasicBlock::getTerminator() is frequently called on valid IR, yet the function has to check that the last instruction is in fact a terminator, even in release builds. This check can only be optimized away when the instruction is dereferenced. Therefore, introduce the functions hasTerminator() and getTerminatorOrNull() as replacement and require (assert) that getTerminator() always returns a valid terminator. As a side effect, this forces explicit expression of intent at call sites when unfinished basic blocks should be supported.	2026-03-30 18:57:37 +02:00
Alexis Engelke	926bea91c1	[Transforms/Utils][NFC] Replace SmallPtrSet with vector (#186664 ) Typically most blocks in a function are reachable, so use a vector indexed by block number instead of a SmallPtrSet.	2026-03-15 16:44:52 +01:00
Alexis Engelke	e2fef479f8	[Transforms/Utils][NFC] Drop uses of BranchInst (#186586 )	2026-03-14 12:47:10 +01:00
Mitchel Dickerson	f932646bf6	[SimplifyCFG][PGO] Add missing overflow check to ConstantFoldTerminator (#178964 ) Branch weight metadata can overflow when folding large branch weights. Updated branch weights to uint64_t, added check for overflow, and then set branch weights using setFittedBranchWeights to ensure branch weight metadata is not lost.	2026-02-11 01:17:12 +00:00
Abhay Kanhere	e39d2822bc	[CodeGen][AArch64] ptrauth intrinsic to safely construct relative ptr (#142047 ) ptrauth intrinsic to safely construct relative ptr for swift coroutines. A ptrauth intrinsic for swift co-routine support that allows creation of signed pointer from offset stored at address relative to the pointer. Following C-like pseudo code (ignoring keys,discriminators) explains its operation: let rawptr = PACauth(inputptr); return PACsign( rawptr + signextend64( (int32)(rawptr+addend) )) What: Authenticate a signed pointer, load a 32bit value at offset 'addend' from pointer, add this value to pointer, sign this new pointer. builtin: __builtin_ptrauth_auth_load_relative_and_sign intrinsic: ptrauth_auth_resign_load_relative	2026-02-03 18:03:37 +01:00
int-zjt	a21001ab41	[JumpThreading] Avoid unnecessary map resizing in gatherIncomingValuesToPhi (#173596 ) Previously, `gatherIncomingValuesToPhi` populated the `IncomingValues` map with all non-undef incoming values from the PHI node. For PHI nodes with a large number of incoming blocks, this caused the `SmallDenseMap` to grow significantly, triggering expensive resizing and rehashing operations, even when the caller (`redirectValuesFromPredecessorsToPhi`) was only interested in a small subset of predecessors. This patch optimizes the logic to prevent this unnecessary map growth. Instead of collecting all values, we now: 1. Initialize the `IncomingValues` map specifically for the blocks in `BBPreds` (setting them to `nullptr` initially). 2. Iterate through the PHI node and update the map entries only if the incoming block is already present in the map. This ensures that the size of the map is bounded by the size of `BBPreds`. Since `BBPreds` is typically small, this change keeps the map within the `SmallDenseMap`'s inline storage in most cases, eliminating heap allocations and resizing overhead for large PHI nodes. The `selectIncomingValueForBlock` and `replaceUndefValuesInPhi` helpers are also updated to handle map entries where the value is `nullptr`.	2026-01-28 09:17:21 +01:00
Matt Arsenault	6934ed51b3	IR: Add !nofpclass metadata (#177140 ) This adds the analogous metadata to the nofpclass attribute to assert values are not a certain set of floating-point classes. This allows the same information to be expressed if a function argument is passed indirectly. This matches the bitmask encoding of nofpclass. I also think this should be allowed for stores to symmetrically handle sret, but leave that for later. Alternatively we could add a more expressive !fprange metadata, but that would be much more complex. It's useful to match the attribute, and more annotations can always be added. Fixes #133560	2026-01-22 20:49:34 +01:00
Peter Collingbourne	e60d62b90f	Utils: Inhibit load/store folding through phis for llvm.protected.field.ptr. Protected pointer field loads/stores should be paired with the intrinsic to avoid unnecessary address escapes. Reviewers: nikic Reviewed By: nikic Pull Request: https://github.com/llvm/llvm-project/pull/151649	2025-12-03 17:42:48 -08:00
Drew Kersnar	9c78bc5de4	Revert "[LSV] Merge contiguous chains across scalar types" (#170381 ) Reverts llvm/llvm-project#154069. I pointed out a number of issues post-merge, most importantly examples of miscompiles: https://github.com/llvm/llvm-project/pull/154069#issuecomment-3603854626. While the motivation of the change is clear, I think the implementation approach is flawed. It seems like the goal is to allow elements like `load <2xi16>` and `load i32` to be vectorized together despite the current algorithm not grouping them into the same equivalence classes. I personally think that if we want to attempt this it should be a more wholistic approach, maybe even redefining the concept of an equivalence class. This current solution seems like it would be really hard to do bug-free, and even if the bugs were not present, it is only able to merge chains that happen to be adjacent to each other after `splitChainByContiguity`, which seems like it is leaving things up to chance whether this optimization kicks in. But we can discuss more in the re-land. Maybe the broader approach I'm proposing is too difficult, and a narrow optimization is worthwhile. Regardless, this should be reverted, it needs more iteration before it is correct.	2025-12-02 18:27:58 -05:00
Anshil Gandhi	fbdf8ab590	[LSV] Merge contiguous chains across scalar types (#154069 ) This change enables the LoadStoreVectorizer to merge and vectorize contiguous chains even when their scalar element types differ, as long as the total bitwidth matches. To do so, we rebase offsets between chains, normalize value types to a common integer type, and insert the necessary casts around loads and stores. This uncovers more vectorization opportunities and explains the expected codegen updates across AMDGPU tests. Key changes: - Chain merging - Build contiguous subchains and then merge adjacent ones when: - They refer to the same underlying pointer object and address space. - They are either all loads or all stores. - A constant leader-to-leader delta exists. - Rebasing one chain into the other's coordinate space does not overlap. - All elements have equal total bit width. - Rebase the second chain by the computed delta and append it to the first. - Type normalization and casting - Normalize merged chains to a common integer type sized to the total bits. - For loads: create a new load of the normalized type, copy metadata, and cast back to the original type for uses if needed. - For stores: bitcast the value to the normalized type and store that. - Insert zext/trunc for integer size changes; use bit-or-pointer casts when sizes match. - Cleanups - Erase replaced instructions and DCE pointer operands when safe. - New helpers: computeLeaderDelta, chainsOverlapAfterRebase, rebaseChain, normalizeChainToType, and allElemsMatchTotalBits. Impact: - Increases vectorization opportunities across mixed-typed but size-compatible access chains. - Large set of expected AMDGPU codegen diffs due to more/changed vectorization. This PR resolves #97715.	2025-12-01 23:05:17 -05:00
Laxman Sole	6fe3eccdf4	[llvm][DebugInfo] Emit 0/1 for constant boolean values (#151225 ) Previously, sign-extending a 1-bit boolean operand in `#DBG_VALUE` would convert `true` to -1 (i.e., 0xffffffffffffffff). However, DWARF treats booleans as unsigned values, so this resulted in the attribute `DW_AT_const_value(0xffffffffffffffff)` being emitted. As a result, the debugger would display the value as `255` instead of `true`. This change modifies the behavior to use zero-extension for 1-bit values instead, ensuring that `true` is represented as 1. Consequently, the DWARF attribute emitted is now `DW_AT_const_value(1)`, which allows the debugger to correctly display the boolean as `true`.	2025-11-03 13:34:44 -08:00
Nikita Popov	6c2781e187	[GVN] Share equality propagation for assume and condition (#161639 ) GVN currently has two different implementation of equality propagation. One of them is used for branch conditions (dominating an edge), which performs replacements across multiple blocks. This is also used for assumes to handle uses outside the current block. However, uses inside the block are handled using a completely separate implementation, which involves populating a replacement map and then checking it for individual instructions during normal GVN. While this approach generally makes sense, it is kind of pointless if we already do a use walk to handle the cross-block case anyway. This PR generalizes propagateEquality() to accept either a BasicBlockEdge or an Instruction* and replace dominated users. This removes the need for special handling of uses in the same block for assumes, as they're covered by instruction dominance, and ensures that both implementations do not go out of sync.	2025-10-10 10:58:58 +02:00
Marco Elver	224873d7ac	[AllocToken] Introduce sanitize_alloc_token attribute and alloc_token metadata (#160131 ) In preparation of adding the "AllocToken" pass, add the pre-requisite `sanitize_alloc_token` function attribute and `alloc_token` metadata. --- This change is part of the following series: 1. https://github.com/llvm/llvm-project/pull/160131 2. https://github.com/llvm/llvm-project/pull/156838 3. https://github.com/llvm/llvm-project/pull/162098 4. https://github.com/llvm/llvm-project/pull/162099 5. https://github.com/llvm/llvm-project/pull/156839 6. https://github.com/llvm/llvm-project/pull/156840 7. https://github.com/llvm/llvm-project/pull/156841 8. https://github.com/llvm/llvm-project/pull/156842	2025-10-07 12:51:42 +02:00
Nikita Popov	63ca8483d0	[IR] Introduce !captures metadata (#160913 ) This introduces `!captures` metadata on stores, which looks like this: ``` store ptr %x, ptr %y, !captures !{!"address", !"read_provenance"} ``` The semantics are the same as replacing the store with a call like this: ``` call void @llvm.store(ptr captures(address, read_provenance) %x, ptr %y) ``` This metadata is intended for annotation by frontends -- it's not something we can feasibly infer at this point, as it would require analyzing uses of the pointer stored in memory. The motivating use case for this is Rust's `println!()` machinery, which involves storing a reference to the value inside a structure. This means that printing code (including conditional debugging code), can inhibit optimizations because the pointer escapes. With the new metadata we can annotate this as a read-only capture, which has less impact on optimizations.	2025-10-01 08:58:47 +02:00
Craig Topper	678dcf13d8	[IR] Fix a few implicit conversions from TypeSize to uint64_t. NFC (#159894 )	2025-09-20 14:18:47 -07:00
Mircea Trofin	8f25ea2d73	[NFC] Leave a comment in `Local.cpp` about debug info & sample profiling (#155296 ) Issue #152767	2025-09-12 15:05:16 -07:00
Kazu Hirata	3e28d3c30e	[Utils] Remove an unnecessary cast (NFC) (#156813 ) getZExtValue() already return uint64_t.	2025-09-04 07:46:19 -07:00
Farzon Lotfi	01c0a8409a	[DirectX] Make dx.RawBuffer an op that can't be replaced (#154620 ) fixes #152348 SimplifyCFG collapses raw buffer store from a if\else load into a select. This change prevents the TargetExtType dx.Rawbuffer from being replace thus preserving the if\else blocks. A further change was needed to eliminate the phi node before we process Intrinsic::dx_resource_getpointer in DXILResourceAccess.cpp	2025-08-29 16:09:03 -04:00
Nikita Popov	24924a8be1	[SimplifyCFG] Move token type check into canReplaceOperandWithVariable() We cannot form phis/selects of token type, so this should be checked inside canReplaceOperandWithVariable().	2025-08-28 15:53:37 +02:00
Kazu Hirata	07eb7b7692	[llvm] Replace SmallSet with SmallPtrSet (NFC) (#154068 ) This patch replaces SmallSet<T , N> with SmallPtrSet<T , N>. Note that SmallSet.h "redirects" SmallSet to SmallPtrSet for pointer element types: template <typename PointeeType, unsigned N> class SmallSet<PointeeType, N> : public SmallPtrSet<PointeeType, N> {}; We only have 140 instances that rely on this "redirection", with the vast majority of them under llvm/. Since relying on the redirection doesn't improve readability, this patch replaces SmallSet with SmallPtrSet for pointer element types.	2025-08-18 07:01:29 -07:00
Nikita Popov	c23b4fbdbb	[IR] Remove size argument from lifetime intrinsics (#150248 ) Now that #149310 has restricted lifetime intrinsics to only work on allocas, we can also drop the explicit size argument. Instead, the size is implied by the alloca. This removes the ability to only mark a prefix of an alloca alive/dead. We never used that capability, so we should remove the need to handle that possibility everywhere (though many key places, including stack coloring, did not actually respect this).	2025-08-08 11:09:34 +02:00
Nikita Popov	e833bb0991	[Local] Do not pass Root to replaceDominatedUsesWith (NFC) Capture it in the lambdas instead.	2025-08-04 14:22:17 +02:00
Nikita Popov	86727fe9a1	[IR] Allow poison argument to lifetime markers (#151148 ) This slightly relaxes the invariant established in #149310, by also allowing the lifetime argument to be poison. This is to support the typical pattern of RAUWing with poison when removing an instruction. It's worth noting that this does not require any conservative assumptions, lifetimes with poison arguments can simply be skipped. Fixes https://github.com/llvm/llvm-project/issues/151119.	2025-08-04 10:02:04 +02:00
Nikita Popov	bdd638a897	[Local] Remove handling for lifetime intrinsic on non-alloca (NFC) After #149310 this is guaranteed to be an alloca.	2025-07-23 14:21:22 +02:00
Nikita Popov	307256ecbd	[GVNSink] Do not sink lifetimes of different allocas (#149818 ) This was always undesirable, and after #149310 it is illegal and will result in a verifier error. Fix this by moving SimplifyCFG's check for this into canReplaceOperandWithVariable(), so it's shared with GVNSink.	2025-07-22 09:44:03 +02:00
Jeremy Morse	c9ceb9b75f	[DebugInfo] Remove intrinsic-flavours of findDbgUsers (#149816 ) This is one of the final remaining debug-intrinsic specific codepaths out there, and pieces of cross-LLVM infrastructure to do with debug intrinsics.	2025-07-21 17:49:25 +01:00
Prabhu Rajasekaran	921c6dbeca	[llvm] Introduce callee_type metadata Introduce `callee_type` metadata which will be attached to the indirect call instructions. The `callee_type` metadata will be used to generate `.callgraph` section described in this RFC: https://lists.llvm.org/pipermail/llvm-dev/2021-July/151739.html Reviewers: morehouse, petrhosek, nikic, ilovepi Reviewed By: nikic, ilovepi Pull Request: https://github.com/llvm/llvm-project/pull/87573	2025-07-18 14:40:54 -07:00
Jeremy Morse	c9d8b68676	[DebugInfo] Suppress lots of users of DbgValueInst (#149476 ) This is another prune of dead code -- we never generate debug intrinsics nowadays, therefore there's no need for these codepaths to run. --------- Co-authored-by: Nikita Popov <github@npopov.com>	2025-07-18 11:31:52 +01:00
Jeremy Morse	2a1869b981	[DebugInfo] Shave even more users of DbgVariableIntrinsic from LLVM (#149136 ) At this stage I'm just opportunistically deleting any code using debug-intrinsic types, largely adjacent to calls to findDbgUsers. I'll get to deleting that in probably one or more two commits.	2025-07-18 08:25:10 +01:00
Jeremy Morse	7eb65f470c	[DebugInfo] Delete a now-unused function after 5328c732a4770	2025-07-16 15:45:36 +01:00
Jeremy Morse	5328c732a4	[DebugInfo] Strip more debug-intrinsic code from local utils (#149037 ) SROA and a few other facilities use generic-lambdas and some overloaded functions to deal with both intrinsics and debug-records at the same time. As part of stripping out intrinsic support, delete a swathe of this code from things in the Utils directory. This is a large diff, but is mostly about removing functions that were duplicated during the migration to debug records. I've taken a few opportunities to replace comments about "intrinsics" with "records", and replace generic lambdas with plain lambdas (I believe this makes it more readable). All of this is chipping away at intrinsic-specific code until we get to removing parts of findDbgUsers, which is the final boss -- we can't remove that until almost everything else is gone.	2025-07-16 14:13:53 +01:00
Jeremy Morse	57a5f9c47e	[DebugInfo][RemoveDIs] Suppress getNextNonDebugInfoInstruction (#144383 ) There are no longer debug-info instructions, thus we don't need this skipping. Horray!	2025-07-15 15:34:10 +01:00
Kunqiu Chen	a6e1700fa6	[Utils][Local] Preserve !nosanitize in combineMetadata when merging instructions (#148376 ) `combineMetadata` helper currently drops `!nosanitize` metadata when merging two instructions, even if both originally carried `!nosanitize`. This is problematic because `!nosanitize` is a key mechanism used by sanitizer (e.g., ASan) to suppress instrumentation. Removing it can lead to unintended sanitizer behavior. This patch adds `nosanitize` to the whitelist in combineMetadata, preserving it only if both instructions carry `!nosanitize`; otherwise, it is dropped. This patch also adds corresponding tests in a test file and regenerates it. --- ### Details Example (see [Godbolt](https://godbolt.org/z/83P5eWczx) for details): ```llvm %v1 = load i32, ptr %p, !nosanitize %v2 = load i32, ptr %p, !nosanitize ``` When merged via `combineMetadata(%v1, %v2, ...)`, the resulting instruction loses its `!nosanitize` metadata. Tools such as UBSan and AFL rely on `nosanitize` to prevent unwanted transformations or checks. However, the current implementation of combineMetadata mistakenly drops !nosanitize. This may lead to unintended behavior during optimization. For example, under `-fsanitize=address,undefined -O2`, IR emitted by UBSan may lose its `!nosanitize` metadata due to the incorrect metadata merging in optimization. As a result, ASan could unexpectedly instrument those instructions. > Note: due to the current UBSan handlers having relatively coarse-grained attributes, this specific case is difficult to reproduce end-to-end from source code—UBSan currently inhibits such optimizations (refer to #135135 for details). Still, I believe it's necessary to fix this now, to support future versions of UBSan that might allow such optimizations, and to support third-party tools (such as AFL-based fuzzers) that rely on the presence of !nosanitize.	2025-07-14 15:45:08 +08:00
Jeremy Morse	9eb0020555	[DebugInfo][RemoveDIs] Remove a swathe of debug-intrinsic code (#144389 ) Seeing how we can't generate any debug intrinsics any more: delete a variety of codepaths where they're handled. For the most part these are plain deletions, in others I've tweaked comments to remain coherent, or added a type to (what was) type-generic-lambdas. This isn't all the DbgInfoIntrinsic call sites but it's most of the simple scenarios. Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-17 15:55:14 +01:00
Kazu Hirata	05cd32adb7	[llvm] Remove unused includes (NFC) (#144293 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-06-16 08:59:18 -07:00
Stephen Tozer	aa8a1fa6f5	[DLCov][NFC] Annotate intentionally-blank DebugLocs in existing code (#136192 ) Following the work in PR #107279, this patch applies the annotative DebugLocs, which indicate that a particular instruction is intentionally missing a location for a given reason, to existing sites in the compiler where their conditions apply. This is NFC in ordinary LLVM builds (each function `DebugLoc::getFoo()` is inlined as `DebugLoc()`), but marks the instruction in coverage-tracking builds so that it will be ignored by Debugify, allowing only real errors to be reported. From a developer standpoint, it also communicates the intentionality and reason for a missing DebugLoc. Some notes for reviewers: - The difference between `I->dropLocation()` and `I->setDebugLoc(DebugLoc::getDropped())` is that the former _may_ decide to keep some debug info alive, while the latter will always be empty; in this patch, I always used the latter (even if the former could technically be correct), because the former could result in some (barely) different output, and I'd prefer to keep this patch purely NFC. - I've generally documented the uses of `DebugLoc::getUnknown()`, with the exception of the vectorizers - in summary, they are a huge cause of dropped source locations, and I don't have the time or the domain knowledge currently to solve that, so I've plastered it all over them as a form of "fixme".	2025-06-11 17:42:10 +01:00
Andrew Rogers	b2584e0b17	[llvm] annotate interfaces in llvm/Transforms for DLL export (#143413 ) ## Purpose This patch is one in a series of code-mods that annotate LLVM’s public interface for export. This patch annotates the `llvm/Transforms` library. These annotations currently have no meaningful impact on the LLVM build; however, they are a prerequisite to support an LLVM Windows DLL (shared library) build. ## Background This effort is tracked in #109483. Additional context is provided in [this discourse](https://discourse.llvm.org/t/psa-annotating-llvm-public-interface/85307), and documentation for `LLVM_ABI` and related annotations is found in the LLVM repo [here](https://github.com/llvm/llvm-project/blob/main/llvm/docs/InterfaceExportAnnotations.rst). The bulk of these changes were generated automatically using the [Interface Definition Scanner (IDS)](https://github.com/compnerd/ids) tool, followed formatting with `git clang-format`. The following manual adjustments were also applied after running IDS on Linux: - Removed a redundant `operator<<` from Attributor.h. IDS only auto-annotates the 1st declaration, and the 2nd declaration being un-annotated resulted in an "inconsistent linkage" error on Windows when building LLVM as a DLL. - `#include` the `VirtualFileSystem.h` in PGOInstrumentation.h and remove the local declaration of the `vfs::FileSystem` class. This is required because exporting the `PGOInstrumentationUse` constructor requires the class be fully defined because it is used by an argument. - Add #include "llvm/Support/Compiler.h" to files where it was not auto-added by IDS due to no pre-existing block of include statements. - Add `LLVM_TEMPLATE_ABI` and `LLVM_EXPORT_TEMPLATE` to exported instantiated templates. ## Validation Local builds and tests to validate cross-platform compatibility. This included llvm, clang, and lldb on the following configurations: - Windows with MSVC - Windows with Clang - Linux with GCC - Linux with Clang - Darwin with Clang	2025-06-10 08:10:17 -07:00
Eli Friedman	9f82ac5738	Remove GlobalObject::getAlign/setAlignment (#143188 ) Currently, GlobalObject has an "alignment" property... but it's basically nonsense: alignment doesn't mean the same thing for variables and functions, and it's completely meaningless for ifuncs. This "removes" (actually marking protected) the methods from GlobalObject, adds the relevant methods to Function and GlobalVariable, and adjusts the code appropriately. This should make future alignment-related cleanups easier.	2025-06-09 13:51:03 -07:00
Jeremy Morse	0e4b8b8f81	[DebugInfo][RemoveDIs] Rip out the UseNewDbgInfoFormat flag (#143207 ) Start removing debug intrinsics support -- starting with the flag that controls production of their replacement, debug records. This patch removes the command-line-flag and with it the ability to switch back to intrinsics. The module / function / block level "IsNewDbgInfoFormat" flags get hardcoded to true, I'll to incrementally remove things that depend on those flags.	2025-06-09 19:36:34 +01:00
Ramkumar Ramachandra	b40e4ceaa6	[ValueTracking] Make Depth last default arg (NFC) (#142384 ) Having a finite Depth (or recursion limit) for computeKnownBits is very limiting, but is currently a load-bearing necessity, as all KnownBits are recomputed on each call and there is no caching. As a prerequisite for an effort to remove the recursion limit altogether, either using a clever caching technique, or writing a easily-invalidable KnownBits analysis, make the Depth argument in APIs in ValueTracking uniformly the last argument with a default value. This would aid in removing the argument when the time comes, as many callers that currently pass 0 explicitly are now updated to omit the argument altogether.	2025-06-03 17:12:24 +01:00
Andreas Jonson	7c080e2677	[InstCombine] Avoid to create bitreverse.i1 for or of trunc to i1 (#142258 )	2025-05-31 10:20:49 +02:00
Florian Hahn	fb923e98d1	[Local] Verify opcodes match for all insts passed to mergeFlags (NFC). (#141231 ) The logic for tracking flags relies on all instructions having the same opcode. Add an assert to check, as suggested in https://github.com/llvm/llvm-project/pull/140406. PR: https://github.com/llvm/llvm-project/pull/141231	2025-05-28 16:03:24 +01:00
Florian Hahn	34813d9d38	[Reassociate] Move Disjoint flag handling to OverflowTracking. (#140406 ) Move disjoint flag tracking to OverflowTracking. This enables preserving disjoint flags in Reassociate. Depends on https://github.com/llvm/llvm-project/pull/140404 PR: https://github.com/llvm/llvm-project/pull/140406	2025-05-23 14:59:18 +01:00
Florian Hahn	c92ff61cee	[Local] Move OverflowTracking to Local.h, move logic to helpers (NFC) (#140403 ) Move parts of the logic used by Reassociate to OverflowTracking (mergeFlags & applyFlags) and move the definition to Local.h. For now it just moves the NUW/NSW handling, as this matches the uses in LICM. I'll look into the FP math handling separately, as it looks like there's a difference between Reassociate (takes all flags from I, while LICM takes the intersection of the flags on both instructions). PR: https://github.com/llvm/llvm-project/pull/140403	2025-05-19 21:47:50 +01:00
Ellis Hoag	78f0af5d89	[SimplifyCFG][swifterror] Don't sink calls with swifterror params (#139015 ) We've encountered an LLVM verification failure when building Swift with the SimplifyCFG pass enabled. I found that https://reviews.llvm.org/D158083 fixed this pass by preventing sinking loads or stores of swifterror values, but it did not implement the same protection for call or invokes. In `Verifier.cpp` [here](`c685355811/llvm/lib/IR/Verifier.cpp (L4360-L4364)`) and [here](`c685355811/llvm/lib/IR/Verifier.cpp (L3661-L3662)`) we can see that swifterror values must also be used directly by call instructions.	2025-05-12 14:37:26 -07:00
Paul Kirth	8404b29b41	[llvm][NFC] Fix bracing from #138414 (#138620 ) I had forgotten to upload the formatting change.	2025-05-05 18:21:57 -07:00
Paul Kirth	43eafc0c4a	[llvm][gvn-sink] Don't try to sink inline asm (#138414 ) Fixes #138345. Before this patch, gvn-sink would try to sink inline assembly statements. Other GVN passes avoid them (see `b4fac94181/llvm/lib/Transforms/Scalar/GVN.cpp (L2932)` Similarly, gvn-sink should skip these instructions, since they are not safe to move. To do this, we update the early exit in canReplaceOperandWithVariable, since it should have caught this case. It's more efficient to also skip numbering in GVNSink if the instruction is InlineAsm, but that should be infrequent. The test added is reduced from a failure when compiling Fuchsia with gvn-sink.	2025-05-05 18:16:33 -07:00
Kazu Hirata	5cfd81b0cc	[llvm] Use range constructors of *Set (NFC) (#137552 )	2025-04-27 15:59:57 -07:00
Kazu Hirata	1f56716a7e	[llvm] Use hash_combine_range with ranges (NFC) (#137530 )	2025-04-27 12:31:28 -07:00

1 2 3 4 5 ...

971 Commits