llvm-project

Author	SHA1	Message	Date
Antonio Frighetto	1247fddf36	[SimplifyCFG] Relax `cttz` cost check in `simplifySwitchOfPowersOfTwo` We should be able to allow `simplifySwitchOfPowersOfTwo` transform to take place, as, on recent X86 targets, the weighted latency-size appears to be 2. This favours computing trailing zeroes and indexing into a smaller value table, over generating a jump table with an indirect branch, which overall should be more efficient.	2025-06-24 09:06:18 +02:00
Jeremy Morse	9eb0020555	[DebugInfo][RemoveDIs] Remove a swathe of debug-intrinsic code (#144389 ) Seeing how we can't generate any debug intrinsics any more: delete a variety of codepaths where they're handled. For the most part these are plain deletions, in others I've tweaked comments to remain coherent, or added a type to (what was) type-generic-lambdas. This isn't all the DbgInfoIntrinsic call sites but it's most of the simple scenarios. Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-17 15:55:14 +01:00
Stephen Tozer	a08a831515	[DLCov][NFC] Propagate annotated DebugLocs through transformations (#138047 ) Part of the coverage-tracking feature, following #107279. In order for DebugLoc coverage testing to work, we firstly have to set annotations for intentionally-empty DebugLocs, and secondly we have to ensure that we do not drop these annotations as we propagate DebugLocs throughout compilation. As the annotations exist as part of the DebugLoc class, and not the underlying DILocation, they will not survive a DebugLoc->DILocation->DebugLoc roundtrip. Therefore this patch modifies a number of places in the compiler to propagate DebugLocs directly rather than via the underlying DILocation. This has no effect on the output of normal builds; it only ensures that during coverage builds, we do not drop incorrectly annotations and therefore create false positives. The bulk of these changes are in replacing DILocation::getMergedLocation(s) with a DebugLoc equivalent, and in changing the IRBuilder to store a DebugLoc directly rather than storing DILocations in its general Metadata array. We also use a new function, `DebugLoc::orElse`, which selects the "best" DebugLoc out of a pair (valid location > annotated > empty), preferring the current DebugLoc on a tie - this encapsulates the existing behaviour at a few sites where we _may_ assign a DebugLoc to an existing instruction, while extending the logic to handle annotation DebugLocs at the same time.	2025-06-12 14:06:27 +01:00
Jeremy Morse	97ac6483aa	[DebugInfo][RemoveDIs] Delete debug-info-format flag (#143746 ) This flag was used to let us incrementally introduce debug records into LLVM, however everything is now using records. It serves no purpose now, so delete it.	2025-06-12 11:51:58 +01:00
Stephen Tozer	aa8a1fa6f5	[DLCov][NFC] Annotate intentionally-blank DebugLocs in existing code (#136192 ) Following the work in PR #107279, this patch applies the annotative DebugLocs, which indicate that a particular instruction is intentionally missing a location for a given reason, to existing sites in the compiler where their conditions apply. This is NFC in ordinary LLVM builds (each function `DebugLoc::getFoo()` is inlined as `DebugLoc()`), but marks the instruction in coverage-tracking builds so that it will be ignored by Debugify, allowing only real errors to be reported. From a developer standpoint, it also communicates the intentionality and reason for a missing DebugLoc. Some notes for reviewers: - The difference between `I->dropLocation()` and `I->setDebugLoc(DebugLoc::getDropped())` is that the former _may_ decide to keep some debug info alive, while the latter will always be empty; in this patch, I always used the latter (even if the former could technically be correct), because the former could result in some (barely) different output, and I'd prefer to keep this patch purely NFC. - I've generally documented the uses of `DebugLoc::getUnknown()`, with the exception of the vectorizers - in summary, they are a huge cause of dropped source locations, and I don't have the time or the domain knowledge currently to solve that, so I've plastered it all over them as a form of "fixme".	2025-06-11 17:42:10 +01:00
Yingwei Zheng	7e1fa09ce2	[SimplifyCFG] Bail out on vector GEPs in `passingValueIsAlwaysUndefined` (#142526 ) Closes https://github.com/llvm/llvm-project/issues/142522.	2025-06-04 12:37:30 +08:00
Ramkumar Ramachandra	b40e4ceaa6	[ValueTracking] Make Depth last default arg (NFC) (#142384 ) Having a finite Depth (or recursion limit) for computeKnownBits is very limiting, but is currently a load-bearing necessity, as all KnownBits are recomputed on each call and there is no caching. As a prerequisite for an effort to remove the recursion limit altogether, either using a clever caching technique, or writing a easily-invalidable KnownBits analysis, make the Depth argument in APIs in ValueTracking uniformly the last argument with a default value. This would aid in removing the argument when the time comes, as many callers that currently pass 0 explicitly are now updated to omit the argument altogether.	2025-06-03 17:12:24 +01:00
Kazu Hirata	54d836a080	[llvm] Use *Set::insert_range (NFC) (#138237 )	2025-06-02 19:48:13 -07:00
Yingwei Zheng	1e08febf0a	[SimplifyCFG] Switch to use `paramHasNonNullAttr` (#125383 )	2025-06-02 12:20:13 +08:00
Nikita Popov	eee958285b	[SimplifyCFG] Only consider provenance capture in store speculation (#138548 ) The capture check here is to protect against concurrent accesses from other threads. This requires the provenance to escape.	2025-05-22 17:01:37 +02:00
Ellis Hoag	78f0af5d89	[SimplifyCFG][swifterror] Don't sink calls with swifterror params (#139015 ) We've encountered an LLVM verification failure when building Swift with the SimplifyCFG pass enabled. I found that https://reviews.llvm.org/D158083 fixed this pass by preventing sinking loads or stores of swifterror values, but it did not implement the same protection for call or invokes. In `Verifier.cpp` [here](`c685355811/llvm/lib/IR/Verifier.cpp (L4360-L4364)`) and [here](`c685355811/llvm/lib/IR/Verifier.cpp (L3661-L3662)`) we can see that swifterror values must also be used directly by call instructions.	2025-05-12 14:37:26 -07:00
Kazu Hirata	85f3d60d81	[Utils] Use range-based for loops (NFC) (#139426 )	2025-05-10 22:50:46 -07:00
Orlando Cazalet-Hyams	74c3025dd5	[KeyInstr][SimplifyCFG] Remap atoms after duplication for threading (#133484 ) Given the same branch condition in `a` and `c` SimplifyCFG converts: +> b -+ \| v --> a --> c --> e --> \| ^ +> d -+ into: +--> bcd ---+ \| v --> a --> c --> e --> Remap source atoms on instructions duplicated from `c` into `bcd`. RFC: https://discourse.llvm.org/t/rfc-improving-is-stmt-placement-for-better-interactive-debugging/82668	2025-05-07 09:56:30 +01:00
Orlando Cazalet-Hyams	f8de1618cc	[KeyInstr][SimplifyCFG] Remap atoms when folding br to common succ into pred (#133482 ) SimplifyCFG folds `d` into preds `b` and `c`. +---------------+ \| \| +--> b --+ \| \| v v --> a d --> e --> f --> \| ^ ^ +--> c --+ \| \| \| +---------------+ Remap source atoms so that the duplicated instructions are analysed independently to determine is_stmt positions. The pull request contains a discussion covering various edge cases here: https://github.com/llvm/llvm-project/pull/133482/files#r2039519348 The summary of the discussion is that we could avoid remapping when there's a single pred, but we decided that it's still a trade off, and not worth the additional complexity right now. RFC: https://discourse.llvm.org/t/rfc-improving-is-stmt-placement-for-better-interactive-debugging/82668	2025-05-06 15:03:20 +01:00
Kazu Hirata	26d71f9ef2	[Transforms] Use range-based for loops (NFC) (#138476 )	2025-05-04 16:08:31 -07:00
Kazu Hirata	5cfd81b0cc	[llvm] Use range constructors of *Set (NFC) (#137552 )	2025-04-27 15:59:57 -07:00
David Green	e183459b8b	[CostModel] Make sure getCmpSelInstrCost is passed a CondTy (#135535 ) It is already required along certain code paths that the CondTy is valid. Fix some of the uses to make sure it is passed.	2025-04-21 05:33:30 +01:00
Kazu Hirata	b01e25deba	[llvm] Call hash_combine_range with ranges (NFC) (#136511 )	2025-04-20 16:36:03 -07:00
Andreas Jonson	ed43207306	[SimplifyCFG] Handle trunc condition in foldBranchToCommonDest. (#135490 ) proof: https://alive2.llvm.org/ce/z/v32Aof	2025-04-13 13:16:15 +02:00
Yingwei Zheng	73e1710a4d	[SimplifyCFG] Remove unused variable. NFC. (#134211 )	2025-04-03 15:22:51 +08:00
Yingwei Zheng	b6c0ce0bb6	[IR][NFC] Use `SwitchInst::defaultDestUnreachable` (#134199 )	2025-04-03 14:47:47 +08:00
Kazu Hirata	f915015a3e	[llvm] Remove extraneous calls to make_range (NFC) (#133551 )	2025-03-28 19:56:02 -07:00
Kazu Hirata	cde58bfc16	[Transforms] Use range constructors of *Set (NFC) (#133203 )	2025-03-27 07:51:58 -07:00
Phoebe Wang	369be311a7	[X86,SimplifyCFG] Support conditional faulting load or store only (#132032 ) This is to fix a bug when a target only support conditional faulting load, see test case hoist_store_without_cstore. Split `-simplifycfg-hoist-loads-stores-with-cond-faulting` into `-simplifycfg-hoist-loads-with-cond-faulting` and `-simplifycfg-hoist-stores-with-cond-faulting` to control conditional faulting load and store respectively.	2025-03-21 21:19:46 +08:00
Phoebe Wang	64555e3d48	[X86][NFCI] Add IsStore parameter to hasConditionalLoadStoreForType (#132153 ) Address https://github.com/llvm/llvm-project/pull/132032#issuecomment-2736936769	2025-03-20 18:25:09 +08:00
Kazu Hirata	0dcc201ac4	[Transforms] Use *Set::insert_range (NFC) (#132056 ) DenseSet, SmallPtrSet, SmallSet, SetVector, and StringSet recently gained C++23-style insert_range. This patch replaces: Dest.insert(Src.begin(), Src.end()); with: Dest.insert_range(Src); This patch does not touch custom begin like succ_begin for now.	2025-03-19 15:35:01 -07:00
Kazu Hirata	1b42be6fe8	[Utils] Avoid repeated hash lookups (NFC) (#131267 ) It's safe to use try_emplace instead of operator[] here because: - PhiPredIVs is empty at the beginning of the loop, and - The elements we are inserting into PhiPredIVs are unique.	2025-03-14 07:22:17 -07:00
Gábor Spaits	a0b175cb34	[SimplifyCFG] Treat `extract oneuse(op.with.overflow),1` pattern as a single instruction (#128021 ) Closes #115683 . Overflow arithmetic instruction plus extract value are usually generated when a division is being replaced, but the zero check may still be there. In that case hoist these two instructions out of this basic block, and let later optimizations take care of the unnecessary zero checks.	2025-03-14 14:18:57 +01:00
Kazu Hirata	cf3aa06c16	[Utils] Avoid repeated hash lookups (NFC) (#130464 )	2025-03-09 00:48:37 -08:00
Kazu Hirata	afe14bb0e4	[Utils] Avoid repeated hash lookups (NFC) (#129990 )	2025-03-06 08:49:36 -08:00
Nikita Popov	9cbdcfcafd	[CaptureTracking] Remove StoreCaptures parameter (NFC) The implementation doesn't use it, and is unlikely to use it in the future. The places that do set StoreCaptures=false, do so incorrectly and would be broken if the parameter actually did anything.	2025-02-24 12:00:57 +01:00
Antonio Frighetto	93b263a01c	[SimplifyCFG] Drop unused `LockstepReverseIterator` class (NFC) Unmaintained code has been removed.	2025-02-22 11:26:13 +01:00
Antonio Frighetto	48a6df3604	Reapply "[Utils] Consolidate `LockstepReverseIterator` into own header (NFC)" Common code has been unified and generalized. Original commit: 123dca9b56e1359d8ec7771ea3bd0afd4b1ea6af Previously reverted due to accidentally merged incompletely. The issue has been addressed by restoring missing code.	2025-02-22 11:21:36 +01:00
Florian Hahn	236fa506d4	Revert "[Utils] Consolidate `LockstepReverseIterator` into own header (NFC) (#116657 )" This reverts commit 123dca9b56e1359d8ec7771ea3bd0afd4b1ea6af. This breaks building on macOS with clang and multiple build bots, including https://lab.llvm.org/buildbot/#/builders/175/builds/13585 llvm-project/llvm/lib/Transforms/Utils/SimplifyCFG.cpp: In function ‘bool sinkCommonCodeFromPredecessors(llvm::BasicBlock, llvm::DomTreeUpdater)’: /b/ml-opt-devrel-x86-64-b1/llvm-project/llvm/lib/Transforms/Utils/SimplifyCFG.cpp:2503:3: error: reference to ‘LockstepReverseIterator’ is ambiguous 2503 \| LockstepReverseIterator<true> LRI(UnconditionalPreds); \| ^~~~~~~~~~~~~~~~~~~~~~~	2025-02-21 21:00:28 +00:00
Antonio Frighetto	123dca9b56	[Utils] Consolidate `LockstepReverseIterator` into own header (NFC) (#116657 ) Common code has been unified and generalized. Not sure if it may be worth to generalize this further, since it looks closely tied to Blocks (might make sense to rename it in `LockstepReverseInstructionIterator`).	2025-02-21 12:21:33 -08:00
Yingwei Zheng	9fbd5fbcc6	[IR][NFC] Switch to use `LifetimeIntrinsic` (#125528 )	2025-02-04 02:18:33 +08:00
Yingwei Zheng	84ba55d18b	[NFC][SimplifyCFG] Refactor `passingValueIsAlwaysUndefined` to work on `Use` (#125519 ) Address comment https://github.com/llvm/llvm-project/pull/125383#discussion_r1938436526	2025-02-04 00:42:25 +08:00
Jeremy Morse	34b139594a	[NFC][DebugInfo] Switch more call-sites to using iterator-insertion (#124283 ) To finalise the "RemoveDIs" work removing debug intrinsics, we're updating call sites that insert instructions to use iterators instead. This set of changes are those where it's not immediately obvious that just calling getIterator to fetch an iterator is correct, and one or two places where more than one line needs to change. Overall the same rule holds though: iterators generated for the start of a block such as getFirstNonPHIIt need to be passed into insert/move methods without being unwrapped/rewrapped, everything else can use getIterator.	2025-01-27 16:44:14 +00:00
Jeremy Morse	81d18ad864	[NFC][DebugInfo] Make some block-start-position methods return iterators (#124287 ) As part of the "RemoveDIs" work to eliminate debug intrinsics, we're replacing methods that use Instruction's as positions with iterators. A number of these (such as getFirstNonPHIOrDbg) are sufficiently infrequently used that we can just replace the pointer-returning version with an iterator-returning version, hopefully without much/any disruption. Thus this patch has getFirstNonPHIOrDbg and getFirstNonPHIOrDbgOrLifetime return an iterator, and updates all call-sites. There are no concerns about the iterators returned being converted to Instruction's and losing the debug-info bit: because the methods skip debug intrinsics, the iterator head bit is always false anyway.	2025-01-27 16:27:54 +00:00
Jeremy Morse	8e70273509	[NFC][DebugInfo] Use iterator moveBefore at many call-sites (#123583 ) As part of the "RemoveDIs" project, BasicBlock::iterator now carries a debug-info bit that's needed when getFirstNonPHI and similar feed into instruction insertion positions. Call-sites where that's necessary were updated a year ago; but to ensure some type safety however, we'd like to have all calls to moveBefore use iterators. This patch adds a (guaranteed dereferenceable) iterator-taking moveBefore, and changes a bunch of call-sites where it's obviously safe to change to use it by just calling getIterator() on an instruction pointer. A follow-up patch will contain less-obviously-safe changes. We'll eventually deprecate and remove the instruction-pointer insertBefore, but not before adding concise documentation of what considerations are needed (very few).	2025-01-24 10:53:11 +00:00
Ryan Mansfield	67efbd0bf1	[LLVM] Fix various cl::desc typos and whitespace issues (NFC) (#121955 )	2025-01-08 11:07:23 +01:00
Yingwei Zheng	a77346bad0	[IRBuilder] Refactor FMF interface (#121657 ) Up to now, the only way to set specified FMF flags in IRBuilder is to use `FastMathFlagGuard`. It makes the code ugly and hard to maintain. This patch introduces a helper class `FMFSource` to replace the original parameter `Instruction *FMFSource` in IRBuilder. To maximize the compatibility, it accepts an instruction or a specified FMF. This patch also removes the use of `FastMathFlagGuard` in some simple cases. Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=f87a9db8322643ccbc324e317a75b55903129b55&to=9397e712f6010be15ccf62f12740e9b4a67de2f4&stat=instructions%3Au	2025-01-06 14:37:04 +08:00
DaPorkchop_	cea738bc9a	[SimplifyCFG] Replace unreachable switch lookup table holes with poison (#94990 ) As discussed in #94468, this causes switch lookup table entries which are unreachable to be poison instead of filling them with a value from one of the reachable cases. --------- Co-authored-by: DianQK <dianqk@dianqk.net>	2024-12-26 07:47:26 +08:00
Florian Hahn	c4a78b6fe3	[SimplifyCFG] Always allow hoisting if all instructions match. (#97158 ) Generalize hoistCommonCodeFromSuccessors's `EqTermsOnly` to `AllInstsEqOnly` and always allow hoisting if all instructions match. In that case, all instructions can be hoisted and the original branch will be replaced and selects for PHIs are added. This allows preserving metadata in more cases, using the existing hoisting logic, whereas previously FoldTwoEntryPHINode would drop the metadata. https://llvm-compile-time-tracker.com/compare.php?from=716360367fbdabac2c374c19b8746f4de49a5599&to=986b2c47df516b31d998c055400e4f62aa76edc6&stat=instructions:u PR: https://github.com/llvm/llvm-project/pull/97158	2024-12-13 21:26:27 +00:00
Antonio Frighetto	d26df32255	[SimplifyCFG] Consider preds to switch in `simplifyDuplicateSwitchArms` Allow a duplicate basic block with multiple predecessors to the jump table to be simplified, by considering that the same basic block may appear in more switch cases.	2024-12-13 09:07:24 +01:00
David Green	18abc7e0c5	[PatternMatch] Introduce m_c_Select (#114328 ) This matches m_Select(m_Value(), L, R) or m_Select(m_Value(), R, L).	2024-11-25 13:47:23 +00:00
Phoebe Wang	2568e52a73	[X86,SimplifyCFG] Support hoisting load/store with conditional faulting (Part II) (#108812 ) This is a follow up of #96878 to support hoisting load/store from BBs have the same predecessor, if load/store are the only instructions and the branch is unpredictable, e.g.: ``` void test (int a, int c, int d) { if (a) c = a; else d = a; } ```	2024-11-25 15:19:28 +08:00
Stephen Tozer	2188a56a75	[DebugInfo][SimplifyCFG] Fully propagate merged invoke DILocations (#114235 ) Currently when we merge invokes as part of SimplifyCFG we apply a merge of the invoke DILocations to the merged invoke. We also insert an unconditional branch to the merged invoke at the positions previously occupied by the original invokes; as this branch is part of the substitution for the invoke it has replaced, we should propagate the original invoke DebugLoc to it.	2024-11-15 17:20:55 +00:00
Michael Maitland	6b9952759f	[SimplifyCFG] Simplify switch instruction that has duplicate arms (#114262 ) I noticed that the two C functions emitted different IR: ``` int switch_duplicate_arms(int switch_val, int v, int w) { switch (switch_val) { default: break; case 0: w = v; break; case 1: w = v; break; } return w; } int if_duplicate_arms(int switch_val, int v, int w) { if (switch_val == 0) w = v; else if (switch_val == 1) w = v; return v0; } ``` We generate IR that looks like this: ``` define i32 @switch_duplicate_arms(i32 %0, i32 %1, i32 %2, i32 %3) { switch i32 %1, label %7 [ i32 0, label %5 i32 1, label %6 ] 5: br label %7 6: br label %7 7: %8 = phi i32 [ %3, %4 ], [ %2, %6 ], [ %2, %5 ] ret i32 %8 } define i32 @if_duplicate_arms(i32 %0, i32 %1, i32 %2, i32 %3) { %5 = icmp ult i32 %1, 2 %6 = select i1 %5, i32 %2, i32 %3 ret i32 %6 } ``` For `switch_duplicate_arms`, taking case 0 and 1 are the same since %5 and %6 branch to the same location and the incoming values for %8 are the same from those blocks. We could remove one on the duplicate switch targets and update the switch with the single target. On RISC-V, prior to this patch, we generate the following code: ``` switch_duplicate_arms: li a4, 1 beq a1, a4, .LBB0_2 mv a0, a3 bnez a1, .LBB0_3 .LBB0_2: mv a0, a2 .LBB0_3: ret if_duplicate_arms: li a4, 2 mv a0, a2 bltu a1, a4, .LBB1_2 mv a0, a3 .LBB1_2: ret ``` After this patch, the O3 code is optimized to the icmp + select pair, which gives us the same code gen as `if_duplicate_arms`, as desired. This results is one less branch instruction in the final assembly. This may help with both code size and further switch simplification. I found that this patch causes no significant impact to spec2006/int/ref and spec2017/intrate/ref. --------- Co-authored-by: Min Hsu <min@myhsu.dev>	2024-11-15 15:38:34 +01:00
Nikita Popov	255a99c29f	[APInt] Fix APInt constructions where value does not fit bitwidth (NFCI) (#80309 ) This fixes all the places that hit the new assertion added in https://github.com/llvm/llvm-project/pull/106524 in tests. That is, cases where the value passed to the APInt constructor is not an N-bit signed/unsigned integer, where N is the bit width and signedness is determined by the isSigned flag. The fixes either set the correct value for isSigned, set the implicitTrunc flag, or perform more calculations inside APInt. Note that the assertion is currently still disabled by default, so this patch is mostly NFC.	2024-10-17 08:48:08 +02:00

1 2 3 4 5 ...

1447 Commits