llvm-project

Author	SHA1	Message	Date
Kazu Hirata	f915015a3e	[llvm] Remove extraneous calls to make_range (NFC) (#133551 )	2025-03-28 19:56:02 -07:00
Kazu Hirata	cde58bfc16	[Transforms] Use range constructors of *Set (NFC) (#133203 )	2025-03-27 07:51:58 -07:00
Phoebe Wang	369be311a7	[X86,SimplifyCFG] Support conditional faulting load or store only (#132032 ) This is to fix a bug when a target only support conditional faulting load, see test case hoist_store_without_cstore. Split `-simplifycfg-hoist-loads-stores-with-cond-faulting` into `-simplifycfg-hoist-loads-with-cond-faulting` and `-simplifycfg-hoist-stores-with-cond-faulting` to control conditional faulting load and store respectively.	2025-03-21 21:19:46 +08:00
Phoebe Wang	64555e3d48	[X86][NFCI] Add IsStore parameter to hasConditionalLoadStoreForType (#132153 ) Address https://github.com/llvm/llvm-project/pull/132032#issuecomment-2736936769	2025-03-20 18:25:09 +08:00
Kazu Hirata	0dcc201ac4	[Transforms] Use *Set::insert_range (NFC) (#132056 ) DenseSet, SmallPtrSet, SmallSet, SetVector, and StringSet recently gained C++23-style insert_range. This patch replaces: Dest.insert(Src.begin(), Src.end()); with: Dest.insert_range(Src); This patch does not touch custom begin like succ_begin for now.	2025-03-19 15:35:01 -07:00
Kazu Hirata	1b42be6fe8	[Utils] Avoid repeated hash lookups (NFC) (#131267 ) It's safe to use try_emplace instead of operator[] here because: - PhiPredIVs is empty at the beginning of the loop, and - The elements we are inserting into PhiPredIVs are unique.	2025-03-14 07:22:17 -07:00
Gábor Spaits	a0b175cb34	[SimplifyCFG] Treat `extract oneuse(op.with.overflow),1` pattern as a single instruction (#128021 ) Closes #115683 . Overflow arithmetic instruction plus extract value are usually generated when a division is being replaced, but the zero check may still be there. In that case hoist these two instructions out of this basic block, and let later optimizations take care of the unnecessary zero checks.	2025-03-14 14:18:57 +01:00
Kazu Hirata	cf3aa06c16	[Utils] Avoid repeated hash lookups (NFC) (#130464 )	2025-03-09 00:48:37 -08:00
Kazu Hirata	afe14bb0e4	[Utils] Avoid repeated hash lookups (NFC) (#129990 )	2025-03-06 08:49:36 -08:00
Nikita Popov	9cbdcfcafd	[CaptureTracking] Remove StoreCaptures parameter (NFC) The implementation doesn't use it, and is unlikely to use it in the future. The places that do set StoreCaptures=false, do so incorrectly and would be broken if the parameter actually did anything.	2025-02-24 12:00:57 +01:00
Antonio Frighetto	93b263a01c	[SimplifyCFG] Drop unused `LockstepReverseIterator` class (NFC) Unmaintained code has been removed.	2025-02-22 11:26:13 +01:00
Antonio Frighetto	48a6df3604	Reapply "[Utils] Consolidate `LockstepReverseIterator` into own header (NFC)" Common code has been unified and generalized. Original commit: 123dca9b56e1359d8ec7771ea3bd0afd4b1ea6af Previously reverted due to accidentally merged incompletely. The issue has been addressed by restoring missing code.	2025-02-22 11:21:36 +01:00
Florian Hahn	236fa506d4	Revert "[Utils] Consolidate `LockstepReverseIterator` into own header (NFC) (#116657 )" This reverts commit 123dca9b56e1359d8ec7771ea3bd0afd4b1ea6af. This breaks building on macOS with clang and multiple build bots, including https://lab.llvm.org/buildbot/#/builders/175/builds/13585 llvm-project/llvm/lib/Transforms/Utils/SimplifyCFG.cpp: In function ‘bool sinkCommonCodeFromPredecessors(llvm::BasicBlock, llvm::DomTreeUpdater)’: /b/ml-opt-devrel-x86-64-b1/llvm-project/llvm/lib/Transforms/Utils/SimplifyCFG.cpp:2503:3: error: reference to ‘LockstepReverseIterator’ is ambiguous 2503 \| LockstepReverseIterator<true> LRI(UnconditionalPreds); \| ^~~~~~~~~~~~~~~~~~~~~~~	2025-02-21 21:00:28 +00:00
Antonio Frighetto	123dca9b56	[Utils] Consolidate `LockstepReverseIterator` into own header (NFC) (#116657 ) Common code has been unified and generalized. Not sure if it may be worth to generalize this further, since it looks closely tied to Blocks (might make sense to rename it in `LockstepReverseInstructionIterator`).	2025-02-21 12:21:33 -08:00
Yingwei Zheng	9fbd5fbcc6	[IR][NFC] Switch to use `LifetimeIntrinsic` (#125528 )	2025-02-04 02:18:33 +08:00
Yingwei Zheng	84ba55d18b	[NFC][SimplifyCFG] Refactor `passingValueIsAlwaysUndefined` to work on `Use` (#125519 ) Address comment https://github.com/llvm/llvm-project/pull/125383#discussion_r1938436526	2025-02-04 00:42:25 +08:00
Jeremy Morse	34b139594a	[NFC][DebugInfo] Switch more call-sites to using iterator-insertion (#124283 ) To finalise the "RemoveDIs" work removing debug intrinsics, we're updating call sites that insert instructions to use iterators instead. This set of changes are those where it's not immediately obvious that just calling getIterator to fetch an iterator is correct, and one or two places where more than one line needs to change. Overall the same rule holds though: iterators generated for the start of a block such as getFirstNonPHIIt need to be passed into insert/move methods without being unwrapped/rewrapped, everything else can use getIterator.	2025-01-27 16:44:14 +00:00
Jeremy Morse	81d18ad864	[NFC][DebugInfo] Make some block-start-position methods return iterators (#124287 ) As part of the "RemoveDIs" work to eliminate debug intrinsics, we're replacing methods that use Instruction's as positions with iterators. A number of these (such as getFirstNonPHIOrDbg) are sufficiently infrequently used that we can just replace the pointer-returning version with an iterator-returning version, hopefully without much/any disruption. Thus this patch has getFirstNonPHIOrDbg and getFirstNonPHIOrDbgOrLifetime return an iterator, and updates all call-sites. There are no concerns about the iterators returned being converted to Instruction's and losing the debug-info bit: because the methods skip debug intrinsics, the iterator head bit is always false anyway.	2025-01-27 16:27:54 +00:00
Jeremy Morse	8e70273509	[NFC][DebugInfo] Use iterator moveBefore at many call-sites (#123583 ) As part of the "RemoveDIs" project, BasicBlock::iterator now carries a debug-info bit that's needed when getFirstNonPHI and similar feed into instruction insertion positions. Call-sites where that's necessary were updated a year ago; but to ensure some type safety however, we'd like to have all calls to moveBefore use iterators. This patch adds a (guaranteed dereferenceable) iterator-taking moveBefore, and changes a bunch of call-sites where it's obviously safe to change to use it by just calling getIterator() on an instruction pointer. A follow-up patch will contain less-obviously-safe changes. We'll eventually deprecate and remove the instruction-pointer insertBefore, but not before adding concise documentation of what considerations are needed (very few).	2025-01-24 10:53:11 +00:00
Ryan Mansfield	67efbd0bf1	[LLVM] Fix various cl::desc typos and whitespace issues (NFC) (#121955 )	2025-01-08 11:07:23 +01:00
Yingwei Zheng	a77346bad0	[IRBuilder] Refactor FMF interface (#121657 ) Up to now, the only way to set specified FMF flags in IRBuilder is to use `FastMathFlagGuard`. It makes the code ugly and hard to maintain. This patch introduces a helper class `FMFSource` to replace the original parameter `Instruction *FMFSource` in IRBuilder. To maximize the compatibility, it accepts an instruction or a specified FMF. This patch also removes the use of `FastMathFlagGuard` in some simple cases. Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=f87a9db8322643ccbc324e317a75b55903129b55&to=9397e712f6010be15ccf62f12740e9b4a67de2f4&stat=instructions%3Au	2025-01-06 14:37:04 +08:00
DaPorkchop_	cea738bc9a	[SimplifyCFG] Replace unreachable switch lookup table holes with poison (#94990 ) As discussed in #94468, this causes switch lookup table entries which are unreachable to be poison instead of filling them with a value from one of the reachable cases. --------- Co-authored-by: DianQK <dianqk@dianqk.net>	2024-12-26 07:47:26 +08:00
Florian Hahn	c4a78b6fe3	[SimplifyCFG] Always allow hoisting if all instructions match. (#97158 ) Generalize hoistCommonCodeFromSuccessors's `EqTermsOnly` to `AllInstsEqOnly` and always allow hoisting if all instructions match. In that case, all instructions can be hoisted and the original branch will be replaced and selects for PHIs are added. This allows preserving metadata in more cases, using the existing hoisting logic, whereas previously FoldTwoEntryPHINode would drop the metadata. https://llvm-compile-time-tracker.com/compare.php?from=716360367fbdabac2c374c19b8746f4de49a5599&to=986b2c47df516b31d998c055400e4f62aa76edc6&stat=instructions:u PR: https://github.com/llvm/llvm-project/pull/97158	2024-12-13 21:26:27 +00:00
Antonio Frighetto	d26df32255	[SimplifyCFG] Consider preds to switch in `simplifyDuplicateSwitchArms` Allow a duplicate basic block with multiple predecessors to the jump table to be simplified, by considering that the same basic block may appear in more switch cases.	2024-12-13 09:07:24 +01:00
David Green	18abc7e0c5	[PatternMatch] Introduce m_c_Select (#114328 ) This matches m_Select(m_Value(), L, R) or m_Select(m_Value(), R, L).	2024-11-25 13:47:23 +00:00
Phoebe Wang	2568e52a73	[X86,SimplifyCFG] Support hoisting load/store with conditional faulting (Part II) (#108812 ) This is a follow up of #96878 to support hoisting load/store from BBs have the same predecessor, if load/store are the only instructions and the branch is unpredictable, e.g.: ``` void test (int a, int c, int d) { if (a) c = a; else d = a; } ```	2024-11-25 15:19:28 +08:00
Stephen Tozer	2188a56a75	[DebugInfo][SimplifyCFG] Fully propagate merged invoke DILocations (#114235 ) Currently when we merge invokes as part of SimplifyCFG we apply a merge of the invoke DILocations to the merged invoke. We also insert an unconditional branch to the merged invoke at the positions previously occupied by the original invokes; as this branch is part of the substitution for the invoke it has replaced, we should propagate the original invoke DebugLoc to it.	2024-11-15 17:20:55 +00:00
Michael Maitland	6b9952759f	[SimplifyCFG] Simplify switch instruction that has duplicate arms (#114262 ) I noticed that the two C functions emitted different IR: ``` int switch_duplicate_arms(int switch_val, int v, int w) { switch (switch_val) { default: break; case 0: w = v; break; case 1: w = v; break; } return w; } int if_duplicate_arms(int switch_val, int v, int w) { if (switch_val == 0) w = v; else if (switch_val == 1) w = v; return v0; } ``` We generate IR that looks like this: ``` define i32 @switch_duplicate_arms(i32 %0, i32 %1, i32 %2, i32 %3) { switch i32 %1, label %7 [ i32 0, label %5 i32 1, label %6 ] 5: br label %7 6: br label %7 7: %8 = phi i32 [ %3, %4 ], [ %2, %6 ], [ %2, %5 ] ret i32 %8 } define i32 @if_duplicate_arms(i32 %0, i32 %1, i32 %2, i32 %3) { %5 = icmp ult i32 %1, 2 %6 = select i1 %5, i32 %2, i32 %3 ret i32 %6 } ``` For `switch_duplicate_arms`, taking case 0 and 1 are the same since %5 and %6 branch to the same location and the incoming values for %8 are the same from those blocks. We could remove one on the duplicate switch targets and update the switch with the single target. On RISC-V, prior to this patch, we generate the following code: ``` switch_duplicate_arms: li a4, 1 beq a1, a4, .LBB0_2 mv a0, a3 bnez a1, .LBB0_3 .LBB0_2: mv a0, a2 .LBB0_3: ret if_duplicate_arms: li a4, 2 mv a0, a2 bltu a1, a4, .LBB1_2 mv a0, a3 .LBB1_2: ret ``` After this patch, the O3 code is optimized to the icmp + select pair, which gives us the same code gen as `if_duplicate_arms`, as desired. This results is one less branch instruction in the final assembly. This may help with both code size and further switch simplification. I found that this patch causes no significant impact to spec2006/int/ref and spec2017/intrate/ref. --------- Co-authored-by: Min Hsu <min@myhsu.dev>	2024-11-15 15:38:34 +01:00
Nikita Popov	255a99c29f	[APInt] Fix APInt constructions where value does not fit bitwidth (NFCI) (#80309 ) This fixes all the places that hit the new assertion added in https://github.com/llvm/llvm-project/pull/106524 in tests. That is, cases where the value passed to the APInt constructor is not an N-bit signed/unsigned integer, where N is the bit width and signedness is determined by the isSigned flag. The fixes either set the correct value for isSigned, set the implicitTrunc flag, or perform more calculations inside APInt. Note that the assertion is currently still disabled by default, so this patch is mostly NFC.	2024-10-17 08:48:08 +02:00
Noah Goldstein	82ac399733	[SimplifyCFG] Allow merging invoke's with different attrs Same logic as other callsites, if the attributes are intersectable, we merge. Closes #111713	2024-10-10 01:07:59 -05:00
Noah Goldstein	4d4beeb43c	[SimplifyCFG] Supporting hoisting/sinking callbases with differing attrs Some (many) attributes can safely be dropped to enable sinking. For example removing `nonnull` on a return/param can't affect correctness. Closes #109472	2024-10-01 18:27:08 -05:00
Simone Campanoni	5d19d55ce1	[SimplifyCFG] Better aligned a comment. (#109307 )	2024-09-30 09:39:35 -07:00
Nikita Popov	f445e39ab2	[SimplifyCFG] Use isWritableObject() API (#110127 ) SimplifyCFG store speculation currently has some homegrown code to check for a writable object, handling the alloca special case only. Switch it to use the generic isWritableObject() API, which means that we also support byval arguments, allocator return values, and writable arguments. I've adjusted isWritableObject() to also check for the noalias attribute when handling writable. Otherwise, I don't think that we can generalize from at-entry writability. This was not relevant for previous uses of the function, because they'd already require noalias for other reasons anyway.	2024-09-30 10:03:46 +02:00
Nikita Popov	6f194a6dea	[SimplifyCFG] Avoid truncation in linear map overflow check This is supposed to test multiplication of the linear multiplifier with the largest value it can be multiplied with. However, if we truncate TableSize-1 here, it might not actually be the largest value. I think in practice this still works out, because in cases where we'd truncate the value here we'd also fail the NonMonotonic check. But to match the intent of the code, we should treat the truncating case as overflowing.	2024-09-23 15:13:32 +02:00
Nikita Popov	8a6248b739	[SimplifyCFG] Don't separate a load/store from its gep during sinking (#102318 ) If we can sink the a load/store, but not the gep producing its pointer operand, don't sink the load/store either. This may prevent the gep from being folded into an addressing mode, and may also negatively affect further analysis. Fixes https://github.com/llvm/llvm-project/issues/96838.	2024-09-23 09:32:24 +02:00
Phoebe Wang	c9e5c42ad1	[X86,SimplifyCFG][NFC] Refactor code for #108812 (#109398 )	2024-09-21 20:18:29 +08:00
Nikita Popov	30cdf1e959	[SimplifyCFG] Pass context instruction to isSafeToSpeculativelyExecute() (#109132 ) Pass speculation target and assumption cache to isSafeToSpeculativelyExecute() calls. This allows speculating based on dereferenceable/align assumptions, but the primary motivation here is to avoid regressions from planned changes to fix https://github.com/llvm/llvm-project/issues/108854.	2024-09-19 10:19:15 +02:00
Noah Goldstein	37932643ab	[SimplifyCFG] Deduce paths unreachable if they cause div/rem UB Same we way mark a path unreachable if it may cause a nullptr dereference, div/rem by zero or signed div/rem of INT_MIN by -1 cause immediate UB. Closes #109008	2024-09-18 12:59:52 -05:00
Noah Goldstein	419c53477e	[SimplifyCFG] Mark div/rem as not-cheap to sink if we are replacing const denominator Close #109007	2024-09-17 12:04:34 -05:00
Andreas Jonson	a0d00c94c2	[SimplifyCFG] Swap range metadata to attribute for calls. (#108984 ) Among the last usages of range metadata for call before being able to deprecate and only have the range attribute for calls.	2024-09-17 18:25:53 +02:00
Phoebe Wang	af5a45b34b	[X86,SimplifyCFG] Use passthru to reduce select (#108754 )	2024-09-16 20:20:36 +08:00
Kazu Hirata	e99eb89d5d	[SimplifyCFG] Use range-based for loops (NFC) (#107180 )	2024-09-04 01:29:13 -07:00
Shengchen Kan	87c86aa6b9	[X86,SimplifyCFG] Support hoisting load/store with conditional faulting (Part I) (#96878 ) This is simplifycfg part of https://github.com/llvm/llvm-project/pull/95515 In this PR, we support hoisting load/store with conditional faulting in `SimplifyCFGOpt::speculativelyExecuteBB` to eliminate conditional branches. This is for cases like ``` void test (int a, int b) { if (a) b = a; } ``` In the following patches, we will support the hoist in `SimplifyCFGOpt::hoistCommonCodeFromSuccessors`. That is for cases like ``` void test (int a, int c, int d) { if (a) c = a; else d = a; } ```	2024-08-29 10:42:44 +08:00
Nikita Popov	84497c6f4f	[SimplifyCFG] Remove limitation on sinking of load/store of alloca (#104788 ) This is a followup to https://github.com/llvm/llvm-project/pull/104579 to remove the limitation on sinking loads/stores of allocas entirely, even if this would introduce a phi node. Nowadays, SROA supports speculating load/store over select/phi. Additionally, SimplifyCFG with sinking only runs at the end of the function simplification pipeline, after SROA. I checked that the two tests modified here still successfully SROA after the SimplifyCFG transform. We should, however, keep the limitation on lifetime intrinsics. SROA does not have speculation support for these, and I've also found that the way these are handled in the backend is very problematic (https://github.com/llvm/llvm-project/issues/104776), so I think we should leave them alone.	2024-08-26 10:14:43 +02:00
Nikita Popov	4d85285ff6	[SimplifyCFG] Fold switch over ucmp/scmp to icmp and br (#105636 ) If we switch over ucmp/scmp and have two switch cases going to the same destination, we can convert into icmp+br. Fixes https://github.com/llvm/llvm-project/issues/105632.	2024-08-22 16:57:09 +02:00
Nikita Popov	b3fa45b642	[SimplifyCFG] Add support for hoisting commutative instructions (#104805 ) This extends SimplifyCFG hoisting to also hoist instructions with commuted operands, for example a+b on one side and b+a on the other side. This should address the issue mentioned in: https://github.com/llvm/llvm-project/pull/91185#issuecomment-2097447927	2024-08-20 12:48:06 +02:00
Nikita Popov	83879f4f53	[SimplifyCFG] Don't block sinking for allocas if no phi created (#104579 ) SimplifyCFG sinking currently does not sink loads/stores of allocas, because historically SROA was unable to handle the resulting IR. Since then, SROA both learned to speculate loads/stores over selects and phis, and SimplifyCFG sinking has been deferred to the end of the function simplification pipeline, which means that SROA happens before it. As such, I believe that this workaround should no longer be necessary. Given how sensitive SimplifyCFG sinking seems to be, this patch takes a very conservative step towards removing this, by allowing sinking if we don't actually need to form a phi over the pointer argument. This fixes https://github.com/llvm/llvm-project/issues/104567, where sinking a store to an escaped alloca allows converting a switch into arithmetic.	2024-08-19 09:55:30 +02:00
Yingwei Zheng	f364b2ee22	[LLVM] Don't peek through bitcast on pointers and gep with zero indices. NFC. (#102889 ) Since we are using opaque pointers now, we don't need to peek through bitcast on pointers and gep with zero indices.	2024-08-13 22:38:50 +08:00
Shengchen Kan	bb790b8bf2	[NFC] Extract the probability check for the hoisted BB into a local function So that we can early bail out to avoid nested if clauses. This is to extract the NFC change in #96878 into a separate PR.	2024-08-03 00:17:28 +08:00
Shengchen Kan	60054dcd81	[NFC] Lowercase the first letter of functions defined in SimplifyCFG.cpp	2024-08-02 23:54:40 +08:00

1 2 3 4 5 ...

1426 Commits