llvm-project

Author	SHA1	Message	Date
Nikita Popov	09dc08b707	[InstCombine] Handle repeated users in foldOpIntoPhi() If the phi is used multiple times in the same user, it will appear multiple times in users(), in which case make_early_inc_range() is insufficient to prevent iterator invalidation. Fixes the issue reported at: https://github.com/llvm/llvm-project/pull/151115#issuecomment-3141542852	2025-08-01 11:07:06 +02:00
Nikita Popov	16d73839b1	[InstCombine] Support folding intrinsics into phis (#151115 ) Call foldOpIntoPhi() for speculatable intrinsics. We already do this for FoldOpIntoSelect(). Among other things, this partially subsumes https://github.com/llvm/llvm-project/pull/149858.	2025-07-31 12:32:37 +02:00
Nikita Popov	a8072a0b4e	[InstCombine] Eliminate icmp+zext pairs over phis more aggressively (#121767 ) When folding icmp over phi, add a special case for `icmp eq (zext(bool), 0)`, which is known to fold to `!bool` and thus won't increase the instruction count. This helps convert more phis to i1, esp. in loops. This is based on existing logic we have to support this for icmp of ucmp/scmp.	2025-01-07 09:29:36 +01:00
Nikita Popov	d68ea317ae	[InstCombine] Add additional tests for icmp of phi of zext (NFC)	2025-01-06 15:32:27 +01:00
Nikita Popov	f7685af4a5	[InstCombine] Move gep of phi fold into separate function This makes sure that an early return during this fold doesn't end up skipping later gep folds.	2024-12-05 15:20:56 +01:00
Nikita Popov	462cb3cd6c	[InstCombine] Infer nusw + nneg -> nuw for getelementptr (#111144 ) If the gep is nusw (usually via inbounds) and the offset is non-negative, we can infer nuw. Proof: https://alive2.llvm.org/ce/z/ihztLy	2024-12-05 14:36:40 +01:00
Antonio Frighetto	929cbe7f59	[InstCombine] Intersect nowrap flags between geps while folding into phi A miscompilation issue has been addressed with refined checking. Fixes: https://github.com/llvm/llvm-project/issues/115149.	2024-11-13 19:42:06 +01:00
Antonio Frighetto	0f44d72e0e	[InstCombine] Precommit test for PR115901 (NFC)	2024-11-13 19:42:05 +01:00
Nikita Popov	dd116369f6	[InstSimplify] Fix incorrect poison propagation when folding phi (#96631 ) We can only replace phi(X, undef) with X, if X is known not to be poison. Otherwise, the result may be more poisonous on the undef branch. Fixes https://github.com/llvm/llvm-project/issues/68683.	2024-11-07 14:09:45 +01:00
Chengjun	94a98cf5dc	[InstCombine] Remove dead phi web (#108876 ) In current visitPHINode function during InstCombine, it can remove dead phi cycles (all phis have one use, which is another phi). However, it cannot deal with the case when the phis form a web (all phis have one or more uses, and all the uses are phi). This change extends the algorithm so that it can also deal with the dead phi web.	2024-09-18 10:04:49 +02:00
Nikita Popov	f044564db1	[InstCombine] Make backedge check in op of phi transform more precise (#106075 ) The op of phi transform wants to prevent moving an operation across a backedge, as this may lead to an infinite combine loop. Currently, this is done using isPotentiallyReachable(). The problem with that is that all blocks inside a loop are reachable from each other. This means that the op of phi transform is effectively completely disabled for code inside loops, even when it's not actually operating on a loop phi (just a phi that happens to be in a loop). Fix this by explicitly computing the backedges inside the function instead. Do this via RPOT, which is a bit more efficient than using FindFunctionBackedges() (which does it without any pre-computed analyses). For irreducible cycles, there may be multiple possible choices of backedge, and this just picks one of them. This is still sufficient to prevent combine loops. This also removes the last use of LoopInfo in InstCombine -- I'll drop the analysis in a followup.	2024-09-02 09:09:21 +02:00
Yingwei Zheng	380fa875ab	[InstCombine] Replace all dominated uses of condition with constants (#105510 ) This patch replaces all dominated uses of condition with true/false to improve context-sensitive optimizations. It eliminates a bunch of branches in llvm-opt-benchmark. As a side effect, it may introduce new phi nodes in some corner cases. See the following case: ``` define i1 @test(i1 %cmp, i1 %cond) { entry: br i1 %cond, label %bb1, label %bb2 bb1: br i1 %cmp, label %if.then, label %if.else if.then: br %bb2 if.else: br %bb2 bb2: %res = phi i1 [%cmp, %entry], [%cmp, %if.then], [%cmp, %if.else] ret i1 %res } ``` It will be simplified into: ``` define i1 @test(i1 %cmp, i1 %cond) { entry: br i1 %cond, label %bb1, label %bb2 bb1: br i1 %cmp, label %if.then, label %if.else if.then: br %bb2 if.else: br %bb2 bb2: %res = phi i1 [%cmp, %entry], [true, %if.then], [false, %if.else] ret i1 %res } ``` I am planning to fix this in late pipeline/CGP since this problem exists before the patch.	2024-09-01 09:49:23 +08:00
Nikita Popov	5afd39d6e4	[InstCombine] Add test for op of phi in loop (NFC)	2024-08-26 14:33:28 +02:00
Nikita Popov	a105877646	[InstCombine] Remove some of the complexity-based canonicalization (#91185 ) The idea behind this canonicalization is that it allows us to handle less patterns, because we know that some will be canonicalized away. This is indeed very useful to e.g. know that constants are always on the right. However, this is only useful if the canonicalization is actually reliable. This is the case for constants, but not for arguments: Moving these to the right makes it look like the "more complex" expression is guaranteed to be on the left, but this is not actually the case in practice. It fails as soon as you replace the argument with another instruction. The end result is that it looks like things correctly work in tests, while they actually don't. We use the "thwart complexity-based canonicalization" trick to handle this in tests, but it's often a challenge for new contributors to get this right, and based on the regressions this PR originally exposed, we clearly don't get this right in many cases. For this reason, I think that it's better to remove this complexity canonicalization. It will make it much easier to write tests for commuted cases and make sure that they are handled.	2024-08-21 12:02:54 +02:00
Sergei Barannikov	50daa2397f	[DataLayout] Refactor parsing of i/f/v/a specifications (#104699 ) Split off of #104545 to reduce patch size.	2024-08-19 16:09:46 +03:00
Nikita Popov	4780dc3d7f	[InstCombine] Add poison variant to phi test (NFC) And rename an argument to avoid an upper/lowercase clash.	2024-06-25 14:40:46 +02:00
Monad	56b3222b79	[InstCombine] Remove the canonicalization of `trunc` to `i1` (#84628 ) Remove the canonicalization of `trunc` to `i1` according to the suggestion of https://github.com/llvm/llvm-project/pull/83829#issuecomment-1986801166 `a84e66a92d/llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp (L737-L745)` Alive2: https://alive2.llvm.org/ce/z/cacYVA	2024-03-29 21:47:35 +08:00
Nikita Popov	90ba33099c	[InstCombine] Canonicalize constant GEPs to i8 source element type (#68882 ) This patch canonicalizes getelementptr instructions with constant indices to use the `i8` source element type. This makes it easier for optimizations to recognize that two GEPs are identical, because they don't need to see past many different ways to express the same offset. This is a first step towards https://discourse.llvm.org/t/rfc-replacing-getelementptr-with-ptradd/68699. This is limited to constant GEPs only for now, as they have a clear canonical form, while we're not yet sure how exactly to deal with variable indices. The test llvm/test/Transforms/PhaseOrdering/switch_with_geps.ll gives two representative examples of the kind of optimization improvement we expect from this change. In the first test SimplifyCFG can now realize that all switch branches are actually the same. In the second test it can convert it into simple arithmetic. These are representative of common optimization failures we see in Rust. Fixes https://github.com/llvm/llvm-project/issues/69841.	2024-01-24 15:25:29 +01:00
Yingwei Zheng	7c3bcc307a	[InstCombine] Fold `switch(zext/sext(X))` into `switch(X)` (#76988 ) This patch folds `switch(zext/sext(X))` into `switch(X)`. The original motivation of this patch is to optimize a pattern found in cvc5. For example: ``` %bf.load.i = load i16, ptr %d_kind.i, align 8 %bf.clear.i = and i16 %bf.load.i, 1023 %bf.cast.i = zext nneg i16 %bf.clear.i to i32 switch i32 %bf.cast.i, label %if.else [ i32 335, label %if.then i32 303, label %if.then ] if.then: ; preds = %entry, %entry %d_children.i.i = getelementptr inbounds %"class.cvc5::internal::expr::NodeValue", ptr %0, i64 0, i32 3 %cmp.i.i.i.i.i = icmp eq i16 %bf.clear.i, 1023 %cond.i.i.i.i.i = select i1 %cmp.i.i.i.i.i, i32 -1, i32 %bf.cast.i ``` `%cmp.i.i.i.i.i` always evaluates to false because `%bf.clear.i` can only be 335 or 303. Folding `switch i32 %bf.cast.i` to `switch i16 %bf.clear.i` will help `CVP` to handle this case. See also https://github.com/llvm/llvm-project/pull/76928#issuecomment-1877055722. Compile-time impact: http://llvm-compile-time-tracker.com/compare.php?from=7954c57124b495fbdc73674d71f2e366e4afe522&to=502b13ed34e561d995ae1f724cf06d20008bd86f&stat=instructions:u \|stage1-O3\|stage1-ReleaseThinLTO\|stage1-ReleaseLTO-g\|stage1-O0-g\|stage2-O3\|stage2-O0-g\|stage2-clang\| \|--\|--\|--\|--\|--\|--\|--\| \|+0.03%\|+0.06%\|+0.07%\|+0.00%\|-0.02%\|-0.03%\|+0.02%\|	2024-01-06 04:30:07 +08:00
Nikita Popov	d8bc546533	[InstCombine] Fix phi or icmp fold with disjoint flag We're changing the operand of the or here, such that the disjoint flag may no longer hold. Clear it.	2023-11-30 16:23:00 +01:00
Nikita Popov	b7af286a4e	[InstCombine] Add test for "or disjoint" miscompile (NFC)	2023-11-30 16:23:00 +01:00
LiqinWeng	5d3d08463d	[InstCombinePHI] Remove dead PHI on UnaryOperator (#71386 ) This patch mainly solves the problem of dead PHI on UnaryOperator	2023-11-07 09:45:33 +08:00
bipmis	4a074f32a6	[InstCombine] Extend Phi-Icmp use to include or (#67682 ) In InstCombinePHI currently the only use of PHI as an Icmp is being checked as a requirement to reduce a value if isKnownNonZero. However this can be extended to include or(icmp) . This is always true as OR only adds bits and we are checking against 0.	2023-10-24 13:08:25 +01:00
David Green	186c9079d4	[InstCombine] Expand redundant phi cycle elimination (#67968 ) There is a combine in instcombine that will look for phi cycles that only have a single incoming value: ``` %0 = phi i64 [ %3, %exit ], [ %othervalue, %preheader ] %3 = phi i64 [ %0, %body ], [ %othervalue, %body2 ] ``` This currently doesn't handle if %othervalue is a phi though, as the algorithm will recurse into the phi and fail with multiple incoming values. This adjusts the algorithm, not requiring the initial value to be found immediately, allowing it to be set to the value of one of the phis that would otherwise fail due to having multiple input values.	2023-10-04 09:34:08 +01:00
bipmis	fc486f0982	Revert oricmp tests	2023-09-28 13:54:25 +01:00
Dhruv Chawla	3e992d81af	[InferAlignment] Enable InferAlignment pass by default This gives an improvement of 0.6%: https://llvm-compile-time-tracker.com/compare.php?from=7d35fe6d08e2b9b786e1c8454cd2391463832167&to=0456c8e8a42be06b62ad4c3e3cf34b21f2633d1e&stat=instructions:u Differential Revision: https://reviews.llvm.org/D158600	2023-09-20 12:08:52 +05:30
bipmis	530a45c296	Add a or(phi,phi) test without loops	2023-09-15 16:18:25 +01:00
bipmis	74e4e9e6f2	Fold or-phi test	2023-09-11 17:33:36 +01:00
bipmis	370880cdcc	[InstCombine] Fold icmp into phi beyond the same BB. The icmp is being folded in phi only if they belong in the same BB. This patch extends the same beyond the BB. Have seen scenarios where this seems to be beneficial. Differential Revision: https://reviews.llvm.org/D157740	2023-09-07 16:53:29 +01:00
bipmis	67b7d3da5c	update checks to the test	2023-08-11 18:16:24 +01:00
bipmis	2c0491d90e	changes test name	2023-08-11 18:06:00 +01:00
bipmis	1c4c0d3f56	add test to show icmp fold into phi beyond same BB	2023-08-11 18:01:51 +01:00
Nikita Popov	d01aec4c76	[InstCombine] Set dead phi inputs to poison in more cases Set phi inputs to poison whenever we find a dead edge (either during initial worklist population or the main InstCombine run), instead of only doing this for successors of dead blocks. This means that the phi operand is set to poison even if for critical edges without an intermediate block. There are quite a few test changes, because the pattern is fairly common in vectorizer output, for cases where we know the vectorized loop will be entered.	2023-08-01 11:53:47 +02:00
Maksim Kita	991855ea8a	[InstCombine] Improve foldOpIntoPhi() to use isImpliedCondition Improve foldOpIntoPhi() for icmp operator to check if incoming PHI value can be replaced with constant based on implied condition. Depends on D156619. Differential Revision: https://reviews.llvm.org/D156620	2023-08-01 12:15:57 +03:00
Maksim Kita	2c4e1df9a1	[InstCombine] Improve foldOpIntoPhi() to use isImpliedCondition tests Precommit tests for D156620. Differential Revision: https://reviews.llvm.org/D156619	2023-08-01 12:15:57 +03:00
Nikita Popov	8aefa2bf39	[InstCombine] Don't remove non-terminator unreachable markers Even if the value happens to be undef, we should preserve these so they get turned into an unreachable terminator later.	2023-06-22 15:52:52 +02:00
luxufan	6beb371e4c	[InstCombine] Combine binary operator of two phi node Combine binary operator of two phi node if there is at least one specific constant value in phi0 and phi1's incoming values for each same incoming block and this specific constant value can be used to do optimization for specific binary operator. For example: ``` %phi0 = phi i32 [0, %bb0], [%i, %bb1] %phi1 = phi i32 [%j, %bb0], [0, %bb1] %add = add i32 %phi0, %phi1 ==> %add = phi i32 [%j, %bb0], [%i, %bb1] ``` Fixes: https://github.com/llvm/llvm-project/issues/61137 Differential Revision: https://reviews.llvm.org/D145223	2023-03-21 15:24:17 +08:00
luxufan	772658a9d0	[InstCombine][NFC] Precommit test case of PR61137	2023-03-21 14:34:12 +08:00
Nikita Popov	8184b563a4	[InstCombine] Add additional tests for dead phi cycles (NFC)	2023-01-25 10:47:07 +01:00
Nikita Popov	779fd39684	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Relative to the previous attempt, this is rebased over the InstSimplify fix in ac74e7a7806480a000c9a3502405c3dedd8810de, which addresses the miscompile reported in PR58401. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-17 16:11:05 +02:00
Nikita Popov	291924a6f9	[InstCombine] Add test for PR58401 (NFC)	2022-10-17 15:36:54 +02:00
Florian Hahn	699396131f	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit 333246b48ea4a70842e78c977cc92d365720465f. It looks like this patch causes a mis-compile: https://github.com/llvm/llvm-project/issues/58401 Fixes #58401.	2022-10-17 12:56:28 +01:00
Nikita Popov	333246b48e	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Relative to the previous attempt, this adjusts simplification to use the correct context instruction: We need to use the terminator of the incoming block, not the original instruction. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-07 11:04:19 +02:00
Nikita Popov	7b442b07f2	[InstCombine] Add test for foldOpIntoPhi() context instr (NFC) Reduced test case for the miscompile reported at https://reviews.llvm.org/D134954#3840475.	2022-10-07 11:00:07 +02:00
Alina Sbirlea	b9898e7ed1	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit e94619b955104841cc2a4a6febe4025ee140194e.	2022-10-06 13:12:24 -07:00
Nikita Popov	e94619b955	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify The infinite loop seen on buildbots should be fixed by 11897708c0229c92802e747564e7c34b722f045f (assuming there are not multiple infinite combine loops...) ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-05 14:00:19 +02:00
Nikita Popov	e035f03e92	[InstCombine] Add test for infinite combine loop with D134954 (NFC) The patch interacts badly with foldIntegerTypedPHI().	2022-10-05 13:12:14 +02:00
Gulfem Savrun Yeniceri	d7592bbb03	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit e1dd2cd063785ea3a6004c8d173f13113b1b8265 because the original commit b20e34b39f72f2be035dfb7367b6880fd2cf213a had a dramatic increase in the build time of RTfuzzer, which caused Fuchsia Clang toolchain builders to timeout: https://luci-milo.appspot.com/ui/p/fuchsia/builders/toolchain.ci/clang-linux-x64/b8801248587754572961/overview	2022-10-04 20:57:34 +00:00
Nikita Popov	e1dd2cd063	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Reapply with a fix for the case where an operand simplified back to the original phi: We need to map this case to the new phi node. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change.	2022-10-04 15:18:34 +02:00
Nikita Popov	279ff0f8ee	[InstCombine] Add test where op of phi simplifies to phi (NFC) Degenerate case for D134954.	2022-10-04 15:10:02 +02:00

1 2 3

107 Commits