llvm-project

Author	SHA1	Message	Date
hanbeom	4ef22fce82	[InstCombine] Simplify select if it combinated and/or/xor (#73362 ) `and/or/xor` operations can each be changed to sum of logical operations including operators other than themselves. `x&y -> (x\|y) ^ (x^y)` `x\|y -> (x&y) \| (x^y)` `x^y -> (x\|y) ^ (x&y)` if left of condition of `SelectInst` is `and/or/xor` logical operation and right is equal to `0, -1`, or a `constant`, and if `TrueVal` consist of `and/or/xor` logical operation then we can optimize this case. This patch implements this combination. Proof: https://alive2.llvm.org/ce/z/WW8iRR Fixes https://github.com/llvm/llvm-project/issues/71792.	2024-04-03 14:29:10 +08:00
zhongyunde 00443407	bd9a2afa80	[InstCombine] Add tests for selects with same conditions (NFC)	2024-03-18 17:51:13 +01:00
Noah Goldstein	01d8e1ca01	[ValueTracking] Handle non-canonical operand order in `isImpliedCondICmps` We don't always have canonical order here, so do it manually. Closes #85575	2024-03-17 17:46:06 -05:00
Nikita Popov	9f45c5e1a6	[InstCombine] Fix infinite loop in select equivalence fold (#84036 ) When replacing with a non-constant, it's possible that the result of the simplification is actually more complicated than the original, and may result in an infinite combine loop. Mitigate the issue by requiring that either the replacement or simplification result is constant, which should ensure that it's simpler. While this check is crude, it does not appear to cause optimization regressions in real-world code in practice. Fixes https://github.com/llvm/llvm-project/issues/83127.	2024-03-06 09:33:51 +01:00
Yingwei Zheng	d51fcd4ed8	[InstCombine] Handle scalable splat in `getFlippedStrictnessPredicateAndConstant` (#83980 ) This patch adds support for canonicalization of icmp with a scalable splat. Some optimizations assume that `icmp pred X, APInt C` is in canonical form. Fixes https://github.com/llvm/llvm-project/issues/83931.	2024-03-05 21:08:15 +08:00
Paul Walker	fd07b8f809	[LLVM][tests/Transforms/InstCombine] Convert instances of ConstantExpr based splats to use splat(). This is mostly NFC but some output does change due to consistently inserting into poison rather than undef and using i64 as the index type for inserts.	2024-02-27 13:37:23 +00:00
hanbeom	66eedd1dd3	[InstCombine] Fix worklist management in select fold (#77738 ) `InstCombine` uses `Worklist` to manage change history. `setOperand`, which was previously used to change the `Select` Instruction, does not, so it is `run` twice, which causes an `LLVM ERROR`. This problem is resolved by changing `setOperand` to `replaceOperand` as the change history will be registered in the Worklist. Fixes #77553.	2024-01-11 09:34:30 +01:00
Yingwei Zheng	554feb0058	[InstSimplify] Simplify `select cond, undef, val` to `val` if `val = poison` implies `cond = poison` (#76465 ) This patch folds: ``` select cond, undef, val -> val select cond, val, undef -> val ``` iff `impliesPoison(val, cond)` returns true. Example: ``` define i32 @src1(i32 %retval.0.i.i) { %cmp.i = icmp sgt i32 %retval.0.i.i, -1 %spec.select.i = select i1 %cmp.i, i32 %retval.0.i.i, i32 undef ret i32 %spec.select.i } define i32 @tgt1(i32 %retval.0.i.i) { ret i32 %retval.0.i.i } ``` Alive2: https://alive2.llvm.org/ce/z/okJW3G Compile-time impact: http://llvm-compile-time-tracker.com/compare.php?from=38c9390b59c4d2b9181614d6a909887497d3692f&to=e146f51ba278aa3bb6879a9ec651831ac8938e91&stat=instructions%3Au	2023-12-28 23:37:19 +08:00
Yingwei Zheng	ff76627aeb	[InstCombine] Fix type mismatch between cond and value in `foldSelectToCopysign` (#76343 ) This patch fixes the miscompilation when we try to bitcast a floating point vector into an integer scalar.	2023-12-26 00:04:06 +08:00
Craig Topper	7ec4f6094e	[InstCombine] Infer disjoint flag on Or instructions. (#72912 ) The disjoint flag was recently added to IR in #72583 We already set it when we turn an add into an or. This patch sets it on Ors that weren't converted from an Add.	2023-12-02 14:11:12 -08:00
Nikita Popov	cd31cf5989	[InstSimplify] Fix or disjoint miscompile with op replacement Make sure %x does not get folded to "or disjoint %x, %x" without dropping the flag, as this would be a derefinement.	2023-12-01 11:45:09 +01:00
Nikita Popov	5a1020bb00	[InstSimplify] Add test for disjoint or miscompile (NFC) The absorption case is already handled correctly, but the idempentence case is not.	2023-12-01 11:45:09 +01:00
Nikita Popov	2fda200cd8	[InstCombine] Canonicalize phi order for newly inserted nodes When new phi nodes are inserted at the start of the block, the order of these does not get canonicalized, as we pick the first phi node to canonicalize towards (and the other phi nodes may have already been visited). This results in phi nodes not being deduplicated and thus a fix-point verification failure. Fix this by remembering the predecessors of the first phi node we encounter -- this will usually result in the same order we picked previously. I also considered using predecessors() order instead, but that causes substantial test fallout. Additionally the predecessors() order tends to be the reverse of the "natural" order. Fixes https://github.com/llvm/llvm-project/issues/46688.	2023-09-27 09:56:23 +02:00
Dhruv Chawla	3e992d81af	[InferAlignment] Enable InferAlignment pass by default This gives an improvement of 0.6%: https://llvm-compile-time-tracker.com/compare.php?from=7d35fe6d08e2b9b786e1c8454cd2391463832167&to=0456c8e8a42be06b62ad4c3e3cf34b21f2633d1e&stat=instructions:u Differential Revision: https://reviews.llvm.org/D158600	2023-09-20 12:08:52 +05:30
Nikita Popov	ad7f02010f	[InstCombine] Process blocks in RPO InstComine currently processes blocks in an arbitrary depth-first order. This can break the usual invariant that the operands of an instruction should be simplified before the instruction itself, if uses across basic blocks (particularly inside phi nodes) are involved. This patch switches the initial worklist population to use RPO instead, which will ensure that predecessors are visited before successors (back-edges notwithstanding). This allows us to fold more cases within a single InstCombine iteration, in preparation for D154579. This change by itself is a minor compile-time regression of about 0.1%, which will be more than recovered by switching to single-iteration InstCombine. Differential Revision: https://reviews.llvm.org/D75362	2023-07-30 18:38:45 +02:00
Nikita Popov	547544112b	[InstSimplify] Allow gep inbounds x, 0 -> x in non-refining op replacement After the semantics change from https://reviews.llvm.org/D154051, gep inbounds x, 0 can no longer produce poison. As such, we can also perform this fold during non-refining operand replacement and avoid unnecessary drops of the inbounds flag. The online alive2 version has not been update to the new semantics yet, but we can use the following proof locally: define ptr @src(ptr %base, i64 %offset) { %cmp = icmp eq i64 %offset, 0 %gep = getelementptr inbounds i8, ptr %base, i64 %offset %sel = select i1 %cmp, ptr %base, ptr %gep ret ptr %sel } define ptr @tgt(ptr %base, i64 %offset) { %gep = getelementptr inbounds i8, ptr %base, i64 %offset ret ptr %gep }	2023-07-14 16:14:50 +02:00
Noah Goldstein	13f16f4dea	[InstCombine] Canonicalize `(icmp eq/ne (and x, C), x)` -> `(icmp eq/ne (and x, ~C), 0)` This increases the likelyhood `x` is single-use and is typically easier to analyze. Proofs: https://alive2.llvm.org/ce/z/8ZpS2W Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D154004	2023-06-29 13:14:37 -05:00
Nikita Popov	8378f1f4cd	[InstCombine] Remove adjustMinMax() fold (PR62088) This fold is buggy if the constant adjustment overflows. Additionally, since we now canonicalize to min/max intrinsics, the constants picked here don't actually matter, as long as SPF still recognizes the pattern. Fixes https://github.com/llvm/llvm-project/issues/62088.	2023-05-30 16:06:38 +02:00
Nikita Popov	271b5cf562	[InstCombine] Fix infinite combine loop (PR61361) In the degenerate case where the select is fed by an unsimplified icmp with two constant operands, don't try to replace one constant with another. Wait for the icmp to be simplified first instead. Fixes https://github.com/llvm/llvm-project/issues/61361.	2023-03-14 16:43:00 +01:00
Sanjay Patel	f48f178717	[InstCombine] canonicalize cmp+select as smin/smax (V == SMIN) ? SMIN+1 : V --> smax(V, SMIN+1) (V == SMAX) ? SMAX-1 : V --> smin(V, SMAX-1) https://alive2.llvm.org/ce/z/d5bqjy Follow-up for the unsigned variants added with: 86b4d8645fc1b866 issue #60374	2023-02-12 07:54:43 -05:00
Sanjay Patel	14411adc3d	[InstCombine] add tests for disguised smin/smax; NFC Adapted from the existing umin/umax tests.	2023-02-12 07:54:43 -05:00
Sanjay Patel	86b4d8645f	[InstCombine] canonicalize cmp+select as umin/umax (V == 0) ? 1 : V --> umax(V, 1) (V == UMAX) ? UMAX-1 : V --> umin(V, UMAX-1) https://alive2.llvm.org/ce/z/pfDBAf This is one pair of the variants discussed in issue #60374. Enhancements for the other end of the constant range and signed variants are potential follow-ups, but that may require more work because we canonicalize at least one min/max like that to icmp+zext.	2023-02-08 17:25:58 -05:00
Sanjay Patel	b98813fbb6	[InstCombine] add tests for cmp+select; NFC	2023-02-08 17:25:58 -05:00
Paul Walker	eae26b6640	[IRBuilder] Use canonical i64 type for insertelement index used by vector splats. Instcombine prefers this canonical form (see getPreferredVectorIndex), as does IRBuilder when passing the index as an integer so we may as well use the prefered form from creation. NOTE: All test changes are mechanical with nothing else expected beyond a change of index type from i32 to i64. Differential Revision: https://reviews.llvm.org/D140983	2023-01-11 14:08:06 +00:00
Nikita Popov	c93e7dec7e	[InstCombine] Convert some tests to opaque pointers (NFC) Check lines for these were regenerated, but without any significant changes (mostly different GEP source element types).	2022-12-27 16:29:52 +01:00
Nikita Popov	4b04c30340	[InstCombine] Regenerate test checks (NFC)	2022-12-27 16:21:31 +01:00
Nikita Popov	cd9e2ba0b3	Revert "[InstCombine] Convert some tests to opaque pointers (NFC)" This reverts commit 66cea84681e16f3d4ebdc69031824b114a0d5681. I did not intend to commit all the changes in here, but only the ones with no significant differences.	2022-12-27 16:19:13 +01:00
Nikita Popov	66cea84681	[InstCombine] Convert some tests to opaque pointers (NFC)	2022-12-27 16:12:59 +01:00
Philip Reames	2e999b7dd1	Allow scalable vectors in ComputeNumSignBits and isKnownNonNull This is a follow up to D136470 which extends the same scheme used there to ComputeNumSignBits and isKnownNonNull. As a reminder, for scalable vectors we track a single bit which is implicitly broadcast to all lanes. We do not know how many lanes there are statically, and thus have to be conservative along paths which require exact sizes. Differential Revision: https://reviews.llvm.org/D137046	2022-11-01 09:29:42 -07:00
Philip Reames	3f532e0524	[test] Add coverage for sign bits and known non-zero for scalable vectors	2022-10-31 12:03:29 -07:00
Philip Reames	144e38f5e5	[test] Refresh an autogened test to pickup naming change	2022-10-31 11:36:45 -07:00
Craig Topper	90b695d1f2	[IR][InstCombine] Support scalable vector splats ConstantExprs in Constant::getUniqueInteger(). I've left the getAggregateElement as a fast path for non-ConstantExprs to avoid a call to getSplatValue in release builds. Fixes PR57989. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D134670	2022-09-26 14:55:15 -07:00
Nikita Popov	41dde5d858	[InstSimplify] Support vectors in simplifyWithOpReplaced() We can handle vectors inside simplifyWithOpReplaced(), as long as cross-lane operations are excluded. The equality can hold (or not hold) for each vector lane independently, so we shouldn't use the replacement value from other lanes. I believe the only operations relevant here are shufflevector (where all previous bugs were seen) and calls (which might use shuffle-like intrinsics and would require more careful classification). Differential Revision: https://reviews.llvm.org/D134348	2022-09-22 10:45:42 +02:00
Nikita Popov	a342350b53	[InstSimplify] Add vector tests for simplifyWithOpReplaced (NFC)	2022-09-21 12:20:38 +02:00
Markus Böck	b751da43b2	[InstCombine] Handle integer extension in `select` patterns using the condition as value These patterns were previously only implemented for i1 type but can be extended for any integer type by also handling zext and sext operands. Differential Revision: https://reviews.llvm.org/D134142	2022-09-20 22:25:13 +02:00
Markus Böck	bba25c4c9f	[InstCombine] Precommit test for D134142	2022-09-20 15:10:52 +02:00
Nikita Popov	8a519b3c21	[InstCombine] Ensure constant folding in binop of select fold When folding a binop into a select, we need to ensure that one of the select arms actually does constant fold, otherwise we'll create two binop instructions and perform the reverse transform. Ensure this by performing an explicit constant folding attempt, and failing the transform if neither side simplifies. A simple alternative here would have been to limit the fold to ImmConstants, but given the current representation of scalable vector splats, this wouldn't be ideal.	2022-07-15 11:03:10 +02:00
Nuno Lopes	952e069393	[NFC] remove 'br undef' from InstCombine test cases This is UB and allows the compiler to give any result, so these tests weren't meaningful InstCombine tests are now clean of 'br undef'	2022-06-10 15:28:57 +01:00
Chenbing Zheng	51df77f36d	[InstCombine] Allow undef vectors when foldSelectToCopysign Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D125671	2022-05-19 10:57:49 +08:00
Chenbing Zheng	6c81079edf	[InstCombine] precommit tests for foldSelectToCopysign	2022-05-17 16:51:54 +08:00
Sanjay Patel	c8ed784ee6	[InstCombine] fold freeze of partial undef/poison vector constants We can always replace the undef elements in a vector constant with regular constants to get rid of the freeze: https://alive2.llvm.org/ce/z/nfRb4F The select diffs show that we might do better by adjusting the logic for a frozen select condition. We may also want to refine the vector constant replacement to consider forming a splat. Differential Revision: https://reviews.llvm.org/D123962	2022-04-26 14:16:11 -04:00
Sanjay Patel	5f7c385498	[InstCombine] add tests for freeze of partial undef vector constants; NFC	2022-04-19 12:41:50 -04:00
Sanjay Patel	3a27b51b27	[InstCombine] reduce code for freeze of undef The description was ambiguous about the behavior when boths select arms are constant or both arms are not constant. I don't think there's any evidence to support either way, but this matches the code with a more specified description. We can extend this to deal with vector constants with undef/poison elements. Currently, those don't get folded anywhere.	2022-04-18 15:14:02 -04:00
Sanjay Patel	60de144119	[InstCombine] add tests for select with frozen condition; NFC	2022-04-18 15:14:02 -04:00
Sanjay Patel	84812b9b07	[InstCombine] drop FMF in select->copysign transform It is not correct to propagate flags from the select to the new instructions: https://alive2.llvm.org/ce/z/tNATrd https://alive2.llvm.org/ce/z/VwcVzn Fixes #54077	2022-03-01 08:51:41 -05:00
Sanjay Patel	53dbedcd18	[InstCombine] add test for copysign with FMF propagation; NFC This is a miscompile as noted in #54077.	2022-03-01 08:51:40 -05:00
Sanjay Patel	f422c5d871	[InstCombine] fold select-of-zero-or-ones with negated op (X u< 2) ? -X : -1 --> sext (X != 0) (X u> 1) ? -1 : -X --> sext (X != 0) https://alive2.llvm.org/ce/z/U3y5Bb https://alive2.llvm.org/ce/z/hgi-4p This is part of solving:	2022-02-28 12:07:49 -05:00
Sanjay Patel	614f36fd38	[InstCombine] add tests for select of zero or all ones; NFC See #54053	2022-02-28 11:20:02 -05:00
Nikita Popov	a266af7211	[InstCombine] Canonicalize SPF to min/max intrinsics Now that integer min/max intrinsics have good support in both InstCombine and other passes, start canonicalizing SPF min/max to intrinsic min/max. Once this sticks, we can stop matching SPF min/max in various places, and can remove hacks we have for preventing infinite loops and breaking of SPF canonicalization. Differential Revision: https://reviews.llvm.org/D98152	2022-02-24 09:01:20 +01:00
Bjorn Pettersson	acdc419c89	[test] Use -passes=instcombine instead of -instcombine in lots of tests. NFC Another step moving away from the deprecated syntax of specifying pass pipeline in opt. Differential Revision: https://reviews.llvm.org/D119081	2022-02-07 14:26:59 +01:00

1 2 3 4 5

236 Commits