llvm-project

Author	SHA1	Message	Date
Yingwei Zheng	470c5b8011	[InstSimplify][InstCombine] Remove unnecessary `m_c_` matchers. (#81712 ) This patch removes unnecessary `m_c_` matchers since we always canonicalize `commutive_op Cst, X` into `commutive_op X, Cst`. Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=bfc0b7c6891896ee8e9818f22800472510093864&to=d27b058bb9acaa43d3cadbf3cd889e8f79e5c634&stat=instructions:u	2024-02-14 16:40:36 +08:00
Yingwei Zheng	be2723da5c	[InstSimplify] Fold icmp of `X and/or C1` and `X and/or C2` into constant (#65905 ) This patch simplifies the pattern `icmp X and/or C1, X and/or C2` when one constant mask is the subset of the other. If `C1 & C2 == C1`, `A = X and/or C1`, `B = X and/or C2`, we can do the following folds: `icmp ule A, B -> true` `icmp ugt A, B -> false` We can apply similar folds for signed predicates when `C1` and `C2` are the same sign: `icmp sle A, B -> true` `icmp sgt A, B -> false` Alive2: https://alive2.llvm.org/ce/z/Q4ekP5 Fixes #65833.	2023-09-18 21:32:48 +08:00
Yingwei Zheng	a054d89cd4	[InstSimplify] Add signed version of pre-commit tests for PR65905. NFC.	2023-09-17 22:36:01 +08:00
Yingwei Zheng	2f45b56728	[InstSimplify] Add pre-commit tests for PR65905. NFC.	2023-09-17 20:40:19 +08:00
Nikita Popov	063b37e7b4	Reapply [IR] Mark and/or constant expressions as undesirable Reapply after D156401, which stops PatternMatch from recognizing binop constant expressions, which should avoid the infinite loops and assertion failures this patch previously exposed. ----- In preparation for removing support for and/or expressions, mark them as undesirable. As such, we will no longer implicitly create such expressions, but they still exist.	2023-07-31 09:54:24 +02:00
Matthew Voss	380dbfd8ca	Revert "Reapply [IR] Mark and/or constant expressions as undesirable" This reverts commit 0cab8d20417c0e2ccc1ffc5505e080126f5de8e6. Reverted due to an LTO crash. I've put a reduced test case here: https://github.com/llvm/llvm-project/issues/64114	2023-07-26 12:54:07 -07:00
Nikita Popov	0cab8d2041	Reapply [IR] Mark and/or constant expressions as undesirable This reapplies the change for and, but also marks or as undesirable at the same time. Only handling one of them can cause infinite combine loops due to the asymmetric handling. ----- In preparation for removing support for and/or expressions, mark them as undesirable. As such, we will no longer implicitly create such expressions, but they still exist.	2023-07-25 15:31:45 +02:00
Nathan Chancellor	17f4f262fc	Revert "Reapply [IR] Mark and constant expressions as undesirable" This reverts commit 086ee99564afbb11449c08ea2e094f7f49fadde5. This patch causes an infinite loop when building arch/mips/mm/c-r4k.c in the Linux kernel. See the comment in Phabricator for a reduced reproducer: https://reviews.llvm.org/rG086ee99564afbb11449c08ea2e094f7f49fadde5	2023-07-21 15:57:03 -07:00
Nikita Popov	086ee99564	Reapply [IR] Mark and constant expressions as undesirable Reapply after fixing an issue in canonicalizeLogicFirst() exposed by this change (218f97578b26f7a89f7f8ed0748c31ef0181f80a). ----- In preparation for removing support for and expressions, mark them as undesirable. As such, we will no longer implicitly create such expressions, but they still exist.	2023-07-21 10:10:50 +02:00
Nikita Popov	9dc391e89c	Revert "[IR] Mark add constant expressions as undesirable" This reverts commit f8a36d8c3e264c4fccf8058e699201a452ea7bb7. I believe this is causing an assertion failure on the sanitizer-x86_64-linux buildbot: clang++: /b/sanitizer-x86_64-linux/build/llvm-project/llvm/include/llvm/Support/Casting.h:578: decltype(auto) llvm::cast(From *) [To = llvm::BinaryOperator, From = llvm::Value]: Assertion `isa<To>(Val) && "cast<Ty>() argument of incompatible type!"' failed. #10 0x000055bdd7e82408 canonicalizeLogicFirst(llvm::BinaryOperator&, llvm::IRBuilder<llvm::TargetFolder, llvm::IRBuilderCallbackInserter>&) /b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp:2131:5 #11 0x000055bdd7e80183 llvm::InstCombinerImpl::visitAnd(llvm::BinaryOperator&) /b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp:2661:20 Likely the code is encountering a constant expression in a case it didn't before.	2023-07-20 18:09:17 +02:00
Nikita Popov	f8a36d8c3e	[IR] Mark add constant expressions as undesirable In preparation for removing support for add expressions, mark them as undesirable. As such, we will no longer implicitly create such expressions, but they still exist.	2023-07-20 15:24:19 +02:00
Noah Goldstein	2622b2f409	[ValueTracking] Use `select` condition to help determine if `select` is non-zero In `select c, x, y` the condition `c` dominates the resulting `x` or `y` chosen by the `select`. This adds logic to `isKnownNonZero` to try and use the `icmp` for the `c` condition to see if it implies the select `x` or `y` are known non-zero. For example in: ``` %c = icmp ugt i8 %x, %C %r = select i1 %c, i8 %x, i8 %y ``` The true arm of select `%x` is non-zero (when "returned" by the `select`) because `%c` being true implies `%x` is non-zero. Alive2 Links (with `x {pred} C`): - EQ iff `C != 0`: - https://alive2.llvm.org/ce/z/umLabn - NE iff `C == 0`: - https://alive2.llvm.org/ce/z/DQvy8Y - UGT [always]: - https://alive2.llvm.org/ce/z/HBkjgQ - UGE iff `C != 0`: - https://alive2.llvm.org/ce/z/LDNifB - SGT iff `C s>= 0`: - https://alive2.llvm.org/ce/z/QzWDj3 - SGE iff `C s> 0`: - https://alive2.llvm.org/ce/z/rR4g3D - SLT iff `C s<= 0`: - https://alive2.llvm.org/ce/z/uysayx - SLE iff `C s< 0`: - https://alive2.llvm.org/ce/z/2jYc7e Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D147900	2023-05-23 13:52:40 -05:00
Matt Devereau	da94a2b62a	[InstSimplify] Correct icmp_lshr test to use ult instead of slt	2023-02-20 09:51:11 +00:00
Matt Devereau	8299c764bd	[InstSimplify] Simplify icmp between Shl instructions of the same value define i1 @compare_vscales() { %vscale = call i64 @llvm.vscale.i64() %vscalex2 = shl nuw nsw i64 %vscale, 1 %vscalex4 = shl nuw nsw i64 %vscale, 2 %cmp = icmp ult i64 %vscalex2, %vscalex4 ret i1 %cmp } This IR is currently emitted by LLVM. This icmp is redundant as this snippet can be simplified to true or false as both operands originate from the same @llvm.vscale.i64() call. Differential Revision: https://reviews.llvm.org/D142542	2023-02-20 09:25:34 +00:00
Matt Devereau	d64d5772b1	Add InstSimplify tests for comparisons between known constants	2023-02-15 09:56:02 +00:00
Nikita Popov	5f01a626dd	[ConstantFold] Fix inbounds inference on mismatching source element type When inferring that a GEP of a global variable is inbounds because there is no notional overindexing, we need to check that the global value type and the GEP source element type match. This was not necessary with typed pointers (because we would have a bitcast in between), but is necessary with opaque pointers. We should be able to recover some of the safe cases by performing an offset based inbounds inference in DL-aware ConstantFolding.	2023-01-31 11:33:00 +01:00
Nikita Popov	04b944e230	[InstSimplify] Convert tests to opaque pointers (NFC) The only interesting test change is in @PR31262, where the following fold is now performed, while it previously was not: https://alive2.llvm.org/ce/z/a5Qmr6 llvm/test/Transforms/InstSimplify/ConstProp/gep.ll has not been updated, because there is a tradeoff between folding and inrange preservation there that we may want to discuss. Updates have been performed using: https://gist.github.com/nikic/98357b71fd67756b0f064c9517b62a34	2022-06-10 17:16:28 +02:00
Philip Reames	ff2e4c04c4	[instsimplify] Assume storage for byval args doesn't overlap allocas, globals, or other byval args This allows us to discharge many pointer comparisons based on byval arguments. Differential Revision: https://reviews.llvm.org/D120133	2022-02-18 11:08:01 -08:00
Philip Reames	a259e62bb6	[instsimplify] Add a couple more pointer compare folding tests [NFC]	2022-02-18 08:24:06 -08:00
Philip Reames	1cf790bd04	[instsimplify] Add pointer compare tests for byval args and globals	2022-02-18 07:50:57 -08:00
Philip Reames	cf5e88864b	[instsimplify] When compare allocas, consider their minimal size The code was using exact sizing only, but since what we really need is just to make sure the offsets are in bounds, a minimum bound on the object size is sufficient. To demonstrate the difference, support computing minimum sizes from obects of scalable vector type.	2022-02-17 09:53:24 -08:00
Philip Reames	2404313d80	[instsimplify] Fix a miscompile with zero sized allocas Remove some code which tried to handle the case of comparing two allocas where an object size could not be precisely computed. This code had zero coverage in tree, and at least one nasty bug. The bug comes from the fact that the code uses the size of the result pointer as a proxy for whether the alloca can be of size zero. Since the result of an alloca is always a pointer type, and a pointer type can never be empty, this check was a nop. As a result, we blindly consider a zero offset from two allocas to never be equal. They can in fact be equal when one or more of the allocas is zero sized. This is particularly ugly because instcombine contains the exact opposite rule. If instcombine reaches the allocas first, it combines them into one (making them equal). If instsimplify reaches the compare first, it would consider them not equal. This creates all kinds of fun scenarios for order of optimization reaching different and contradictory conclusions.	2022-02-17 09:27:34 -08:00
Philip Reames	7eb3ce997a	[instsimplify] Precommit a test showing an alloca equality miscompile	2022-02-17 09:16:31 -08:00
Bjorn Pettersson	b280ee1dd7	[test] Use -passes=instsimplify instead of -instsimplify in a number of tests. NFC Another step moving away from the deprecated syntax of specifying pass pipeline in opt. Differential Revision: https://reviews.llvm.org/D119080	2022-02-07 14:26:58 +01:00
Erik Desjardins	53b00b8215	[InstSimplify] Fold X {lshr,udiv} C <u X --> true for nonzero X, non-identity C This eliminates the bounds check in Rust code like pub fn mid(data: &[i32]) -> i32 { if data.is_empty() { return 0; } return data[data.len()/2]; } (from https://blog.sigplan.org/2021/11/18/undefined-behavior-deserves-a-better-reputation/) Alive proofs: lshr https://alive2.llvm.org/ce/z/nyTu8D udiv https://alive2.llvm.org/ce/z/CNUZH7 Differential Revision: https://reviews.llvm.org/D114279	2021-11-26 16:48:33 -05:00
Erik Desjardins	a68af62b42	[InstSimplify] baseline tests for icmp of lshr/udiv fold (NFC) Precommits tests for https://reviews.llvm.org/D114279 Differential Revision: https://reviews.llvm.org/D114280	2021-11-26 15:57:04 -05:00
Nikita Popov	e3cec17b2d	[InstSimplify] Remove incorrect icmp of gep fold (PR52429) As described in https://bugs.llvm.org/show_bug.cgi?id=52429 this fold is incorrect, because inbounds only guarantees that the pointers don't wrap in the unsigned space: It is possible that the sign boundary is crossed by an object. I'm dropping the fold entirely rather than adjusting it, because computePointerICmp() fully subsumes it (just with correct predicate handling). Differential Revision: https://reviews.llvm.org/D113343	2021-11-06 21:03:21 +01:00
Sanjay Patel	00808e321c	[InstSimplify] allow vector folds for (Pow2C << X) == NonPow2C Existing pre-conditions seem to be correct: https://rise4fun.com/Alive/lCLB Name: non-zero C1 Pre: !isPowerOf2(C1) && isPowerOf2(C2) && C1 != 0 %sub = shl i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false Name: one == C2 Pre: !isPowerOf2(C1) && isPowerOf2(C2) && C2 == 1 %sub = shl i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false Name: nuw Pre: !isPowerOf2(C1) && isPowerOf2(C2) %sub = shl nuw i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false Name: nsw Pre: !isPowerOf2(C1) && isPowerOf2(C2) %sub = shl nsw i8 C2, %X %cmp = icmp eq i8 %sub, C1 => %cmp = false	2020-11-08 09:52:05 -05:00
Sanjay Patel	73a5f0b614	[InstSimplify] add tests for icmp with power-of-2 operand; NFC	2020-11-08 09:52:05 -05:00
Sanjay Patel	c74db55ff5	[InstSimplify] allow vector folds for icmp Pred (1 << X), 0x80	2020-11-04 08:12:48 -05:00
Sanjay Patel	5765edbf9e	[InstSimplify] add vector cmp tests; NFC	2020-11-04 08:12:47 -05:00
Sanjay Patel	c72198079d	[ValueTracking] add range limits for cttz As discussed in D89952, instcombine can sometimes find a way to reduce similar patterns, but it is incomplete. InstSimplify uses the computeConstantRange() ValueTracking analysis via simplifyICmpWithConstant(), so we just need to fill in the max value of cttz to process any "icmp pred cttz(X), C" pattern (the min value is initialized to zero automatically). https://alive2.llvm.org/ce/z/Z_SLWZ Follow-up to D89976.	2020-10-23 08:43:45 -04:00
Sanjay Patel	3fb0d6b0d5	[ValueTracking] add range limits for ctlz As discussed in D89952, instcombine can sometimes find a way to reduce similar patterns, but it is incomplete. InstSimplify uses the computeConstantRange() ValueTracking analysis via simplifyICmpWithConstant(), so we just need to fill in the max value of ctlz to process any "icmp pred ctlz(X), C" pattern (the min value is initialized to zero automatically). Follow-up to D89976.	2020-10-23 08:43:45 -04:00
Sanjay Patel	0351bd959f	[InstSimplify] add tests for cttz constant range; NFC This is a search-and-replace of f6cb7f3	2020-10-23 08:43:45 -04:00
Sanjay Patel	9bcb437f46	[InstSimplify] add tests for ctlz constant range; NFC This is a search-and-replace of f6cb7f3.	2020-10-23 08:43:45 -04:00
Sanjay Patel	748ecc6b32	[ValueTracking] add range limits for ctpop As discussed in D89952, instcombine can sometimes find a way to reduce similar patterns, but it is incomplete. InstSimplify uses the computeConstantRange() ValueTracking analysis via simplifyICmpWithConstant(), so we just need to fill in the max value of ctpop to process any "icmp pred ctpop(X), C" pattern (the min value is initialized to zero automatically). Differential Revision: https://reviews.llvm.org/D89976	2020-10-23 08:17:54 -04:00
Sanjay Patel	f6cb7f37ff	[InstSimplify] add tests for ctpop constant range; NFC	2020-10-22 14:16:48 -04:00
Sjoerd Meijer	51d7df3fa1	[InstructionSimplify] icmp (X+Y), (X+Z) simplification This improves simplifications for pattern `icmp (X+Y), (X+Z)` -> `icmp Y,Z` if only one of the operands has NSW set, e.g.: icmp slt (x + 0), (x +nsw 1) We can still safely rewrite this to: icmp slt 0, 1 because we know that the LHS can't overflow if the RHS has NSW set and C1 < C2 && C1 >= 0, or C2 < C1 && C1 <= 0 This simplification is useful because ScalarEvolutionExpander which is used to generate code for SCEVs in different loop optimisers is not always able to put back NSW flags across control-flow, thus inhibiting CFG simplifications. Differential Revision: https://reviews.llvm.org/D89317	2020-10-22 08:55:52 +01:00
Sjoerd Meijer	e86a70ce3d	[InstructionSimplify] And precommit more tests for D89317. NFC.	2020-10-21 11:02:25 +01:00
Sjoerd Meijer	782b8f0d38	[InstructionSimplify] Precommit more tests for D89317. NFC.	2020-10-21 10:14:39 +01:00
Sanjay Patel	7c516504a1	[InstSimplify] allow vector splats for icmp-of-neg folds	2020-10-20 09:24:36 -04:00
Sanjay Patel	b11588b18e	[InstSimplify] add vector icmp tests; NFC	2020-10-20 09:24:35 -04:00
Sjoerd Meijer	66f22411e1	[InstructionSimplify] Precommit tests for D89317. NFC.	2020-10-13 15:40:33 +01:00
Xavier Denis	29fe3fe615	[InstSimplify] Peephole optimization for icmp (urem X, Y), X This revision adds the following peephole optimization and it's negation: %a = urem i64 %x, %y %b = icmp ule i64 %a, %x ====> %b = true With John Regehr's help this optimization was checked with Alive2 which suggests it should be valid. This pattern occurs in the bound checks of Rust code, the program const N: usize = 3; const T = u8; pub fn split_mutiple(slice: &[T]) -> (&[T], &[T]) { let len = slice.len() / N; slice.split_at(len * N) } the method call slice.split_at will check that len * N is within the bounds of slice, this bounds check is after some transformations turned into the urem seen above and then LLVM fails to optimize it any further. Adding this optimization would cause this bounds check to be fully optimized away. ref: https://github.com/rust-lang/rust/issues/74938 Differential Revision: https://reviews.llvm.org/D85092	2020-08-04 20:48:37 +02:00
Xavier Denis	b778b04b69	[InstSimplify] Add tests for icmp with urem divisor (NFC)	2020-08-04 20:45:20 +02:00
Nikita Popov	f89f7da999	[IR] Convert null-pointer-is-valid into an enum attribute The "null-pointer-is-valid" attribute needs to be checked by many pointer-related combines. To make the check more efficient, convert it from a string into an enum attribute. In the future, this attribute may be replaced with data layout properties. Differential Revision: https://reviews.llvm.org/D78862	2020-05-15 19:41:07 +02:00
Simon Pilgrim	6d24dd7ed1	[InstSimplify] Regenerate compares tests to fix issue reported on D77354	2020-04-03 17:34:56 +01:00
Simon Pilgrim	7f764fa18f	[ValueTracking] Add some initial isKnownNonZero DemandedElts support (PR36319)	2020-03-20 13:29:00 +00:00
Simon Pilgrim	95b6f62efb	[InstSimplify] Add some vector shift tests to show lack of DemandedElts support	2020-03-19 22:09:51 +00:00
Simon Pilgrim	0b458d4dca	[ValueTracking] Add computeKnownBits DemandedElts support to ADD/SUB/MUL instructions (PR36319)	2020-03-19 12:41:29 +00:00

1 2 3

138 Commits