llvm-project

Author	SHA1	Message	Date
Christoph Stiller	3af4590506	[InstCombine] Contracting x^2 + 2xy + y^2 to (x + y)^2 (float) Resolves https://github.com/llvm/llvm-project/issues/61296 if https://reviews.llvm.org/D156026 didn't suffice. Reviewed By: goldstein.w.n Differential Revision: https://reviews.llvm.org/D158079	2023-09-01 15:02:12 -05:00
Matt Arsenault	033d6ffb53	IR: Add operator \| and & for FastMathFlags We only had \|= and &= which was annoying.	2023-08-28 19:25:54 -04:00
Noah Goldstein	9c375a457e	[InstCombine] Transform `(add (shl (neg X), Cnt))` -> `(sub (shl X, Cnt))` Just a simple instruction save. Proof: https://alive2.llvm.org/ce/z/Vb484j Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D154807	2023-08-16 22:43:04 -05:00
Matt Arsenault	f7f5d348a3	InstCombine: Fold fneg (ldexp x, n) -> ldexp (fneg x), n https://reviews.llvm.org/D156579	2023-08-11 19:16:20 -04:00
Christoph Stiller	0bdab96a5a	[InstCombine] Contract x^2 + 2xy + y^2 to (x + y)^2 (integer) There are a number of variations on the pattern, depending on order. Proof: https://alive2.llvm.org/ce/z/ri5iFw Differential Revision: https://reviews.llvm.org/D156026	2023-08-09 14:19:21 +02:00
Matt Arsenault	020d2fb771	InstCombine: Avoid repeated m_OneUse checks The check was repeated for the fmul and fdiv case, and the caller was already checking anyway.	2023-08-02 10:42:56 -04:00
Matt Arsenault	ed0dbfef7a	InstCombine: Remove dead fsub-as-fneg handling The above combine matching m_FNeg to produce a new fneg always would hide this.	2023-08-02 10:42:56 -04:00
Antonio Frighetto	f12a5561b2	[InstCombine] Fold binop of `select` and cast of `select` condition Simplify binary operations, whose operands involve a `select` instruction and a cast of the `select` condition. Specifically, the binop is canonicalized into a `select` with folded arguments as follows: (Binop (zext C), (select C, T, F)) -> (select C, (binop 1, T), (binop 0, F)) (Binop (sext C), (select C, T, F)) -> (select C, (binop -1, T), (binop 0, F)) Proofs: https://alive2.llvm.org/ce/z/c_JwwM Differential Revision: https://reviews.llvm.org/D153963	2023-07-20 19:42:58 +00:00
Nikita Popov	21827268ad	[InstCombine] Fold add of zext and sext of i1 (zext a) + (sext a) is 0 if a is a bool. The regression is in a fuzzer-generated test. Proof: https://alive2.llvm.org/ce/z/KotnN6	2023-07-14 14:52:13 +02:00
Matt Arsenault	0f4eb557e8	ValueTracking: Replace CannotBeNegativeZero This is now just a wrapper around computeKnownFPClass.	2023-07-12 13:14:05 -04:00
Nikita Popov	b45a73f4d0	[InstCombine] Fold binop of shifts with related amounts Fold binop(shift(ShiftedC1, ShAmt), shift(ShiftedC2, add(ShAmt, AddC))) -> shift(binop(ShiftedC1, shift(ShiftedC2, AddC)), ShAmt) where both shifts are the same and AddC is a valid shift amount. Proofs: https://alive2.llvm.org/ce/z/PhVVeg Differential Revision: https://reviews.llvm.org/D152927	2023-06-28 09:54:36 +02:00
Noah Goldstein	91cdffcb2f	[InstCombine] Transform `(binop1 (binop2 (lshift X,Amt),Mask),(lshift Y,Amt))` If `Mask` and `Amt` are not constants and `binop1` and `binop2` are the same we can transform to: `(binop (lshift (binop X, Y), Amt), Mask)` If `binop` is `add`, `lshift` must be `shl`. If `Mask` and `Amt` are constants `C` and `C1` respectively. We can transform to: `(lshift1 (binop1 (binop2 X, (inv_lshift1 C, C1), Y)), C1)` Saving an instruction IFF: `lshift1` is same opcode as `lshift2` Either `bitwise1` and/or `bitwise2` is `and`. Proofs(1/2): https://alive2.llvm.org/ce/z/BjN-m_ Proofs(2/2): https://alive2.llvm.org/ce/z/bZn5QB This is to help fix the regression caused in D151807 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D152568	2023-06-13 20:08:35 -05:00
khei4	116670d192	[InstCombine] add overflow checking on Add ~X + C --> (C-1) - X Differential Revision: https://reviews.llvm.org/D152088	2023-06-06 12:24:45 +09:00
khei4	4db8d4f839	[InstCombine] add overflow checking on AddSub `C-(X+C2) --> (C-C2)-X` Differential Revision: https://reviews.llvm.org/D152068	2023-06-05 20:05:06 +09:00
Serguei Katkov	f38365aef4	[InstCombine] Add support for maximum(a,b) + minimum(a,b) => a + b Unfortunately alive2 cannot prove the correctness due to fails by timeout even for float type half. However it should be correct. If a and b are not NaN, maximum and minimum will just return different values (a and b) and take into account a + b == b + a this is the same. If a or b is NaN, than maximum and minimum are equal to NaN and NaN + NaN is NaN. a + b is also a NaN. In terms of preserving fast flags, we cannot preserve ninf due to minimum(NaN, Infinity) == maximum(NaN, Infinity) == NaN, minimum(NaN, Infinity) +ninf maximum(NaN, Infinity) == NaN +ninf NaN = NaN However transformation will change minimum(NaN, Infinity) + maximum(NaN, Infinity) to NaN +ninf Infinity == poison. But if fadd is marked as nnan, we can preserve because NaN +ninf/nnan NaN = poison as well. The same optimization for maximum(a,b) * minimum(a,b) => a * b is added. All said above for fadd is correct for fmul. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D147299	2023-04-07 12:38:04 +07:00
Serguei Katkov	624973806c	[InstCombine] Add support for max(a,b) + min(a,b) => a + b. Re-land. The same optimization for max(a,b) * min(a,b) => a * b is added. Correctness check: uadd: https://alive2.llvm.org/ce/z/2rXDek sadd: https://alive2.llvm.org/ce/z/zNu_er uadd + nuw/nsw: https://alive2.llvm.org/ce/z/EaiNjB sadd + nuw/nsw: https://alive2.llvm.org/ce/z/w_2Nrs umul: https://alive2.llvm.org/ce/z/dgXRLr smul: https://alive2.llvm.org/ce/z/hBjGzz umul + nuw/nsw: https://alive2.llvm.org/ce/z/EaiNjB smul + nuw/nsw: https://alive2.llvm.org/ce/z/87MNeS Reviewed By: goldstein.w.n Differential Revision: https://reviews.llvm.org/D147296	2023-04-07 11:56:05 +07:00
Serguei Katkov	4665f3c838	Revert "[InstCombine] Add support for max(a,b) + min(a,b) => a + b." Revert commit due to failure on buildbot: error: 'match_combine_or' may not intend to support class template argument deduction This reverts commit b86a06ef284f2637bef89bf5bb20157a8b195568.	2023-04-07 11:14:28 +07:00
Serguei Katkov	b86a06ef28	[InstCombine] Add support for max(a,b) + min(a,b) => a + b. The same optimization for max(a,b) * min(a,b) => a * b is added. Correctness check: uadd: https://alive2.llvm.org/ce/z/2rXDek sadd: https://alive2.llvm.org/ce/z/zNu_er uadd + nuw/nsw: https://alive2.llvm.org/ce/z/EaiNjB sadd + nuw/nsw: https://alive2.llvm.org/ce/z/w_2Nrs umul: https://alive2.llvm.org/ce/z/dgXRLr smul: https://alive2.llvm.org/ce/z/hBjGzz umul + nuw/nsw: https://alive2.llvm.org/ce/z/EaiNjB smul + nuw/nsw: https://alive2.llvm.org/ce/z/87MNeS Reviewed By: goldstein.w.n Differential Revision: https://reviews.llvm.org/D147296	2023-04-07 10:24:07 +07:00
Jun Zhang	3ecf731376	[InstCombine] Reduce absolute diff from min+max+sub This patch implements fold: max(a,b) nsw/nuw - min(a,b) --> abs(a nsw - b) Alive2: https://alive2.llvm.org/ce/z/4yLp7D Fixes: https://github.com/llvm/llvm-project/issues/61228 Signed-off-by: Jun Zhang <jun@junz.org> Differential Revision: https://reviews.llvm.org/D145540	2023-03-10 08:00:40 +08:00
Sanjay Patel	542e70f96c	[InstCombine] reassociate subtract-from-constant to add-constant (C - X) + Y --> (Y - X) + C Moving the constant operand to an 'add' gives more flexibility to subsequent reassociation patterns, and it may be better for codegen on targets that don't have subtract-from-immediate instructions.	2023-02-28 14:21:13 -05:00
Kazu Hirata	f8f3db2756	Use APInt::count{l,r}_{zero,one} (NFC)	2023-02-19 22:04:47 -08:00
Craig Topper	df76ff98e8	[InstCombine][LV] Fold (add (zext (add X, -1)), 1) -> (zext X) if X is non-zero. This artifact can appear from the vectorizer. (add X, -1) is the backedge taken count. It gets zero extended and then 1 is added to it to get the trip count. There is usually a dominating branch that rules out X being zero. Alive: https://alive2.llvm.org/ce/z/NsRDwX	2023-01-30 17:45:01 -08:00
Sanjay Patel	1720ec6da0	[InstCombine] restrict no-wrap propagation for i1/i2 to avoid miscompiles This transform was added with 68c197f07eeae71b9b7, and the post-commit review noted the potential for miscompiles at narrow bitwidths. I'm not sure how to expose the i1 nuw bug because we already simplify that, but other cases show that there are missing transforms to add in follow-up patches.	2023-01-18 10:32:12 -05:00
Sanjay Patel	830ac677b7	[InstCombine] reduce code duplication in visitSub(); NFC	2023-01-18 10:17:07 -05:00
Sanjay Patel	68c197f07e	[InstCombine] factor difference-of-squares to reduce multiplication (X * X) - (Y * Y) --> (X + Y) * (X - Y) https://alive2.llvm.org/ce/z/BAuRCf The no-wrap propagation could be relaxed in some cases, but there does not seem to be an obvious rule for that.	2023-01-17 14:58:40 -05:00
Sanjay Patel	dedc58da49	[InstCombine] canonicalize a signum (spaceship) that ends in add (A s>> (BW - 1)) + (zext (A s> 0)) --> (A s>> (BW - 1)) \| (zext (A != 0)) https://alive2.llvm.org/ce/z/V-nM8N This is not the form that we currently match as m_Signum(), but I'm not sure if one is better than the other, so there's a follow-up patch needed either way. For this patch, it should be better for analysis to use a not-null test and bitwise logic rather than >0 with add. Codegen doesn't seem significantly different on any targets that I looked at. Also note that none of these variants is shown in issue #60012 - those generally include at least one 'select', so that's likely where these patterns will end up.	2023-01-16 12:47:21 -05:00
luxufan	561ee10a25	[InstCombine] Combine ZExt (B - A) + ZExt(A) to ZExt(B) Combine ZExt (B - A) + ZExt(A) to ZExt(B) https://alive2.llvm.org/ce/z/ESUwPi Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D139930	2022-12-21 22:53:29 +08:00
Matt Arsenault	8ab032fbe5	InstCombine: Fold fneg (copysign x, y) -> copysign x, (fneg y)	2022-12-20 17:14:34 -05:00
Sanjay Patel	86b4a2355e	[InstCombine] fold flooring sdiv by power-of-2 to ashr It's a bigger match than usual, but I have not found any sub-patterns that reduce: (X / DivC) + sext ((X & (SMin \| (DivC - 1)) >u SMin) --> X >>s log2(DivC) https://alive2.llvm.org/ce/z/MJzlhl Fixes issue #55741	2022-12-18 08:17:07 -05:00
Sanjay Patel	93099c7d21	[InstCombine] allow more fast-math-flags to propagate in fneg-of-select fold We were conservatively intersecting flags, but we can take the union here because forbidden special values (nsz/nnan/ninf) are not altered by fneg. So if they were guaranteed not present on the select or fneg, then they are guaranteed not present on the new select. Alive2 appears to agree on the test diffs (reduced to not include irrelevant flags like reassoc): https://alive2.llvm.org/ce/z/ViqqrO This prevents a potential regression if we tighten up the FMF behavior for fabs with NAN as suggested in issue #59279.	2022-12-09 11:07:24 -05:00
chenglin.bi	6703290361	[InstCombine] fold `sub + and` pattern with specific const value `C1 - ((C3 - X) & C2) --> (X & C2) + (C1 - (C2 & C3))` when: (C3 - ((C2 & C3) - 1)) is pow2 && ((C2 + C3) & ((C2 & C3) - 1)) == ((C2 & C3) - 1) && C2 is negative pow2 \|\| (C3 - X) is nuw https://alive2.llvm.org/ce/z/HXQJV- Fix: #58523 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D136582	2022-11-05 12:58:45 +08:00
zhongyunde	f58311796c	[InstCombine] refactor the SimplifyUsingDistributiveLaws NFC Precommit for D136015 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D137019	2022-10-30 21:04:06 +08:00
zhongyunde	620cff096a	[InstCombine] Fold series of instructions into mull for more types Relax the constraint of wide/vectors types. Address the comment https://reviews.llvm.org/D136015?id=469189#inline-1314520 Reviewed By: spatel, chfast Differential Revision: https://reviews.llvm.org/D136661	2022-10-25 23:04:46 +08:00
zhongyunde	81713e893a	[InstCombine] Fold series of instructions into mull The following sequence should be folded into in0 * in1 In0Lo = in0 & 0xffffffff; In0Hi = in0 >> 32; In1Lo = in1 & 0xffffffff; In1Hi = in1 >> 32; m01 = In1Hi * In0Lo; m10 = In1Lo * In0Hi; m00 = In1Lo * In0Lo; addc = m01 + m10; ResLo = m00 + (addc >> 32); Reviewed By: spatel, RKSimon Differential Revision: https://reviews.llvm.org/D136015	2022-10-25 01:09:37 +08:00
Sanjay Patel	ee0bf64722	[InstCombine] try to fold mul by neg-power-of-2 to shl `(A * -2**C) + B --> B - (A << C)` https://alive2.llvm.org/ce/z/A6BWkf This inverts what Negator was doing before: D134310 / 0f32a5dea0e9 Analysis and codegen are generally better without multiply, so we should favor this form even if we trade add for sub (because those are generally equivalent cost operations).	2022-09-21 15:09:39 -04:00
Marc Auberer	f52dd920d4	[InstCombine] Fix bug when folding x + (x \| -x) to x & (x - 1) Addresses concern: https://reviews.llvm.org/rG09cdddea0c4d284c2c22f5dfade40a60850c5ea7 There was a copy/paste mistake in the code. Updated code and test ref. Differential Revision: https://reviews.llvm.org/D134135	2022-09-18 13:16:12 -04:00
Marc Auberer	09cdddea0c	[InstCombine] Fold x + (x \| -x) to x & (x - 1) Fixes #57531 This transformation may be particularly useful on x86-64, because x & (x - 1) can be performed by a single blsr instruction. Differential Revision: https://reviews.llvm.org/D133362	2022-09-11 06:14:24 -04:00
Joe Loser	5e96cea1db	[llvm] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Differential Revision: https://reviews.llvm.org/D133429	2022-09-08 09:01:53 -06:00
Sanjay Patel	ae117e1c1b	[InstCombine] remove dead code for add (select cond, (sub), 0); NFC This pattern is handled more generally in SimplifySelectsFeedingBinaryOp(). Tests to confirm that added to the add.ll test file in the previous commit.	2022-09-06 12:19:50 -04:00
Sanjay Patel	6c39a3aae1	[InstCombine] fold not-shift of signbit to icmp+zext https://alive2.llvm.org/ce/z/j_8Wz9 The arithmetic shift was converted to logical shift with: 246078604c871d43 That does not seem to uncover any other missing/conflicting folds, so convert directly to signbit test + cast. We still need to fold the pattern with logical shift to test + cast. This allows reducing patterns where the output type is not the same as the input value: https://alive2.llvm.org/ce/z/nydwFV Fixes #57394	2022-08-29 10:06:31 -04:00
Sanjay Patel	246078604c	[InstCombine] fold inc-of-signbit-splat to not+lshr (iN X s>> (N - 1)) + 1 --> (~X) u>> (N - 1) https://alive2.llvm.org/ce/z/wzS474	2022-08-29 08:48:22 -04:00
Sanjay Patel	f7ab70cf8d	[InstCombine] reduce disguised mul+add factorization ~(A * C1) + A --> (A * (1 - C1)) - 1 This is a non-obvious mix of bitwise logic and math: https://alive2.llvm.org/ce/z/U7ACVT The pattern may be produced by Negator from the more typical code seen in issue #57255.	2022-08-24 16:02:12 -04:00
Simon Pilgrim	2f217c1214	[InstCombine] Canonicalize ((X & -X) - 1) --> ((X - 1) & ~X) (PR51784) Enables the ctpop((x & -x ) - 1) -> cttz(x, false) fold Alive2: https://alive2.llvm.org/ce/z/EDk4h7 (((X & -X) - 1) --> (~X & (X - 1)) ) Alive2: https://alive2.llvm.org/ce/z/8Yr3XG (CTPOP -> CTTZ) Fixes #51126 Differential Revision: https://reviews.llvm.org/D110488	2022-08-24 16:50:43 +01:00
Simon Pilgrim	80cc8f0f62	Revert rGc360955c4804e9b25017372cb4c6be7adcb216ce "[InstCombine] Canonicalize ((X & -X) - 1) --> (~X & (X - 1)) (PR51784)" The test changes are failing on some buildbots (but not others.....).	2022-08-24 16:26:28 +01:00
Simon Pilgrim	c360955c48	[InstCombine] Canonicalize ((X & -X) - 1) --> (~X & (X - 1)) (PR51784) Enables the ctpop((x & -x ) - 1) -> cttz(x, false) fold Alive2: https://alive2.llvm.org/ce/z/EDk4h7 (((X & -X) - 1) --> (~X & (X - 1)) ) Alive2: https://alive2.llvm.org/ce/z/8Yr3XG (CTPOP -> CTTZ) Fixes #51126 Differential Revision: https://reviews.llvm.org/D110488	2022-08-24 15:31:15 +01:00
Jay Foad	f82c55fa08	[InstCombine] Change order of canonicalization of ADD and AND Canonicalize ((x + C1) & C2) --> ((x & C2) + C1) for suitable constants C1 and C2, instead of the other way round. This should allow more constant ADDs to be matched as part of addressing modes for loads and stores. Differential Revision: https://reviews.llvm.org/D130080	2022-08-22 20:03:53 +01:00
Sanjay Patel	fa68d93d54	[InstCombine] fold reassociative fadd with negated operand We manage to iteratively achieve this result with no extra uses, and the reassociate pass can also do this, but this pattern falls through the cracks in the example from issue #57053.	2022-08-11 11:43:36 -04:00
Nikita Popov	5eaeeed8cb	[InstCombine] Avoid ConstantExpr::getFNeg() calls (NFCI) Instead call the constant folding API, which can fail. For now, this should be NFC, as we still allow the creation of fneg constant expressions.	2022-07-29 16:01:46 +02:00
Sanjay Patel	08091a99ae	Revert "[InstCombine] enhance fold for subtract-from-constant -> xor" This reverts commit 79bb915fb60b2cd220d89e3bb54f67abb8cdb7bd. This caused regressions because SCEV works better with sub.	2022-07-22 15:56:24 -04:00
Nikita Popov	fc18a88231	[InstCombine] Avoid creating float binop ConstantExprs Replace ConstantExpr:getFAdd etc with call to ConstantFoldBinaryOpOperands(). I'm using the constant folding API rather than IRBuilder here to ensure that this does actually constant fold. These transforms don't use m_ImmConstant(), so this would not otherwise be guaranteed (and apparently, they can't use m_ImmConstant because they want to handle scalable vector splats). There is an opportunity here to further migrate these to the ConstantFoldFPInstOperands() API, which would respect the denormal mode. I've held off on doing so here, because some of this code explicitly checks for denormal results, and I don't want to touch it in a mostly NFC change.	2022-07-08 16:36:04 +02:00

1 2 3 4 5 ...

404 Commits