llvm-project

Author	SHA1	Message	Date
zhongyunde	620cff096a	[InstCombine] Fold series of instructions into mull for more types Relax the constraint of wide/vectors types. Address the comment https://reviews.llvm.org/D136015?id=469189#inline-1314520 Reviewed By: spatel, chfast Differential Revision: https://reviews.llvm.org/D136661	2022-10-25 23:04:46 +08:00
Sanjay Patel	5dcfc32822	[InstCombine] allow more commutative matches for logical-and to select fold This is a sibling transform to the fold just above it. That was changed to allow the corresponding commuted patterns with: 307307456277 e1bd759ea567 8628e6df7000	2022-10-24 16:40:43 -04:00
Craig Topper	1edc51b56a	[InstCombine] Explicitly check for scalable TypeSize. Instead of assuming it is a fixed size. Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D136517	2022-10-24 12:29:06 -07:00
zhongyunde	81713e893a	[InstCombine] Fold series of instructions into mull The following sequence should be folded into in0 * in1 In0Lo = in0 & 0xffffffff; In0Hi = in0 >> 32; In1Lo = in1 & 0xffffffff; In1Hi = in1 >> 32; m01 = In1Hi * In0Lo; m10 = In1Lo * In0Hi; m00 = In1Lo * In0Lo; addc = m01 + m10; ResLo = m00 + (addc >> 32); Reviewed By: spatel, RKSimon Differential Revision: https://reviews.llvm.org/D136015	2022-10-25 01:09:37 +08:00
Ahmed Bougacha	bddd9b6b91	[InstCombine] Combine ptrauth sign/resign + auth/resign intrinsics. (sign\|resign) + (auth\|resign) can be folded by omitting the middle sign+auth component if the key and discriminator match. Differential Revision: https://reviews.llvm.org/D132383	2022-10-24 08:03:14 -07:00
Mike Hommey	86e57e66da	[InstCombine] Bail out of casting calls when a conversion from/to byval is involved. Fixes #58307 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D135738	2022-10-23 09:49:48 +02:00
Sanjay Patel	8628e6df70	[InstCombine] use freeze to enable poison-safe logic->select fold Without a freeze, this transform can leak poison to the output: https://alive2.llvm.org/ce/z/GJuF9i This makes the transform as uniform as possible, and it can help reduce patterns like issue #58313 (although that particular example probably still needs another transform). Differential Revision: https://reviews.llvm.org/D136527	2022-10-22 10:42:14 -04:00
Sanjay Patel	e1bd759ea5	[InstCombine] allow more matches for logical-ands --> select This allows patterns with real 'and' instructions because those are safe to transform: https://alive2.llvm.org/ce/z/7-U_Ak	2022-10-22 08:15:50 -04:00
Sanjay Patel	3073074562	[InstCombine] allow more commutative matches for logical-and to select fold When the common value is part of either select condition, this is safe to reduce. Otherwise, it is not poison-safe (with the select form of the pattern): https://alive2.llvm.org/ce/z/FxQTzB This is another patch motivated by issue #58313.	2022-10-21 13:29:13 -04:00
Sanjay Patel	d7fecf26f4	[InstCombine] allow some commutative matches for logical-and to select fold This is obviously correct for real logic instructions, and it also works for the poison-safe variants that use selects: https://alive2.llvm.org/ce/z/wyHiwX This is motivated by the lack of 'xor' folding seen in issue #58313. This more general fold should help reduce some of those patterns, but I'm not sure if this specific case does anything for that particular example.	2022-10-21 11:28:38 -04:00
Sanjay Patel	f6fc3e23b9	[InstCombine] refactor matching code for logical ands; NFCI Separating the matches makes it easier to enhance for commutative patterns.	2022-10-21 11:28:38 -04:00
Sanjay Patel	bf75e937bb	[InstCombine] match logical and/or more generally in fold to select This allows the regular bitwise logic opcodes in addition to the poison-safe select variants: https://alive2.llvm.org/ce/z/8xB9gy Handling commuted variants safely is likely trickier, so that's left to another patch.	2022-10-21 09:03:36 -04:00
William Huang	6c767cef5a	[InstCombine] Canonicalize GEP of GEP by swapping constant-indexed GEP to the back Canonicalize GEP of GEP by swapping GEP with some suffix constant indices to the back (and GEP with all constant indices to the back of that), this allows more constant index GEP merging to happen. Exceptions are: If swapping violates use-def relations, or anti-optimizes LICM For constant indexed GEP of GEP, if they cannot be merged directly, they will be casted to i8* and merged. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D125845	2022-10-20 17:41:26 +00:00
Nabeel Omer	e1fd6d49a3	[InstCombine] Fix assert condition in `foldSelectShuffleOfSelectShuffle` Bug introduced in e239198cdbbf. The assert() is making an assumption that the resulting shuffle mask will always select elements from both vectors, this is untrue in the case of two shuffles being folded if the former shuffle has a mask with undef elements in it. In such a case folding the shuffles might result in a mask which only selects from one of the vectors because the other elements (in the mask) are undef. Differential Revision: https://reviews.llvm.org/D136256	2022-10-20 12:10:54 +00:00
Sanjay Patel	44b7da89d7	[InstCombine] fmul nnan X, 0.0 --> copysign(0.0, X) https://alive2.llvm.org/ce/z/ybgM5F Differential Revision: https://reviews.llvm.org/D136166	2022-10-18 11:34:02 -04:00
Sanjay Patel	d16989607b	[InstCombine] reduce code duplication in visitBranchInst(); NFCI	2022-10-18 11:34:02 -04:00
Daniel Sanders	021e6e05d3	[instsimplify] Move (extelt (inselt Vec, Value, Index), Index) -> Value from InstCombine As requested in https://reviews.llvm.org/D135625#3858141 Differential Revision: https://reviews.llvm.org/D136099	2022-10-17 15:22:06 -07:00
Nikita Popov	779fd39684	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Relative to the previous attempt, this is rebased over the InstSimplify fix in ac74e7a7806480a000c9a3502405c3dedd8810de, which addresses the miscompile reported in PR58401. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-17 16:11:05 +02:00
Florian Hahn	699396131f	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit 333246b48ea4a70842e78c977cc92d365720465f. It looks like this patch causes a mis-compile: https://github.com/llvm/llvm-project/issues/58401 Fixes #58401.	2022-10-17 12:56:28 +01:00
Sanjay Patel	e5ee0b06d6	[InstCombine] try to determine "exact" for sdiv If the divisor is a power-of-2 or negative-power-of-2 and the dividend is known to have >= trailing zeros than the divisor, the division is exact: https://alive2.llvm.org/ce/z/UGBksM (general proof) https://alive2.llvm.org/ce/z/D4yPS- (examples based on regression tests) This isn't the most direct optimization (we could create ashr in these examples instead of relying on existing folds for exact divides), but it's possible that there's a more general constraint than just a pow2 divisor, so this might be extended in the future. This should solve issue #58348. Differential Revision: https://reviews.llvm.org/D135970	2022-10-16 10:59:56 -04:00
Sanjay Patel	340ae45be0	[InstCombine] use isKnownNonNegative() for readability; NFCI This should be functionally equivalent - both calls are thin wrappers around computeKnownBits(). We'll probably want to use known-bits directly in follow-up patches because that could determine "exact" for example (see issue #58348).	2022-10-16 10:59:56 -04:00
Sanjay Patel	d85505a932	[InstCombine] fold logical and/or to xor (A \| B) & ~(A & B) --> A ^ B https://alive2.llvm.org/ce/z/qpFMns We already have the equivalent fold for real logic instructions, but this pattern may occur with selects too. This is part of solving issue #58313.	2022-10-13 16:12:20 -04:00
Sanjay Patel	7b9482df3d	[InstCombine] fold sdiv with common shl amount in operands (X << Z) / (Y << Z) --> X / Y https://alive2.llvm.org/ce/z/CLKzqT This requires a surprising "nuw" constraint because we have to guard against immediate UB via signed-div overflow with -1 divisor. This extends 008a89037a49ca0d9 and is another transform derived from issue #58137.	2022-10-12 11:32:15 -04:00
Sanjay Patel	008a89037a	[InstCombine] fold udiv with common shl amount in operands (X << Z) / (Y << Z) --> X / Y https://alive2.llvm.org/ce/z/E5eaxU This fixes the motivating example from issue #58137, but it is not the most general transform. We should probably also convert left-shift in the divisor to right-shift in the dividend for that, but that exposes another missed canonicalization for shifts and adds.	2022-10-12 11:12:26 -04:00
Sanjay Patel	fe97f95036	[InstCombine] propagate "exact" through folds of div These folds were added recently with: 6b869be8100d 8da2fa856f1b ...but they didn't account for the "exact" attribute, and that can be safely propagated: https://alive2.llvm.org/ce/z/F_WhnR https://alive2.llvm.org/ce/z/ft9Cgr	2022-10-12 09:25:05 -04:00
Sanjay Patel	d117ee25b8	[InstCombine] add helper function for div+shl folds; NFC There are at least 2 similar patterns that could be added here, and the existing fold can be improved because it fails to propagate "exact".	2022-10-12 09:25:04 -04:00
Sanjay Patel	7ec604a317	[InstCombine] try harder to cancel out mul/div ((Op1 * X) / Y) / Op1 --> X / Y https://alive2.llvm.org/ce/z/JYxWjA InstSimplify handles the more basic mul+div pattern with shared operand, but we don't seem to have any reassociation folds to handle cases where the common op is further away. This is a generalization of 9cff4711ac72 and another transform derived from issue #58137.	2022-10-11 09:51:51 -04:00
Daniel Sanders	4a95a64e4a	[instcombine] (extelt (inselt Vec, Value, Index), Index) -> Value When Index is variable but still trivially known to be equal we can use Value from before the insertion, possibly eliminating the vector. Reverts a functional change from: Author: Philip Reames <listmail@philipreames.com> Date: Wed Dec 8 12:21:10 2021 -0800 [instcombine] A couple style tweaks to visitExtractElementInst [nfc] Thanks to Michele Scandale for identifying the bug Differential Revision: https://reviews.llvm.org/D135625	2022-10-10 15:41:53 -07:00
Sanjay Patel	9cff4711ac	[InstCombine] fold udiv with common factor ((X *nuw Y) >> Z) / X --> Y >> Z https://alive2.llvm.org/ce/z/x3kKnq This is similar to 6b869be8100d / 8da2fa856f1b, but I have not found a signed equivalent, so it's just an unsigned match for now.	2022-10-10 08:12:06 -04:00
Sanjay Patel	eccb9a77c6	[InstCombine] fold exact sdiv to ashr (2nd try) The 1st attempt failed to updated the test checks as expected. Original commit message: sdiv exact X, (1<<ShAmt) --> ashr exact X, ShAmt (if shl is non-negative) https://alive2.llvm.org/ce/z/kB6VF7 It would probably be better to use ValueTracking to replace this and the existing transform above it, but the analysis does not account for the no-wrap properly, and it's not immediately clear to me how to fix it.	2022-10-08 10:09:44 -04:00
Sanjay Patel	68d4dbc2c1	Revert "[InstCombine] fold exact sdiv to ashr" This reverts commit fe15290e0cf5d2bcdefca2e81ef6ff8155a2f7a8. The test checks were not updated as expected.	2022-10-08 10:02:03 -04:00
Sanjay Patel	fe15290e0c	[InstCombine] fold exact sdiv to ashr sdiv exact X, (1<<ShAmt) --> ashr exact X, ShAmt (if shl is non-negative) https://alive2.llvm.org/ce/z/kB6VF7 It would probably be better to use ValueTracking to replace this and the existing transform above it, but the analysis does not account for the no-wrap properly, and it's not immediately clear to me how to fix it.	2022-10-08 09:23:46 -04:00
Sanjay Patel	3e6767ed5f	[InstCombine] propagate 'exact' when converting ashr to lshr The shift amount is not changing, so if we guaranteed shifting out zeros before, those bits are still zeros. https://alive2.llvm.org/ce/z/sokQca	2022-10-07 13:17:19 -04:00
Sanjay Patel	bdfefac9a4	[InstCombine] refactor sdiv by (negative) power-of-2 folds; NFCI It's probably better to try harder on this kind of pattern by using ValueTracking.	2022-10-07 11:35:17 -04:00
Nikita Popov	333246b48e	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Relative to the previous attempt, this adjusts simplification to use the correct context instruction: We need to use the terminator of the incoming block, not the original instruction. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-07 11:04:19 +02:00
Alina Sbirlea	b9898e7ed1	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit e94619b955104841cc2a4a6febe4025ee140194e.	2022-10-06 13:12:24 -07:00
Sanjay Patel	8da2fa856f	[InstCombine] fold sdiv with hidden common factor (X * Y) s/ (X << Z) --> Y s/ (1 << Z) https://alive2.llvm.org/ce/z/yRSddG issue #58137	2022-10-06 13:11:50 -04:00
Sanjay Patel	6b869be810	[InstCombine] fold udiv with hidden common factor (X * Y) u/ (X << Z) --> Y u>> Z https://alive2.llvm.org/ce/z/4G9D_W	2022-10-06 11:35:27 -04:00
Nikita Popov	e94619b955	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify The infinite loop seen on buildbots should be fixed by 11897708c0229c92802e747564e7c34b722f045f (assuming there are not multiple infinite combine loops...) ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-05 14:00:19 +02:00
Nikita Popov	11897708c0	[InstCombine] Directly replace instr in foldIntegerTypedPHI() (NFCI) Rather than inserting a ptrtoint + inttoptr pair, directly replace the inttoptr with the new phi node. This ensures that no other transform can undo it before the pair gets folded away. This avoids the infinite loop when combined with D134954. This is NFCI in the sense that it shouldn't make a difference, but could due to different worklist order.	2022-10-05 13:28:23 +02:00
Gulfem Savrun Yeniceri	d7592bbb03	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit e1dd2cd063785ea3a6004c8d173f13113b1b8265 because the original commit b20e34b39f72f2be035dfb7367b6880fd2cf213a had a dramatic increase in the build time of RTfuzzer, which caused Fuchsia Clang toolchain builders to timeout: https://luci-milo.appspot.com/ui/p/fuchsia/builders/toolchain.ci/clang-linux-x64/b8801248587754572961/overview	2022-10-04 20:57:34 +00:00
Nikita Popov	e1dd2cd063	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Reapply with a fix for the case where an operand simplified back to the original phi: We need to map this case to the new phi node. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change.	2022-10-04 15:18:34 +02:00
Nikita Popov	0f32f0e147	Revert "[InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit b20e34b39f72f2be035dfb7367b6880fd2cf213a. This causes RAUW type mismatch assertions on some buildbots, reverting for now.	2022-10-04 11:17:09 +02:00
Nikita Popov	b20e34b39f	[InstCombine] Switch foldOpIntoPhi() to use InstSimplify foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change.	2022-10-04 10:12:14 +02:00
Sanjay Patel	2e87333bfe	[InstCombine] convert mul by negative-pow2 to negate and shift This is an unusual canonicalization because we create an extra instruction, but it's likely better for analysis and codegen (similar reasoning as D133399). InstCombine::Negator may create this kind of multiply from negate and shift, but this should not conflict because of the narrow negation. I don't know how to create a fully general proof for this kind of transform in Alive2, but here's an example with bitwidths similar to one of the regression tests: https://alive2.llvm.org/ce/z/J3jTjR Differential Revision: https://reviews.llvm.org/D133667	2022-10-02 12:22:25 -04:00
Sanjay Patel	e239198cdb	[InstCombine] fold select shuffles with shared operand together We don't combine generic shuffles together in IR, but select shuffles are a special-case because a select shuffle of a select shuffle is just another select shuffle; codegen is expected to efficiently lower those (select shuffles are also the canonical form of a vector select with constant condition).	2022-09-28 11:56:27 -04:00
Sanjay Patel	def6cbd2bd	[InstCombine] add assert/test for zext to i1 This is a test to verify that we do not crash with the problem noted in issue #57986. The root problem should be fixed with a prior change to InstSimplify.	2022-09-26 16:01:25 -04:00
Nikita Popov	8df376db72	[InstCombine] Remove buggy zext of icmp eq with pow2 fold (PR57899) For the case where the constant is a power of two rather than zero, the fold is incorrect, because it fails to check that the bit set in the LHS matches the bit in the RHS. Rather than fixing this, remove the power of two handling entirely, as a different fold will already canonicalize such comparisons to use a zero constant. Fixes https://github.com/llvm/llvm-project/issues/57899.	2022-09-22 16:37:10 +02:00
Nikita Popov	c2e76f914c	[InstCombine] Use simplifyWithOpReplaced() for non-bool selects Perform the simplifyWithOpReplaced() fold even for non-bool selects. This subsumes a number of recently added folds for zext/sext of the condition. We still need to manually handle variations with both sext/zext and not, because simplifyWithOpReplaced() only performs one level of replacements.	2022-09-22 15:46:00 +02:00
Nikita Popov	41dde5d858	[InstSimplify] Support vectors in simplifyWithOpReplaced() We can handle vectors inside simplifyWithOpReplaced(), as long as cross-lane operations are excluded. The equality can hold (or not hold) for each vector lane independently, so we shouldn't use the replacement value from other lanes. I believe the only operations relevant here are shufflevector (where all previous bugs were seen) and calls (which might use shuffle-like intrinsics and would require more careful classification). Differential Revision: https://reviews.llvm.org/D134348	2022-09-22 10:45:42 +02:00

1 2 3 4 5 ...

5178 Commits