llvm-project

Author	SHA1	Message	Date
Nikita Popov	a759745169	[InstCombine] Support multiple comparisons in foldAllocaCmp() foldAllocaCmp() needs to fold all comparisons of an alloca at the same time, to ensure that there is a consistent view of the alloca address. Currently, it folds "all" comparisons by limiting to the case where there is only one. This patch switches the algorithm to instead actually collect and fold all comparisons. Something we need to be careful about here is that there may be comparisons where both sides of the icmp are based on the alloca. Such comparisons are comparing offsets of the alloca, and as such can be ignored here, but shouldn't be folded to false. Differential Revision: https://reviews.llvm.org/D144492	2023-04-14 11:32:58 +02:00
Nikita Popov	cf9f1a8203	[InstCombine] Remove visitGEPOfBitcast() fold (NFC) This does not apply to opaque pointers, and as such is no longer necessary.	2023-04-06 09:04:31 +02:00
Nikita Popov	3cbdcd6ebf	[InstCombine] Remove PromoteCastOfAllocation() fold (NFC) This fold does not apply to opaque pointers, and as such is no longer needed.	2023-04-05 15:55:43 +02:00
chenglin.bi	dd31a3b3a5	[InstCombine] fold icmp of the sum of ext bool based on limited range For the pattern `(zext i1 X) + (sext i1 Y)`, the constant range is [-1, 1]. We can simplify the pattern by logical operations. Like: ``` (zext i1 X) + (sext i1 Y) == -1 --> ~X & Y (zext i1 X) + (sext i1 Y) == 0 --> ~(X ^ Y) (zext i1 X) + (sext i1 Y) == 1 --> X & ~Y ``` And other predicates can the combination of these results: ``` (zext i1 X) + (sext i1 Y)) != -1 --> X \| ~Y (zext i1 X) + (sext i1 Y)) s> -1 --> X \| ~Y (zext i1 X) + (sext i1 Y)) u< -1 --> X \| ~Y (zext i1 X) + (sext i1 Y)) s> 0 --> X & ~Y (zext i1 X) + (sext i1 Y)) s< 0 --> ~X & Y (zext i1 X) + (sext i1 Y)) != 1 --> ~X \| Y (zext i1 X) + (sext i1 Y)) s< 1 --> ~X \| Y (zext i1 X) + (sext i1 Y)) u> 1 --> ~X & Y ``` All alive proofs: https://alive2.llvm.org/ce/z/KmgDpF https://alive2.llvm.org/ce/z/fLwWa9 https://alive2.llvm.org/ce/z/ZKQn2P Fix: https://github.com/llvm/llvm-project/issues/59666 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D143373	2023-02-15 10:34:00 +08:00
chenglin.bi	6f149a17d4	[InstCombine] Look through truncate to fold icmp with intrinsics The output of intrinsic functions like ctpop, cttz, ctlz have limited range from 0 to bitwidth. So if the truncate destination type can hold the source bitwidth size, we can just ignore the truncate and use the truncate src to do combination. Alive2 proofs: https://alive2.llvm.org/ce/z/9D_-qP Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D143368	2023-02-10 10:33:07 +08:00
Matt Arsenault	e9f3034feb	InstCombine: Perform basic isnan combines on llvm.is.fpclass is.fpclass(x, qnan\|snan) -> fcmp uno x, 0.0 is.fpclass(nnan x, qnan\|snan\|other) -> is.fpclass(x, other) Start porting the existing combines from llvm.amdgcn.class to the generic intrinsic. Start with the ones which aren't dependent on the FP mode.	2023-02-05 08:36:09 -04:00
Sanjay Patel	c09c90b90b	[InstCombine] rename variables for readability; NFC There's no reason to use "CI" (cast instruction) when we know that the value is a more specific (exact) type of instruction (although we might want to common-ize some of this code to eliminate duplication or logic diffs). It's also visually difficult to distinguish between "CI", "ICI", and "IC" acronyms (and those could change meaning depending on context). This was partially changed in earlier commits, so this makes this pair of functions consistent.	2023-01-24 14:18:40 -05:00
Pierre van Houtryve	b3fdb7b0cb	[InstCombine] Combine lshr of add -> (a + b < a) Tries to perform (lshr (add (zext X), (zext Y)), K) -> (icmp ult (add X, Y), X) where - The add's operands are zexts from a K-bits integer to a bigger type. - The add is only used by the shr, or by iK (or narrower) truncates. - The lshr type has more than 2 bits (other types are boolean math). - K > 1 This seems to be a pattern that just comes from OpenCL front-ends, so adding DAG/GISel combines doesn't seem to be worth the complexity. Original patch D107552 by @abinavpp - adapted to use (a + b < a) instead of uaddo following discussion on the review. See this issue https://github.com/RadeonOpenCompute/ROCm/issues/488 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D138814	2023-01-10 03:37:23 -05:00
chenglin.bi	87b2c760d0	[Instcombine] fold logic ops to select (C & X) \| ~(C \| Y) -> C ? X : ~Y https://alive2.llvm.org/ce/z/4yLh_i Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D139080	2023-01-05 12:04:35 +08:00
Roman Lebedev	cf58063a40	[InstCombine] Canonicalize math-y conditional negation into a `select` https://alive2.llvm.org/ce/z/vPs-gZ This is a larger pattern than would seem necessary, with minimal being: * `and` https://alive2.llvm.org/ce/z/q9-MqK * `or` https://alive2.llvm.org/ce/z/AUUEMZ * `xor` https://alive2.llvm.org/ce/z/dm3Ume .. so for all others, we canonicalize away from math to `select`, but there we canonicalize in the opposite direction. Fixes https://github.com/llvm/llvm-project/issues/59791	2023-01-02 21:26:37 +03:00
Roman Lebedev	b20ccccda2	[InstCombine] Support sinking `not` into logical operand with invertible hands The important bit here is that we gracefully handle other uses, iff they can be adapted to inversion. I'll note, the previous logic was actively bad, it increased instruction count since it didn't actually ensure that the inversions happened.	2022-12-19 04:11:16 +03:00
Roman Lebedev	9f0c9e4725	[InstCombine] Try to sink `not` of one operand of logical operation into another hand Matches what we do for binary operations, but a special care needs is needed to preserve operand order, as the logical operations are not strictly commutative!	2022-12-19 01:10:16 +03:00
Craig Topper	ad476fb217	[InstCombine] Remove code duplication between InstCombiner.h and InstCombineInternal.h. The class in InstCombineInternal.h inherits from InstCombiner.h. I think this split was created when target specific InstCombines were moved to go through TTI. I had to update some of the code in InstCombiner.h to match changes that had been made to InstCombineInternal.h. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D140230	2022-12-16 11:42:23 -08:00
Nikita Popov	379de1239e	[InstCombine] Preserve instruction name in replaceInstUsesWith() Currently InstCombine folds using the `return replaceInstUsesWith(V, Builder.CreateFoo())` pattern do not preserve the original name of the instruction. To preserve the name, you either have to use something like `return FooInst::Create(...)` which is usually less nice, or go out of the way to preserve the name with takeName(). We often don't do that. This patch instead preserves the name in replaceInstUsesWith() when replacing a named instruction with an unnamed instruction. To be conservative, I also added a zero-use check, which is a proxy for the case where the instruction was just created, rather than an existing one reused. Possibly we could drop that part. As InstCombine tests are robust against renames this does not cause any test diffs, so I regenerated a random test to show the effects. Differential Revision: https://reviews.llvm.org/D140192	2022-12-16 16:01:25 +01:00
Vasileios Porpodas	32b38d248f	[NFC] Rename Instruction::insertAt() to Instruction::insertInto(), to be consistent with BasicBlock::insertInto() Differential Revision: https://reviews.llvm.org/D140085	2022-12-15 12:27:45 -08:00
Nikita Popov	43b5fbae3b	Revert "[InstCombine] Handle logical op in simplifyRangeCheck() (PR59484)" This reverts commit 492c471839a66e354ebe696bd3e15f7477c63613. As pointed out by nloped, the transform in f2 is not correct: If %shr is poison, then freeze may result in a negative value. The transform is correct in the case where the freeze is pushed through the operation in a way that guarantees the result is non-negative, which is the case I had tested.	2022-12-14 12:04:21 +01:00
Matt Arsenault	8fc25caae5	InstCombine: Fold logic of fp_classes together Move logical operators on pairs of llvm.is.fpclass on the same value into the test mask of a single is_fpclass. or (class x, mask0), (class x, mask1) -> class x, (mask0 \| mask1) and (class x, mask0), (class x, mask1) -> class x, (mask0 & mask1) xor (class x, mask0), (class x, mask1) -> class x, (mask0 ^ mask1) The and/or cases should appear frequently in the builtin math libraries; haven't seen the xor case but handle it for completeness.	2022-12-13 10:51:41 -05:00
Nikita Popov	492c471839	[InstCombine] Handle logical op in simplifyRangeCheck() (PR59484) We need to freeze to avoid propagating a potentially poison upper bound (https://alive2.llvm.org/ce/z/MsD38k). This resolves the existing TODO in the code. Fixes https://github.com/llvm/llvm-project/issues/59484.	2022-12-13 09:51:18 +01:00
Fangrui Song	21cd58baa1	[Transforms/InstCombine] llvm::Optional => std::optional	2022-12-13 08:26:08 +00:00
Vasileios Porpodas	06911ba6ea	[NFC] Cleanup: Replaces BB->getInstList().insert() with I->insertAt(). This is part of a series of cleanup patches towards making BasicBlock::getInstList() private. Differential Revision: https://reviews.llvm.org/D138877	2022-12-12 13:33:05 -08:00
Matt Arsenault	e661185fb3	InstCombine: Fold fdiv nnan x, 0 -> copysign(inf, x) https://alive2.llvm.org/ce/z/gLBFKB	2022-11-07 22:00:15 -08:00
Matt Devereau	a8c24d57b8	[InstCombine] Remove redundant splats in InstCombineVectorOps Splatting the first vector element of the result of a BinOp, where any of the BinOp's operands are the result of a first vector element splat can be simplified to splatting the first vector element of the result of the BinOp Differential Revision: https://reviews.llvm.org/D135876	2022-11-07 15:39:05 +00:00
Peter Waller	e1790c8c29	Revert "[InstCombine] Remove redundant splats in InstCombineVectorOps" This reverts commit 957eed0b1af2cb88edafe1ff2643a38165c67a40.	2022-11-03 07:56:03 +00:00
Matt Devereau	957eed0b1a	[InstCombine] Remove redundant splats in InstCombineVectorOps Splatting the first vector element of the result of a BinOp, where any of the BinOp's operands are the result of a first vector element splat can be simplified to splatting the first vector element of the result of the BinOp Differential Revision: https://reviews.llvm.org/D135876	2022-11-02 11:57:05 +00:00
Sanjay Patel	4299b28a9b	[InstCombine] add helper function for select-of-bools folds; NFC This set of folds keeps growing, and it contains bugs like issue #58552, so make it easier to spot those via backtrace.	2022-11-01 11:06:18 -04:00
zhongyunde	f58311796c	[InstCombine] refactor the SimplifyUsingDistributiveLaws NFC Precommit for D136015 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D137019	2022-10-30 21:04:06 +08:00
Nikita Popov	11897708c0	[InstCombine] Directly replace instr in foldIntegerTypedPHI() (NFCI) Rather than inserting a ptrtoint + inttoptr pair, directly replace the inttoptr with the new phi node. This ensures that no other transform can undo it before the pair gets folded away. This avoids the infinite loop when combined with D134954. This is NFCI in the sense that it shouldn't make a difference, but could due to different worklist order.	2022-10-05 13:28:23 +02:00
Sanjay Patel	6bfe5361b7	[InstCombine] add helper function for extract of with-overflow-intrinsic; NFC We can do more with these patterns, so this block is going to grow.	2022-08-09 12:38:11 -04:00
Fangrui Song	fa66789d06	[llvm] LLVM_NODISCARD => [[nodiscard]]. NFC With C++17 there is no Clang pedantic warning.	2022-08-07 00:26:33 +00:00
Alexander Shaposhnikov	4220ef2be1	[InstCombine] Add fold for redundant sign bits count comparison For power-of-2 C: ((X s>> ShiftC) ^ X) u< C --> (X + C) u< (C << 1) ((X s>> ShiftC) ^ X) u> (C - 1) --> (X + C) u> ((C << 1) - 1) (https://github.com/llvm/llvm-project/issues/56479) Test plan: 0/ ninja check-llvm check-clang + bootstrap LLVM/Clang 1/ https://alive2.llvm.org/ce/z/eEUfx3 Differential revision: https://reviews.llvm.org/D130433	2022-07-30 09:06:53 +00:00
Kazu Hirata	3f3930a451	Remove redundaunt virtual specifiers (NFC) Identified with tidy-modernize-use-override.	2022-07-25 23:00:59 -07:00
Nikita Popov	c81dff3c30	[MemoryBuiltins] Add getFreedOperand() function (NFCI) We currently assume in a number of places that free-like functions free their first argument. This is true for all hardcoded free-like functions, but with the new attribute-based design, the freed argument is supposed to be indicated by the allocptr attribute. To make sure we handle this correctly once allockind(free) is respected, add a getFreedOperand() helper which returns the freed argument, rather than just indicating whether the call frees some argument. This migrates most but not all users of isFreeCall() to the new API. The remaining users are a bit more tricky.	2022-07-21 12:39:35 +02:00
Nikita Popov	c6b88cb918	[InstCombine] Push freeze through recurrence phi We really want to push freezes through recurrence phis, so that we freeze only the start value, rather than the IV value on every iteration. foldOpIntoPhi() already handles this for the case where the transfer function doesn't produce poison, e.g. %iv.next = add %iv, 1. However, this does not work if nowrap flags are present, e.g. the very common %iv.next = add nuw %iv, 1 case. This patch adds a fold that pushes freeze instructions to the start value by checking whether all backedge values will be non-poison after poison generating flags have been dropped. This allows pushing freezes out of loops in most cases. I suspect that this also obsoletes the CanonicalizeFreezeInLoops pass, and we can probably drop it. Fixes https://github.com/llvm/llvm-project/issues/56048. Differential Revision: https://reviews.llvm.org/D127960	2022-06-17 15:01:41 +02:00
Nuno Lopes	e5c5f92e12	[InstCombine] switch synthetic unreachable to use undef instead of poison (NFC)	2022-06-10 21:54:09 +01:00
Nikita Popov	45226d04f0	[InstCombine] Reuse icmp of and/or folds for logical and/or Similarly to a change recently done for fcmps, add a flag that indicates whether the and/or is logical to foldAndOrOfICmps, and reuse the function when folding logical and/or. We were already calling some parts of it, but this gives us a clearer indication of which parts may need poison-safe variants, and would also allow to fold combinations of bitwise and logical and/or. This change should be close to NFC, because all folds this enables were either already called previously, or can make use of implied poison reasoning.	2022-05-23 15:37:07 +02:00
Chenbing Zheng	ffaaf2498b	[InstCombine] (rot X, ?) == 0/-1 --> X == 0/-1 In this patch we add a function foldICmpInstWithConstantAllowUndef to fold integer comparisons with a constant operand: icmp Pred X, C where X is some kind of instruction and C is AllowUndef. We move this fold to the new function, so that it can solve undef elts in a vector. Reviewed By: spatel, RKSimon Differential Revision: https://reviews.llvm.org/D125220	2022-05-19 11:22:26 +08:00
Chenbing Zheng	acbad5086a	[InstCombine] [NFC] separate a function foldICmpBinOpWithConstant There is a long function foldICmpInstWithConstant, we can separate a function foldICmpBinOpWithConstant from it. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D125457	2022-05-14 10:54:15 +08:00
Nikita Popov	6001bfcedc	[InstCombine] Freeze other uses of frozen value If there is a freeze %x, we currently replace all other uses of %x with freeze %x -- as long as they are dominated by the freeze instruction. This patch extends this behavior to cases where we did not originally dominate the use by moving the freeze instruction directly after the definition of the frozen value. The motivation can be seen in test @combine_and_after_freezing_uses: Canonicalizing everything to freeze %x allows folds that are based on value identity (i.e. same operand occurring in two places) to trigger. This also covers the case from D125248. Differential Revision: https://reviews.llvm.org/D125321	2022-05-11 16:47:12 +02:00
Nikita Popov	b457ac4240	[InstCombine] Extract icmp of select transform (NFC) To make it either to extend to the case where the other operand is not a constant.	2022-05-06 14:46:44 +02:00
Nikita Popov	982cbed819	[InstCombine] Fold logical and/or of range icmps with nowrap flags This is an edge-case where we don't convert to bitwise and/or based on implies poison reasoning, so explicitly try to perform the fold in logical form. The transform itself is poison-safe, as both icmps are based on the same value and any nowrap flags are discarded as part of the fold (https://alive2.llvm.org/ce/z/aCwC8b for the used example).	2022-04-29 14:42:42 +02:00
Sanjay Patel	903aa5e0f8	[InstCombine] try to fold icmp with mismatched extended operands If a value is known to be non-negative and zexted, that's the same thing as sexted. So for the purpose of looking past the casts with an icmp, treat it as if it was a sext: https://alive2.llvm.org/ce/z/_BDsGV This is necessary, but not enough to solve the motivating problem: https://github.com/llvm/llvm-project/issues/55013 Differential Revision: https://reviews.llvm.org/D124419	2022-04-26 14:26:36 -04:00
Nikita Popov	ba46ae7bd8	[InstCombine] Merge foldAndOfICmps() and foldOrOfICmps() (NFCI) Folds are supposed to always be added in conjugated pairs for and and or. Merge the two functions to make folds for which this is currently not the case more obvious.	2022-04-22 12:48:03 +02:00
Chenbing Zheng	467cbb6249	[InstCombine] fold more constant divisor to select-of-constants divisor By adding a parameter to function FoldOpIntoSelect， we can fold more Ops to Select. For this example, we tend to fold the division instruction, so we no longer care whether SelectInst is one use. This patch slove TODO left in InstCombine/div.ll. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D122967	2022-04-08 10:19:24 +08:00
Augie Fackler	f3c702fbd1	InstCombineCalls: fix annotateAnyAllocCallSite to report changes Spotted during review of D123052. Differential Revision: https://reviews.llvm.org/D123232	2022-04-07 13:49:09 -04:00
Craig Topper	ce78e68261	[InstCombine] Fold select based logic of fcmps with same operands when FMF is present. If we have a logical and/or in select form and the true/false operand is an fcmp with poison generating FMF, we won't be able to fold it to an and/or instruction. This prevents us from optimizing the case where it is a logical operation of two fcmps with identical operands. This patch adds explicit checks for this case that doesn't rely on converting to and/or to do the optimization. It reuses the existing foldLogicOfFCmps, but adds a new flag to disable the other combine that is inside that function. FMF flags from the two FCmps are intersected using the logic added in D121243. The FIXME has been updated to indicate that we can only use a union for the non-select form. This allows us to optimize cases like this from compare-fp-3.c in the gcc torture suite with fast math. void test1 (float x, float y) { if ((x==y) && (x!=y)) link_error0(); } Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D121323	2022-03-14 14:45:07 -07:00
Nikita Popov	9353ed6a53	[InstCombine] Don't call matchSAddSubSat() for SPF (NFC) Only call it for intrinsic min/max. The moved implementation is unchanged apart from the one-use check: It is now hardcoded to one-use, without the two-use special case for SPF.	2022-02-28 10:41:56 +01:00
Philip Reames	6f9d557e08	[instcombine] Cleanup foldAllocaCmp slightly [NFC]	2022-02-18 18:49:39 -08:00
Nikita Popov	e714b98fff	[InstCombine] Check type compatibility in indexed load fold This fold could use a rewrite to an offset-based implementation, but for now make sure it doesn't crash with opaque pointers.	2022-02-11 10:16:27 +01:00
Kazu Hirata	3a3cb929ab	[llvm] Use = default (NFC)	2022-02-06 22:18:35 -08:00
Nikita Popov	648faa3b5d	[InstCombine] Mark element type access as non-opaque (NFC) Also make the function static to make it more obvious that it is only used in the one place.	2022-01-27 11:40:29 +01:00

1 2 3 4 5 ...

333 Commits