llvm-project

Author	SHA1	Message	Date
Nikita Popov	dcdf44aca7	[InstCombine] Remove foldSelectICmpEq() fold (#122098 ) This fold matches complex patterns, for which we have no proof of real-world relevance, and which does not actually handle the originally motivating cases from https://github.com/llvm/llvm-project/issues/71792 either. In https://github.com/llvm/llvm-project/pull/121708 and https://github.com/llvm/llvm-project/pull/121753 we have handled some simpler variants by extending existing folds. I propose to remove this code until we have evidence that it is useful for something.	2025-01-09 12:33:01 +01:00
Yingwei Zheng	03e7862962	[ValueTracking] Move `getFlippedStrictnessPredicateAndConstant` into ValueTracking. NFC. (#122064 ) Needed by https://github.com/llvm/llvm-project/pull/121958.	2025-01-08 20:02:49 +08:00
Yingwei Zheng	231d113c7e	[InstCombine] Handle commuted patterns in `foldSelectWithSRem` (#121896 ) Closes https://github.com/llvm/llvm-project/issues/121771.	2025-01-07 17:09:58 +08:00
Yingwei Zheng	a77346bad0	[IRBuilder] Refactor FMF interface (#121657 ) Up to now, the only way to set specified FMF flags in IRBuilder is to use `FastMathFlagGuard`. It makes the code ugly and hard to maintain. This patch introduces a helper class `FMFSource` to replace the original parameter `Instruction *FMFSource` in IRBuilder. To maximize the compatibility, it accepts an instruction or a specified FMF. This patch also removes the use of `FastMathFlagGuard` in some simple cases. Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=f87a9db8322643ccbc324e317a75b55903129b55&to=9397e712f6010be15ccf62f12740e9b4a67de2f4&stat=instructions%3Au	2025-01-06 14:37:04 +08:00
Yingwei Zheng	a37dbc1f51	[InstCombine] Drop noundef in `foldSelectCttzCtlz` (#121692 ) Close https://github.com/llvm/llvm-project/issues/121428	2025-01-06 00:04:28 +08:00
Rajat Bajpai	76a4c4593b	[InstCombine] Fix constant swap case of fcmp + fadd + sel xfrm (#119419 ) The fcmp + fadd + sel => fcmp + sel + fadd xfrm performs incorrect transformation when select branch values are swapped. This change fixes this.	2025-01-02 19:03:10 +08:00
Veera	6f8afafd30	[InstCombine] Fold `A == MIN_INT ? B != MIN_INT : A < B` to `A < B` (#120177 ) This PR folds: `A == MIN_INT ? B != MIN_INT : A < B` to `A < B` `A == MAX_INT ? B != MAX_INT : A > B` to `A > B` Proof: https://alive2.llvm.org/ce/z/bR6E2s This helps in optimizing comparison of optional unsigned non-zero types in https://github.com/rust-lang/rust/issues/49892. Rust compiler's current output: https://rust.godbolt.org/z/9fxfq3Gn8	2024-12-19 22:52:55 +08:00
Yingwei Zheng	7d25bcef09	[InstCombine] Recursively replace condition with constant in select arms (#120011 ) This patch is proposed to reduce the number of selects with undefs introduced by https://github.com/llvm/llvm-project/pull/119884.	2024-12-16 21:11:59 +08:00
Ramkumar Ramachandra	4a0d53a0b0	PatternMatch: migrate to CmpPredicate (#118534 ) With the introduction of CmpPredicate in 51a895a (IR: introduce struct with CmpInst::Predicate and samesign), PatternMatch is one of the first key pieces of infrastructure that must be updated to match a CmpInst respecting samesign information. Implement this change to Cmp-matchers. This is a preparatory step in migrating the codebase over to CmpPredicate. Since we no functional changes are desired at this stage, we have chosen not to migrate CmpPredicate::operator==(CmpPredicate) calls to use CmpPredicate::getMatching(), as that would have visible impact on tests that are not yet written: instead, we call CmpPredicate::operator==(Predicate), preserving the old behavior, while also inserting a few FIXME comments for follow-ups.	2024-12-13 14:18:33 +00:00
Rajat Bajpai	de415fbb45	[InstCombine][FP] Fix nnan preservation for transform fcmp + sel => fmax/fmin (#117977 ) Preserve `nnan` constraint only if present on both `fcmp` and `select`. Alive2: https://alive2.llvm.org/ce/z/ZNDjzt	2024-12-03 14:01:36 +08:00
Nikita Popov	7bbc049688	[InstCombine] Consolidate another fold into select value equivalence (#117746 ) We had a separate fold that handled just the trivial case where we're replacing exactly the argument of the select. Handle this in select value equivalence by relaxing the infinite loop protection to allow a replacement of a non-constant with a constant. This also fixes https://github.com/llvm/llvm-project/issues/113301, as the separate fold did not handle undef values correctly.	2024-12-02 09:45:39 +01:00
Veera	979a0356d4	[InstCombine] Fold `X Pred C2 ? X BOp C1 : C2 BOp C1` to `min/max(X, C2) BOp C1` (#116888 ) Fixes #82414. General Proof: https://alive2.llvm.org/ce/z/ERjNs4 Proof for Tests: https://alive2.llvm.org/ce/z/K-934G This PR transforms `select` instructions of the form `select (Cmp X C1) (BOp X C2) C3` to `BOp (min/max X C1) C2` iff `C3 == BOp C1 C2`. This helps in eliminating a noop loop in https://github.com/rust-lang/rust/issues/123845 but does not improve optimizations.	2024-12-02 09:33:45 +01:00
Yingwei Zheng	a6fefc8245	[InstCombine] Convert logical and/or with `icmp samesign` into bitwise ops (#116983 ) See the following case: ``` define i1 @test_logical_and_icmp_samesign(i8 %x) { %cmp1 = icmp ne i8 %x, 9 %cmp2 = icmp samesign ult i8 %x, 11 %and = select i1 %cmp1, i1 %cmp2, i1 false ret i1 %and } ``` Currently we cannot convert this logical and into a bitwise and due to the `samesign` flag. But if `%cmp2` evaluates to `poison`, we can infer that `%cmp1` is either `poison` or `true` (`samesign` violation indicates that X is negative). Therefore, `%and` still evaluates to `poison`. This patch converts a logical and into a bitwise and iff TV is poison implies that Cond is either poison or true. Likewise, we convert a logical or into a bitwise or iff FV is poison implies that Cond is either poison or false. Note: 1. This logic is implemented in InstCombine. Not sure whether it is profitable to move it into ValueTracking and call `impliesPoison(TV/FV, Sel)` instead. 2. We only handle the case that `ValAssumedPoison` is `icmp samesign pred X, C1` and `V` is `icmp pred X, C2`. There are no suitable variants for `isImpliedCondition` to pass the fact that X is [non-]negative. Alive2: https://alive2.llvm.org/ce/z/eorFfa Motivation: fix [a major regression](https://github.com/dtcxzyw/llvm-opt-benchmark/pull/1724#discussion_r1849663863) to unblock https://github.com/llvm/llvm-project/pull/112742.	2024-11-21 15:33:18 +08:00
Yingwei Zheng	2c094ac761	[InstCombine] Drop range attributes in `foldBitCeil` (#116641 ) Closes https://github.com/llvm/llvm-project/issues/112076	2024-11-20 21:15:26 +08:00
Ramkumar Ramachandra	9568f88b7f	InstCombine: support floating-point equivalences (#114975 ) Since cd16b07 (IR: introduce CmpInst::isEquivalence), there is now an isEquivalence routine in CmpInst that we can use to determine equivalence in foldSelectValueEquivalence. Implement this, extending it to include floating-point equivalences as well.	2024-11-20 09:44:14 +00:00
Rajat Bajpai	ef2d6dafc4	[InstCombine] Transform (fcmp + fadd + sel) into (fcmp + sel + fadd) (#106492 ) Transform `fcmp + fadd + sel` into `fcmp + sel + fadd` which enables the possibility of transforming `fcmp + sel` into `maxnum/minnum` intrinsics. Alive2 results: https://alive2.llvm.org/ce/z/2cmimW https://alive2.llvm.org/ce/z/Qh9ZJt https://alive2.llvm.org/ce/z/vtLj3R	2024-11-11 12:11:43 -08:00
Yingwei Zheng	e577f14b67	[InstCombine] Use `m_NotForbidPoison` when folding `(X u< Y) ? -1 : (~X + Y) --> uadd.sat(~X, Y)` (#114345 ) Alive2: https://alive2.llvm.org/ce/z/mTGCo- We cannot reuse `~X` if `m_AllOnes` matches a vector constant with some poison elts. An alternative solution is to create a new not instead of reusing `~X`. But it doesn't worth the effort because we need to add a one-use check. Fixes https://github.com/llvm/llvm-project/issues/113869.	2024-11-01 22:18:44 +08:00
Yingwei Zheng	96b14f2ccb	[Reland][InstCombine] Fix FMF propagation in `foldSelectIntoOp` (#114499 ) Relands #114356. Compared to the last version, this patch only merges poison-generating/nsz flags from the select to fix LV regression in `llvm/test/Transforms/PhaseOrdering/AArch64/predicated-reduction.ll`.	2024-11-01 12:22:57 +08:00
gulfemsavrun	d183dc7c24	Revert "[InstCombine] Fix FMF propagation in `foldSelectIntoOp`" (#114458 ) Reverts llvm/llvm-project#114356 because it caused test failures. https://lab.llvm.org/buildbot/#/builders/190/builds/8601 https://luci-milo.appspot.com/ui/p/fuchsia/builders/toolchain.ci/clang-base-linux-x64/b8732549597609293617/overview	2024-10-31 13:21:52 -07:00
Yingwei Zheng	cf1963afad	[InstCombine] Fix FMF propagation in `foldSelectIntoOp` (#114356 ) Closes https://github.com/llvm/llvm-project/issues/113423.	2024-10-31 23:26:45 +08:00
Nikita Popov	0f7d148db4	[InstCombine] Add shared helper for logical and bitwise and/or (NFC) Add a helper for shared folds between logical and bitwise and/or and move the and/or of icmp and fcmp folds in there. This makes it easier to extend to more folds. A possible extension would be to base the current and/or of icmp reassociation logic on this helper, so that it for example also applies to fcmp.	2024-10-17 14:25:44 +02:00
Ramkumar Ramachandra	682fa797b7	InstCombine/Select: remove redundant code (NFC) (#112388 ) InstCombinerImpl::foldSelectInstWithICmp has some inlined code for select-icmp-xor simplification, but this simplification is already done by other code, via another path: (X & Y) == 0 ? X : X ^ Y -> ((X & Y) == 0 ? 0 : Y) ^ X -> (X & Y) ^ X -> X & ~Y Cover the cases that it claims to simplify, and demonstrate that stripping it doesn't cause test changes.	2024-10-16 12:44:09 +01:00
Yingwei Zheng	0936195311	[InstCombine] Drop `samesign` in InstCombine (#112480 ) Closes https://github.com/llvm/llvm-project/issues/112476.	2024-10-16 19:13:52 +08:00
Alexey Bader	583fa4f5b7	[InstCombine] Extend fcmp+select folding to minnum/maxnum intrinsics (#112088 ) Today, InstCombine can fold fcmp+select patterns to minnum/maxnum intrinsics when the nnan and nsz flags are set. The ordering of the operands in both the fcmp and select instructions is important for the folding to occur. maxnum patterns: 1. (a op b) ? a : b -> maxnum(a, b), where op is one of {ogt, oge} 2. (a op b) ? b : a -> maxnum(a, b), where op is one of {ule, ult} The second pattern is supposed to make the order of the operands in the select instruction irrelevant. However, the pattern matching code uses the CmpInst::getInversePredicate method to invert the comparison predicate. This method doesn't take into account the fast-math flags, which can lead missing the folding opportunity. The patch extends the pattern matching code to handle unordered fcmp instructions. This allows the folding to occur even when the select instruction has the operands in the inverse order. New maxnum patterns: 1. (a op b) ? a : b -> maxnum(a, b), where op is one of {ugt, uge} 2. (a op b) ? b : a -> maxnum(a, b), where op is one of {ole, olt} The same changes are applied to the minnum intrinsic.	2024-10-15 22:05:16 +04:00
Ramkumar Ramachandra	1c6c850937	InstCombine: extend select-equiv to support vectors (#111966 ) foldSelectEquivalence currently doesn't support GVN-like replacements on vector types. Put in the checks for potentially lane-crossing operations, and lift the limitation.	2024-10-15 11:10:45 +01:00
Rahul Joshi	fa789dffb1	[NFC] Rename `Intrinsic::getDeclaration` to `getOrInsertDeclaration` (#111752 ) Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).	2024-10-11 05:26:03 -07:00
Marina Taylor	d0d12fc78a	[InstCombine] Fold (X==Z) ? (Y==Z) : (!(Y==Z) && X==Y) --> X==Y (#108619 ) This corresponds to the canonicalized form of some logic that was seen in Swift-generated code for comparing optional pointers: `(X==Z \|\| Y==Z) ? (X==Z && Y==Z) : X==Y --> X==Y` where `Z` was the constant `0`. https://alive2.llvm.org/ce/z/J_3aa9	2024-10-03 15:33:30 +01:00
Nikita Popov	b8d1bae648	[CmpInstAnalysis] Return decomposed bit test as struct (NFC) (#109819 ) decomposeBitTestICmp() currently returns the result via two out parameters plus an in-place modification of Pred. This changes it to return an optional struct instead. The motivation here is twofold. First, I'd like to extend this code to handle cases where the comparison is against a value other than zero, which would mean yet another out parameter. Second, while doing that I was badly bitten by the in-place modification, so I'd like to get rid of it.	2024-09-25 10:14:15 +02:00
Volodymyr Vasylkun	d4798498c4	[InstCombine] Fold `(x == y) ? 0 : (x > y ? 1 : -1)` into `ucmp/scmp(x,y)` (#107314 ) This also handles commuted cases of the same fold, with either the condition or the true/false values of the inner select being swapped.	2024-09-23 15:41:22 +01:00
Nikita Popov	3bc38fb27a	[InstCombine] Generalize and consolidate phi translation check (#106051 ) The foldOpIntoPhi() transforms requires all operands to be phi-translatable. This can be the case either because they are phi nodes in the same block, or because the operand dominates the block. Currently, most callers of foldOpIntoPhi() satisfy this pre-condition by requiring a constant operand, which trivially dominates everything. Only selects had handling for variable operands. Move this logic into foldOpIntoPhi(), so things are handled correctly if other callers are generalized. Also make the implementation a bit more general by querying the dominator tree.	2024-09-04 16:22:43 +02:00
c8ef	43c6fb29a6	[InstCombine] Update the `select` operand when the `cond` is `trunc` and has the `nuw` or `nsw` property. (#105914 ) This patch updates the select operand when the cond has the nuw or nsw property. Considering the semantics of the nuw and nsw flag, if there is no poison value in this expression, this code assumes that X can only be 0, 1 or -1. close: #96765 alive2: https://alive2.llvm.org/ce/z/3n3n2Q	2024-08-24 19:56:59 +08:00
Volodymyr Vasylkun	da6f423251	[InstCombine] Fold `(x < y) ? -1 : zext(x > y)` and `(x > y) ? 1 : sext(x < y)` to `ucmp/scmp(x, y)` (#105272 ) This patch expands already existing funcionality to include these two additional folds, which are nearly identical to the ones already implemented. Proofs: https://alive2.llvm.org/ce/z/Xy7s4j	2024-08-23 22:31:03 +01:00
Volodymyr Vasylkun	abf69a167b	[InstCombine] Fold `(x < y) ? -1 : zext(x != y)` into `u/scmp(x,y)` (#101049 ) This patch adds the aforementioned fold to InstCombine. This pattern is produced after naive implementations of 3-way comparison in high-level languages are transformed into LLVM IR and then optimized. Proofs: https://alive2.llvm.org/ce/z/w4QLq_	2024-08-19 13:02:29 +01:00
Nikita Popov	5d28678277	[InstCombine] Fix incorrect zero ext in select of lshr/ashr fold The -1 constant should be sign extended, not zero extended. Split out from https://github.com/llvm/llvm-project/pull/80309.	2024-08-16 15:02:16 +02:00
Bjorn Pettersson	145aff6d92	Clean up pointer casts etc after opaque pointers transition. NFC (#102631 )	2024-08-12 13:28:53 +02:00
Rose	ede49fa5f3	[InstCombine] Canonicalize more saturated-add variants (#100008 ) LLVM is not evaluating X u > C, a, b the same way it evaluates X <= C, b, a. To fix this, let's move the folds to after the canonicalization of -1 to TrueVal. Let's allow splat vectors with poison elements to be recognized too! Finally, for completion, handle the one case that isn't caught by the above checks because it is canonicalized to eq: X == -1 ? -1 : X + 1 -> uadd.sat(X, 1) Alive2 Proof: https://alive2.llvm.org/ce/z/WEcgYH	2024-08-07 14:16:50 -07:00
Rose Silicon	278c0ad99e	[InstCombine] (NFC) Remove improper TODO for a - UMIN (#101076 ) It is already handled in a different method, especially as a - UMIN(a, b) cannot be handled by a select statement, unless it means something like: "(c < b) ? b - ((b > c) ? c : b) : 0;" but LLVM handles that case as well.	2024-08-06 13:39:03 -07:00
Yingwei Zheng	6def5170e8	[InstCombine] Fold `(X & Mask) == 0 ? TC : FC -> TC binop (X & Mask)` (#100437 ) Alive2: https://alive2.llvm.org/ce/z/d9wV7N	2024-08-06 22:04:24 +08:00
Yingwei Zheng	62e9f40949	[PatternMatch] Use `m_SpecificCmp` matchers. NFC. (#100878 ) Compile-time improvement: http://llvm-compile-time-tracker.com/compare.php?from=13996378d81c8fa9a364aeaafd7382abbc1db83a&to=861ffa4ec5f7bde5a194a7715593a1b5359eb581&stat=instructions:u baseline: 803eaf29267c6aae9162d1a83a4a2ae508b440d3 ``` Top 5 improvements: stockfish/movegen.ll 2541620819 2538599412 -0.12% minetest/profiler.cpp.ll 431724935 431246500 -0.11% abc/luckySwap.c.ll 581173720 580581935 -0.10% abc/kitTruth.c.ll 2521936288 2519445570 -0.10% abc/extraUtilTruth.c.ll 1216674614 1215495502 -0.10% Top 5 regressions: openssl/libcrypto-shlib-sm4.ll 1155054721 1155943201 +0.08% openssl/libcrypto-lib-sm4.ll 1155054838 1155943063 +0.08% spike/vsm4r_vv.ll 1296430080 1297039258 +0.05% spike/vsm4r_vs.ll 1312496906 1313093460 +0.05% nuttx/lib_rand48.c.ll 126201233 126246692 +0.04% Overall: -0.02112308% ```	2024-07-29 10:04:06 +08:00
Yingwei Zheng	248fcab2fc	[InstCombine] Do not use operand info in `replaceInInstruction` (#99492 ) Consider the following case: ``` %cmp = icmp eq ptr %p, null %load = load i32, ptr %p, align 4 %sel = select i1 %cmp, i32 %load, i32 0 ``` `foldSelectValueEquivalence` converts `load i32, ptr %p, align 4` into `load i32, ptr null, align 4`, which causes immediate UB. `%load` is speculatable, but it doesn't hold after operand substitution. This patch introduces a new helper `isSafeToSpeculativelyExecuteWithVariableReplaced`. It ignores operand info in these instructions since their operands will be replaced later. Fixes #99436. --------- Co-authored-by: Nikita Popov <github@npopov.com>	2024-07-22 11:59:54 +08:00
Tim Gymnich	c034c44362	[InstCombine] Fold select of symmetric selects (#99245 ) fixes #98800 Fold patterns like: select c2 (select c1 a b) (select c1 b a) into: select (xor c1 c2) b a Alive2 proofs: https://alive2.llvm.org/ce/z/4QAm4K https://alive2.llvm.org/ce/z/vTVRnC	2024-07-17 22:57:42 +08:00
Nikita Popov	3969d2c3b5	[InstCombine] Disable select known bits fold for vectors This is not safe if the simplification ends up looking through lane-crossing operations. For now, we don't have a good way to limit this in computeKnownBits(), so just disable vector handling entirely. Fixes https://github.com/llvm/llvm-project/issues/97475.	2024-07-03 09:56:48 +02:00
Alex MacLean	8361d9065e	[InstCombine] disable select folding resulting in extra instructions (#97184 ) Disable conversion of a `(select (icmp))` when it would result in more instructions `(xor (lshr (and)))`. This transformation produces more instructions and can interfere with other more profitable folds for `select`. For example before this change the following folding would occur: ```llvm %1 = icmp slt i32 %X, 0 %2 = select i1 %1, i64 0, i64 8 ``` to ```llvm %1 = lshr i32 %X, 28 %2 = and i32 %1, 8 %3 = xor i32 %2, 8 %4 = zext nneg i32 %3 to i64 ```	2024-07-01 08:10:56 -07:00
Nikita Popov	77eb056830	[InstCombine] Simplify select using KnownBits of condition (#95923 ) Simplify the arms of a select based on the KnownBits implied by its condition. For now this only handles the case where the select arm folds to a constant, but this can be generalized to handle other patterns by using SimplifyDemandedBits instead (in that case we would also have to limit to non-undef conditions). This is implemented by adding a new member to SimplifyQuery that can be used to inject an additional condition. The affected values are pre-computed and we don't call computeKnownBits() if the select arms don't contain affected values. This reduces the cost in some pathological cases.	2024-07-01 09:26:01 +02:00
Nikita Popov	2d209d964a	[IR] Add getDataLayout() helpers to BasicBlock and Instruction (#96902 ) This is a helper to avoid writing `getModule()->getDataLayout()`. I regularly try to use this method only to remember it doesn't exist... `getModule()->getDataLayout()` is also a common (the most common?) reason why code has to include the Module.h header.	2024-06-27 16:38:15 +02:00
Stephen Tozer	d75f9dd1d2	Revert "[IR][NFC] Update IRBuilder to use InsertPosition (#96497 )" Reverts the above commit, as it updates a common header function and did not update all callsites: https://lab.llvm.org/buildbot/#/builders/29/builds/382 This reverts commit 6481dc57612671ebe77fe9c34214fba94e1b3b27.	2024-06-24 18:00:22 +01:00
Stephen Tozer	6481dc5761	[IR][NFC] Update IRBuilder to use InsertPosition (#96497 ) Uses the new InsertPosition class (added in #94226) to simplify some of the IRBuilder interface, and removes the need to pass a BasicBlock alongside a BasicBlock::iterator, using the fact that we can now get the parent basic block from the iterator even if it points to the sentinel. This patch removes the BasicBlock argument from each constructor or call to setInsertPoint. This has no functional effect, but later on as we look to remove the `Instruction *InsertBefore` argument from instruction-creation (discussed [here](https://discourse.llvm.org/t/psa-instruction-constructors-changing-to-iterator-only-insertion/77845)), this will simplify the process by allowing us to deprecate the InsertPosition constructor directly and catch all the cases where we use instructions rather than iterators.	2024-06-24 17:27:43 +01:00
Noah Goldstein	b37a4b9991	[InstCombine] Improve coverage of `foldSelectValueEquivalence` for non-constants If f(Y) simplifies to Y, replace with Y. This requires Y to be non-undef. Closes #94719	2024-06-23 11:15:47 +08:00
Nikita Popov	9e6625d6a2	[InstCombine] Preserve all gep flags in another select of gep fold	2024-06-19 12:18:01 +02:00
Nikita Popov	4c8ce5d301	[InstCombine] Preserve all flags in select of gep fold Preserve the flag intersection.	2024-06-19 12:01:48 +02:00

1 2 3 4 5 ...

587 Commits