llvm-project

Author	SHA1	Message	Date
Ryan Buchner	1f1fd07c32	[InstCombine] Optimize (select %x, op(%x), 0) to op(%x) for operations where op(0) == 0 (#147605 ) Currently this optimization only occurs for `mul`, but this generalizes that for any operation that has a fixed point of `0`. There is similar logic within `EarlyCSE` pass, but that is stricter in terms of `poison` propagation so will not optimize for many operations. Alive2 Proofs: `and`: https://alive2.llvm.org/ce/z/RraasX ; base-case https://alive2.llvm.org/ce/z/gzfFTX ; commuted-case https://alive2.llvm.org/ce/z/63XaoX ; compare against undef https://alive2.llvm.org/ce/z/MVRVNd ; select undef https://alive2.llvm.org/ce/z/2bsoYG ; vector https://alive2.llvm.org/ce/z/xByeX- ; vector compare against undef https://alive2.llvm.org/ce/z/zNdzmZ ; vector select undef `fshl`: https://alive2.llvm.org/ce/z/U3_PG3 ; base-case https://alive2.llvm.org/ce/z/BWCnxT ; compare against undef https://alive2.llvm.org/ce/z/8HGAE_ ; select undef ; vector times out `fshr`: https://alive2.llvm.org/ce/z/o6F47G ; base-case https://alive2.llvm.org/ce/z/fVnBXy ; compare against undef https://alive2.llvm.org/ce/z/suymYJ ; select undef ; vector times out `umin`: https://alive2.llvm.org/ce/z/GGMqf6 ; base-case https://alive2.llvm.org/ce/z/6cx5-k ; commuted-case https://alive2.llvm.org/ce/z/W5d9tz ; compare against undef https://alive2.llvm.org/ce/z/nKbaUn ; select undef https://alive2.llvm.org/ce/z/gxEGqc ; vector https://alive2.llvm.org/ce/z/_SDpi_ ; vector compare against undef `sdiv`: https://alive2.llvm.org/ce/z/5XGs3q `srem`: https://alive2.llvm.org/ce/z/vXAnQM `udiv`: https://alive2.llvm.org/ce/z/e6_8Ug `urem`: https://alive2.llvm.org/ce/z/VmM2SL `shl`: https://alive2.llvm.org/ce/z/aCZr3u ; Argument with range https://alive2.llvm.org/ce/z/YgDy8C ; Instruction with known bits https://alive2.llvm.org/ce/z/6pIxR6 ; Constant `lshr`: https://alive2.llvm.org/ce/z/WCCBej `ashr: https://alive2.llvm.org/ce/z/egV4TR --------- Co-authored-by: Ryan Buchner <rbuchner@ventanamicro.com> Co-authored-by: Yingwei Zheng <dtcxzyw@qq.com>	2025-07-16 19:42:41 -07:00
Alex MacLean	59388fb0b9	[InstCombine] Preserve NSW/NUW flags when folding const BOp with min/max (#143471 ) When folding `X Pred C2 ? X BOp C1 : C2 BOp C1` to `min/max(X, C2) BOp C1`, if NUW/NSW flags are present on `X BOp C1` and could be safely applied to `C2 BOp C1`, then they may be added on the BOp after the fold is complete. https://alive2.llvm.org/ce/z/n_3aNJ Preserving these flags can allow subsequent transforms to re-order the min/max and BOp, which in the case of NVPTX would allow for some potential future transformations which would improve instruction-selection.	2025-06-13 11:16:44 -07:00
Yash Solanki	a361a3dc7a	[llvm][InstCombine] Fold select to cmp for weak and inverted inequalities (#143445 )	2025-06-13 21:53:34 +08:00
Alex MacLean	09029045a8	[InstCombine] Fold max/min when incrementing/decrementing by 1 (#142466 ) Add the following folds for integer min max folding in InstCombine: - (X > Y) ? X : (Y - 1) ==> MIN(X, Y - 1) - (X < Y) ? X : (Y + 1) ==> MAX(X, Y + 1) These are safe when overflow corresponding to the sign of the comparison is poison. (proof https://alive2.llvm.org/ce/z/oj5iiI). The most common of these patterns is likely the minimum case which occurs in some internal library code when clamping an integer index to a range (The maximum cases are included for completeness). Here is a simplified example: int clampToWidth(int idx, int width) { if (idx >= width) return width - 1; return idx; } https://cuda.godbolt.org/z/nhPzWrc3W	2025-06-10 07:55:56 -07:00
Alex MacLean	107601ed06	[InstCombine] Allow min/max in constant BOp min/max folding (#142878 ) Extend folding for `X Pred C2 ? X BOp C1 : C2 BOp C1` to `min/max(X, C2) BOp C1` to allow min and max as `BOp`. This ensures a constant clamping pattern is folded into a pair of min/max instructions. Here is a simplified example of a case where this folding is not occurring currently. int clampToU8(int v) { if (v < 0) return 0; if (v > 255) return 255; return v; } https://godbolt.org/z/78jhKPWbv Generic proof: https://alive2.llvm.org/ce/z/cdpLYy	2025-06-06 12:44:04 -07:00
Yingwei Zheng	5e2dcfe42c	[InstCombine] Avoid infinite loop in `foldSelectValueEquivalence` (#142754 ) Before this patch, InstCombine hung because it replaced a value with a more complex one: ``` %sel = select i1 %cmp, i32 %smax, i32 0 -> %sel = select i1 %cmp, i32 %masked, i32 0 -> %sel = select i1 %cmp, i32 %smax, i32 0 -> ... ``` This patch makes this replacement more conservative. It only performs the replacement iff the new value is one of the operands of the original value. Closes https://github.com/llvm/llvm-project/issues/142405.	2025-06-04 19:42:56 +08:00
Ramkumar Ramachandra	b40e4ceaa6	[ValueTracking] Make Depth last default arg (NFC) (#142384 ) Having a finite Depth (or recursion limit) for computeKnownBits is very limiting, but is currently a load-bearing necessity, as all KnownBits are recomputed on each call and there is no caching. As a prerequisite for an effort to remove the recursion limit altogether, either using a clever caching technique, or writing a easily-invalidable KnownBits analysis, make the Depth argument in APIs in ValueTracking uniformly the last argument with a default value. This would aid in removing the argument when the time comes, as many callers that currently pass 0 explicitly are now updated to omit the argument altogether.	2025-06-03 17:12:24 +01:00
Yingwei Zheng	3ec0c5c7fe	[InstCombine] Propagate FMF from select instead of fcmp (#141010 ) Previously, `3d6b53980c` propagates FMF from fcmp to avoid performance regressions. With the help of https://github.com/llvm/llvm-project/pull/139861, https://github.com/llvm/llvm-project/pull/141015, and https://github.com/llvm/llvm-project/pull/141914, we can still convert SPF into fabs/minnum/maxnum intrinsics even if some flags are missing. This patch propagates FMF from select to address the long-standing issue. Closes https://github.com/llvm/llvm-project/issues/140994.	2025-05-31 16:25:10 +08:00
Yingwei Zheng	87fd352d91	[InstCombine] Use `canIgnoreSignBitOfZero` in `spf->minmax` fold (#141914 ) Alive2: https://alive2.llvm.org/ce/z/dCZBB_ Fix remaining regressions caused by https://github.com/llvm/llvm-project/pull/141010.	2025-05-30 14:18:05 +08:00
Yingwei Zheng	6c86b7d7d8	[ValueTracking][InstCombine] Generalize ignoreSignBitOfZero/NaN to handle more cases (#141015 ) This patch was originally part of https://github.com/llvm/llvm-project/pull/139861. It generalizes `ignoreSignBitOfZero/NaN` to handle more instructions/intrinsics. BTW, I find it mitigates performance regressions caused by https://github.com/llvm/llvm-project/pull/141010 (IR diff https://github.com/dtcxzyw/llvm-opt-benchmark/pull/2365/files). We don't need to propagate FMF from fcmp into select, since we can infer demanded properties from the user of select.	2025-05-28 19:17:51 +08:00
Yingwei Zheng	d13947bd6c	[InstCombine] Enable more fabs fold when the user ignores sign bit of zero/NaN (#139861 ) When the only user of select is a fcmp or a fp operation with nnan/nsz, the sign bit of zero/NaN can be ignored. Alive2: https://alive2.llvm.org/ce/z/ZcxeIv Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=7add1bcd02b1f72d580bb2e64a1fe4a8bdc085d9&to=cb419c7cbddce778673f3d4b414ed9b8064b8d6e&stat=instructions:u Closes https://github.com/llvm/llvm-project/issues/133367.	2025-05-21 23:50:00 +08:00
Yingwei Zheng	a0c4876eed	[InstCombine] Fix ninf propagation for fcmp+sel -> minmax (#136433 ) Proof: https://alive2.llvm.org/ce/z/nCrvfr Closes https://github.com/llvm/llvm-project/issues/136430	2025-04-28 17:24:46 +08:00
Yingwei Zheng	3e1e4062e1	[InstCombine] Preserve signbit semantics of NaN with fold to fabs (#136648 ) As per the LangRef and IEEE 754-2008 standard, the sign bit of NaN is preserved if there is no floating-point operation being performed. See also `862e35e25a` for reference. Alive2: https://alive2.llvm.org/ce/z/QYtEGj Closes https://github.com/llvm/llvm-project/issues/136646	2025-04-26 14:03:12 +08:00
Andreas Jonson	2d1e64669e	[InstCombine] Reuse common code between foldSelectICmpAndBinOp and foldSelectICmpAnd. (#131902 ) The commit that was removed from https://github.com/llvm/llvm-project/pull/127905 due to the conflict with https://github.com/llvm/llvm-project/pull/128741. The use of common code results in that the foldSelectICmpAndBinOp also use knownbits in the same way as was added for foldSelectICmpAnd in https://github.com/llvm/llvm-project/pull/128741. proof for the use of knowbits in foldSelectICmpAndBinOp: https://alive2.llvm.org/ce/z/RYXr_k	2025-03-19 19:57:48 +01:00
Andreas Jonson	b326cb6792	[InstCombine] Support trunc to i1 in foldSelectICmpAnd (#127905 ) proof: https://alive2.llvm.org/ce/z/Ey6BoT	2025-03-18 18:41:34 +01:00
Julian Nagele	beb4a48297	[InstCombine] Use known bits to simplify mask in foldSelectICmpAnd (#128741 ) Make use of known bits when trying to decompose a select/icmp bittest and folding it into an and. This means we can fold when additional information, for instance via a range attribute or metadata, allows us to conclude that the resulting mask is in fact a power of two.	2025-03-14 16:34:04 +00:00
Yingwei Zheng	c5f40bf024	[InstCombine] Fold `X!=Y ? ctz(X^Y, true) : BW -> ctz(X^Y, false)` (#128483 ) Proof: https://alive2.llvm.org/ce/z/mzL6W2 Closes https://github.com/llvm/llvm-project/issues/128441.	2025-02-24 17:35:46 +08:00
Andreas Jonson	aa847ced07	[InstCombine] handle trunc to i1 in foldSelectICmpAndBinOp (#127390 ) for `trunc nuw` saves a instruction and otherwise only other instructions without the select, same behavior as for bit test before. proof: https://alive2.llvm.org/ce/z/a6QmyV	2025-02-19 18:29:47 +01:00
Andreas Jonson	8fc03e4ff1	[InstCombine] avoid extra instructions in foldSelectICmpAnd (#127398 ) Disable fold when it will result in more instructions.	2025-02-19 18:09:24 +01:00
Yingwei Zheng	b2659ca44b	[InstCombine] Propagate flags in `foldSelectICmpAndBinOp` (#127437 ) It is always safe to add poison-generating flags for `BinOp Y, Identity`. Proof: https://alive2.llvm.org/ce/z/8BLEpq and https://alive2.llvm.org/ce/z/584Bb4 Then we can propagate flags from one of the arms: ``` select Cond, Y, (BinOp flags Y, Z) -> select Cond, (BinOp flags Y, Identity), (BinOp flags Y, Z) -> BinOp flags Y, (select Cond, Identity, Z) ``` This patch is proposed to avoid information loss caused by https://github.com/llvm/llvm-project/pull/127390.	2025-02-19 09:22:15 +08:00
Yingwei Zheng	922ab6650d	[InstCombine] Drop nowrap flags in `foldBitCeil` (#125817 ) For convenience this patch drops nsw for `sub`. It also allows this fold with `ctlz_zero_undef`. Alive2: https://alive2.llvm.org/ce/z/VmvqSt	2025-02-05 16:49:39 +08:00
Yingwei Zheng	caeefe7b94	[InstCombine] Extend `foldSelectInstWithICmpConst` to handle minmax (#125346 ) This patch extends `f6bb156fb1` to handle minmax intrinsics. Motivating case: https://alive2.llvm.org/ce/z/JFKbYn Addresses a regression caused by https://github.com/llvm/llvm-project/pull/121958. It also works for `*.sat`. But no real-world benefit is observed.	2025-02-02 17:05:16 +08:00
Yingwei Zheng	3ec6a6b85a	[InstCombine] Fix FMF propagation in `foldSelectWithFCmpToFabs` (#121580 ) Consider the following pattern: ``` %cmp = fcmp <pred> double %x, 0.000000e+00 %negX = fneg <fmf> double %x %sel = select i1 %cmp, double %x, double %negX ``` We cannot propagate ninf from fneg to select since `%negX` may not be chosen. Similarly, we cannot propagate nnan unless `%negX` is guaranteed to be selected when `%x` is NaN. This patch also propagates nnan/ninf from fcmp to avoid regression in `PhaseOrdering/generate-fabs.ll`. Alive2: https://alive2.llvm.org/ce/z/t6U-tA Closes https://github.com/llvm/llvm-project/issues/121430 and https://github.com/llvm/llvm-project/issues/113989.	2025-02-01 15:14:17 +08:00
Ramkumar Ramachandra	d76ea250c8	Reland [InstCombine] Teach foldSelectOpOp about samesign (#124320 ) Changes: There was a serious bug in the previous patch, leading to a miscompile. See #122723 for the miscompile report from Alexander, and the follow-up investigation by Nikita. The patch has since been reworked, and now includes the testcase from the miscompile. Follow up on 4a0d53a (PatternMatch: migrate to CmpPredicate) to get rid of one of the FIXMEs it introduced by replacing a predicate comparison with CmpPredicate::getMatching. Co-authored-by: Nikita Popov <npopov@redhat.com>	2025-01-28 16:53:01 +00:00
Alexander Kornienko	788318484d	Revert "[InstCombine] Teach foldSelectOpOp about samesign" (#124123 ) Reverts llvm/llvm-project#122723 due to a miscompilation See https://github.com/llvm/llvm-project/pull/122723#issuecomment-2608777844 for details and the test case.	2025-01-24 01:08:10 +01:00
Ramkumar Ramachandra	48757e02ba	[InstCombine] Teach foldSelectOpOp about samesign (#122723 ) Follow up on 4a0d53a (PatternMatch: migrate to CmpPredicate) to get rid of one of the FIXMEs it introduced by replacing a predicate comparison with CmpPredicate::getMatching.	2025-01-14 19:58:17 +00:00
Nikita Popov	dcdf44aca7	[InstCombine] Remove foldSelectICmpEq() fold (#122098 ) This fold matches complex patterns, for which we have no proof of real-world relevance, and which does not actually handle the originally motivating cases from https://github.com/llvm/llvm-project/issues/71792 either. In https://github.com/llvm/llvm-project/pull/121708 and https://github.com/llvm/llvm-project/pull/121753 we have handled some simpler variants by extending existing folds. I propose to remove this code until we have evidence that it is useful for something.	2025-01-09 12:33:01 +01:00
Yingwei Zheng	03e7862962	[ValueTracking] Move `getFlippedStrictnessPredicateAndConstant` into ValueTracking. NFC. (#122064 ) Needed by https://github.com/llvm/llvm-project/pull/121958.	2025-01-08 20:02:49 +08:00
Yingwei Zheng	231d113c7e	[InstCombine] Handle commuted patterns in `foldSelectWithSRem` (#121896 ) Closes https://github.com/llvm/llvm-project/issues/121771.	2025-01-07 17:09:58 +08:00
Yingwei Zheng	a77346bad0	[IRBuilder] Refactor FMF interface (#121657 ) Up to now, the only way to set specified FMF flags in IRBuilder is to use `FastMathFlagGuard`. It makes the code ugly and hard to maintain. This patch introduces a helper class `FMFSource` to replace the original parameter `Instruction *FMFSource` in IRBuilder. To maximize the compatibility, it accepts an instruction or a specified FMF. This patch also removes the use of `FastMathFlagGuard` in some simple cases. Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=f87a9db8322643ccbc324e317a75b55903129b55&to=9397e712f6010be15ccf62f12740e9b4a67de2f4&stat=instructions%3Au	2025-01-06 14:37:04 +08:00
Yingwei Zheng	a37dbc1f51	[InstCombine] Drop noundef in `foldSelectCttzCtlz` (#121692 ) Close https://github.com/llvm/llvm-project/issues/121428	2025-01-06 00:04:28 +08:00
Rajat Bajpai	76a4c4593b	[InstCombine] Fix constant swap case of fcmp + fadd + sel xfrm (#119419 ) The fcmp + fadd + sel => fcmp + sel + fadd xfrm performs incorrect transformation when select branch values are swapped. This change fixes this.	2025-01-02 19:03:10 +08:00
Veera	6f8afafd30	[InstCombine] Fold `A == MIN_INT ? B != MIN_INT : A < B` to `A < B` (#120177 ) This PR folds: `A == MIN_INT ? B != MIN_INT : A < B` to `A < B` `A == MAX_INT ? B != MAX_INT : A > B` to `A > B` Proof: https://alive2.llvm.org/ce/z/bR6E2s This helps in optimizing comparison of optional unsigned non-zero types in https://github.com/rust-lang/rust/issues/49892. Rust compiler's current output: https://rust.godbolt.org/z/9fxfq3Gn8	2024-12-19 22:52:55 +08:00
Yingwei Zheng	7d25bcef09	[InstCombine] Recursively replace condition with constant in select arms (#120011 ) This patch is proposed to reduce the number of selects with undefs introduced by https://github.com/llvm/llvm-project/pull/119884.	2024-12-16 21:11:59 +08:00
Ramkumar Ramachandra	4a0d53a0b0	PatternMatch: migrate to CmpPredicate (#118534 ) With the introduction of CmpPredicate in 51a895a (IR: introduce struct with CmpInst::Predicate and samesign), PatternMatch is one of the first key pieces of infrastructure that must be updated to match a CmpInst respecting samesign information. Implement this change to Cmp-matchers. This is a preparatory step in migrating the codebase over to CmpPredicate. Since we no functional changes are desired at this stage, we have chosen not to migrate CmpPredicate::operator==(CmpPredicate) calls to use CmpPredicate::getMatching(), as that would have visible impact on tests that are not yet written: instead, we call CmpPredicate::operator==(Predicate), preserving the old behavior, while also inserting a few FIXME comments for follow-ups.	2024-12-13 14:18:33 +00:00
Rajat Bajpai	de415fbb45	[InstCombine][FP] Fix nnan preservation for transform fcmp + sel => fmax/fmin (#117977 ) Preserve `nnan` constraint only if present on both `fcmp` and `select`. Alive2: https://alive2.llvm.org/ce/z/ZNDjzt	2024-12-03 14:01:36 +08:00
Nikita Popov	7bbc049688	[InstCombine] Consolidate another fold into select value equivalence (#117746 ) We had a separate fold that handled just the trivial case where we're replacing exactly the argument of the select. Handle this in select value equivalence by relaxing the infinite loop protection to allow a replacement of a non-constant with a constant. This also fixes https://github.com/llvm/llvm-project/issues/113301, as the separate fold did not handle undef values correctly.	2024-12-02 09:45:39 +01:00
Veera	979a0356d4	[InstCombine] Fold `X Pred C2 ? X BOp C1 : C2 BOp C1` to `min/max(X, C2) BOp C1` (#116888 ) Fixes #82414. General Proof: https://alive2.llvm.org/ce/z/ERjNs4 Proof for Tests: https://alive2.llvm.org/ce/z/K-934G This PR transforms `select` instructions of the form `select (Cmp X C1) (BOp X C2) C3` to `BOp (min/max X C1) C2` iff `C3 == BOp C1 C2`. This helps in eliminating a noop loop in https://github.com/rust-lang/rust/issues/123845 but does not improve optimizations.	2024-12-02 09:33:45 +01:00
Yingwei Zheng	a6fefc8245	[InstCombine] Convert logical and/or with `icmp samesign` into bitwise ops (#116983 ) See the following case: ``` define i1 @test_logical_and_icmp_samesign(i8 %x) { %cmp1 = icmp ne i8 %x, 9 %cmp2 = icmp samesign ult i8 %x, 11 %and = select i1 %cmp1, i1 %cmp2, i1 false ret i1 %and } ``` Currently we cannot convert this logical and into a bitwise and due to the `samesign` flag. But if `%cmp2` evaluates to `poison`, we can infer that `%cmp1` is either `poison` or `true` (`samesign` violation indicates that X is negative). Therefore, `%and` still evaluates to `poison`. This patch converts a logical and into a bitwise and iff TV is poison implies that Cond is either poison or true. Likewise, we convert a logical or into a bitwise or iff FV is poison implies that Cond is either poison or false. Note: 1. This logic is implemented in InstCombine. Not sure whether it is profitable to move it into ValueTracking and call `impliesPoison(TV/FV, Sel)` instead. 2. We only handle the case that `ValAssumedPoison` is `icmp samesign pred X, C1` and `V` is `icmp pred X, C2`. There are no suitable variants for `isImpliedCondition` to pass the fact that X is [non-]negative. Alive2: https://alive2.llvm.org/ce/z/eorFfa Motivation: fix [a major regression](https://github.com/dtcxzyw/llvm-opt-benchmark/pull/1724#discussion_r1849663863) to unblock https://github.com/llvm/llvm-project/pull/112742.	2024-11-21 15:33:18 +08:00
Yingwei Zheng	2c094ac761	[InstCombine] Drop range attributes in `foldBitCeil` (#116641 ) Closes https://github.com/llvm/llvm-project/issues/112076	2024-11-20 21:15:26 +08:00
Ramkumar Ramachandra	9568f88b7f	InstCombine: support floating-point equivalences (#114975 ) Since cd16b07 (IR: introduce CmpInst::isEquivalence), there is now an isEquivalence routine in CmpInst that we can use to determine equivalence in foldSelectValueEquivalence. Implement this, extending it to include floating-point equivalences as well.	2024-11-20 09:44:14 +00:00
Rajat Bajpai	ef2d6dafc4	[InstCombine] Transform (fcmp + fadd + sel) into (fcmp + sel + fadd) (#106492 ) Transform `fcmp + fadd + sel` into `fcmp + sel + fadd` which enables the possibility of transforming `fcmp + sel` into `maxnum/minnum` intrinsics. Alive2 results: https://alive2.llvm.org/ce/z/2cmimW https://alive2.llvm.org/ce/z/Qh9ZJt https://alive2.llvm.org/ce/z/vtLj3R	2024-11-11 12:11:43 -08:00
Yingwei Zheng	e577f14b67	[InstCombine] Use `m_NotForbidPoison` when folding `(X u< Y) ? -1 : (~X + Y) --> uadd.sat(~X, Y)` (#114345 ) Alive2: https://alive2.llvm.org/ce/z/mTGCo- We cannot reuse `~X` if `m_AllOnes` matches a vector constant with some poison elts. An alternative solution is to create a new not instead of reusing `~X`. But it doesn't worth the effort because we need to add a one-use check. Fixes https://github.com/llvm/llvm-project/issues/113869.	2024-11-01 22:18:44 +08:00
Yingwei Zheng	96b14f2ccb	[Reland][InstCombine] Fix FMF propagation in `foldSelectIntoOp` (#114499 ) Relands #114356. Compared to the last version, this patch only merges poison-generating/nsz flags from the select to fix LV regression in `llvm/test/Transforms/PhaseOrdering/AArch64/predicated-reduction.ll`.	2024-11-01 12:22:57 +08:00
gulfemsavrun	d183dc7c24	Revert "[InstCombine] Fix FMF propagation in `foldSelectIntoOp`" (#114458 ) Reverts llvm/llvm-project#114356 because it caused test failures. https://lab.llvm.org/buildbot/#/builders/190/builds/8601 https://luci-milo.appspot.com/ui/p/fuchsia/builders/toolchain.ci/clang-base-linux-x64/b8732549597609293617/overview	2024-10-31 13:21:52 -07:00
Yingwei Zheng	cf1963afad	[InstCombine] Fix FMF propagation in `foldSelectIntoOp` (#114356 ) Closes https://github.com/llvm/llvm-project/issues/113423.	2024-10-31 23:26:45 +08:00
Nikita Popov	0f7d148db4	[InstCombine] Add shared helper for logical and bitwise and/or (NFC) Add a helper for shared folds between logical and bitwise and/or and move the and/or of icmp and fcmp folds in there. This makes it easier to extend to more folds. A possible extension would be to base the current and/or of icmp reassociation logic on this helper, so that it for example also applies to fcmp.	2024-10-17 14:25:44 +02:00
Ramkumar Ramachandra	682fa797b7	InstCombine/Select: remove redundant code (NFC) (#112388 ) InstCombinerImpl::foldSelectInstWithICmp has some inlined code for select-icmp-xor simplification, but this simplification is already done by other code, via another path: (X & Y) == 0 ? X : X ^ Y -> ((X & Y) == 0 ? 0 : Y) ^ X -> (X & Y) ^ X -> X & ~Y Cover the cases that it claims to simplify, and demonstrate that stripping it doesn't cause test changes.	2024-10-16 12:44:09 +01:00
Yingwei Zheng	0936195311	[InstCombine] Drop `samesign` in InstCombine (#112480 ) Closes https://github.com/llvm/llvm-project/issues/112476.	2024-10-16 19:13:52 +08:00
Alexey Bader	583fa4f5b7	[InstCombine] Extend fcmp+select folding to minnum/maxnum intrinsics (#112088 ) Today, InstCombine can fold fcmp+select patterns to minnum/maxnum intrinsics when the nnan and nsz flags are set. The ordering of the operands in both the fcmp and select instructions is important for the folding to occur. maxnum patterns: 1. (a op b) ? a : b -> maxnum(a, b), where op is one of {ogt, oge} 2. (a op b) ? b : a -> maxnum(a, b), where op is one of {ule, ult} The second pattern is supposed to make the order of the operands in the select instruction irrelevant. However, the pattern matching code uses the CmpInst::getInversePredicate method to invert the comparison predicate. This method doesn't take into account the fast-math flags, which can lead missing the folding opportunity. The patch extends the pattern matching code to handle unordered fcmp instructions. This allows the folding to occur even when the select instruction has the operands in the inverse order. New maxnum patterns: 1. (a op b) ? a : b -> maxnum(a, b), where op is one of {ugt, uge} 2. (a op b) ? b : a -> maxnum(a, b), where op is one of {ole, olt} The same changes are applied to the minnum intrinsic.	2024-10-15 22:05:16 +04:00

1 2 3 4 5 ...

613 Commits