llvm-project

Author	SHA1	Message	Date
Noah Goldstein	37ca6fa1e2	[ValueTracking] Add support for overflow detection functions is `isKnownNonZero` Adds support for: `{s,u}{add,sub,mul}.with.overflow` The logic is identical to the the non-overflow binops, we where just missing the cases. Closes #87701	2024-04-10 10:40:48 -05:00
Noah Goldstein	a02b3c0182	[ValueTracking] Add tests for overflow detection functions is `isKnownNonZero`; NFC	2024-04-10 10:40:48 -05:00
Noah Goldstein	41c52217b0	[ValueTracking] Add support for `vector_reduce_{s,u}{min,max}` in `computeKnownBits` Previously missing. We compute by just applying the reduce function on the knownbits of each element. Closes #88169	2024-04-10 10:40:48 -05:00
Noah Goldstein	77d668451a	[ValueTracking] Add support for `vector_reduce_{s,u}{min,max}` in `isKnownNonZero` Previously missing, proofs for all implementations: https://alive2.llvm.org/ce/z/G8wpmG	2024-04-10 10:40:48 -05:00
Noah Goldstein	f9f4aba547	[InstCombine] Add tests for non-zero/knownbits of `vector_reduce_{s,u}{min,max}`; NFC	2024-04-10 10:40:48 -05:00
XChy	313a33b9df	[InstCombine] Reduce nested logical operator if poison is implied (#86823 ) Fixes #76623 Alive2 proof: https://alive2.llvm.org/ce/z/gX6znJ (I'm not sure how to write a proof for such transform, maybe there are mistakes) In most cases, `icmp(a, C1) && (other_cond && icmp(a, C2))` will be reduced to `icmp(a, C1) & (other_cond && icmp(a, C2))`, since latter icmp always implies the poison of the former. After reduction, it's easier to simplify the icmp chain. Similarly, this patch does the same thing for `(A && B) && C --> A && (B & C)`. Maybe we could constraint such reduction only on icmps if there is regression in benchmarks.	2024-04-10 14:19:44 +08:00
hanbeom	44c79da3ae	[InstCombine] Remove shl if we only demand known signbits of shift source (#79014 ) This patch resolve TODO written in commit: `5909c67883` Proof: https://alive2.llvm.org/ce/z/C3VNoR	2024-04-10 11:19:09 +09:00
Noah Goldstein	9170e38575	Add support for `nneg` flag with `uitofp` As noted when #82404 was pushed (canonicalizing `sitofp` -> `uitofp`), different signedness on fp casts can have dramatic performance implications on different backends. So, it makes to create a reliable means for the backend to pick its cast signedness if either are correct. Further, this allows us to start canonicalizing `sitofp`- > `uitofp` which may easy middle end analysis. Closes #86141	2024-04-09 18:12:33 -05:00
Noah Goldstein	71ef04d7cd	[InstCombine] fold `(icmp eq/ne (or disjoint x, C0), C1)` -> `(icmp eq/ne x, C0^C1)` Proof: https://alive2.llvm.org/ce/z/m3xoo_ Closes #87734	2024-04-09 15:38:18 -05:00
Noah Goldstein	5b58eb68ed	[InstCombine] Add tests for folding `(icmp eq/ne (or disjoint x, C0), C1)`; NFC	2024-04-09 15:38:18 -05:00
Noah Goldstein	7599d478ef	[InstCombine] Fold `(icmp eq/ne (add nuw x, y), 0)` -> `(icmp eq/ne (or x, y), 0)` `(icmp eq/ne (or x, y), 0)` is probably easier to analyze than `(icmp eq/ne x, -y)` Proof: https://alive2.llvm.org/ce/z/2-VTb6 Closes #88088	2024-04-09 13:56:28 -05:00
Noah Goldstein	759bab0681	[InstCombine] Add tests for folding `(icmp eq/ne (add nuw x, y), 0)`; NFC	2024-04-09 13:56:28 -05:00
Noah Goldstein	964df099e1	[ValueTracking] Support non-constant idx for `computeKnownBits` of `insertelement` Its same logic as before, we just need to intersect what we know about the new Elt and the entire pre-existing Vec. Closes #87707	2024-04-09 01:01:41 -05:00
Noah Goldstein	3a2367561d	[ValueTracking] Add tests for non-constant idx in `computeKnownBits` of `insertelement`; NFC	2024-04-09 01:01:41 -05:00
Noah Goldstein	678f32ab66	[ValueTracking] Add more conditions in to `isTruePredicate` There is one notable "regression". This patch replaces the bespoke `or disjoint` logic we a direct match. This means we fail some simplification during `instsimplify`. All the cases we fail in `instsimplify` we do handle in `instcombine` as we add `disjoint` flags. Other than that, just some basic cases. See proofs: https://alive2.llvm.org/ce/z/_-g7C8 Closes #86083	2024-04-04 12:42:58 -05:00
Noah Goldstein	74447cf46f	[ValueTracking] Add tests for deducing more conditions in `isTruePredicate`; NFC	2024-04-04 12:42:58 -05:00
Noah Goldstein	05cff99a29	[ValueTracking] Infer known bits fromfrom `(icmp eq (and/or x,y), C)` In `(icmp eq (and x,y), C)` all 1s in `C` must also be set in both `x`/`y`. In `(icmp eq (or x,y), C)` all 0s in `C` must also be set in both `x`/`y`. Closes #87143	2024-04-04 12:42:58 -05:00
Noah Goldstein	02b49d14a5	[ValueTracking] Add tests for computing known bits from `(icmp eq (and/or x,y), C)`; NFC	2024-04-04 12:42:58 -05:00
hanbeom	4ef22fce82	[InstCombine] Simplify select if it combinated and/or/xor (#73362 ) `and/or/xor` operations can each be changed to sum of logical operations including operators other than themselves. `x&y -> (x\|y) ^ (x^y)` `x\|y -> (x&y) \| (x^y)` `x^y -> (x\|y) ^ (x&y)` if left of condition of `SelectInst` is `and/or/xor` logical operation and right is equal to `0, -1`, or a `constant`, and if `TrueVal` consist of `and/or/xor` logical operation then we can optimize this case. This patch implements this combination. Proof: https://alive2.llvm.org/ce/z/WW8iRR Fixes https://github.com/llvm/llvm-project/issues/71792.	2024-04-03 14:29:10 +08:00
Vitaly Buka	c0cabfbdaf	[InstCombiner] Remove trivially dead `llvm.allow.{runtime,ubsan}.check()` (#84851 ) Intrinsic declared to have sideeffects, but it's done only to prevent moving it. Removing unused ones is OK. Exacted from #84850 for easier review.	2024-04-01 00:42:18 -07:00
Monad	56b3222b79	[InstCombine] Remove the canonicalization of `trunc` to `i1` (#84628 ) Remove the canonicalization of `trunc` to `i1` according to the suggestion of https://github.com/llvm/llvm-project/pull/83829#issuecomment-1986801166 `a84e66a92d/llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp (L737-L745)` Alive2: https://alive2.llvm.org/ce/z/cacYVA	2024-03-29 21:47:35 +08:00
elhewaty	7d3924cee3	[IR] Add nowrap flags for trunc instruction (#85592 ) This patch adds the nuw (no unsigned wrap) and nsw (no signed wrap) poison-generating flags to the trunc instruction. Discourse thread: https://discourse.llvm.org/t/rfc-add-nowrap-flags-to-trunc/77453	2024-03-29 14:08:49 +08:00
Noah Goldstein	637421cb88	[ValueTracking] Tracking `or disjoint` conditions as `add` in Assumption/DomCondition Cache We can definitionally treat `or disjoint` as `add` anywhere. Closes #86302	2024-03-28 13:49:05 -05:00
Noah Goldstein	cba6df99e0	[ValueTracking] Add basic tests tracking `or disjoint` conditions as `add`; NFC	2024-03-28 13:49:04 -05:00
Florian Hahn	b9cd48f96a	Revert "[TBAA] Add verifier for tbaa.struct metadata (#86709 )" This reverts commit df75183d70e029352a49c93f275db703c81a65c1. Revert for now as this appears to cause failures on some buildbots, e.g.: https://lab.llvm.org/buildbot/#/builders/93/builds/19428/steps/10/logs/stdio	2024-03-27 21:22:15 +00:00
Julian Nagele	df75183d70	[TBAA] Add verifier for tbaa.struct metadata (#86709 ) Adds logic to the IR verifier that checks whether !tbaa.struct nodes are well-formed. That is, it checks that the operands of !tbaa.struct nodes are in groups of three, that each group of three operands consists of two integers and a valid tbaa node, and that the regions described by the offset and size operands are non-overlapping. PR: https://github.com/llvm/llvm-project/pull/86709	2024-03-27 10:30:27 +01:00
zhongyunde 00443407	bd9bb31bce	[InstCombine] add restrict reassoc for the powi(X,Y) / X add restrict reassoc for the powi(X,Y) / X according the discuss on PR69998.	2024-03-27 16:47:03 +08:00
Marc Auberer	b3fe27f2be	[InstCombine] Copy flags of extractelement for extelt -> icmp combine (#86366 ) Fixes #86164	2024-03-24 16:14:56 +01:00
Michele Scandale	536cb1fad3	[InstCombine] Fix for folding select-like `shufflevector` into floating point binary operators. (#85452 ) Folding a select-like `shufflevector` into a floating point binary operators can only be done if the result is preserved for both case. In particular, if the common operand of the `shufflevector` and the floating point binary operator can be a NaN, then the transformation won't preserve the result value.	2024-03-21 12:40:18 -07:00
Noah Goldstein	b3ee127e7d	[InstCombine] integrate `N{U,S}WAddLike` into existing folds Just went a quick replacement of `N{U,S}WAdd` with the `Like` variant that old matches `or disjoint` Closes #86082	2024-03-21 13:03:38 -05:00
Noah Goldstein	ac13e5c0f6	[InstCombine] Add tests for integrating `N{U,S}WAddLike`; NFC	2024-03-21 13:03:38 -05:00
Yingwei Zheng	2bfa7d0e16	[InstCombine] Fold `fmul X, -0.0` into `copysign(0.0, -X)` (#85772 ) `fneg + copysign` is better than fmul for analysis/codegen. godbolt: https://godbolt.org/z/eEs6dGd1G Alive2: https://alive2.llvm.org/ce/z/K3M5BA	2024-03-21 21:48:10 +08:00
Nikita Popov	eefef900c6	Revert "Enable exp10 libcall on linux (#68736 )" This reverts commit 9848fa4aa2a83b0fc3a93e5f905ef09c8dd85226. Causes buildbot failures.	2024-03-20 11:08:52 +01:00
Nikita Popov	0f46e31cfb	[IR] Change representation of getelementptr inrange (#84341 ) As part of the migration to ptradd (https://discourse.llvm.org/t/rfc-replacing-getelementptr-with-ptradd/68699), we need to change the representation of the `inrange` attribute, which is used for vtable splitting. Currently, inrange is specified as follows: ``` getelementptr inbounds ({ [4 x ptr], [4 x ptr] }, ptr @vt, i64 0, inrange i32 1, i64 2) ``` The `inrange` is placed on a GEP index, and all accesses must be "in range" of that index. The new representation is as follows: ``` getelementptr inbounds inrange(-16, 16) ({ [4 x ptr], [4 x ptr] }, ptr @vt, i64 0, i32 1, i64 2) ``` This specifies which offsets are "in range" of the GEP result. The new representation will continue working when canonicalizing to ptradd representation: ``` getelementptr inbounds inrange(-16, 16) (i8, ptr @vt, i64 48) ``` The inrange offsets are relative to the return value of the GEP. An alternative design could make them relative to the source pointer instead. The result-relative format was chosen on the off-chance that we want to extend support to non-constant GEPs in the future, in which case this variant is more expressive. This implementation "upgrades" the old inrange representation in bitcode by simply dropping it. This is a very niche feature, and I don't think trying to upgrade it is worthwhile. Let me know if you disagree.	2024-03-20 10:59:45 +01:00
Yingwei Zheng	f0420c7bc6	[ValueTracking] Handle `not` in `isImpliedCondition` (#85397 ) This patch handles `not` in `isImpliedCondition` to enable more fold in some multi-use cases.	2024-03-20 16:16:42 +08:00
Krishna Narayanan	9848fa4aa2	Enable exp10 libcall on linux (#68736 )	2024-03-20 13:03:49 +05:30
Noah Goldstein	6960ace534	Revert "[InstCombine] Canonicalize `(sitofp x)` -> `(uitofp x)` if `x >= 0`" This reverts commit d80d5b923c6f611590a12543bdb33e0c16044d44. It wasn't a particularly important transform to begin with and caused some codegen regressions on targets that prefer `sitofp` so dropping. Might re-visit along with adding `nneg` flag to `uitofp` so its easily reversable for the backend.	2024-03-20 00:50:45 -05:00
Noah Goldstein	b60cf84e09	[InstCombine] Add more cases for simplifying `(icmp (and/or x, Mask), y)` This cleans up basically all the regressions assosiated from #84688 Proof of all new cases: https://alive2.llvm.org/ce/z/5yYWLb Closes #85445	2024-03-19 17:17:35 -05:00
Noah Goldstein	23047dfbf1	[InstCombine] Add more tests for simplifying `(icmp (and/or x, Mask), y)`; NFC	2024-03-19 17:17:35 -05:00
Yingwei Zheng	0b59af4d86	[InstCombine] Clear sign-bit of the constant magnitude in copysign (#85787 ) Alive2: https://alive2.llvm.org/ce/z/vFykcZ Address the comment https://github.com/llvm/llvm-project/pull/85772#discussion_r1530179048. Unfortunately, non-splat vector constants are not supported because we haven't implemented constant folding of fabs with vector operands.	2024-03-20 03:28:19 +08:00
Michele Scandale	09eb9f1136	[InstCombine] Fix for folding `select` into floating point binary operators. (#83200 ) Folding a `select` into a floating point binary operators can only be done if the result is preserved for both case. In particular, if the other operand of the `select` can be a NaN, then the transformation won't preserve the result value.	2024-03-19 09:47:07 -07:00
Yingwei Zheng	a747e86caa	[InstCombine] Fold `fpto{s\|u}i non-norm` to zero (#85569 ) This patch enables more optimization after canonicalizing `fmul X, 0.0` into a copysign. I decide to implement this fold in InstCombine because `computeKnownFPClass` may be expensive. Alive2: https://alive2.llvm.org/ce/z/ASM8tQ	2024-03-19 17:16:48 +08:00
zhongyunde 00443407	bd9a2afa80	[InstCombine] Add tests for selects with same conditions (NFC)	2024-03-18 17:51:13 +01:00
Noah Goldstein	01d8e1ca01	[ValueTracking] Handle non-canonical operand order in `isImpliedCondICmps` We don't always have canonical order here, so do it manually. Closes #85575	2024-03-17 17:46:06 -05:00
Noah Goldstein	31775e1894	[ValueTracking] Add tests for implied cond with swapped operands; NFC	2024-03-17 17:46:06 -05:00
Yingwei Zheng	252d01952c	[InstCombine] Drop UB-implying attrs/metadata after speculating an instruction (#85542 ) When speculating an instruction in `InstCombinerImpl::FoldOpIntoSelect`, the call may result in undefined behavior. This patch drops all UB-implying attrs/metadata to fix this. Fixes #85536.	2024-03-17 14:15:27 +08:00
Yingwei Zheng	cf5cd98e74	[InstCombine] Support and/or in `getFreelyInvertedImpl` using DeMorgan's Law (#85193 ) This patch adds the support for and/or in `getFreelyInvertedImpl` using DeMorgan's Law: ``` (~(A \| B)) -> (~A & ~B) (~(A & B)) -> (~A \| ~B) ``` Alive2: https://alive2.llvm.org/ce/z/Uig8-j	2024-03-15 19:10:02 +08:00
SahilPatidar	e61e26091c	[InstCombine] Fold `mul (sext bool X), Y` into `select X, -Y, 0` (#84792 ) Alive2: https://alive2.llvm.org/ce/z/n_ns-W Resolve #84608	2024-03-15 16:08:46 +08:00
Noah Goldstein	70d0ebb279	[InstCombine] Fix behavior for `(fmul (sitfp x), 0)` Bug was introduced in #82555 We where missing check that the constant was non-zero for signed + mul transform. Closes #85298	2024-03-14 17:41:25 -05:00
Noah Goldstein	3236d97499	[InstCombine] Add test for `(fmul (sitfp x), 0)`; NFC	2024-03-14 17:41:25 -05:00

1 2 3 4 5 ...

8548 Commits