llvm-project

Author	SHA1	Message	Date
XChy	313a33b9df	[InstCombine] Reduce nested logical operator if poison is implied (#86823 ) Fixes #76623 Alive2 proof: https://alive2.llvm.org/ce/z/gX6znJ (I'm not sure how to write a proof for such transform, maybe there are mistakes) In most cases, `icmp(a, C1) && (other_cond && icmp(a, C2))` will be reduced to `icmp(a, C1) & (other_cond && icmp(a, C2))`, since latter icmp always implies the poison of the former. After reduction, it's easier to simplify the icmp chain. Similarly, this patch does the same thing for `(A && B) && C --> A && (B & C)`. Maybe we could constraint such reduction only on icmps if there is regression in benchmarks.	2024-04-10 14:19:44 +08:00
hanbeom	44c79da3ae	[InstCombine] Remove shl if we only demand known signbits of shift source (#79014 ) This patch resolve TODO written in commit: `5909c67883` Proof: https://alive2.llvm.org/ce/z/C3VNoR	2024-04-10 11:19:09 +09:00
Noah Goldstein	71ef04d7cd	[InstCombine] fold `(icmp eq/ne (or disjoint x, C0), C1)` -> `(icmp eq/ne x, C0^C1)` Proof: https://alive2.llvm.org/ce/z/m3xoo_ Closes #87734	2024-04-09 15:38:18 -05:00
Noah Goldstein	7599d478ef	[InstCombine] Fold `(icmp eq/ne (add nuw x, y), 0)` -> `(icmp eq/ne (or x, y), 0)` `(icmp eq/ne (or x, y), 0)` is probably easier to analyze than `(icmp eq/ne x, -y)` Proof: https://alive2.llvm.org/ce/z/2-VTb6 Closes #88088	2024-04-09 13:56:28 -05:00
hanbeom	4ef22fce82	[InstCombine] Simplify select if it combinated and/or/xor (#73362 ) `and/or/xor` operations can each be changed to sum of logical operations including operators other than themselves. `x&y -> (x\|y) ^ (x^y)` `x\|y -> (x&y) \| (x^y)` `x^y -> (x\|y) ^ (x&y)` if left of condition of `SelectInst` is `and/or/xor` logical operation and right is equal to `0, -1`, or a `constant`, and if `TrueVal` consist of `and/or/xor` logical operation then we can optimize this case. This patch implements this combination. Proof: https://alive2.llvm.org/ce/z/WW8iRR Fixes https://github.com/llvm/llvm-project/issues/71792.	2024-04-03 14:29:10 +08:00
Monad	56b3222b79	[InstCombine] Remove the canonicalization of `trunc` to `i1` (#84628 ) Remove the canonicalization of `trunc` to `i1` according to the suggestion of https://github.com/llvm/llvm-project/pull/83829#issuecomment-1986801166 `a84e66a92d/llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp (L737-L745)` Alive2: https://alive2.llvm.org/ce/z/cacYVA	2024-03-29 21:47:35 +08:00
zhongyunde 00443407	bd9bb31bce	[InstCombine] add restrict reassoc for the powi(X,Y) / X add restrict reassoc for the powi(X,Y) / X according the discuss on PR69998.	2024-03-27 16:47:03 +08:00
zhongyunde 00443407	2938f1cff9	[InstCombine] Refactor powi(X,Y) / X to call foldPowiReassoc, NFC	2024-03-27 16:47:03 +08:00
Yingwei Zheng	caa2258250	[LLVM] Remove nuw neg (#86295 ) This patch removes APIs that creating NUW neg. It is a trivial case because `sub nuw 0, X` always gets simplified into zero. I believe there is no optimization opportunities in the real-world applications that we can take advantage of the nuw flag. Motivated by https://github.com/llvm/llvm-project/pull/84792#discussion_r1524891134. Compile-time improvement: https://llvm-compile-time-tracker.com/compare.php?from=d1f182c895728d89c5c3d198b133e212a5d9d4a3&to=da7b7478b7cbb32c09d760f6b8d0e67901e0d533&stat=instructions:u	2024-03-26 20:56:16 +08:00
Marc Auberer	b3fe27f2be	[InstCombine] Copy flags of extractelement for extelt -> icmp combine (#86366 ) Fixes #86164	2024-03-24 16:14:56 +01:00
Michele Scandale	536cb1fad3	[InstCombine] Fix for folding select-like `shufflevector` into floating point binary operators. (#85452 ) Folding a select-like `shufflevector` into a floating point binary operators can only be done if the result is preserved for both case. In particular, if the common operand of the `shufflevector` and the floating point binary operator can be a NaN, then the transformation won't preserve the result value.	2024-03-21 12:40:18 -07:00
Noah Goldstein	b3ee127e7d	[InstCombine] integrate `N{U,S}WAddLike` into existing folds Just went a quick replacement of `N{U,S}WAdd` with the `Like` variant that old matches `or disjoint` Closes #86082	2024-03-21 13:03:38 -05:00
Yingwei Zheng	2bfa7d0e16	[InstCombine] Fold `fmul X, -0.0` into `copysign(0.0, -X)` (#85772 ) `fneg + copysign` is better than fmul for analysis/codegen. godbolt: https://godbolt.org/z/eEs6dGd1G Alive2: https://alive2.llvm.org/ce/z/K3M5BA	2024-03-21 21:48:10 +08:00
Noah Goldstein	6960ace534	Revert "[InstCombine] Canonicalize `(sitofp x)` -> `(uitofp x)` if `x >= 0`" This reverts commit d80d5b923c6f611590a12543bdb33e0c16044d44. It wasn't a particularly important transform to begin with and caused some codegen regressions on targets that prefer `sitofp` so dropping. Might re-visit along with adding `nneg` flag to `uitofp` so its easily reversable for the backend.	2024-03-20 00:50:45 -05:00
Noah Goldstein	b60cf84e09	[InstCombine] Add more cases for simplifying `(icmp (and/or x, Mask), y)` This cleans up basically all the regressions assosiated from #84688 Proof of all new cases: https://alive2.llvm.org/ce/z/5yYWLb Closes #85445	2024-03-19 17:17:35 -05:00
Stephen Tozer	ffd08c7759	[RemoveDIs][NFC] Rename DPValue -> DbgVariableRecord (#85216 ) This is the major rename patch that prior patches have built towards. The DPValue class is being renamed to DbgVariableRecord, which reflects the updated terminology for the "final" implementation of the RemoveDI feature. This is a pure string substitution + clang-format patch. The only manual component of this patch was determining where to perform these string substitutions: `DPValue` and `DPV` are almost exclusively used for DbgRecords, except for: - llvm/lib/target, where 'DP' is used to mean double-precision, and so appears as part of .td files and in variable names. NB: There is a single existing use of `DPValue` here that refers to debug info, which I've manually updated. - llvm/tools/gold, where 'LDPV' is used as a prefix for symbol visibility enums. Outside of these places, I've applied several basic string substitutions, with the intent that they only affect DbgRecord-related identifiers; I've checked them as I went through to verify this, with reasonable confidence that there are no unintended changes that slipped through the cracks. The substitutions applied are all case-sensitive, and are applied in the order shown: ``` DPValue -> DbgVariableRecord DPVal -> DbgVarRec DPV -> DVR ``` Following the previous rename patches, it should be the case that there are no instances of any of these strings that are meant to refer to the general case of DbgRecords, or anything other than the DPValue class. The idea behind this patch is therefore that pure string substitution is correct in all cases as long as these assumptions hold.	2024-03-19 20:07:07 +00:00
Yingwei Zheng	0b59af4d86	[InstCombine] Clear sign-bit of the constant magnitude in copysign (#85787 ) Alive2: https://alive2.llvm.org/ce/z/vFykcZ Address the comment https://github.com/llvm/llvm-project/pull/85772#discussion_r1530179048. Unfortunately, non-splat vector constants are not supported because we haven't implemented constant folding of fabs with vector operands.	2024-03-20 03:28:19 +08:00
Michele Scandale	09eb9f1136	[InstCombine] Fix for folding `select` into floating point binary operators. (#83200 ) Folding a `select` into a floating point binary operators can only be done if the result is preserved for both case. In particular, if the other operand of the `select` can be a NaN, then the transformation won't preserve the result value.	2024-03-19 09:47:07 -07:00
Yingwei Zheng	a747e86caa	[InstCombine] Fold `fpto{s\|u}i non-norm` to zero (#85569 ) This patch enables more optimization after canonicalizing `fmul X, 0.0` into a copysign. I decide to implement this fold in InstCombine because `computeKnownFPClass` may be expensive. Alive2: https://alive2.llvm.org/ce/z/ASM8tQ	2024-03-19 17:16:48 +08:00
Yingwei Zheng	252d01952c	[InstCombine] Drop UB-implying attrs/metadata after speculating an instruction (#85542 ) When speculating an instruction in `InstCombinerImpl::FoldOpIntoSelect`, the call may result in undefined behavior. This patch drops all UB-implying attrs/metadata to fix this. Fixes #85536.	2024-03-17 14:15:27 +08:00
Yingwei Zheng	cf5cd98e74	[InstCombine] Support and/or in `getFreelyInvertedImpl` using DeMorgan's Law (#85193 ) This patch adds the support for and/or in `getFreelyInvertedImpl` using DeMorgan's Law: ``` (~(A \| B)) -> (~A & ~B) (~(A & B)) -> (~A \| ~B) ``` Alive2: https://alive2.llvm.org/ce/z/Uig8-j	2024-03-15 19:10:02 +08:00
Artem Tyurin	141145232f	[IRBuilder] Fold binary intrinsics (#80743 ) Fixes https://github.com/llvm/llvm-project/issues/61240.	2024-03-15 09:58:25 +01:00
SahilPatidar	e61e26091c	[InstCombine] Fold `mul (sext bool X), Y` into `select X, -Y, 0` (#84792 ) Alive2: https://alive2.llvm.org/ce/z/n_ns-W Resolve #84608	2024-03-15 16:08:46 +08:00
Noah Goldstein	70d0ebb279	[InstCombine] Fix behavior for `(fmul (sitfp x), 0)` Bug was introduced in #82555 We where missing check that the constant was non-zero for signed + mul transform. Closes #85298	2024-03-14 17:41:25 -05:00
zhongyunde 00443407	2d6988a45e	[InstCombine] Add restrict reassoc for the operands of fmul According the discussion, except the fmul itself, all its operands should also have reassoc flag. Add new API m_AllowReassoc to check reassoc flag	2024-03-14 22:05:21 +08:00
zhongyunde 00443407	1752b9e4c7	[InstCombine] create a helper function foldPowiReassoc, NFC	2024-03-14 22:05:20 +08:00
zhongyunde 00443407	098520244f	[InstCombine] optimize powi(X,Y) * X with Ofast Try to transform the powi(X, Y) * X into powi(X, Y+1) with Ofast For this case, when the Y is 3, then powi(X, 4) is replaced by X2 = X * X; X2 * X2 in the further step. Similar to D109954, who requires reassoc. Fixes https://github.com/llvm/llvm-project/issues/69862.	2024-03-14 22:05:20 +08:00
Stephen Tozer	2e865353ed	[RemoveDIs][NFC] Move DPValue::filter -> filterDbgVars (#85208 ) This patch changes DPValue::filter to be a non-member method filterDbgVars. There are two reasons for this: firstly, the name of DPValue is about to change to DbgVariableRecord, which will result in every `for` loop that uses DPValue::filter to require a line break. This is a small thing, but it makes the rename patch more difficult to review, and is just generally more awkward for what is a fairly common loop. Secondly, the intent is to later break up the DPValue class into subclasses, at which point it would be better to have a non-member function that allows template arguments for the cases we want to filter with greater specificity.	2024-03-14 12:19:15 +00:00
Yingwei Zheng	fef62be09c	[InstCombine] Canonicalize `extractvalue + select` (#84686 ) This patch canonicalizes `extractvalue (select Cond, TV, FV)` into `select Cond, (extractvalue TV), (extractvalue FV)`. The latter form may enable more optimizations.	2024-03-14 14:06:40 +08:00
Noah Goldstein	d80d5b923c	[InstCombine] Canonicalize `(sitofp x)` -> `(uitofp x)` if `x >= 0` Just a standard canonicalization. Proofs: https://alive2.llvm.org/ce/z/9W4VFm Closes #82404	2024-03-13 18:26:21 -05:00
Nikita Popov	628a79dad3	[InstCombine] Don't generate crash dialog for fixpoint verification failure (NFC) Fixpoint verification failures outside our tests are usually not indicative of a bug -- don't be pushy about having people report them.	2024-03-13 16:11:11 +01:00
Yingwei Zheng	83d178843f	[InstCombine] Set zero_is_poison for ctlz/cttz if they are only used as shift amounts (#85035 ) Alive2: https://alive2.llvm.org/ce/z/r-67t9 It would improve the codegen if the target doesn't provide a defined value for ctlz/cttz with zero.	2024-03-13 21:52:40 +08:00
Yingwei Zheng	c18e1215c4	[InstCombine] Simplify `zext nneg i1 X` to zero (#85043 ) Alive2: https://alive2.llvm.org/ce/z/Wm6kCk	2024-03-13 20:15:29 +08:00
Noah Goldstein	5ca325e49c	[InstCombine] Detect `(x ^ -x)` as a ~Mask Proof: https://alive2.llvm.org/ce/z/TAFmPw This is a lemma for clearing up some of the regressions that #84688 causes. Closes #84868	2024-03-12 13:26:18 -05:00
Stephen Tozer	15f3f446c5	[RemoveDIs][NFC] Rename common interface functions for DPValues->DbgRecords (#84793 ) As part of the effort to rename the DbgRecord classes, this patch renames the widely-used functions that operate on DbgRecords but refer to DbgValues or DPValues in their names to refer to DbgRecords instead; all such functions are defined in one of `BasicBlock.h`, `Instruction.h`, and `DebugProgramInstruction.h`. This patch explicitly does not change the names of any comments or variables, except for where they use the exact name of one of the renamed functions. The reason for this is reviewability; this patch can be trivially examined to determine that the only changes are direct string substitutions and any results from clang-format responding to the changed line lengths. Future patches will cover renaming variables and comments, and then renaming the classes themselves.	2024-03-12 14:53:13 +00:00
elhewaty	3f302eaca4	[InstCombine] Fold usub_sat((sub nuw C1, A), C2) to usub_sat(C1 - C2, A) or 0 (#82280 ) - Fixes: https://github.com/llvm/llvm-project/issues/82177 - Alive2: https://alive2.llvm.org/ce/z/Q7mMC3	2024-03-11 15:10:40 +01:00
Noah Goldstein	60dda1fc6e	[InstCombine] fold `(icmp eq/ne (and (shl -1, X), Y), 0)` -> `(icmp eq/ne (lshr Y, X), 0)` Proofs: https://alive2.llvm.org/ce/z/oSRGBt Closes #84691	2024-03-10 18:16:00 -05:00
Noah Goldstein	193b3d6733	[InstCombine] Recognize `(icmp eq/ne (and X, ~Mask), 0)` pattern in `foldICmpWithLowBitMaskedVal` `(icmp eq/ne (and X, ~Mask), 0)` is equivilent to `(icmp eq/ne (and X, Mask), X` and we sometimes generate the former pattern intentionally to reduce number of uses of `X`. Proof: https://alive2.llvm.org/ce/z/3u-usC Differential Revision: https://reviews.llvm.org/D159329 Closes #81562	2024-03-10 14:33:34 -05:00
Noah Goldstein	d77eb9ea59	[InstCombine] Improve mask detection in `foldICmpWithLowBitMaskedVal` Make recursive matcher that is able to detect a lot more patterns. Proofs for all supported patterns: https://alive2.llvm.org/ce/z/fSQ3nZ Differential Revision: https://reviews.llvm.org/D159058	2024-03-10 14:33:34 -05:00
Noah Goldstein	f89e4e339f	[InstCombine] Move `foldICmpWithLowBitMaskedVal` to `foldICmpCommutative`; NFC	2024-03-10 14:33:34 -05:00
Zain Jaffal	f5811494b0	check if operand is div in fold FDivSqrtDivisor (#81970 ) This patch fixes the issues introduced in `bb5c3899d1`. I moved the check for the instruction to be div before I check for the fast math flags which resolves the crash in ``` float a, b; double sqrt(); void c() { b = a / sqrt(a); } ``` --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2024-03-09 17:15:14 +00:00
Noah Goldstein	8d976c7f20	[InstCombine] Make `(binop ({s\|u}itofp),({s\|u}itofp))` transform more flexible to mismatched signs Instead of taking the sign of the cast operation as the required since for the transform, only force a sign if an operation is maybe negative. This gives us more flexability when checking if the floats are safely converable to integers. Closes #84389	2024-03-09 11:06:02 -06:00
Jon Chesterfield	6157538d9e	[InstCombine] ptrmask of gep for dynamic pointer aligment (#80002 ) Targets the dynamic realignment pattern of `(Ptr + Align - 1) & -Align;` as implemented by gep then ptrmask. Specifically, when the pointer already has alignment information, dynamically realigning it to less than is already known should be a no-op. Discovered while writing test cases for another patch. For the zero low bits of a known aligned pointer, adding the gep index then removing it with a mask is a no-op. Folding the ptrmask effect entirely into the gep is the ideal result as that unblocks other optimisations that are not aware of ptrmask. In some other cases the gep is known to be dead and is removed without changing the ptrmask. In the least effective case, this transform creates a new gep with a rounded-down index and still leaves the ptrmask unchanged. That simplified gep is still a minor improvement, geps are cheap and ptrmask occurs in address calculation contexts so I don't think it's worth special casing to avoid the extra instruction.	2024-03-07 17:36:28 +00:00
Noah Goldstein	946ea4e3ca	[InstCombine] Add folds for `(fp_binop ({s\|u}itofp x), ({s\|u}itofp y))` The full fold is one of the following: 1) `(fp_binop ({s\|u}itofp x), ({s\|u}itofp y))` -> `({s\|u}itofp (int_binop x, y))` 2) `(fp_binop ({s\|u}itofp x), FpC)` -> `({s\|u}itofp (int_binop x, (fpto{s\|u}i FpC)))` And support the following binops: `fmul` -> `mul` `fadd` -> `add` `fsub` -> `sub` Proofs: https://alive2.llvm.org/ce/z/zuacA8 The proofs timeout, so they must be reproduced locally. Closes #82555	2024-03-06 13:28:04 -06:00
Noah Goldstein	0f5849eeee	[InstCombine] Move folding `(add (sitofp x), (sitofp y))` impl to InstructionCombiner; NFC	2024-03-06 13:28:04 -06:00
hanbeom	6cdf596c52	[InstCombine] If inst in unreachable refers to an inst change it to poison (#78444 ) Instructions in unreachable basic blocks are removed, but terminators are not. In this case, even instructions that are only referenced by a terminator, such as a return instruction, cannot be processed properly. This patch changes the operand of a return instruction in an unreachable basic block to poison if it refers to the instruction, allowing the instruction to be properly processed. Fixes #65107.	2024-03-06 09:42:33 +01:00
Nikita Popov	9f45c5e1a6	[InstCombine] Fix infinite loop in select equivalence fold (#84036 ) When replacing with a non-constant, it's possible that the result of the simplification is actually more complicated than the original, and may result in an infinite combine loop. Mitigate the issue by requiring that either the replacement or simplification result is constant, which should ensure that it's simpler. While this check is crude, it does not appear to cause optimization regressions in real-world code in practice. Fixes https://github.com/llvm/llvm-project/issues/83127.	2024-03-06 09:33:51 +01:00
Quentin Dian	e96c0c1d5e	[InstCombine] Fix shift calculation in InstCombineCasts (#84027 ) Fixes #84025.	2024-03-06 06:16:28 +08:00
Noah Goldstein	61c06775c9	[KnownBits] Add API for `nuw` flag in `computeForAddSub`; NFC	2024-03-05 12:59:58 -06:00
Jeremy Morse	2fe81edef6	[NFC][RemoveDIs] Insert instruction using iterators in Transforms/ As part of the RemoveDIs project we need LLVM to insert instructions using iterators wherever possible, so that the iterators can carry a bit of debug-info. This commit implements some of that by updating the contents of llvm/lib/Transforms/Utils to always use iterator-versions of instruction constructors. There are two general flavours of update: * Almost all call-sites just call getIterator on an instruction * Several make use of an existing iterator (scenarios where the code is actually significant for debug-info) The underlying logic is that any call to getFirstInsertionPt or similar APIs that identify the start of a block need to have that iterator passed directly to the insertion function, without being converted to a bare Instruction pointer along the way. Noteworthy changes: * FindInsertedValue now takes an optional iterator rather than an instruction pointer, as we need to always insert with iterators, * I've added a few iterator-taking versions of some value-tracking and DomTree methods -- they just unwrap the iterator. These are purely convenience methods to avoid extra syntax in some passes. * A few calls to getNextNode become std::next instead (to keep in the theme of using iterators for positions), * SeparateConstOffsetFromGEP has it's insertion-position field changed. Noteworthy because it's not a purely localised spelling change. All this should be NFC.	2024-03-05 15:12:22 +00:00

1 2 3 4 5 ...

6055 Commits