llvm-project

Author	SHA1	Message	Date
Nikita Popov	fe4dbbb467	[DAGCombiner] Fold add (mul x, C), x to mul x, C+1 While this is normally non-canonical IR, this pattern can appear during SDAG lowering if the add is actually a getelementptr, as illustrated in `@test_ptr`. This pattern comes up when doing provenance-aware high-bit pointer tagging. Proof: https://alive2.llvm.org/ce/z/DLoEcs Fixes https://github.com/llvm/llvm-project/issues/62093. Differential Revision: https://reviews.llvm.org/D148341	2023-04-17 12:33:46 +02:00
Archibald Elliott	17cd511007	[DAGCombiner] Fix (shl (ctlz x) n) for non-power-of-two Data This DAGCombine is not valid for some combinations of the known bits of x and non-power-of-two widths of x. As shown in the bug: - The bitwidth of x is 35 (n=5) - The unknown bits of x is only the least significant bit - This gives the result of the ctlz two possible values: 34 or 35, both of which will give 1 when left-shifted 5 bits. - So the `eor x, 1` that this optimisation would give is not correct. A similar instcombine optimisation is only applied when the width of x is a power-of-two. GlobalISel does not have this bug, as shown by the testcase. Fixes #61549 Differential Revision: https://reviews.llvm.org/D147518	2023-04-12 17:38:39 +01:00
Amaury Séchet	91105df3df	[DAG] Peek through zext/trunc when matching (or (and X, (not Y)), Y). This shows up in the wild, notably as a regression in D127115 . Depends on D147821 Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D147827	2023-04-11 13:48:22 +00:00
Dinar Temirbulatov	7f05bdf4ee	[AArch64][SME] Fix an infinite loop in DAGCombine related to adding -force-streaming-compatible-sve flag. Compiler hits infinite loop in DAGCombine. For force-streaming-compatible-sve mode we have custom lowering for 128-bit vector splats and later in DAGCombiner::SimplifyVCastOp() we scalarized SPLAT because we have custom lowering for SME. Later, we restored SPLAT opertion via performMulCombine().	2023-04-05 10:10:55 +00:00
Jun Zhang	7657e50fef	[DAGCombiner] Fold avg(x, x) --> x Signed-off-by: Jun Zhang <jun@junz.org> Differential Revision: https://reviews.llvm.org/D147404	2023-04-03 16:57:50 +08:00
Nemanja Ivanovic	e7c35d7100	[SelectionDAG] Correctly reduce BV to shuffle with zero on big endian This DAG combine is correct on little endian targets but is incorrect on big endian targets. Add big endian code to correct it. Differential revision: https://reviews.llvm.org/D146460	2023-03-24 10:57:17 -04:00
Xiang1 Zhang	cc86e6b0a8	[BugFix] Fix VSELECT ISel fail Reviewed By: Luo yuanke Differential Revision: https://reviews.llvm.org/D146683	2023-03-24 13:13:35 +08:00
Kazu Hirata	7bb6d1b32e	[llvm] Skip getAPIntValue (NFC) ConstantSDNode provides some convenience functions like isZero, getZExtValue, and isMinSignedValue that are named identically to those provided by APInt, so we can "skip" getAPIntValue.	2023-03-22 22:10:25 -07:00
Simon Pilgrim	651b4054a0	[DAG] visitABS - use FoldConstantArithmetic to perform constant folding. Avoid needing to perform extra isConstantIntBuildVectorOrConstantInt checks	2023-03-21 17:23:27 +00:00
Simon Pilgrim	a6a788bdfb	[DAG] foldBinOpIntoSelect - use FoldConstantArithmetic instead of getNode() + constant check. This prevents unused nodes from being created if the constant check fails. Noticed while triaging D127115 regressions	2023-03-21 12:59:16 +00:00
Simon Pilgrim	2d4042f4b7	[DAG] visitTRUNCATE - use FoldConstantArithmetic to perform constant folding. Avoid needing to perform extra isConstantIntBuildVectorOrConstantInt checks	2023-03-20 11:14:14 +00:00
Simon Pilgrim	e9a86b7813	[DAG] foldBinOpIntoSelect - remove !CanFoldNonConst check. NFC. These checks are in an if-else chain where CanFoldNonConst is already guaranteed to be false.	2023-03-20 11:14:14 +00:00
Simon Pilgrim	28a0d0e85a	[DAG] Don't fold zext(logicalshift(zext(x),c)) -> logicalshift(zext(x),c) if the outer zext is free Avoid widening the shift to a bigger type if the zext would be free anyway Pulled out of D146121	2023-03-15 17:45:12 +00:00
Simon Pilgrim	c1f81e7604	[DAG] mergeStore - peek through truncates when finding dead store(trunc(load())) patterns Extend the existing store(load()) removal code to account for intermediate truncates that some targets won't remove with canCombineTruncStore - we only care about the load/store MemoryVT. Fixes regression from D146121	2023-03-15 11:54:13 +00:00
Simon Pilgrim	70562607ab	[DAG] Fold multiple insert_vector_elt of zero values into an AND mask This also allows us to make use of the existing isVectorClearMaskLegal shuffle canonicalization Differential Revision: https://reviews.llvm.org/D145939	2023-03-15 09:56:26 +00:00
Simon Pilgrim	da570ef1b4	[DAG] Match select(icmp(x,y),sub(x,y),sub(y,x)) -> abd(x,y) patterns Pulled out of PowerPC, and added ABDS support as well (hence the additional v4i32 PPC matches) Differential Revision: https://reviews.llvm.org/D144789	2023-03-14 15:10:30 +00:00
Simon Pilgrim	4bf004e07e	[DAG] Fold (bitcast (logicop (bitcast x), (c))) -> (logicop x, (bitcast c)) iff the current logicop type is illegal Try to remove extra bitcasts around logicops if we're dealing with illegal types Fixes the regressions in D145939 Differential Revision: https://reviews.llvm.org/D146032	2023-03-14 14:41:11 +00:00
Simon Pilgrim	c7d844ea0f	[DAG] Use ISD::isBitwiseLogicOp in AND/OR/XOR checks. NFCI. There's additional cases we can cleanup (mainly in target code), but this tries to cleanup generic code and PPC which had an equivalent helper.	2023-03-13 13:39:02 +00:00
Chen Zheng	4f0ed16a46	Reland rGf35a09daebd0a90daa536432e62a2476f708150d and rG63854f91d3ee1056796a5ef27753648396cac6ec [DAGCombiner] handle more store value forwarding When lowering calls on target like PPC, some stack loads will be generated for by value parameters. Node CALLSEQ_START prevents such loads from being combined. Suggested by @RolandF, this patch removes the unnecessary loads for the byval parameter by extending ForwardStoreValueToDirectLoad Reviewed By: nemanjai, RolandF Differential Revision: https://reviews.llvm.org/D138899	2023-03-12 21:59:18 -04:00
Jun Ma	00eef4f7c3	[SelectionDAG] Fix mismatched truncate when combine BUILD_VECTOR with EXTRACT_SUBVECTOR Just use correct type for truncation. Fixes PR59625 Differential Revision: https://reviews.llvm.org/D145757	2023-03-13 08:59:52 +08:00
Simon Pilgrim	82dc04befd	[DAG] visitZERO_EXTEND - pull out the repeated SDLoc(N) variables	2023-03-12 15:18:46 +00:00
Simon Pilgrim	4d7da0e711	[DAG] Cleanup the (zext (shl (zext x), cst)) -> (shl (zext x), cst) fold. NFC. Preliminary cleanup before adding some additional legality and value tracking handling.	2023-03-12 15:01:33 +00:00
Simon Pilgrim	b53ea2b9c5	[DAG] visitAND - fold (and (any_ext V), c) -> (zero_ext (and (trunc V), c)) if profitable. Try to more aggressively narrow masks of extended values. This is mainly for cases where the mask is trying to zero out any_extended upper bits, assuming we can zext/trunc the values for free. This catches a few actual missed folds, as well as helps canonicalize a number of other cases which were being caught in isel etc. Differential Revision: https://reviews.llvm.org/D145866	2023-03-12 13:25:23 +00:00
Simon Pilgrim	fad852efe4	[DAG] combineShiftAnd1ToBitTest - improve support for peeking through truncations Allows us to handle shift amounts that exceed the original bitwidth	2023-03-11 16:37:47 +00:00
Juneyoung Lee	a66bc1c4a3	[DAGCombiner] Avoid converting (x or/xor const) + y to (x + y) + const if benefit is unclear This patch resolves suboptimal code generation reported by https://github.com/llvm/llvm-project/issues/60571 . DAGCombiner currently converts `(x or/xor const) + y` to `(x + y) + const` if this is valid. However, if `.. + const` is broken down into a sequences of adds with carries, the benefit is not clear, introducing two more add(-with-carry) ops (total 6) in the case of the reported issue whereas the optimal sequence must only have 4 add(-with-carry)s. This patch resolves this issue by allowing this conversion only when (1) `.. + const` is legal or promotable, or (2) `const` is a sign bit because it does not introduce more adds. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D144116	2023-03-08 18:13:57 +00:00
Chen Zheng	fc26ab36a2	[DAGCombiner] don't use the pointer info for widen store The merged store touches memory for other underlying objects, so mapping the merged store to the first underlying object is not correct. For example in https://github.com/llvm/llvm-project/issues/60744, the merged store is not correctly analyzed as dependent with memory operations which are also part of the merged store. Fixes #60744 Reviewed By: foad Differential Revision: https://reviews.llvm.org/D144711	2023-03-07 20:31:09 -05:00
Noah Goldstein	c1ecd0a3f4	[DAGCombiner] Add fold for `~x + x` -> `-1` This is generally done by the InstCombine, but can be emitted as an intermediate step and is cheap to handle. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D145177	2023-03-06 20:30:27 -06:00
Noah Goldstein	d4b24b4a55	[DAGCombiner] Add fold for `~x & x` -> `0` This is generally done by the InstCombine, but can be emitted as an intermediate step and is cheap to handle. Differential Revision: https://reviews.llvm.org/D145143	2023-03-06 20:30:20 -06:00
Craig Topper	c546f13f1f	[DAGCombiner] Replace LegalOperations check in visitSIGN_EXTEND with LegalTypes. This is guarding a check for isTypeLegal so it should check is LegalTypes. Fixes PR61111. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D145139	2023-03-02 07:52:53 -08:00
Sander de Smalen	170e7a0ec2	[AArch64][SME2] Add CodeGen support for target("aarch64.svcount"). This patch adds AArch64 CodeGen support such that the type can be passed and returned to/from functions, and also adds support to use this type in load/store operations and PHI nodes. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D136862	2023-03-02 12:07:41 +00:00
David Green	337215ddf9	[DAG] ABD is not reassociative I'm not sure how I missed this in the testing, but as far as I understand whilst ABDS and ABDU are commutive they are not associative. This patch disables reassociateOps from visitABD, fixing the problems found in #61069. ABDU: https://alive2.llvm.org/ce/z/eiT5QG ABDS: https://alive2.llvm.org/ce/z/HzE29l Differential Revision: https://reviews.llvm.org/D145064	2023-03-01 16:22:13 +00:00
Caroline Concatto	cb96eba27c	[IR][Legalization] Split illegal deinterleave and interleave vectors To make legalization easier, the operands and outputs have the same size for these ISD Nodes. When legalizing the results in SplitVectorResult the operands are legalized to the same size as the outputs. The ISD Node has two output/results, therefore the legalizing functions update both results/outputs. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D144744	2023-03-01 08:30:16 +00:00
Noah Goldstein	e981e6d10e	Add transform for `(and/or (icmp eq/ne A,-1),(icmp eq/ne A,-1+C))`->`(and/or (icmp eq/ne (and ~A,-1+C),0))` This works of `-1+C` is a negative power of 2. This can be more useful than the `AddAnd` case as `~A` does not necessarily require materializing a constant. This makes the transform worth it for X86 vector types. Alive2 Links: EQ: https://alive2.llvm.org/ce/z/P6u8cq NE: https://alive2.llvm.org/ce/z/_Kkqp1 Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D144284	2023-02-24 15:22:09 -06:00
Noah Goldstein	8c74c5402f	Make `(and/or (icmp eq/ne A,C0), (icmp eq/ne A,C1))` where `IsPow(dif(C0,C1))` work for more patterns. `(and/or (icmp eq/ne A,C0), (icmp eq/ne A,C1))` can be lowered to `(icmp eq/ne (and (sub A, (smin C0, C1)), (not (sub (smax C0, C1), (smin C0, C1)))), 0)` generically if `(sub (smax C0, C1), (smin C0,C1))` is a power of 2. This covers the existing case of `(and/or (icmp eq/ne A, C_Pow2),(icmp eq/ne A, -C_Pow2))` as well as other cases. Alive2 Links: EQ: https://alive2.llvm.org/ce/z/mLJiUW NE: https://alive2.llvm.org/ce/z/TKnzUr Differential Revision: https://reviews.llvm.org/D144283	2023-02-24 15:22:09 -06:00
Samuel Parker	f48d3b6f46	Revert "[DAGCombine] Fold redundant select" This reverts commit c7f9344d0f8f6a00adab138037e2e7b406ef2b69.	2023-02-23 17:59:41 +00:00
Cameron McInally	af4c4f4e21	[DAGCombine] Fix an ICE in combineMinNumMaxNum(...) 65420c8041f4 introduced an ICE in combineMinNumMaxNum(...) when combineMinNumMaxNumImpl(...) returns an SDValue(). Make sure to check that a value is returned before trying to perform an FNEG on it. GitHub Issue: #60924 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D144571	2023-02-22 11:00:51 -08:00
Ricardo Jesus	272bd573dc	[AArch64] Fix abs(sub nsw) -> absd This partially reverts a regression introduced in 8f25e382c5b1 for AArch64 targets. In particular, we restore the logic of `(abs (sub nsw x, y)) -> abds(x, y)` for all targets except X86, which keeps the logic introduced in 8f25e382c5b1. See also https://reviews.llvm.org/D142288. Differential Revision: https://reviews.llvm.org/D144379	2023-02-22 09:17:25 +00:00
Kazu Hirata	a28b252d85	Use APInt::getSignificantBits instead of APInt::getMinSignedBits (NFC) Note that getMinSignedBits has been soft-deprecated in favor of getSignificantBits.	2023-02-19 23:56:52 -08:00
Kazu Hirata	4a05edd410	[llvm] Use APInt::getZero instead of APInt::getNullValue (NFC) Note that APInt::getNullValue has been soft-deprecated in favor of APInt::getZero.	2023-02-19 22:42:01 -08:00
Kazu Hirata	a7baaab952	Use APInt::isZero instead of APInt::isNulLValue (NFC) Note that APInt::isNullValue has been soft-deprecated in favor of APInt::isZero.	2023-02-19 22:23:58 -08:00
Kazu Hirata	f8f3db2756	Use APInt::count{l,r}_{zero,one} (NFC)	2023-02-19 22:04:47 -08:00
Kazu Hirata	cbde2124f1	Use APInt::popcount instead of APInt::countPopulation (NFC) This is for consistency with the C++20-style bit manipulation functions in <bit>.	2023-02-19 11:29:12 -08:00
Yeting Kuo	a96cbeb450	[DAGCombiner] Teach MatchContextClass classes to use TargetLowering::isOperationLegalOrCustom(). Some of TargetLowering functions needed opcodes are often used in DAGCombiner. The patch make those MatchContextClass classes have TargetLowering members and pass specific opcodes for those TargetLowering functions. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D144075	2023-02-17 15:58:47 +08:00
Kazu Hirata	7e6e636fb6	Use llvm::has_single_bit<uint32_t> (NFC) This patch replaces isPowerOf2_32 with llvm::has_single_bit<uint32_t> where the argument is wider than uint32_t.	2023-02-15 22:17:27 -08:00
Samuel Parker	c7f9344d0f	[DAGCombine] Fold redundant select Recommit bbdf24357932b064f2aa18ea1356b474e0220dde. Original commit message: If a chain of two selects share a true/false value and are controlled by two setcc nodes, that are never both true, we can fold away one of the selects. So, the following: (select (setcc X, const0, eq), Y, (select (setcc X, const1, eq), Z, Y)) Can be combined to: select (setcc X, const1, eq) Z, Y Differential Revision: https://reviews.llvm.org/D142535	2023-02-15 10:32:16 +00:00
Noah Goldstein	42e11a6ea3	Add transform (and/or (icmp eq/ne (A, C)), (icmp eq/ne (A, -C))) -> (icmp eq/ne (ABS A), ABS(C)) This can be beneficial if there is a fast `ABS` (For example with X86 `vpabs`) or if there is a dominating ABS(A) in the `DAG`. Note `C` is constant so `ABS(C)` is just a constant. Alive2 Links: EQ: https://alive2.llvm.org/ce/z/829F-c NE: https://alive2.llvm.org/ce/z/tsS8bU Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D142601	2023-02-14 18:59:04 -06:00
Noah Goldstein	54a9e992c8	Add Transform for `(and/or (eq/ne A,Pow2),(eq/ne A,-Pow2))`->`(eq/ne (and (and A,Pow2),~(Pow2*2)), 0)` In many instances this can be preferable if the `icmp` -> `i1` cannot be done in one instruction (such as X86 for scalars). At the moment guarded behind `TLI.isDesirableToCombineLogicOpOfSETCC`. alive2 links: https://alive2.llvm.org/ce/z/nLm5sN https://alive2.llvm.org/ce/z/moEcyE Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D142344	2023-02-14 18:59:04 -06:00
Matt Arsenault	09dd4d870e	DAG: Remove hasBitPreservingFPLogic This doesn't make sense as an option. fneg and fabs are bit preserving by definition. If a target has some fneg or fabs instruction that are not bitpreserving it's incorrect to lower fneg/fabs to use it.	2023-02-14 10:25:24 -04:00
Matt Arsenault	f3c008ca77	DAG: Relax foldBitcastedFPLogic conditions Requiring a bitcast to exist was unhelpful. The most basic cases are always going to be a CopyFromReg or load, so they would need a new cast inserted. Don't require a bitcast if it's a free operation. I don't think this logic makes particularly much sense (it seems to be imparting special interpretation of bitcast), but this needs to be in sync with foldSignChangeInBitcast. We should also get rid of this hasBitPreservingFPLogic hook. fabs/fneg are bitpreserving or incorrectly implemented, so this should just be a regular legality check.	2023-02-14 07:59:10 -04:00
pvanhout	04f6934589	[DAG] Handle build_vector with all undefs in reduceBuildVecTruncToBitCast While working on D143731 I hit a case where a build_vector with 2 undef operands could be generated (with one undef hidden behind a bitcast). That made `reduceBuildVecTruncToBitCast` crash because it seems to assume there is at least one good operand. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D143886	2023-02-14 08:52:28 +01:00

1 2 3 4 5 ...

3554 Commits