llvm-project

Author	SHA1	Message	Date
Simon Pilgrim	da570ef1b4	[DAG] Match select(icmp(x,y),sub(x,y),sub(y,x)) -> abd(x,y) patterns Pulled out of PowerPC, and added ABDS support as well (hence the additional v4i32 PPC matches) Differential Revision: https://reviews.llvm.org/D144789	2023-03-14 15:10:30 +00:00
Simon Pilgrim	4bf004e07e	[DAG] Fold (bitcast (logicop (bitcast x), (c))) -> (logicop x, (bitcast c)) iff the current logicop type is illegal Try to remove extra bitcasts around logicops if we're dealing with illegal types Fixes the regressions in D145939 Differential Revision: https://reviews.llvm.org/D146032	2023-03-14 14:41:11 +00:00
Simon Pilgrim	c7d844ea0f	[DAG] Use ISD::isBitwiseLogicOp in AND/OR/XOR checks. NFCI. There's additional cases we can cleanup (mainly in target code), but this tries to cleanup generic code and PPC which had an equivalent helper.	2023-03-13 13:39:02 +00:00
Chen Zheng	4f0ed16a46	Reland rGf35a09daebd0a90daa536432e62a2476f708150d and rG63854f91d3ee1056796a5ef27753648396cac6ec [DAGCombiner] handle more store value forwarding When lowering calls on target like PPC, some stack loads will be generated for by value parameters. Node CALLSEQ_START prevents such loads from being combined. Suggested by @RolandF, this patch removes the unnecessary loads for the byval parameter by extending ForwardStoreValueToDirectLoad Reviewed By: nemanjai, RolandF Differential Revision: https://reviews.llvm.org/D138899	2023-03-12 21:59:18 -04:00
Jun Ma	00eef4f7c3	[SelectionDAG] Fix mismatched truncate when combine BUILD_VECTOR with EXTRACT_SUBVECTOR Just use correct type for truncation. Fixes PR59625 Differential Revision: https://reviews.llvm.org/D145757	2023-03-13 08:59:52 +08:00
Simon Pilgrim	82dc04befd	[DAG] visitZERO_EXTEND - pull out the repeated SDLoc(N) variables	2023-03-12 15:18:46 +00:00
Simon Pilgrim	4d7da0e711	[DAG] Cleanup the (zext (shl (zext x), cst)) -> (shl (zext x), cst) fold. NFC. Preliminary cleanup before adding some additional legality and value tracking handling.	2023-03-12 15:01:33 +00:00
Simon Pilgrim	b53ea2b9c5	[DAG] visitAND - fold (and (any_ext V), c) -> (zero_ext (and (trunc V), c)) if profitable. Try to more aggressively narrow masks of extended values. This is mainly for cases where the mask is trying to zero out any_extended upper bits, assuming we can zext/trunc the values for free. This catches a few actual missed folds, as well as helps canonicalize a number of other cases which were being caught in isel etc. Differential Revision: https://reviews.llvm.org/D145866	2023-03-12 13:25:23 +00:00
Simon Pilgrim	fad852efe4	[DAG] combineShiftAnd1ToBitTest - improve support for peeking through truncations Allows us to handle shift amounts that exceed the original bitwidth	2023-03-11 16:37:47 +00:00
Juneyoung Lee	a66bc1c4a3	[DAGCombiner] Avoid converting (x or/xor const) + y to (x + y) + const if benefit is unclear This patch resolves suboptimal code generation reported by https://github.com/llvm/llvm-project/issues/60571 . DAGCombiner currently converts `(x or/xor const) + y` to `(x + y) + const` if this is valid. However, if `.. + const` is broken down into a sequences of adds with carries, the benefit is not clear, introducing two more add(-with-carry) ops (total 6) in the case of the reported issue whereas the optimal sequence must only have 4 add(-with-carry)s. This patch resolves this issue by allowing this conversion only when (1) `.. + const` is legal or promotable, or (2) `const` is a sign bit because it does not introduce more adds. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D144116	2023-03-08 18:13:57 +00:00
Chen Zheng	fc26ab36a2	[DAGCombiner] don't use the pointer info for widen store The merged store touches memory for other underlying objects, so mapping the merged store to the first underlying object is not correct. For example in https://github.com/llvm/llvm-project/issues/60744, the merged store is not correctly analyzed as dependent with memory operations which are also part of the merged store. Fixes #60744 Reviewed By: foad Differential Revision: https://reviews.llvm.org/D144711	2023-03-07 20:31:09 -05:00
Noah Goldstein	c1ecd0a3f4	[DAGCombiner] Add fold for `~x + x` -> `-1` This is generally done by the InstCombine, but can be emitted as an intermediate step and is cheap to handle. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D145177	2023-03-06 20:30:27 -06:00
Noah Goldstein	d4b24b4a55	[DAGCombiner] Add fold for `~x & x` -> `0` This is generally done by the InstCombine, but can be emitted as an intermediate step and is cheap to handle. Differential Revision: https://reviews.llvm.org/D145143	2023-03-06 20:30:20 -06:00
Craig Topper	c546f13f1f	[DAGCombiner] Replace LegalOperations check in visitSIGN_EXTEND with LegalTypes. This is guarding a check for isTypeLegal so it should check is LegalTypes. Fixes PR61111. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D145139	2023-03-02 07:52:53 -08:00
Sander de Smalen	170e7a0ec2	[AArch64][SME2] Add CodeGen support for target("aarch64.svcount"). This patch adds AArch64 CodeGen support such that the type can be passed and returned to/from functions, and also adds support to use this type in load/store operations and PHI nodes. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D136862	2023-03-02 12:07:41 +00:00
David Green	337215ddf9	[DAG] ABD is not reassociative I'm not sure how I missed this in the testing, but as far as I understand whilst ABDS and ABDU are commutive they are not associative. This patch disables reassociateOps from visitABD, fixing the problems found in #61069. ABDU: https://alive2.llvm.org/ce/z/eiT5QG ABDS: https://alive2.llvm.org/ce/z/HzE29l Differential Revision: https://reviews.llvm.org/D145064	2023-03-01 16:22:13 +00:00
Caroline Concatto	cb96eba27c	[IR][Legalization] Split illegal deinterleave and interleave vectors To make legalization easier, the operands and outputs have the same size for these ISD Nodes. When legalizing the results in SplitVectorResult the operands are legalized to the same size as the outputs. The ISD Node has two output/results, therefore the legalizing functions update both results/outputs. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D144744	2023-03-01 08:30:16 +00:00
Noah Goldstein	e981e6d10e	Add transform for `(and/or (icmp eq/ne A,-1),(icmp eq/ne A,-1+C))`->`(and/or (icmp eq/ne (and ~A,-1+C),0))` This works of `-1+C` is a negative power of 2. This can be more useful than the `AddAnd` case as `~A` does not necessarily require materializing a constant. This makes the transform worth it for X86 vector types. Alive2 Links: EQ: https://alive2.llvm.org/ce/z/P6u8cq NE: https://alive2.llvm.org/ce/z/_Kkqp1 Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D144284	2023-02-24 15:22:09 -06:00
Noah Goldstein	8c74c5402f	Make `(and/or (icmp eq/ne A,C0), (icmp eq/ne A,C1))` where `IsPow(dif(C0,C1))` work for more patterns. `(and/or (icmp eq/ne A,C0), (icmp eq/ne A,C1))` can be lowered to `(icmp eq/ne (and (sub A, (smin C0, C1)), (not (sub (smax C0, C1), (smin C0, C1)))), 0)` generically if `(sub (smax C0, C1), (smin C0,C1))` is a power of 2. This covers the existing case of `(and/or (icmp eq/ne A, C_Pow2),(icmp eq/ne A, -C_Pow2))` as well as other cases. Alive2 Links: EQ: https://alive2.llvm.org/ce/z/mLJiUW NE: https://alive2.llvm.org/ce/z/TKnzUr Differential Revision: https://reviews.llvm.org/D144283	2023-02-24 15:22:09 -06:00
Samuel Parker	f48d3b6f46	Revert "[DAGCombine] Fold redundant select" This reverts commit c7f9344d0f8f6a00adab138037e2e7b406ef2b69.	2023-02-23 17:59:41 +00:00
Cameron McInally	af4c4f4e21	[DAGCombine] Fix an ICE in combineMinNumMaxNum(...) 65420c8041f4 introduced an ICE in combineMinNumMaxNum(...) when combineMinNumMaxNumImpl(...) returns an SDValue(). Make sure to check that a value is returned before trying to perform an FNEG on it. GitHub Issue: #60924 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D144571	2023-02-22 11:00:51 -08:00
Ricardo Jesus	272bd573dc	[AArch64] Fix abs(sub nsw) -> absd This partially reverts a regression introduced in 8f25e382c5b1 for AArch64 targets. In particular, we restore the logic of `(abs (sub nsw x, y)) -> abds(x, y)` for all targets except X86, which keeps the logic introduced in 8f25e382c5b1. See also https://reviews.llvm.org/D142288. Differential Revision: https://reviews.llvm.org/D144379	2023-02-22 09:17:25 +00:00
Kazu Hirata	a28b252d85	Use APInt::getSignificantBits instead of APInt::getMinSignedBits (NFC) Note that getMinSignedBits has been soft-deprecated in favor of getSignificantBits.	2023-02-19 23:56:52 -08:00
Kazu Hirata	4a05edd410	[llvm] Use APInt::getZero instead of APInt::getNullValue (NFC) Note that APInt::getNullValue has been soft-deprecated in favor of APInt::getZero.	2023-02-19 22:42:01 -08:00
Kazu Hirata	a7baaab952	Use APInt::isZero instead of APInt::isNulLValue (NFC) Note that APInt::isNullValue has been soft-deprecated in favor of APInt::isZero.	2023-02-19 22:23:58 -08:00
Kazu Hirata	f8f3db2756	Use APInt::count{l,r}_{zero,one} (NFC)	2023-02-19 22:04:47 -08:00
Kazu Hirata	cbde2124f1	Use APInt::popcount instead of APInt::countPopulation (NFC) This is for consistency with the C++20-style bit manipulation functions in <bit>.	2023-02-19 11:29:12 -08:00
Yeting Kuo	a96cbeb450	[DAGCombiner] Teach MatchContextClass classes to use TargetLowering::isOperationLegalOrCustom(). Some of TargetLowering functions needed opcodes are often used in DAGCombiner. The patch make those MatchContextClass classes have TargetLowering members and pass specific opcodes for those TargetLowering functions. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D144075	2023-02-17 15:58:47 +08:00
Kazu Hirata	7e6e636fb6	Use llvm::has_single_bit<uint32_t> (NFC) This patch replaces isPowerOf2_32 with llvm::has_single_bit<uint32_t> where the argument is wider than uint32_t.	2023-02-15 22:17:27 -08:00
Samuel Parker	c7f9344d0f	[DAGCombine] Fold redundant select Recommit bbdf24357932b064f2aa18ea1356b474e0220dde. Original commit message: If a chain of two selects share a true/false value and are controlled by two setcc nodes, that are never both true, we can fold away one of the selects. So, the following: (select (setcc X, const0, eq), Y, (select (setcc X, const1, eq), Z, Y)) Can be combined to: select (setcc X, const1, eq) Z, Y Differential Revision: https://reviews.llvm.org/D142535	2023-02-15 10:32:16 +00:00
Noah Goldstein	42e11a6ea3	Add transform (and/or (icmp eq/ne (A, C)), (icmp eq/ne (A, -C))) -> (icmp eq/ne (ABS A), ABS(C)) This can be beneficial if there is a fast `ABS` (For example with X86 `vpabs`) or if there is a dominating ABS(A) in the `DAG`. Note `C` is constant so `ABS(C)` is just a constant. Alive2 Links: EQ: https://alive2.llvm.org/ce/z/829F-c NE: https://alive2.llvm.org/ce/z/tsS8bU Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D142601	2023-02-14 18:59:04 -06:00
Noah Goldstein	54a9e992c8	Add Transform for `(and/or (eq/ne A,Pow2),(eq/ne A,-Pow2))`->`(eq/ne (and (and A,Pow2),~(Pow2*2)), 0)` In many instances this can be preferable if the `icmp` -> `i1` cannot be done in one instruction (such as X86 for scalars). At the moment guarded behind `TLI.isDesirableToCombineLogicOpOfSETCC`. alive2 links: https://alive2.llvm.org/ce/z/nLm5sN https://alive2.llvm.org/ce/z/moEcyE Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D142344	2023-02-14 18:59:04 -06:00
Matt Arsenault	09dd4d870e	DAG: Remove hasBitPreservingFPLogic This doesn't make sense as an option. fneg and fabs are bit preserving by definition. If a target has some fneg or fabs instruction that are not bitpreserving it's incorrect to lower fneg/fabs to use it.	2023-02-14 10:25:24 -04:00
Matt Arsenault	f3c008ca77	DAG: Relax foldBitcastedFPLogic conditions Requiring a bitcast to exist was unhelpful. The most basic cases are always going to be a CopyFromReg or load, so they would need a new cast inserted. Don't require a bitcast if it's a free operation. I don't think this logic makes particularly much sense (it seems to be imparting special interpretation of bitcast), but this needs to be in sync with foldSignChangeInBitcast. We should also get rid of this hasBitPreservingFPLogic hook. fabs/fneg are bitpreserving or incorrectly implemented, so this should just be a regular legality check.	2023-02-14 07:59:10 -04:00
pvanhout	04f6934589	[DAG] Handle build_vector with all undefs in reduceBuildVecTruncToBitCast While working on D143731 I hit a case where a build_vector with 2 undef operands could be generated (with one undef hidden behind a bitcast). That made `reduceBuildVecTruncToBitCast` crash because it seems to assume there is at least one good operand. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D143886	2023-02-14 08:52:28 +01:00
Arthur Eubanks	7c6b46e87e	Revert "[DAGCombiner] handle more store value forwarding" This reverts commit f35a09daebd0a90daa536432e62a2476f708150d. Causes miscompiles, see D138899	2023-02-13 19:07:28 -08:00
Arthur Eubanks	ac6219d0ae	Revert "[DAGCombiner] fix comments for D138899; NFC" This reverts commit 63854f91d3ee1056796a5ef27753648396cac6ec. Dependent commit to be reverted.	2023-02-13 19:07:27 -08:00
Dinar Temirbulatov	d44b31eca2	[DAGCombine] Allow DAGCombine to remove dead masked stores. Remove a dead masked store if another one has the same base pointer and mask or the following store has all true constant mask and size if equal or bigger to the first store. Differential Revision: https://reviews.llvm.org/D143069	2023-02-13 16:11:11 +00:00
Simon Pilgrim	ce63cd3bf1	[DAG] Fold freeze(concat_vectors(x,y,...)) -> concat_vectors(freeze(x),freeze(y),...) Another of the cleanups necessary for D136529	2023-02-08 20:26:43 +00:00
Simon Pilgrim	b7deb71ef5	[DAG] Fold freeze(build_pair(x,y)) -> build_pair(freeze(x),freeze(y)) One of the cleanups necessary for D136529 - another being how we're going to handle moving freeze through multiple result nodes (like uaddo and subcarry)	2023-02-08 17:54:03 +00:00
David Green	1af3f596f6	[DAG] Fold Op(vecreduce(a), vecreduce(b)) into vecreduce(Op(a,b)) So long as the operation is reassociative, we can reassociate the double vecreduce from for example fadd(vecreduce(a), vecreduce(b)) to vecreduce(fadd(a,b)). This will in general save a few instructions, but some architectures (MVE) require the opposite fold, so a shouldExpandReduction is added to account for it. Only targets that use shouldExpandReduction will be affected. Differential Revision: https://reviews.llvm.org/D141870	2023-02-08 11:43:36 +00:00
Fangrui Song	a13645cf8c	DAGCombiner: fix -Wunused-private-field. NFC	2023-02-07 22:33:56 -08:00
Yeting Kuo	7bc2cd614e	[VP][DAGCombiner] Introduce generalized pattern match for vp sdnodes. The patch tries to solve duplicated combine work for vp sdnodes. The idea is to introduce MatchConext that verifies specific patterns and generate specific node infromation. There is two MatchConext in DAGCombiner. EmptyMatcher is for normal nodes and VPMatcher is for vp nodes. The idea of this patch is come form Simon Moll's proposal [0]. I only fixed some minor issues and added few new features in this patch. [0]: `c38a14484a` Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D141891	2023-02-08 13:45:35 +08:00
Dinar Temirbulatov	8fdc3ff220	[DAGCombine] Allow scalable type dead store elimination. Add support to allow removing a dead store for scalable types. Avoid to remove scalable type store in favor of fixed type store, since scalable type size is unknown at the compile time. Differential Revision: https://reviews.llvm.org/D142100	2023-02-08 00:57:26 +00:00
Samuel Parker	91f8289ff0	Revert "[DAGCombine] Fold redundant select" This reverts commit bbdf24357932b064f2aa18ea1356b474e0220dde.	2023-02-07 10:37:20 +00:00
Chen Zheng	63854f91d3	[DAGCombiner] fix comments for D138899; NFC	2023-02-07 00:32:34 -05:00
David Green	120ce83660	[DAG] Add visitABD optimizations This adds basic a visitABD to optimize ABDS and ABDU nodes, similar to the existing visitAVG method. The fold I was initially interested in was folding shuffles though the binop. This also: - Marks ABDS and ABDU as commutative binops (https://alive2.llvm.org/ce/z/oCDogb and https://alive2.llvm.org/ce/z/7zrs86). - Add reassociative folds. - Add constant folding using max(x,y)-min(x,y) - Canonicalizes constants to the RHS - Folds abds x, 0 -> abs(x) (https://alive2.llvm.org/ce/z/4ZEibv) - Folds abdu x, 0 -> x (https://alive2.llvm.org/ce/z/J_rKqx) - Folds abd x, undef -> 0 (https://alive2.llvm.org/ce/z/NV6Nsv and https://alive2.llvm.org/ce/z/vs92hu). Differential Revision: https://reviews.llvm.org/D143193	2023-02-05 10:28:54 +00:00
Simon Pilgrim	723b6cf7a8	[DAG] visitFREEZE - handle case where the folded node merges with another existing node Fixes #60413	2023-02-04 20:53:47 +00:00
Simon Pilgrim	8f25e382c5	[X86] Add basic vector handling for ISD::ABDS/ABDU (absolute difference) nodes I'm intending to add generic legalization in the future, but for now I've added basic support to targets that have the necessary MIN/MAX support to expand to SUB(MAX(X,Y),MIN(X,Y)). This exposed a couple of issues with the DAG combines - in particular we need to catch trunc(abs(sub(ext(x),ext(y)))) patterns earlier before the SSE/AVX vector trunc expansion folds trigger. Differential Revision: https://reviews.llvm.org/D142288	2023-02-04 11:25:51 +00:00
Samuel Parker	bbdf243579	[DAGCombine] Fold redundant select If a chain of two selects share a true/false value and are controlled by two setcc nodes, that are never both true, we can fold away one of the selects. So, the following: (select (setcc X, const0, eq), Y, (select (setcc X, const1, eq), Z, Y)) Can be combined to: select (setcc X, const1, eq) Z, Y Differential Revision: https://reviews.llvm.org/D142535	2023-02-02 09:43:21 +00:00

1 2 3 4 5 ...

3539 Commits