llvm-project

Author	SHA1	Message	Date
Shourya Goel	92764c99e9	[DAG] Matched Fixedwidth Pattern for ISD::AVGCEILU (#85031 ) Fixes: #84753	2024-03-19 13:02:37 +00:00
Simon Pilgrim	97fc16e14b	[DAG] visitEXTRACT_VECTOR_ELT - share the same SDLoc instead of recreating it. NFC.	2024-03-19 12:39:47 +00:00
Shourya Goel	703920d413	[DAG] Matched FixedWidth pattern for ISD::AVGFLOORU (#84903 ) Fixes: #84749	2024-03-19 08:29:55 +00:00
zicwangupa	bc70f60418	[SelectionDAG] Add m_Neg and m_Not pattern matcher and update DAGCombiner (#85365 ) Resolves #85065 --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2024-03-18 18:34:31 +05:30
David Green	601e102bdb	[CodeGen] Use LocationSize for MMO getSize (#84751 ) This is part of #70452 that changes the type used for the external interface of MMO to LocationSize as opposed to uint64_t. This means the constructors take LocationSize, and convert ~UINT64_C(0) to LocationSize::beforeOrAfter(). The getSize methods return a LocationSize. This allows us to be more precise with unknown sizes, not accidentally treating them as unsigned values, and in the future should allow us to add proper scalable vector support but none of that is included in this patch. It should mostly be an NFC. Global ISel is still expected to use the underlying LLT as it needs, and are not expected to see unknown sizes for generic operations. Most of the changes are hopefully fairly mechanical, adding a lot of getValue() calls and protecting them with hasValue() where needed.	2024-03-17 18:15:56 +00:00
Simon Pilgrim	f1d0a48b27	[DAG] foldABSToABD - share the same SDLoc argument instead of recreating it over and over again.	2024-03-15 11:25:33 +00:00
Simon Pilgrim	cdb36d47b7	[DAG] foldAndToUsubsat/foldSubToUSubSat - share the same SDLoc argument instead of recreating it over and over again.	2024-03-15 11:25:33 +00:00
Simon Pilgrim	fe753f77c3	[DAG] visitTRUNCATE - pull out repeated SDLoc(N). NFC.	2024-03-15 10:24:19 +00:00
Simon Pilgrim	d59256992e	[DAG] visitAND - pull out repeated SDLoc(N). NFC.	2024-03-15 10:24:19 +00:00
Simon Pilgrim	c7cf1350b4	[DAG] foldAndToUsubsat - convert to use sd_match pattern. NFC.	2024-03-14 12:20:49 +00:00
Simon Pilgrim	63180ba444	[DAG] Use SelectionDAG::getNOT helper where possible. NFC.	2024-03-13 14:47:35 +00:00
Simon Pilgrim	a7af53e99b	[DAG] visitSUB - convert some folds to use SDPatternMatch General cleanup and allows us to handle several commutable matches with a single pattern	2024-03-13 12:00:24 +00:00
XChy	c8cc7903b3	[SelectionDAG] Replace some basic patterns in visitADDLike with SDPatternMatch (#84759 ) Resolves #84745. Based on SDPatternMatch introduced by #78654, this patch replaces some of basic patterns in `visitADDLike` with corresponding patterns in SDPatternMatch. This patch only replaces original folds, instead of introducing new ones.	2024-03-13 00:33:50 +08:00
SahilPatidar	9e0f5909d0	[DAG] Fix Failure to reassociate SMAX/SMIN/UMAX/UMIN (#82175 ) Resolve #58110	2024-03-07 15:15:17 +00:00
Jay Foad	571d5af5aa	[DAGCombiner] Improve comment on reassociateOps and its helper	2024-03-06 16:40:12 +00:00
elhewaty	26058e68ea	[DAG] select (sext m), (add X, C), X --> (add X, (and C, (sext m)))) (#83640 ) - [DAG][X86] Add tests for Folding select m, add(X, C), X --> add (X, and(C, m))(NFC) - [DAG][X86] Fold select (sext m), (add X, C), X --> (add X, (and C, (sext m)))) - Fixes: https://github.com/llvm/llvm-project/issues/66101	2024-03-05 16:41:41 +00:00
Luke Lau	a668846202	[DAGCombiner] Handle extending EXTRACT_VECTOR_ELTs in calculateByteProvider (#83963 ) An EXTRACT_VECTOR_ELT can extend the element to the width of its result type, leaving the high bits undefined. Previously if we attempted to query the bytes in these high bits we would recurse and hit an assertion. This fixes it by bailing if the index is outside of the vector element size. I think the assertion Index < ByteWidth may still be incorrect, since ByteWidth is calculated from Op.getValueSizeInBits(). I believe this should be Op.getScalarValueSizeInBits() whenever VectorIndex is set since we're querying the element now, not the vector. But I couldn't think of a test case to trigger it. It can be addressed in a follow-up patch. Fixes #83920	2024-03-05 18:31:33 +08:00
David Green	6e41d60a71	[SelectionDAG] Change computeAliasing signature from optional<uint64> to LocationSize. (#83017 ) This is another smaller step of #70452, changing the signature of computeAliasing() from optional<uint64_t> to LocationSize, and follow-up changes in DAGCombiner::mayAlias(). There are some test change due to the previous AA->isNoAlias call incorrectly using an unknown size (~UINT64_T(0)). This should then be improved again in #70452 when the types are known to be scalable.	2024-02-28 09:43:05 +00:00
David Green	257cbea20d	[DAG] Format DAGCombiner::mayAlias. NFC	2024-02-26 18:22:35 +00:00
Yeting Kuo	850dde063b	[RISCV][VP] Introduce vp saturating addition/subtraction and RISC-V support. (#82370 ) This patch also pick the MatchContext framework from DAGCombiner to an indiviual header file to make the framework be used from other files in llvm/lib/CodeGen/SelectionDAG/.	2024-02-23 14:17:15 +08:00
Craig Topper	c1716e3fcf	[DAGCombiner][RISCV] CSE zext nneg and sext. (#82597 ) If we have a sext and a zext nneg with the same types and operand we should combine them into the sext. We can't go the other way because the nneg flag may only be valid in the context of the uses of the zext nneg.	2024-02-22 09:06:49 -08:00
Craig Topper	f8cbb67b10	[DAGCombiner] Preserve nneg flag from inner zext when we combine (z/s/aext (zext X)) (#82199 )	2024-02-19 12:21:17 -08:00
Craig Topper	f668a08e00	[DAGCombiner][RISCV] Optimize (zext nneg (truncate X)) if X has known sign bits. (#82227 ) This treats the zext nneg as sext if X is known to have sufficient sign bits to allow the zext or truncate or both to removed. This code is taken from the same optimization for sext.	2024-02-19 10:45:11 -08:00
Craig Topper	d5167c84f9	[DAGCombiner] Allow tryToFoldExtOfLoad to use a sextload for zext nneg. (#81714 ) If the load is used by any signed setccs, we can use a sextload instead of zextload. Then we don't have to give up on extending the load.	2024-02-17 11:37:13 -08:00
Simon Pilgrim	b279ca2783	[DAG] visitCTPOP - CTPOP(SHIFT(X)) -> CTPOP(X) iff the shift doesn't affect any non-zero bits If the source is being (logically) shifted, but doesn't affect any active bits, then we can call CTPOP on the shift source directly.	2024-02-15 10:41:08 +00:00
Craig Topper	86ce491f30	[DAGCombiner] Remove unneeded commonAlignment from reduceLoadWidth. (#81707 ) We already have the PtrOff factored into MachinePointerInfo. Any calls to getAlign on the new load with do commonAlignment with the MachinePointerInfo offset and the base alignment.	2024-02-13 23:26:25 -08:00
Craig Topper	e6253102a7	[DAGCombiner] Remove unnecessary commonAlignment from CombineExtLoad. (#81705 ) The getAlign function for a load returns the commonAlignment of the "base align" and the offset stored in the MachinePointerInfo. We're splitting a load here, so we should take the base alignment from the original load without any offset that may already exist in the original load. The new load can then maintain its own alignment using just the base alignment and its own offset. Noticed by inspection.	2024-02-13 23:26:08 -08:00
Nikita Popov	25b9ed6e49	[DAGCombine] Fix multi-use miscompile in load combine (#81586 ) The load combine replaces a number of original loads with one new loads and also replaces the output chains of the original loads with the output chain of the new load. This is incorrect if the original load is retained (due to multi-use), as it may get incorrectly reordered. Fix this by using makeEquivalentMemoryOrdering() instead, which will create a TokenFactor with both chains. Fixes https://github.com/llvm/llvm-project/issues/80911.	2024-02-13 16:41:00 +01:00
Simon Pilgrim	b35c519762	[DAG] tryToFoldExtendOfConstant - share the same SDLoc argument instead of recreating it over and over again.	2024-02-08 11:43:29 +00:00
Simon Pilgrim	670c2529bb	[DAG] Use DAGCombiner::SimplifyDemandedBits wrappers with default (all) DemandedElts. NFC. Don't call TLI.SimplifyDemandedVectorElts directly from every SimplifyDemandedBits call, use the more expressive wrappers instead first. This reduces the number of places we call TLI.SimplifyDemandedVectorElts and CommitTargetLoweringOpt to make it easier to track. Part of the work to process DAG nodes in topological order.	2024-02-07 11:12:29 +00:00
Simon Pilgrim	b8cdc2638e	[DAG] visitCTPOP - if only the upper half of the ctpop operand is zero then see if its profitable to only count the lower half. (#80473 )	2024-02-06 12:19:31 +00:00
Philip Reames	e722d9662d	[DAG] Avoid a crash when checking size of scalable type in visitANDLike Fixes https://github.com/llvm/llvm-project/issues/80744. This transform doesn't handled vectors at all, The fixed length ones pass the first check, but would fail the constant operand checks which immediate follow. This patch takes the simplest approach, and just guards the transform for scalar integers.	2024-02-05 14:30:10 -08:00
Craig Topper	6590d0fed5	[DAGCombiner][ARM] Teach reduceLoadWidth to handle (and (srl (load), C, ShiftedMask)) (#80342 ) If we have a shifted mask, we may be able to reduce the load width to the width of the non-zero part of the mask and use an offset to the base address to remove the srl. The offset is given by C+trailingzeros(ShiftedMask). Then we add a final shl to restore the trailing zero bits. I've use the ARM test because that's where the existing (and (srl (load))) tests were. The X86 test was modified to keep the H register.	2024-02-04 16:05:51 -08:00
Liao Chunyu	45188c64db	[DAGCombiner] Use generalized pattern matcher in foldBoolSelectToLogic (#79101 ) support vp.select TODO: Possibly other functions could be supported, eg: SimplifySelect()	2024-01-30 10:26:51 +08:00
Kazu Hirata	faf555f93f	Revert "[DAGCombiner] Use SmallDenseMap (NFC) (#79681 )" This reverts commit 863b2c84c0fbcfb02d969fa36af4932d410a827b. A compile-time regression has been reported: https://github.com/llvm/llvm-project/pull/79681#issuecomment-1913325915	2024-01-27 19:29:47 -08:00
Kazu Hirata	863b2c84c0	[DAGCombiner] Use SmallDenseMap (NFC) (#79681 ) The use of SmallDenseMap saves 0.48% of heap allocations during the compilation of a large preprocessed file, namely X86ISelLowering.cpp, for the X86 target. During the experiment, the maximum size of WorklistMap was 24 or less 74% of the time. (Note that DenseMap has the maximum occupancy rate of 3/4.)	2024-01-27 08:46:02 -08:00
David Green	7f518ee9ea	[DAG] Add a one-use check to concat -> scalar_to_vector fold. (#79510 ) Without this we can end up with multiple copies from gpr->fpr.	2024-01-26 18:17:17 +00:00
Nico Weber	184ca39529	[llvm] Move CodeGenTypes library to its own directory (#79444 ) Finally addresses https://reviews.llvm.org/D148769#4311232 :) No behavior change.	2024-01-25 12:01:31 -05:00
Simon Pilgrim	e1aa5b1fd1	[DAG] visitSCALAR_TO_VECTOR - don't fold scalar_to_vector(bin(extract(x),extract(y)) -> bin(x,y) if extracts have other uses Fixes #78897 - although the test case still has a number of poor codegen issues (in particular for i686 triples) that will need addressing (combining the nodes in topological order should help).	2024-01-23 16:28:43 +00:00
Nikita Popov	cde780c18f	[DAGCombine] Add debug counter (#78259 ) Add a debug counter for DAGCombine. This can help with bisecting which DAG combine introduced a miscompile.	2024-01-17 09:31:56 +01:00
Simon Pilgrim	7bf13fe812	[DAG] Fold (sext (sext_inreg x)) -> (sext (trunc x)) if the trunc is free (#77616 )	2024-01-11 09:39:30 +00:00
Simon Pilgrim	c1173e4e05	[DAG] Use FoldConstantArithmetic for unary bitops constant folding. BSWAP/BITREVERSE/CTPOP/CTLZ/CTLZ_ZERO_UNDEF/CTTZ/CTTZ_ZERO_UNDEF are all handled by FoldConstantArithmetic - so use directly instead of testing for isConstantIntBuildVectorOrConstantInt and relying on DAG.getNode() to perform the constant fold.	2024-01-09 18:55:05 +00:00
Simon Pilgrim	e9ac2dc68d	[DAG] XformToShuffleWithZero - use dyn_cast instead of isa/cast pair. NFCI.	2024-01-09 15:08:25 +00:00
Alex Bradbury	2d54ec36f7	[SelectionDAG] Add and use SDNode::getAsAPIntVal() helper (#77455 ) This is the logical equivalent for #76710 for APInt and uses the same naming scheme. Converted existing users through: `git grep -l "cast<ConstantSDNode>\(.\).getAPIntValueValue" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.*)\)->getAPIntValue/\1->getAsAPIntVal/'`	2024-01-09 14:27:07 +00:00
Alex Bradbury	197214e39b	[RFC][SelectionDAG] Add and use SDNode::getAsZExtVal() helper (#76710 ) This follows on from #76708, allowing `cast<ConstantSDNode>(N)->getZExtValue()` to be replaced with just `N->getAsZextVal();` Introduced via `git grep -l "cast<ConstantSDNode>\(.\).getZExtValue" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.*)\)->getZExtValue/\1->getAsZExtVal/'` and then using `git clang-format` on the result.	2024-01-09 12:25:17 +00:00
Craig Topper	bdcd7c0ba0	[DAGCombiner][RISCV] Preserve disjoint flag in folding (shl (or x, c1), c2) -> (or (shl x, c2), c1 << c2) (#76860 ) Since we are shifting both inputs to the original Or by the same amount and inserting zeros in the LSBs, the result should still be disjoint.	2024-01-03 13:14:13 -08:00
Craig Topper	47a1704ac9	[SelectionDAG][X86] Use disjoint flag in SelectionDAG::isADDLike. (#76847 ) Keep the haveNoCommonBitsSet check because we haven't started inferring the flag yet. I've added tests for two transforms, but these are not the only transforms that use isADDLike.	2024-01-03 11:54:29 -08:00
Shao-Ce SUN	9f6bf00b25	[DAGCombine] Add DAG optimisation for BF16_TO_FP (#69426 ) fold bf16_to_fp(op & 0xffff) -> bf16_to_fp(op)	2023-12-27 17:20:54 +08:00
Simon Pilgrim	1e710cfc80	[DAG] Add TLI::isTruncateFree(SDValue, EVT) wrapper. Similar to the existing isZExtFree(SDValue, EVT) wrapper, this will allow targets to override for specific cases (e.g. free truncation of an ext/extload node). But for now its just used to wrap the existing isTruncateFree(EVT, EVT) call.	2023-12-24 13:19:10 +00:00
Jonas Paulsson	e32e147d6c	[DAGCombiner] Don't drop alignment info of original load. (#75626 ) Pass the original MMO instead of different individual values. getAlign() was used before where actually getOriginalAlign() would have been better, and this patch has the same effect.	2023-12-19 16:30:47 +01:00

1 2 3 4 5 ...

3753 Commits