llvm-project

Author	SHA1	Message	Date
Alexey Bataev	855178af99	[SLP]Fix/improve getSpillCost analysis Previous implementation may took some extra time, when walked over the same instructions several times. And also it did not include proper analysis for cross-basic-block use of the vectorized values. This version fixes it. It walks over the tree and checks the deps between entries and their operands. If there are non-vectorized calls in between, it adds a single(!) spill cost, because the vector value should be spilled/reloaded only once. Also, this version caches analysis for each entries, which are detected, and do not repeats it, uses data, found during previous analysis for previous nodes. Also, it has the internal limit. If the number of instructions between nodes and their operands is too big (> than ScheduleRegionSizeBudget / VectorizableTree.size()), it is considered that the spill is required. It allows to improve compile time. Reviewers: preames, RKSimon, mikhailramalho Reviewed By: preames Pull Request: https://github.com/llvm/llvm-project/pull/129258	2025-03-04 15:47:23 -05:00
Mikhail R. Gadelha	e55f1a7ef8	[SLP] Add test for getSpillCost fix	2025-02-24 16:54:53 -03:00
Alexey Bataev	7dca2c628c	[SLP]Gather scalarized calls If the calls won't be vectorized, but will be scalarized after vectorization, they should be build as buildvector nodes, not vector nodes. Vectorization of such calls leads to incorrect cost estimation, does not allow to calculate correctly spills costs. Reviewers: lukel97, preames Reviewed By: preames Pull Request: https://github.com/llvm/llvm-project/pull/125070	2025-02-04 19:09:57 -05:00
Alexey Bataev	5ca136d0e7	[SLP][NFC]Replace undefs with just poison in the test	2025-02-04 08:24:45 -08:00
Alexey Bataev	466217eb03	[SLP]Fix graph traversal in getSpillCost getSpill cost relies on def-use order when performs the analysis for the vectorized instructions live-over-calls spills. Patch fixes it to check the dependencies based on TreeEntries and performs actual vectorized type analysis. Reviewers: RKSimon, preames Reviewed By: preames Pull Request: https://github.com/llvm/llvm-project/pull/124984	2025-01-31 06:27:47 -05:00
Simon Pilgrim	ceb613a8be	[RISCV] Add full test coverage for acos/asin/atan and cosh/sinh/tanh intrinsics to support #106584	2024-08-30 14:01:15 +01:00
Alexey Bataev	d70963a762	[SLP]Fix the cost of the adjusted extracts in per-register analysis. Previous patch did not pass the list of the extract indices by reference, so the compiler just ignored them. Pass indices by reference and fix the per-register analysis. Reviewers: RKSimon Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/96808	2024-06-28 14:33:08 -07:00
Alexey Bataev	a9c12e481b	Revert "[SLP]Fix the cost of the adjusted extracts in per-register analysis." This reverts commit 784152056ea40a800a8fd9f4157a428dfb7a6de8 to fix buildbots issues reported in https://lab.llvm.org/buildbot/#/builders/4/builds/315 and https://lab.llvm.org/buildbot/#/builders/35/builds/481	2024-06-28 13:41:51 -07:00
Alexey Bataev	784152056e	[SLP]Fix the cost of the adjusted extracts in per-register analysis. Previous patch did not pass the list of the extract indices by reference, so the compiler just ignored them. Pass indices by reference and fix the per-register analysis. Reviewers: RKSimon Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/96808	2024-06-28 15:49:47 -04:00
Alexey Bataev	70a54bca6f	[SLP]Improve/fix extracts calculations for non-power-of-2 elements. One of the previous patches introduced initial support for non-power-of-2 number of elements but some parts of the SLP vectorizer still were not adjusted to handle the costs correctly. Patch fixes it by improving analysis of the non-power-of-2 number of elements and fixes in the cost of the extractelements instructions. Reviewers: RKSimon Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/93213	2024-05-24 09:33:36 -04:00
Alexey Bataev	019aee8327	[SLP]Improve costs in computeExtractCost() to avoid crash after D158449. Need to consider the length of the original vector for extractelements, not the length, matched number of the scalars. It fixes 2 issues: 1) improves cost estimation; 2) Fixes crashes after D158449.	2023-09-29 07:48:02 -07:00
Hans Wennborg	06f3b0ed43	Revert "[SLP]Improve costs in computeExtractCost() to avoid crash after D158449." This caused asserts: Assertion failed: NumElts > 1 && "Expected at least 2-element fixed length vector(s).", file C:\b\s\w\ir\cache\builder\src\third_party\llvm\llvm\lib\Transforms\Vectorize\SLPVectorizer.cpp, line 7096 see comment on `59a67ea35d` > Need to consider the length of the original vector for extractelements, > not the length, matched number of the scalars. It fixes 2 issues: 1) > improves cost estimation; 2) Fixes crashes after D158449. This reverts commit 59a67ea35d608480257fc64ec3e5106ef50de740.	2023-09-29 10:42:19 +02:00
Alexey Bataev	59a67ea35d	[SLP]Improve costs in computeExtractCost() to avoid crash after D158449. Need to consider the length of the original vector for extractelements, not the length, matched number of the scalars. It fixes 2 issues: 1) improves cost estimation; 2) Fixes crashes after D158449.	2023-09-28 09:36:08 -07:00
David Spickett	8f548610a6	Revert "[SLP]Use source vector type as the original vector type instead of" This reverts commit 9a99944df068b29b905cd8ba9a2132cc6382b6fb. Due to test suite failures on all our SVE buildbots e.g.: https://lab.llvm.org/buildbot/#/builders/184/builds/7375 clang: ../llvm/llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp:3565: InstructionCost llvm::AArch64TTIImpl::getShuffleCost(TTI::ShuffleKind, VectorType , ArrayRef<int>, TTI::TargetCostKind, int, VectorType , ArrayRef<const Value *>): Assertion `Mask.size() == TpNumElts && "Expected Mask and Tp size to match!"' failed.	2023-09-22 07:52:16 +00:00
Alexey Bataev	9a99944df0	[SLP]Use source vector type as the original vector type instead of artificial for better cost estimation. Need to use original source vector type, not the one artificially constructed, based on the number of vectorized scalars. It affect the cost significantly.	2023-09-21 11:34:02 -07:00
Philip Reames	7f26c27e03	[RISCV] Enable SLP by default (when vectors are available) I propose that we go ahead and enabled SLP by default. Over the last few weeks, @luke and I have been working through codegen issues seen at small VLs from a couple of SPEC workloads. We still have a ways to go to get optimal codegen, but we're at the point where having a single configuration we're all tuning against is probably the right default. As a bit of history, I introduced this TTI hook back in a310637132 back in August of last year to unblock enabling LoopVectorizer. At the time, we had a couple known issues: constant materialization, address generation, and a general lack of maturity of small fixed vector codegen. By now, each of these has had significant investment. I can't say any of them are completely fixed, but we're no longer seeing instances of them every place we look. What we're mostly seeing at this point is a long tail of code gen opportunities, many involving build vectors, shuffles, and extract patterns. I have a couple patches up to continue iterating on those issues, but I don't think they need to be blockers for enabling SLP. Differential Revision: https://reviews.llvm.org/D152750	2023-06-14 09:49:58 -07:00
ManuelJBrito	8b56da5e9f	[IR] Change shufflevector undef mask to poison With this patch an undefined mask in a shufflevector will be printed as poison. This change is done to support the new shufflevector semantics for undefined mask elements. Differential Revision: https://reviews.llvm.org/D149210	2023-04-27 14:41:10 +01:00
Luke Lau	f23ea4cbd4	[RISCV] Model select and insertsubvector shuffle kinds Selects get lowered to a vmerge with a mask, and insertsubvectors get lowered to a vslideup. Differential Revision: https://reviews.llvm.org/D146747	2023-03-24 17:30:32 +00:00
Ben Shi	013235a200	[RISCV][NFC] Add tests for SLP vectorization of math functions RISCV has "vfabs.v" and "vfsqrt.v" so math functions abs and sqrt can be SLP vectorized. But others exp/log/sin/asin/sinh/asinh/... can not. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D145562	2023-03-10 07:34:21 +08:00

19 Commits