llvm-project

Author	SHA1	Message	Date
Florian Hahn	a861ed411a	[VPlan] Add initial loop-invariant code motion transform. (#107894 ) Add initial transform to move out loop-invariant recipes. This also helps to fix a divergence between legacy and VPlan-based cost model due to legacy using ScalarEvolution::isLoopInvariant in some cases. Fixes https://github.com/llvm/llvm-project/issues/107501. PR: https://github.com/llvm/llvm-project/pull/107894	2024-09-20 11:22:03 +01:00
Florian Hahn	96ba9d372d	[VPlan] Only consider recipes in loop region in planContainsSimp. (NFCI) Limit checks in planContainsAdditionalSimplifications to recipes in the vector loop region. Preparation for https://github.com/llvm/llvm-project/pull/107894.	2024-09-19 20:11:13 +01:00
Jay Foad	e03f427196	[LLVM] Use {} instead of std::nullopt to initialize empty ArrayRef (#109133 ) It is almost always simpler to use {} instead of std::nullopt to initialize an empty ArrayRef. This patch changes all occurrences I could find in LLVM itself. In future the ArrayRef(std::nullopt_t) constructor could be deprecated or removed.	2024-09-19 16:16:38 +01:00
Florian Hahn	256100489d	[VPlan] Rename isDefinedOutside[Vector]Regions -> [Loop] (NFC) Clarify name of helper, split off from https://github.com/llvm/llvm-project/pull/95842/files#r1765556732.	2024-09-19 11:20:31 +01:00
David Sherwood	e762d4dac7	[LoopVectorize] Teach LoopVectorizationLegality about more early exits (#107004 ) This patch is split off from PR #88385 and concerns only the code related to the legality of vectorising early exit loops. It is the first step in adding support for vectorisation of a simple class of loops that typically involves searching for something, i.e. for (int i = 0; i < n; i++) { if (p[i] == val) return i; } return n; or for (int i = 0; i < n; i++) { if (p1[i] != p2[i]) return i; } return n; In this initial commit LoopVectorizationLegality will only consider early exit loops legal for vectorising if they follow these criteria: 1. There are no stores in the loop. 2. The loop must have only one early exit like those shown in the above example. I have referred to such exits as speculative early exits, to distinguish from existing support for early exits where the exit-not-taken count is known exactly at compile time. 3. The early exit block dominates the latch block. 4. The latch block must have an exact exit count. 5. There are no loads after the early exit block. 6. The loop must not contain reductions or recurrences. I don't see anything fundamental blocking vectorisation of such loops, but I just haven't done the work to support them yet. 7. We must be able to prove at compile-time that loops will not contain faulting loads. Tests have been added here: Transforms/LoopVectorize/AArch64/simple_early_exit.ll	2024-09-19 09:41:25 +01:00
Florian Hahn	0d736e296c	[VPlan] Add getSCEVExprForVPValue util, use to get trip count SCEV (NFC) (#94464 ) Add a new getSCEVExprForVPValue utility which can be used to get a SCEV expression for a VPValue. The initial implementation only returns SCEVs for live-in IR values (by constructing a SCEV based on the live-in IR value) and VPExpandSCEVRecipe. This is enough to serve its first use, getting a SCEV for a VPlan's trip count, but will be extended in the future. It also removes createTripCountSCEV, as the new helper can be used to retrieve the SCEV from the VPlan. PR: https://github.com/llvm/llvm-project/pull/94464	2024-09-18 14:41:56 +01:00
David Sherwood	b29c5b66fd	[NFC][LoopVectorize] Dont pass LLVMContext to VPTypeAnalysis constructor (#108540 ) We already pass a Type object into the VPTypeAnalysis constructor, which can be used to obtain the context. While in the same area it also made sense to avoid passing the context into the VPTransformState and VPCostContext constructors.	2024-09-16 09:12:11 +01:00
Florian Hahn	012dbec604	[VPlan] Handle ForceTargetInstructionCost in during precomputeCosts. Make sure ForceTargetInstruction is respected in precomputeCosts.	2024-09-15 10:53:43 +01:00
Florian Hahn	cfe3f5fa61	[VPlan] Remove unneeded ExitBB variable after f0c5caa814. Fix buildbot failures due to an unused variable, e.g. https://lab.llvm.org/buildbot/#/builders/186/builds/2329	2024-09-14 21:35:45 +01:00
Florian Hahn	f0c5caa814	[VPlan] Add VPIRInstruction, use for exit block live-outs. (#100735 ) Add a new VPIRInstruction recipe to wrap existing IR instructions not to be modified during execution, execept for PHIs. For PHIs, a single VPValue operand is allowed, and it is used to add a new incoming value for the single predecessor VPBB. Expect PHIs, VPIRInstructions cannot have any operands. Depends on https://github.com/llvm/llvm-project/pull/100658. PR: https://github.com/llvm/llvm-project/pull/100735	2024-09-14 21:21:55 +01:00
Ramkumar Ramachandra	75a57edadc	VPlan/Builder: inline VPBuilder::createICmp (NFC) (#105650 ) Inline VPBuilder::createICmp in the header, in line with the other VPBuilder functions.	2024-09-13 20:08:11 +01:00
Florian Hahn	76fd69be74	[VPlan] Simplify VPBuilder insert point when live outs for FORs. Simplifies setting the insert point, addressing a TODO.	2024-09-13 13:21:23 +01:00
David Sherwood	f3029b330a	[NFC][LoopVectorize] Avoid passing ScalarEvolution to VPlanTransforms::optimize (#108380 ) Whilst trying to write some VPlan unit tests I realised that we don't need to pass a ScalarEvolution object into VPlanTransforms::optimize because the only thing we actually need is a LLVMContext.	2024-09-13 12:09:00 +01:00
Florian Hahn	08d294df55	[VPlan] Simplify VPBuilder insert point when adding users in exit block. Simplifies setting the insert point, addressing a TODO.	2024-09-12 22:47:03 +01:00
Florian Hahn	71cb7811bb	[LV] Remove stale completeLoopSkeleton (NFCI). The function has been removed a while ago, also remove the stable declaration.	2024-09-12 21:55:43 +01:00
Florian Hahn	ea83e1c05a	[LV] Assign cost to all interleave members when not interleaving. At the moment, the full cost of all interleave group members is assigned to the instruction at the group's insert position, even if the decision was to not form an interleave group. This can lead to inaccurate cost estimates, e.g. if the instruction at the insert position is dead. If the decision is to not vectorize but scalarize or scather/gather, then the cost will be to total cost for all members. In those cases, assign individual the cost per member, to more closely reflect to choice per instruction. This fixes a divergence between legacy and VPlan-based cost model. Fixes https://github.com/llvm/llvm-project/issues/108098.	2024-09-11 21:04:34 +01:00
Hari Limaye	7858e14547	[LV] Amend check for IV increments in collectUsersInEntryBlock (#108020 ) The check for IV increments in collectUsersInEntryBlock currently triggers for exit-block PHIs which use the IV start value, resulting in us failing to add the input value for the middle block to these PHIs. Fix this by amending the check for IV increments to only include incoming values that are instructions inside the loop. Fixes #108004	2024-09-11 16:43:34 +01:00
Florian Hahn	e3c537ff90	[VPlan] Consider non-header phis in planContainsAdditionalSimp. Update planContainsAdditionalSimplifications to also check phis not in the loop header. This ensures we don't miss cases where VPBlendRecipes (which correspond to such phis) have been simplified. Fixes https://github.com/llvm/llvm-project/issues/107473.	2024-09-10 21:37:14 +01:00
Florian Hahn	a794ee4559	[VPlan] Add VPValue for VF, use it for VPWidenIntOrFpInductionRecipe. (#95305 ) Similar to VFxUF, also add a VF VPValue to VPlan and use it to get the runtime VF in VPWidenIntOrFpInductionRecipe. Code for VF is only generated if there are users of VF, to avoid unnecessary test changes. PR: https://github.com/llvm/llvm-project/pull/95305	2024-09-10 10:41:35 +01:00
Simon Pilgrim	97e6f92d31	Fix GCC Wparentheses warning. NFC.	2024-09-08 13:34:34 +01:00
Ramkumar Ramachandra	a6577791d4	LV: fix style after cursory reading (NFC) (#105830 )	2024-09-06 18:41:56 +01:00
Florian Hahn	cf2ecc7c1c	[LV] Remove over-aggressive assert from 3fe6a064f15c. There are some cases where only the first operand is marked for truncation. In that case, the compare won't be truncated which would incorrectly trigger the assertion. It also shows that the check pre 3fe6a064f15c also considered compares truncated that cannot be truncated.	2024-09-05 18:20:16 +01:00
Florian Hahn	3fe6a064f1	[LV] Check if compare is truncated directly in getInstructionCost. The current check for truncated compares in getInstructionCost misses cases where either the first or both operands are constants. Check directly if the compare is marked for truncation. In that case, the minimum bitwidth is that of the operands. The patch also adds asserts to ensure that. This fixes a divergence between legacy and VPlan-based cost model, where the legacy cost model incorrectly estimated the cost of compares with truncated operands. Fixes https://github.com/llvm/llvm-project/issues/107171.	2024-09-04 20:50:06 +01:00
Florian Hahn	3bd161e98d	[LV] Honor forced scalars in setVectorizedCallDecision. Similarly to dd94537b4, setVectorizedCallDecision also did not consider ForcedScalars. This lead to VPlans not reflecting the decision by the legacy cost model (cost computation would use scalar cost, VPlan would have VPWidenCallRecipe). To fix this, check if the call has been forced to scalar in setVectorizedCallDecision. Note that this requires moving setVectorizedCallDecision after collectLoopUniforms (which sets ForcedScalars). collectLoopUniforms does not depend on call decisions and can safely be moved. Fixes https://github.com/llvm/llvm-project/issues/107051.	2024-09-03 21:06:32 +01:00
Florian Hahn	dd94537b40	[LV] Update call widening decision when scalarzing calls. collectInstsToScalarize may decide to scalarize a call. If so, we have to update the widening decision for the call, otherwise the call won't be scalarized as expected during VPlan construction. This issue was uncovered by f82543d509.	2024-09-03 14:12:41 +01:00
Florian Hahn	954ed05c10	[VPlan] Simplify MUL operands at recipe construction. This moves the logic to create simplified operands using SCEV to MUL recipe creation. This is needed to match the behavior of the legacy's cost model. TODOs are to extend to other opcodes and move to a transform. Note that this also restricts the number of SCEV simplifications we apply to more precisely match the cases handled by the legacy cost model. Fixes https://github.com/llvm/llvm-project/issues/107015.	2024-09-02 21:25:31 +01:00
Florian Hahn	654bb4e9f2	[LV] Don't consider branches leaving loop in collectValuesToIgnore. Branches exiting the loop will remain regardless, so don't consider them in collectValuesToIgnore. This fixes another divergence between legacy and VPlan-based cost model. Fixes https://github.com/llvm/llvm-project/issues/106780.	2024-09-01 20:35:36 +01:00
Florian Hahn	f0e34f3818	[VPlan] Don't skip optimizable truncs in planContainsAdditionalSimps. A optimizable cast can also be removed by VPlan simplifications. Remove the restriction from planContainsAdditionalSimplifications, as this causes it to miss relevant simplifications, triggering false positives for the cost decision verification. Also adds debug output for printing additional cost-precomputations. Fixes https://github.com/llvm/llvm-project/issues/106641.	2024-08-30 11:29:30 +01:00
Florian Hahn	c4906588ce	[VPlan] Use skipCostComputation when pre-computing induction costs. This ensures we skip any instructions identified to be ignored by the legacy cost model as well. Fixes a divergence between legacy and VPlan-based cost model. Fixes https://github.com/llvm/llvm-project/issues/106417.	2024-08-29 21:20:00 +01:00
Florian Hahn	0a272d3a17	[LV] Use SCEV to analyze second operand for cost query. Improve operand analysis using SCEV for cost purposes. This fixes a divergence between legacy and VPlan-based cost-modeling after 533e6bbd0d34. Fixes https://github.com/llvm/llvm-project/issues/106248.	2024-08-29 12:08:27 +01:00
Michael Maitland	18c79ca360	[LV][NFC] Remove unnecessary space in comment	2024-08-28 14:23:44 -07:00
Florian Hahn	4b84288f00	[VPlan] Pass live-ins used as exit values straight to live-out. Live-ins that are used as exit values don't need to be extracted, they can be passed through directly. This fixes a crash when trying to extract from a live-in. Fixes https://github.com/llvm/llvm-project/issues/106257.	2024-08-28 19:12:05 +01:00
Florian Hahn	16910a21ee	[VPlan] Move logic to create interleave groups to VPlanTransforms (NFC). This is a step towards further breaking up the rather large tryToBuildVPlanWithVPRecipes. It moves logic create interleave groups to VPlanTransforms.cpp, where similar replacements for other recipes are defined as well (e.g. EVL-based ones)	2024-08-28 15:56:09 +01:00
Ramkumar Ramachandra	71ede8d831	VPlan: factor out VPlanUtils into its own file (NFC) (#105857 )	2024-08-28 13:54:41 +01:00
Florian Hahn	d853b3f4b6	[VPlan] Remove unneeded Plan arg from getVPValueOrAddLiveIn (NFC). The helper can simply use VPRecipeBuilder::Plan.	2024-08-25 20:00:30 +01:00
Florian Hahn	d66cbecb33	[VPlan] Use getVPValueOrAddLiveIn in mapToVPValues (NFC). Use existing helper.	2024-08-25 19:54:17 +01:00
Florian Hahn	40975da950	[VPlan] Wrap planContainsAdditionalSimplifications in NDEBUG (NFC) Only used for an assertion.	2024-08-24 13:22:54 +01:00
Florian Hahn	885c4365c1	[VPlan] Skip branches marked as dead in cost precomputation. Don't consider the cost of branches marked to be skipped in VPlan cost pre-computation. Those aren't included in the legacy cost, so they should not be included in the VPlan cast.	2024-08-23 15:58:29 +01:00
Kazu Hirata	4e6ff75efa	[Vectorize] Fix a warning This patch fixes: llvm/lib/Transforms/Vectorize/LoopVectorize.cpp:7245:1: error: unused function 'planContainsAdditionalSimplifications' [-Werror,-Wunused-function]	2024-08-22 15:35:05 -07:00
Florian Hahn	768dba71fe	[VPlan] Fix typo in cb4efe1d.	2024-08-22 21:42:16 +01:00
Florian Hahn	cb4efe1d07	[VPlan] Don't trigger VF assertion if VPlan has extra simplifications. There are cases where VPlans contain some simplifications that are very hard to accurately account for up-front in the legacy cost model. Those cases are caused by un-simplified inputs, which trigger the assert ensuring both the legacy and VPlan-based cost model agree on the VF. To avoid false positives due to missed simplifications in general, only trigger the assert if the chosen VPlan doesn't contain any additional simplifications. Fixes https://github.com/llvm/llvm-project/issues/104714. Fixes https://github.com/llvm/llvm-project/issues/105713.	2024-08-22 21:38:06 +01:00
Florian Hahn	e454d31037	[VPlan] Factor out precomputing costs from LVP::cost (NFC). Move the logic for pre-computing costs of certain instructions to a separate helper function, allowing re-use in a follow-up patch.	2024-08-22 20:40:38 +01:00
Florian Hahn	1fa6c99a09	[VPlan] Move EVL memory recipes to VPlanRecipes.cpp (NFC) Move VPWiden[Load\|Store]EVLRecipe::executeto VPlanRecipes.cpp in line with other ::execute implementations that don't depend on anything defined in LoopVectorization.cpp	2024-08-22 18:30:49 +01:00
Paul Walker	4f075086e7	[LLVM][VPlan] Keep all VPBlend masks until VPlan transformation. (#104015 ) It's not possible to pick the best mask to remove when optimising VPBlend at construction and so this patch refactors the code to move the decision (and thus transformation) to VPlanTransforms. NOTE: This patch does not change the decision of which mask to pick. That will be done in a following PR to keep this patch as NFC from an output point of view.	2024-08-21 12:51:40 +01:00
Florian Hahn	4e04286d61	[VPlan] Only use selectVectorizationFactor for cross-check (NFCI). (#103033 ) Use getBestVF to select VF up-front and only use selectVectorizationFactor to get the VF legacy VF to check the vectorization decision matches the VPlan-based cost model. PR: https://github.com/llvm/llvm-project/pull/103033	2024-08-21 13:09:01 +02:00
Florian Hahn	99741ac285	[VPlan] Introduce explicit ExtractFromEnd recipes for live-outs. (#100658 ) Introduce explicit ExtractFromEnd recipes to extract the final values for live-outs instead of implicitly extracting in VPLiveOut::fixPhi. This is a follow-up to the recent changes of modeling extracts for recurrences and consolidates live-out extract creation for fixed-order recurrences at a single place: addLiveOutsForFirstOrderRecurrences. It is also in preparation of replacing VPLiveOut with VPIRInstructions wrapping the original scalar phis. PR: https://github.com/llvm/llvm-project/pull/100658	2024-08-21 10:06:44 +02:00
Florian Hahn	7452014c95	[LV] Simplify !UserVF.isZero() -> UserVF (NFC). Address post-commit comment for b8dccb7d56c to simplify code.	2024-08-20 09:40:35 +01:00
Florian Hahn	f2fcd9cb97	[VPlan] Rename getBestPlanFor -> getPlanFor (NFC). As suggested in https://github.com/llvm/llvm-project/pull/103033, more accurately rename to getPlanFor , as it simplify returns the VPlan for VF, relying on the fact that there is a single VPlan for each VF at the moment.	2024-08-19 13:05:19 +01:00
Florian Hahn	b8dccb7d56	[VPlan] Emit note when UserVF > MaxUserVF (NFCI). As suggested in https://github.com/llvm/llvm-project/pull/103033, add a remark when the UserVF is ignored due to it being larger than MaxUserVF. Only changes behavior of diagnostic/debug output.	2024-08-19 12:40:20 +01:00
Florian Hahn	740f055451	[VPlan] Rename getBestVF -> computeBestVF (NFC). As suggested in https://github.com/llvm/llvm-project/pull/103033, more accurately rename to computeBestVF, as it now does not simply return the best VF, but directly computes it.	2024-08-19 10:44:50 +01:00

1 2 3 4 5 ...

2218 Commits