llvm-project

Author	SHA1	Message	Date
Florian Hahn	040d9c94be	[VPlan] Collect FMFs for in-loop reduction chain in VPlan. (NFC) Replace retrieving FMFs for in-loop reduction via underlying instruction + legal by collecting the flags during reduction chain traversal in VPlan.	2025-11-19 22:11:21 +00:00
Mikhail Gudim	12131d5cd3	[SLPVectorizer] Widen constant strided loads. (#162324 ) Given a set of pointers, check if they can be rearranged as follows (%s is a constant): %b + 0 * %s + 0 %b + 0 * %s + 1 %b + 0 * %s + 2 ... %b + 0 * %s + w %b + 1 * %s + 0 %b + 1 * %s + 1 %b + 1 * %s + 2 ... %b + 1 * %s + w ... If the pointers can be rearanged in the above pattern, it means that the memory can be accessed with a strided loads of width `w` and stride `%s`.	2025-11-19 15:11:09 -05:00
Rahul Joshi	4703195c8d	[NFC][LLVM] Namespace cleanup in SLPVectorizer (#168623 ) - Remove file local functions out of `llvm` or anonymous namespace and make them static. - Use namespace qualifier to define `BoUpSLP` class and several template specializations.	2025-11-19 07:34:09 -08:00
Luke Lau	5da0445420	[LV] Consolidate shouldOptimizeForSize and remove unused BFI/PSI. NFC (#168697 ) #158690 plans on passing BFI as a lazy lambda to avoid computing BlockFrequencyInfo when not needed. In preparation for that, this PR removes BFI and PSI from some constructors that aren't used. It also consolidates the two calls to llvm::shouldOptimizeForSize so that the result is computed once and passed where needed. This also renames OptForSize in LoopVectorizationLegality to clarify that it's to prevent runtime SCEV checks, see https://reviews.llvm.org/D68082	2025-11-19 21:29:26 +08:00
Florian Hahn	7b94dd336e	[VPLan] Reduce duplication in VPHeaderPHIRecipe::classof. (NFCI) Implement VPHeaderPHIRecipe::classof(const VPValue *V) in terms of the variant taking VPRecipeBase. Reduces some duplication, split off from https://github.com/llvm/llvm-project/pull/141431.	2025-11-19 12:46:53 +00:00
Florian Hahn	0730913529	[VPlan] Print debug info for all recipes. (#168454 ) Use the recently refactored VPRecipeBase::print to print debug location for all recipes. PR: https://github.com/llvm/llvm-project/pull/168454	2025-11-19 10:10:08 +00:00
Hassnaa Hamdi	f7f41350b4	[LV]: Skip Epilogue scalable VF greater than RemainingIterations. (#156724 ) Consider skipping epilogue scalable VF when they are greater than RemainingIterations same as fixed VF. And skip scalable RemainingIterations from that comparison because SCEV ATM can't evaluate non-canonical vscale-based expressions.	2025-11-19 05:11:17 +00:00
Shih-Po Hung	961940e1a7	[TTI] Use MemIntrinsicCostAttributes for getMaskedMemoryOpCost (#168029 ) - Split from #165532. This is a step toward a unified interface for masked/gather-scatter/strided/expand-compress cost modeling. - Replace the ad-hoc parameter list with a single attributes object. API change: ``` - InstructionCost getMaskedMemoryOpCost(Opcode, Src, Alignment, - AddressSpace, CostKind); + InstructionCost getMaskedMemoryOpCost(MemIntrinsicCostAttributes, + CostKind); ``` Notes: - NFCI intended: callers populate MemIntrinsicCostAttributes with the same information as before. - Follow-up: migrate gather/scatter, strided, and expand/compress cost queries to the same attributes-based entry point.	2025-11-19 09:51:12 +08:00
Florian Hahn	1e3ea03293	[VPlan] VPIRFlags kind for FCmp with predicate + fast-math flags (NFCI). FCmp instructions have both a predicate and fast-math flags. Introduce a new FCmp kind, that combines both to model this correctly in the current system. This should be NFC modulo VPlan printing which now includes the correct fast-math flags.	2025-11-18 22:09:53 +00:00
Ramkumar Ramachandra	507f236f5e	[VPlan] Fix OpType-mismatch in getFlagsFromIndDesc (#168560 ) Follow up on a cse OpType-mismatch crash reported due to ef023cae388d (Reland [VPlan] Expand WidenInt inductions with nuw/nsw), setting the OpType correctly when returning from getFlagsFromIndDesc.	2025-11-18 20:41:57 +00:00
Florian Hahn	2befda2225	[VPlan] Populate and use VPIRFlags from initial VPInstruction. (#168450 ) Update VPlan to populate VPIRFlags during VPInstruction construction and use it when creating widened recipes, instead of constructing VPIRFlags from the underlying IR instruction each time. The VPRecipeWithIRFlags constructor taking an underlying instruction and setting the flags based on it has been removed. This centralizes initial VPIRFlags creation and ensures flags are consistently available throughout VPlan transformations and makes sure we don't accidentally re-add flags from the underlying instruction that already got dropped during transformations. Follow-up to https://github.com/llvm/llvm-project/pull/167253, which did the same for VPIRMetadata. Should be NFC w.r.t. to the generated IR. PR: https://github.com/llvm/llvm-project/pull/168450	2025-11-18 15:15:14 +00:00
Florian Hahn	2432465d99	[VPlan] Support isa/dyn_cast from VPRecipeBase to VPIRMetadata (NFC). (#166245 ) Implement CastInfo from VPRecipeBase to VPIRMetadata to support isa/dyn_Cast. This is similar to CastInfoVPPhiAccessors, supporting dyn_cast by down-casting to the concrete recipe types inheriting from VPIRMetadata. Can be used for more generalized VPIRMetadata printing following https://github.com/llvm/llvm-project/pull/165825. PR: https://github.com/llvm/llvm-project/pull/166245	2025-11-18 11:31:11 +00:00
Florian Hahn	7c34848ae1	[VPlan] Hoist loads with invariant addresses using noalias metadata. (#166247 ) This patch implements a transform to hoists single-scalar replicated loads with invariant addresses out of the vector loop to the preheader when scoped noalias metadata proves they cannot alias with any stores in the loop. This enables hosting of loads we can prove do not alias any stores in the loop due to memory runtime checks added during vectorization. PR: https://github.com/llvm/llvm-project/pull/166247	2025-11-18 09:35:48 +00:00
Michael Bedy	a61889580e	[SLP] Invariant loads cannot have a memory dependency on stores. (#167929 )	2025-11-18 09:35:29 +01:00
Florian Hahn	3cba379e3d	[VPlan] Populate and use VPIRMetadata from VPInstructions (NFC) (#167253 ) Update VPlan to populate VPIRMetadata during VPInstruction construction and use it when creating widened recipes, instead of constructing VPIRMetadata from the underlying IR instruction each time. This centralizes VPIRMetadata in VPInstructions and ensures metadata is consistently available throughout VPlan transformations. PR: https://github.com/llvm/llvm-project/pull/167253	2025-11-17 21:28:49 +00:00
Florian Hahn	321b9d190b	[VPlan] Replace VPIRMetadata::addMetadata with setMetadata. (NFC) Replace addMetadata with setMetadata, which sets metadata, updating existing entries or adding a new entry otherwise. This isn't strictly needed at the moment, but will be needed for follow-up patches.	2025-11-17 20:55:18 +00:00
Ramkumar Ramachandra	ef023cae38	Reland [VPlan] Expand WidenInt inductions with nuw/nsw (#168354 ) Changes: The previous patch had to be reverted to a mismatching-OpType assert in cse. The reduced-test has now been added corresponding to a RVV pointer-induction, and the pointer-induction case has been updated to use createOverflowingBinaryOp. While at it, record VPIRFlags in VPWidenInductionRecipe.	2025-11-17 13:44:25 +00:00
Florian Hahn	7e730da128	[VPlan] Add printRecipe, prepare printing metadata in ::print (NFC) (#166244 ) Add a new pinrRecipe which handles printing the recipe without common info like debug info or metadata. Prepares to print them once, in ::print(), after/in combination with https://github.com/llvm/llvm-project/pull/165825. PR: https://github.com/llvm/llvm-project/pull/166244	2025-11-17 12:01:40 +00:00
Luke Lau	4d4a60cde0	[VPlan] Fix LastActiveLane assertion on scalar VF (#167897 ) For a scalar only VPlan with tail folding, if it has a phi live out then legalizeAndOptimizeInductions will scalarize the widened canonical IV feeding into the header mask: <x1> vector loop: { vector.body: EMIT vp<%4> = CANONICAL-INDUCTION ir<0>, vp<%index.next> vp<%5> = SCALAR-STEPS vp<%4>, ir<1>, vp<%0> EMIT vp<%6> = icmp ule vp<%5>, vp<%3> EMIT vp<%index.next> = add nuw vp<%4>, vp<%1> EMIT branch-on-count vp<%index.next>, vp<%2> No successors } Successor(s): middle.block middle.block: EMIT vp<%8> = last-active-lane vp<%6> EMIT vp<%9> = extract-lane vp<%8>, vp<%5> Successor(s): ir-bb<exit> The verifier complains about this but this should still generate the correct last active lane, so this fixes the assert by handling this case in isHeaderMask. There is a similar pattern already there for ActiveLaneMask, which also expects a VPScalarIVSteps recipe. Fixes #167813	2025-11-17 11:03:38 +00:00
Ramkumar Ramachandra	54fdf67bdb	[VPlan] Mark getPredicatedMask static (NFC) (#168067 )	2025-11-17 08:22:00 +00:00
Ramkumar Ramachandra	daa30ae263	[VPlan] Improve code in RemoveMask_match (NFC) (#168065 )	2025-11-17 08:21:34 +00:00
Florian Hahn	67c8e38b8b	[VPlan] Delegate to other VPInstruction constructors. (NFCI) Update VPInstruction constructor to delegate to constructor with more comprehensive checking and validation. This required updating some unit tests, to make sure the constructed VPInstructions are valid.	2025-11-16 22:21:00 +00:00
Alexey Bataev	306b5a3d64	[SLP]Do not consider split nodes, when checking parent PHI-based nodes The compiler should not consider split vectorize nodes, when checking for non-schedulable PHI-based parent nodes. Only pure PHI nodes must be considered, they only can be considered as explicit users, split nodes are not. Fixes #168268	2025-11-16 12:39:58 -08:00
Florian Hahn	e009de26b6	[LV] Use VPlan pattern matching in adjustRecipesForReductions (NFC) Replace the assert checking if CurrentLinkI is a CmpInst with a pattern matching check in the if condition. This uses VPlan-level pattern matching instead of inspecting the underlying instruction type.	2025-11-15 21:45:40 +00:00
Florian Hahn	67f61df22b	[VPlan] Always set trip count when creating plan for unit tests (NFC). Simplifies some tests which no do not need to pass TC, and future changes will require to always have a trip count available.	2025-11-15 16:42:28 +00:00
Kazu Hirata	3a7876d789	[llvm] Delete pointers without null checks (NFC) (#168183 ) Identified with readability-delete-null-pointer.	2025-11-15 08:06:08 -08:00
Florian Hahn	820daa5c1e	[VPlan] Support VPWidenIntOrFpInduction in getSCEVExprForVPValue. (NFCI) Construct SCEVs for VPWidenIntOrFpInductionRecipe analogous to VPCanonicalInductionPHIRecipe: create an AddRec with start + step from the recipe. Currently the only impact should be computing more costs of replicating stores directly in VPlan.	2025-11-15 13:35:11 +00:00
Ramkumar Ramachandra	85db92884c	[VPlan] Strip outdated comment in optimizeForVFAndUF (NFC) (#168068 )	2025-11-15 10:09:49 +00:00
Alexey Bataev	326d4e9033	[SLP]Check if the copyable element is a sub instruciton with abs in isCommutable Need to check if the non-copyable element is an instruction before actually trying to check its NSW attribute.	2025-11-14 16:09:50 -08:00
Alexey Bataev	e8cc0d2207	Revert "[SLP]Check if the copyable element is a sub instruciton with abs in isCommutable" This reverts commit ddf5bb0a2e2d2dd77bce66173387d62ab7174d9f to fix buildbots https://lab.llvm.org/buildbot/#/builders/11/builds/28083.	2025-11-14 15:22:55 -08:00
Alexey Bataev	ddf5bb0a2e	[SLP]Check if the copyable element is a sub instruciton with abs in isCommutable Need to check if the non-copyable element is an instruction before actually trying to check its NSW attribute.	2025-11-14 14:53:42 -08:00
Gang Chen	a407d02752	Revert "[Transform][LoadStoreVectorizer] allow redundant in Chain (#1… (#168105 ) …63019)" This reverts commit 92e5608ffa6ff39ac3707f29418cc9482471f5d9.	2025-11-14 11:49:09 -08:00
Alexey Bataev	0a5be0f997	[SLP]Enable Sub as a base instruction in copyables Patch adds support for sub instructions as main instruction in copyables elements. Also, adds a check if the base instruction is not profitable for the selection if at least one instruction with the main opcode is used as an immediate operand. Reviewers: RKSimon, hiraditya Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/163231	2025-11-14 12:30:38 -05:00
Alex Bradbury	f2336d4c7e	Revert "[VPlan] Expand WidenInt inductions with nuw/nsw" (#168080 ) Reverts llvm/llvm-project#163538 This is causing build failures on the two-stage RVV buildbots. e.g. https://lab.llvm.org/buildbot/#/builders/214/builds/1363. I've shared a reproducer and more information at https://github.com/llvm/llvm-project/pull/163538#issuecomment-3533482822 This reverts commit 355e0f94af5adabe90ac57110ce1b47596afd4cd.	2025-11-14 16:11:48 +00:00
Ramkumar Ramachandra	355e0f94af	[VPlan] Expand WidenInt inductions with nuw/nsw (#163538 ) While at it, record VPIRFlags in VPWidenInductionRecipe.	2025-11-14 12:10:55 +00:00
Mel Chen	3277f6caef	[LV] Explicitly disable in-loop reductions for AnyOf and FindIV. nfc (#163541 ) Currently, in-loop reductions for AnyOf and FindIV are not supported. They were implicitly blocked. This happened because RecurrenceDescriptor::getReductionOpChain could not detect their recurrence chain. The reason is that RecurrenceDescriptor::getOpcode was set to Instruction::Or, but the recurrence chains of AnyOf and FindIV do not actually contain an Instruction::Or. This patch explicitly disables in-loop reductions for AnyOf and FindIV instead of relying on getReductionOpChain to implicitly prevent them.	2025-11-14 09:14:07 +00:00
Luke Lau	851f8f7984	[VPlan] Disable partial reductions again with EVL tail folding (#167863 ) VPPartialReductionRecipe doesn't yet support an EVL variant, and we guard against this by not calling convertToAbstractRecipes when we're tail folding with EVL. However recently some things got shuffled around which means we may detect some scaled reductions in collectScaledReductions and store them in ScaledReductionMap, where outside of convertToAbstractRecipes we may look them up and start e.g. adding a scale factor to an otherwise regular VPReductionPHI. This fixes it by skipping collectScaledReductions, and fixes #167861	2025-11-14 06:30:12 +00:00
Florian Hahn	4e71530dcb	[VPlan] Add findComputeReductionResult helper. (NFC) Move utility to helper for re-use in follow-up patches.	2025-11-13 23:20:54 +00:00
Florian Hahn	a6edeedbfa	Revert "[LV] Use ExtractLane(LastActiveLane, V) live outs when tail-folding. (#149042 )" This reverts commit 62d1a080e69e3c5e98840e000135afa7c688a77b. This appears to be causing some runtime failures on RISCV https://lab.llvm.org/buildbot/#/builders/210/builds/5221	2025-11-13 22:34:55 +00:00
Gang Chen	92e5608ffa	[Transform][LoadStoreVectorizer] allow redundant in Chain (#163019 ) This can absorb redundant loads when forming vector load. Can be used to fix the situation created by VectorCombine. See: https://discourse.llvm.org/t/what-is-the-purpose-of-vectorizeloadinsert-in-the-vectorcombine-pass/88532	2025-11-13 12:19:29 -08:00
Fabrice de Gans	23f6a8aeaf	Add missing `LLVM_ABI` annotations (#167718 ) This patch updates various LLVM headers to properly add the `LLVM_ABI` and `LLVM_ABI_FOR_TEST` annotations to build LLVM as a DLL on Windows. This effort is tracked in #109483.	2025-11-13 09:49:39 -08:00
Ryan Buchner	a04c6b5512	[LV] Update LoopVectorizationPlanner::emitInvalidCostRemarks to handle reduction plans (#165913 ) The TypeSwitch for extracting the Opcode now handles the `VPReductionRecipe` case. Fixes #165359.	2025-11-13 06:12:40 -10:00
Luke Lau	c0f7d51e8a	[VPlan] Simplify ExplicitVectorLength(%AVL) -> %AVL when AVL <= VF (#167647 ) [`llvm.experimental.get.vector.length`](https://llvm.org/docs/LangRef.html#id2399) has the property that if the AVL (%cnt) is less than or equal to VF (%max_lanes) then the return value is just AVL. This patch uses SCEV to simplify this in optimizeForVFAndUF, and adds `ExplicitVectorLength` to `VPInstruction::opcodeMayReadOrWriteFromMemory` so it gets removed once dead.	2025-11-13 13:17:01 +00:00
Florian Hahn	b6bcfdea40	[VPlan] Get opcode & type from recipe in adjustRecipesForReduction (NFC) Replace direct access to underlying IR instructions with VPlan-level equivalents, i.e. VPTypeAnalysis and pattern matching on the recipe. Removes a few uses of accessing underlying IR.	2025-11-12 22:37:15 +00:00
Florian Hahn	53a65ba6b9	[VPlan] Don't look up recipe for IV step via RecipeBuilder. (NFC) Directly update induction increments with step value created for wide inductions in createWidenInductionRecipes, which does not require looking up via RecipeBuilder.	2025-11-12 22:08:56 +00:00
Ramkumar Ramachandra	9ba738af2c	[VPlan] Fix assert in store-user in narrowToSingleScalars (#167686 ) Follow up on c2d4c7c18b96 ([VPlan] Permit more users in narrowToSingleScalars) to fix an assert related to WidenStore users of the recipe being narrowed in narrowToSingleScalars.	2025-11-12 17:53:24 +00:00
Julian Nagele	8280070a73	[VectorCombine] Try to scalarize vector loads feeding bitcast instructions. (#164682 ) This change aims to convert vector loads to scalar loads, if they are only converted to scalars after anyway. alive2 proof: https://alive2.llvm.org/ce/z/U_rvht	2025-11-12 15:35:03 +00:00
Florian Hahn	62d1a080e6	[LV] Use ExtractLane(LastActiveLane, V) live outs when tail-folding. (#149042 ) Building on top of https://github.com/llvm/llvm-project/pull/148817, introduce a new abstract LastActiveLane opcode that gets lowered to Not(Mask) → FirstActiveLane(NotMask) → Sub(result, 1). When folding the tail, update all extracts for uses outside the loop the extract the value of the last actice lane. See also https://github.com/llvm/llvm-project/issues/148603 PR: https://github.com/llvm/llvm-project/pull/149042	2025-11-12 15:11:00 +00:00
Luke Lau	02c68b3ef7	[VPlan] Plumb scalable register size through narrowInterleaveGroups (#167505 ) On RISC-V narrowInterleaveGroups doesn't kick in because the wrong VectorRegWidth is passed to isConsecutiveInterleaveGroup. narrowInterleaveGroups is always passed the RGK_FixedWidthVector register size, but on RISC-V the RGK_ScalableVector size is twice as large because we want to use LMUL 2. This causes the `GroupSize == VectorRegWidth` check to fail. This fixes it by using the scalable register size whenever the VF is scalable and plumbing it through as a potentially scalable TypeSize. Note that this only makes a difference when tail folding is disabled, as narrowInterleaveGroups can't handle EVL based IVs yet.	2025-11-12 11:14:53 +00:00
Florian Hahn	b9f0dadc10	[VPlan] Merge fcmp uno feeding Or. (#167251 ) Fold or (fcmp uno %A, %A), (fcmp uno %B, %B), ... -> or (fcmp uno %A, %B), ... This pattern is generated to check if any vector lane is NaN, and combining multiple compares is beneficial on architectures that have dedicated instructions. Alive2 Proof: https://alive2.llvm.org/ce/z/vA_aoM Combine suggested as part of #161735 PR: https://github.com/llvm/llvm-project/pull/167251	2025-11-12 10:15:59 +00:00

1 2 3 4 5 ...

6790 Commits