llvm-project

Author	SHA1	Message	Date
Florian Hahn	f8fe67c998	[VPlan] Expose cloneFrom and mergeBlocksIntoPredecessors. (NFC) (#188818 ) Move cloneFrom from a file-static function in VPlan.cpp to a public static method VPBlockUtils::cloneFrom, and move mergeBlocksIntoPredecessors from a file-static function in VPlanTransforms.cpp to a public static method VPlanTransforms::mergeBlocksIntoPredecessors. This is in preparation for dissolving replicate regions which needs both utilities. Split off from approved https://github.com/llvm/llvm-project/pull/170212. PR: https://github.com/llvm/llvm-project/pull/188818	2026-03-26 19:07:33 +00:00
Florian Hahn	0ea2e5813f	[VPlan] Account for early-exit dispatch blocks when updating LI. (#185618 ) Now that we can vectorize loops with multiple early exits, we emit dispatch blocks after the middle block to go to a specific exit or continue in the dispatch chain. With that, we need to be a bit more careful when it comes to picking the loop the dispatch block belongs to. The dispatch block will belong to the innermost loop of all exit blocks reachable from the current block. Fixes https://github.com/llvm/llvm-project/issues/185362 PR: https://github.com/llvm/llvm-project/pull/185618	2026-03-18 18:37:34 +00:00
Florian Hahn	3e3d2b6fbc	[VPlan] Add hasPredecessors and hasSuccessors to VPBlockBase (NFC). Add/move helpers to VPBlockBase, and use in a few more places. Split off from https://github.com/llvm/llvm-project/pull/156262 as suggested.	2026-03-14 20:07:42 +00:00
Alexis Engelke	efcd3b6108	[IPO][InstCombine][Vectorize][NFCI] Drop uses of BranchInst (#186596 ) Refactor remaining parts of Transforms apart from Scalar and Utils.	2026-03-14 17:49:00 +00:00
Florian Hahn	579aca8755	[VPlan] Prevent uses of materialized VPSymbolicValues. (NFC) (#182318 ) After VPSymbolicValues (like VF and VFxUF) are materialized via replaceAllUsesWith, they should not be accessed again. This patch: 1. Tracks materialization state in VPSymbolicValue. 2. Asserts if the materialized VPValue is used again. Currently it adds asserts to various member functions, preventing calling them on materialized symbolic values. Note that this still allows some uses (e.g. comparing VPSymbolicValue references or pointers), but this should be relatively harmless given that it is impossible to (re-)add any users. If we want to further tighten the checks, we could add asserts to the accessors or override operator&, but that will require more changes and not add much extra guards I think. Depends on https://github.com/llvm/llvm-project/pull/182146 to fix a current access violation. PR: https://github.com/llvm/llvm-project/pull/182318	2026-03-13 14:39:46 +00:00
Alexis Engelke	4fd826d1f9	[IR] Split Br into UncondBr and CondBr (#184027 ) BranchInst currently represents both unconditional and conditional branches. However, these are quite different operations that are often handled separately. Therefore, split them into separate opcodes and classes to allow distinguishing these operations in the type system. Additionally, this also slightly improves compile-time performance.	2026-03-11 12:31:10 +00:00
Luke Lau	825129378e	[VPlan] Move tail folding out of VPlanPredicator. NFC (#176143 ) Currently the logic for introducing a header mask and predicating the vector loop region is done inside introduceMasksAndLinearize. This splits the tail folding part out into an individual VPlan transform so that VPlanPredicator.cpp doesn't need to worry about tail folding, which seemed to be a temporary measure according to a comment in VPlanTransforms.h. To perform tail folding independently, this splits the "body" of the vector loop region between the phis in the header and the branch + iv increment in the latch: Before: ``` +-------------------------------------------+ \|%iv = ... \| \|... \| \|%iv.next = add %iv, vfxuf \| \|branch-on-count %iv.next, vector-trip-count\| +-------------------------------------------+ ``` After: ``` +-------------------------------------------+ \|%iv = ... \| \|%wide.iv = widen-canonical-iv ... \| \|%header-mask = icmp ule %wide.iv, BTC \|---+ \|branch-on-cond %header-mask \| \| +-------------------------------------------+ \| \| \| v \| +-------------------------------------------+ \| \|... \| \| +-------------------------------------------+ \| \| \| v \| +-------------------------------------------+ \| \|%iv.next = add %iv, vfxuf \|<--+ \|branch-on-count %iv.next, vector-trip-count\| +-------------------------------------------+ ``` Phis are then inserted in the latch for any value in the loop body that have outside uses, with poison as their incoming value from the header edge. The motivation for this is to allow us to share the same "predicate all successor blocks" type of predication we do for tail folding, but for early-exit loops in #172454. This may also allow us to directly emit an EVL based header mask, instead of having to match + transform the existing header mask in addExplicitVectorLength. This also allows us to eventually handle recurrences in the same transform, avoiding the need to special case tail folding in addReductionResultComputation.	2026-03-05 08:17:37 +00:00
Florian Hahn	7b26069828	[VPlan] Pass ForceTargetInstructionCost insted of NumOccurences. Update code incorrectly passing ForceTargetInstructionCost.getNumOccurrences(). Fixes a cost-divergence with legacy cost model.	2026-03-01 15:41:40 +00:00
Florian Hahn	c4e8adf7bb	[VPlan] Skip gather/scatters in useEmulatedMaskMemRefHack. The legacy cost model skips gather/scatters when determining the predicated stores in useEmulatedMaskMemRefHack. Match the behavior in the VPlan-based implementation. This fixes a cost divergence in the attached test.	2026-02-16 21:01:16 +00:00
Florian Hahn	f3a816598d	[VPlan] Add VPSymbolicValue for UF. (NFC) Add a symbolic unroll factor (UF) to VPlan similar to VF & VFxUF that gets replaced with the concrete UF during plan execution, similar to how VF is used for the vectorization factor. This is a preparatory change that allows transforms to use the symbolic UF before the concrete UF is determined. Note that the old getUF that returns the concrete UF after unrolling has been renamed to getConcreteUF. Split off from the re-commit of 8d29d093096 (https://github.com/llvm/llvm-project/pull/149706) as suggested.	2026-02-15 15:24:35 +00:00
Florian Hahn	b3dcf485d2	[VPlan] Compute NumPredStores for VPReplicateRecipe costs in VPlan. Compute the number of predicated stores directly in VPlan instead of using CM.useEmulatedMaskMemRefHack(), which will only account for the number of predicated stores for the last VF the legacy cost model considered. Fixes https://github.com/llvm/llvm-project/issues/181183	2026-02-13 21:16:53 +00:00
Andrei Elovikov	d8621d665d	Reapply "[VPlan] Add hidden `-vplan-print-after-all` option" (#178547 ) Re-commit of https://github.com/llvm/llvm-project/pull/175839 after fixing build without `LLVM_ENABLE_DUMP`. This consists of the following changes: * Merge several overloads of `VPlanTransforms::runPass` into a single function to avoid code duplication. * Add helper macro `RUN_VPLAN_PASS` to capture the transformation name and pass it to the helper above for printing. * Add new `-vplan-print-after-all` option (somewhat similar to existing `-vplan-verify-each`). * Add two empty passes `printAfterInitialConstruction`/`printFinalVPlan` so that initial/final VPlans would be supported in `-vplan-print-after-all` This follows the original future plans in https://github.com/llvm/llvm-project/pull/123640.	2026-01-30 19:55:09 +00:00
Florian Hahn	eabcdb572b	Revert "[VPlan] Add hidden `-vplan-print-after-all` option (#175839 )" (#178544 ) This reverts commit 97e1df149de213b760aae4060ee9e25dc9908125. It looks like the commit caused some build bot failures. Revert back to green so the failures can be investigated. https://lab.llvm.org/buildbot/#/builders/159/builds/39803 https://lab.llvm.org/buildbot/#/builders/2/builds/43204	2026-01-28 23:49:24 +00:00
Andrei Elovikov	97e1df149d	[VPlan] Add hidden `-vplan-print-after-all` option (#175839 ) This consists of the following changes: * Merge several overloads of `VPlanTransforms::runPass` into a single function to avoid code duplication. * Add helper macro `RUN_VPLAN_PASS` to capture the transformation name and pass it to the helper above for printing. * Add new `-vplan-print-after-all` option (somewhat similar to existing `-vplan-verify-each`). * Add two empty passes `printAfterInitialConstruction`/`printFinalVPlan` so that initial/final VPlans would be supported in `-vplan-print-after-all` This follows the original future plans in https://github.com/llvm/llvm-project/pull/123640.	2026-01-28 22:25:54 +00:00
Florian Hahn	b794baf8e7	[TTI] Add VectorInstrContext for context-aware insert/extract costs. (#175982 ) This commit introduces the VectorInstrContext (VIC) infrastructure to improve cost estimates for insert/extracts based on the context instruction in which the insert/extract is used. This is similar to CastContextHint, and allows providing context on how the insert/extract is going to be used before creating IR. This is useful in the LoopVectorizer, where costs need to estimated before creating IR. The new hint currently only replaces an existing check in AArch64, but new uses will be introduced in follow-ups, including https://github.com/llvm/llvm-project/pull/177201. PR: https://github.com/llvm/llvm-project/pull/175982	2026-01-27 16:30:29 +00:00
Florian Hahn	a871b707b7	Reapply "[VPlan] Move VDef subclass ID to VPRecipeBase (NFC). (#174282 )" Move SubclassID to VPRecipeBase, and store VPRecipeBase directly in VPRecipeValue, instead of VPDef. This allows for some additional simplifications and VPDef now just holds various helpers to deal with removing and adding VPValues. This reverts commit 16395da0ff577750571b99fe28281ce6fb6a3ae8. PR: https://github.com/llvm/llvm-project/pull/174282	2026-01-24 13:22:48 +00:00
Florian Hahn	16395da0ff	Revert "[VPlan] Fold VPDef into VPRecipeBase (NFC). (#174282 )" This reverts commit f3ae334f4b7a8cf4fe0eb6ee7b2f2ef0879f522d. Committed with out-of-date message, revert to reland with updated message.	2026-01-24 13:16:45 +00:00
Florian Hahn	f3ae334f4b	[VPlan] Fold VPDef into VPRecipeBase (NFC). (#174282 ) A separate VDef is not needed any longer, fold i into VPRecipeBase to simplify code and class hierarchy. Depends on https://github.com/llvm/llvm-project/pull/172758. PR: https://github.com/llvm/llvm-project/pull/174282	2026-01-24 13:16:12 +00:00
Mircea Trofin	b39568d782	[LV] capture branch weights for constant trip counts (#175096 ) When a vectorized loop has constant trip, it's important to update the profile information accordingly. Hotness analysis will only look at profile info. For example, in the `tripcount.ll` test, without producing the profile info, in the `const_trip_over_profile` function, the BFI of the `vector.body` would be 32 (this is the expected value when synthetic branch weights are used, in loops). The real value is 250. The `for.body`value was _very_ incorrect before, too (and detrimentally so, as it would have appeared as "very hot" when it wasn't): The table below was obtained by printing BFI in the RUN: command, i.e. `build/bin/opt < llvm/test/Transforms/LoopVectorize/tripcount.ll -passes="loop-vectorize,print<block-freq>" -loop-vectorize-with-block-frequency -S -o /dev/null`. Showing only the `float` value, i.e. the BFI relative to the function entry BB. ``` Printing analysis results of BFI for function 'const_trip_over_profile': block-frequency-info: const_trip_over_profile ``` \| Block \| Before \| After \| \| ----- \| ------ \| ----- \| \| `entry` \| float = 1.0 \| float = 1.0 \| \| `vector.ph` \| float = 1.0 \| float = 1.0 \| \| `vector.body` \| float = 32.0 \| float = 250.0 \| \| `middle.block` \| float = 1.0 \| float = 1.0 \| \| `scalar.ph` \| float = 1.0 \| float = 1.0 \| \| `for.body` \| float = 2147483647.8 \| float = 1.0 \| \| `for.end` \| float = 1.0 \| float = 1.0 \|	2026-01-23 14:31:05 -08:00
Florian Hahn	31b93d6e38	[VPlan] Add specialized VPValue subclasses for different types (NFC) (#172758 ) This patch adds VPValue sub-classes for the different cases we currently have: * VPIRValue: A live-in VPValue that wraps an underlying IR value * VPSymbolicValue: A symbolic VPValue not tied to an underlying value, e.g. the vector trip count or VF VPValues * VPRecipeValue: A VPValue defined by a VPDef/VPRecipeBase. This has multiple benefits: * clearer constructors for each kind of VPValue * limited scope: for example allows moving VPDef member to VPRecipeValue, reducing size of other VPValues. * stricter type checking for member variables (e.g. using VPLiveIn in the Value -> live-in map in VPlan, or using VPSymbolicValue for symbolic member VPValues) There probably are additional opportunities for cleanups as follow-ups. PR: https://github.com/llvm/llvm-project/pull/172758	2026-01-07 20:29:05 +00:00
Florian Hahn	524b1788c4	[VPlan] Add BranchOnTwoConds, use for early exit plans. (#172750 ) This PR introduces a new BranchOnTwoConds VPInstruction, that takes 2 boolean operands and must be placed in a block with 3 successors. If condition I is true, branches to successor I, otherwise falls through to check the next condition. If both conditions are false, branch to the third successor. This new branch recipe is used for early-exit loops, to simplify the representation in VPlan initially, by avoid the need for splitting the middle block early on, in a way that preserves the single-exit block property of regions. All exits still go through the latch block, but they can go to more than 2 successors. This idea was part of one of the original proposals for how to model early exits in VPlan, but at that point in time, there was no good way to handle this during code-gen, and we went with the early split-middle block approach initially. Now that we dissolve regions before ::execute, the new recipe can be lowered nicely after regions have been removed, to a set of VPBBs and BranchOnCond recipes. The initial lowering preserves the original structure with the split middle blocks. Follow-ups will improve the lowering to avoid this splitting, providing performance gains. PR: https://github.com/llvm/llvm-project/pull/172750	2025-12-29 19:39:38 +00:00
Florian Hahn	9cc1585b13	[VPlan] Add VPBlockUtils::transferSuccessors (NFCI). Add a new helper to transfer successors to a new, unconnected VPBB. Helps to simplify existing code, and prepare for upcoming changes.	2025-12-17 22:48:22 +00:00
Kazu Hirata	3a7876d789	[llvm] Delete pointers without null checks (NFC) (#168183 ) Identified with readability-delete-null-pointer.	2025-11-15 08:06:08 -08:00
Florian Hahn	519cf3c2b8	[VPlan] Remove unneeded getDefiningRecipe with isa/cast/dyn_cast. (NFC) Classof for most recipes directly supports VPValue, so there is no need to call getDefiningRecipe when using isa/cast/dyn_cast.	2025-11-11 22:07:48 +00:00
Florian Hahn	0767c64043	[VPlan] Use getDefiningRecipe instead of directly accessing Def. (NFC) Use getDefiningRecipe to future-proof the code. Split off from https://github.com/llvm/llvm-project/pull/156262 as suggested.	2025-11-10 21:55:19 +00:00
Kazu Hirata	3ce5df408b	[Vectorize] Remove a redundant declaration (NFC) (#167188 ) EnableVPlanNativePath is declared in LoopVectorizationPlanner.h. Identified with readability-redundant-declaration.	2025-11-08 22:28:00 -08:00
Ramkumar Ramachandra	c1ca4a55d4	[VPlan] Strip redundant code in VPTransformState::get (NFC) (#166145 ) vputils::isSingleScalar is sufficient.	2025-11-05 21:59:47 +00:00
Florian Hahn	bfc322dd72	Revert "[VPlan] Run narrowInterleaveGroups during general VPlan optimizations. (#149706 )" This reverts commit 8d29d09309654541fb2861524276ada6a3ebf84c. There have been reports of mis-compiles in https://github.com/llvm/llvm-project/pull/149706. Revert while I investigate.	2025-10-22 21:27:11 +01:00
Florian Hahn	82b59345fe	[VPlan] Clarify naming for helpers to create loop&replicate regions (NFC) Split off to clarify naming, as suggested in https://github.com/llvm/llvm-project/pull/156262.	2025-10-21 20:41:54 +01:00
Ramkumar Ramachandra	2ec01e430a	[VPlan] Move two VPBlockUtils members (NFC) (#162507 )	2025-10-21 16:40:13 +01:00
Florian Hahn	8d29d09309	[VPlan] Run narrowInterleaveGroups during general VPlan optimizations. (#149706 ) Move narrowInterleaveGroups to to general VPlan optimization stage. To do so, narrowInterleaveGroups now has to find a suitable VF where all interleave groups are consecutive and saturate the full vector width. If such a VF is found, the original VPlan is split into 2: a) a new clone which contains all VFs of Plan, except VFToOptimize, and b) the original Plan with VFToOptimize as single VF. The original Plan is then optimized. If a new copy for the other VFs has been created, it is returned and the caller has to add it to the list of candidate plans. Together with https://github.com/llvm/llvm-project/pull/149702, this allows to take the narrowed interleave groups into account when computing costs to choose the best VF and interleave count. One example where we currently miss interleaving/unrolling when narrowing interleave groups is https://godbolt.org/z/Yz77zbacz PR: https://github.com/llvm/llvm-project/pull/149706	2025-10-21 11:37:42 +01:00
Ramkumar Ramachandra	0a4702407b	[VPlan] Improve code around canConstantBeExtended (NFC) (#161652 ) Follow up on 7c4f188 ([LV] Support multiplies by constants when forming scaled reductions), introducing m_APInt, and improving code around canConstantBeExtended: we change canConstantBeExtended to take an APInt.	2025-10-16 13:03:13 +01:00
Ramkumar Ramachandra	869c76dda3	[VPlan] Allow zero-operand m_BranchOn(Cond\|Count) (NFC) (#162721 )	2025-10-13 08:50:09 +01:00
Florian Hahn	ae7b15f2e2	[VPlan] Return invalid for scalable VF in VPReplicateRecipe::computeCost Replication is currently not supported for scalable VFs. Make sure VPReplicateRecipe::computeCost returns an invalid cost early, for scalable VFs if the recipe is not a single-scalar. Note that this moves the existing invalid-costs.ll out of the AArch64 subdirectory, as it does not use a target triple. Fixes https://github.com/llvm/llvm-project/issues/160792.	2025-10-11 19:28:02 +01:00
Florian Hahn	74af5784a5	Reapply "[VPlan] Compute cost of more replicating loads/stores in ::computeCost. (#160053 )" (#162157 ) This reverts commit f80c0baf058dbdc5 and 94eade61a02ae5. Recommit a small fix for targets using prefersVectorizedAddressing. Original message: Update VPReplicateRecipe::computeCost to compute costs of more replicating loads/stores. There are 2 cases that require extra checks to match the legacy cost model: 1. If the pointer is based on an induction, the legacy cost model passes its SCEV to getAddressComputationCost. In those cases, still fall back to the legacy cost. SCEV computations will be added as follow-up 2. If a load is used as part of an address of another load, the legacy cost model skips the scalarization overhead. Those cases are currently handled by a usedByLoadOrStore helper. Note that getScalarizationOverhead also needs updating, because when the legacy cost model computes the scalarization overhead, scalars have not been collected yet, so we can't each for replicating recipes to skip their cost, except other loads. This again can be further improved by modeling inserts/extracts explicitly and consistently, and compute costs for those operations directly where needed. PR: https://github.com/llvm/llvm-project/pull/160053	2025-10-06 22:16:08 +01:00
Alexey Bataev	f80c0baf05	Revert "Reapply "[VPlan] Compute cost of more replicating loads/stores in ::computeCost. (#160053 )" (#161724 )" This reverts commit 8f2466bc72a5ab163621cb1bf4bf53a27f1cefe7 to fix crashes reported in commits	2025-10-05 08:38:17 -07:00
Florian Hahn	8f2466bc72	Reapply "[VPlan] Compute cost of more replicating loads/stores in ::computeCost. (#160053 )" (#161724 ) This reverts commit f61be4352592639a0903e67a9b5d3ec664ad4d23. Recommit a small fix handling scalarization overhead consistently with legacy cost model if a load is used directly as operand of another memory operation, which fixes https://github.com/llvm/llvm-project/issues/161404. Original message: Update VPReplicateRecipe::computeCost to compute costs of more replicating loads/stores. There are 2 cases that require extra checks to match the legacy cost model: 1. If the pointer is based on an induction, the legacy cost model passes its SCEV to getAddressComputationCost. In those cases, still fall back to the legacy cost. SCEV computations will be added as follow-up 2. If a load is used as part of an address of another load, the legacy cost model skips the scalarization overhead. Those cases are currently handled by a usedByLoadOrStore helper. Note that getScalarizationOverhead also needs updating, because when the legacy cost model computes the scalarization overhead, scalars have not been collected yet, so we can't each for replicating recipes to skip their cost, except other loads. This again can be further improved by modeling inserts/extracts explicitly and consistently, and compute costs for those operations directly where needed. PR: https://github.com/llvm/llvm-project/pull/160053	2025-10-02 22:00:22 +01:00
Florian Hahn	7c4f188f27	[LV] Support multiplies by constants when forming scaled reductions. (#161092 ) We can create partial reductions for multiplies with constants, if the constant is small enough to be extended from source to destination type w/o changing the value. This only handles constant on the right side of a multiply, relying on other passes to canonicalize the input. Alive2 Proofs: https://alive2.llvm.org/ce/z/iWRMr6 PR: https://github.com/llvm/llvm-project/pull/161092	2025-10-02 10:53:17 +00:00
Florian Hahn	94b2617590	[VPlan] Remove VPIRPhis in exit blocks when deleting scalar loop BBs. DeleteDeadBlocks will remove single-entry phis. Remove them from the exit VPIRBBs in VPlan as well, otherwise we would retain references to deleted IR instructions. Fixes MSan failures after 8907b6d39 https://lab.llvm.org/buildbot/#/builders/164/builds/14013	2025-10-01 22:01:23 +01:00
Florian Hahn	8907b6d393	[VPlan] Remove original loop blocks if dead. (#155497 ) Build on top of https://github.com/llvm/llvm-project/pull/154510 to completely remove the blocks of dead scalar loops. Depends on https://github.com/llvm/llvm-project/pull/154510. PR: https://github.com/llvm/llvm-project/pull/155497	2025-10-01 16:53:59 +00:00
Florian Hahn	f61be43525	Revert "[VPlan] Compute cost of more replicating loads/stores in ::computeCost. (#160053 )" This reverts commit b4be7ecaf06bfcb4aa8d47c4fda1eed9bbe4ae77. See https://github.com/llvm/llvm-project/issues/161404 for a crash exposed by the change. Revert while I investigate.	2025-09-30 22:13:06 +01:00
Florian Hahn	b4be7ecaf0	[VPlan] Compute cost of more replicating loads/stores in ::computeCost. (#160053 ) Update VPReplicateRecipe::computeCost to compute costs of more replicating loads/stores. There are 2 cases that require extra checks to match the legacy cost model: 1. If the pointer is based on an induction, the legacy cost model passes its SCEV to getAddressComputationCost. In those cases, still fall back to the legacy cost. SCEV computations will be added as follow-up 2. If a load is used as part of an address of another load, the legacy cost model skips the scalarization overhead. Those cases are currently handled by a usedByLoadOrStore helper. Note that getScalarizationOverhead also needs updating, because when the legacy cost model computes the scalarization overhead, scalars have not been collected yet, so we can't each for replicating recipes to skip their cost, except other loads. This again can be further improved by modeling inserts/extracts explicitly and consistently, and compute costs for those operations directly where needed. PR: https://github.com/llvm/llvm-project/pull/160053	2025-09-29 08:08:09 +00:00
Florian Hahn	41f3438362	[VPlan] Remove dead code for scalar VFs in VPRegionBlock::cost (NFC). The VPlan cost model is not used to compute costs of scalar VFs currently, as conversion to replicate regions makes accurately computing the original scalar cost difficult. Remove left over, dead code.	2025-09-28 17:30:57 +01:00
Shih-Po Hung	0d22f8344a	[LV][EVL] Remove metadata on EVL vectorized loops (#155760 ) This patch removes the metadata emission for EVL‑vectorized loops, since there is no current in-tree consumer: 1) after VPlan performs canonical IV replacement #147222 and 2) RISCV dropped EVLIndVarSimplifyPass #151483, which was the only user of this metadata.	2025-09-23 07:39:33 +08:00
Florian Hahn	50b9ca4dda	[VPlan] Simplify Plan's entry in removeBranchOnConst. (#154510 ) After https://github.com/llvm/llvm-project/pull/153643, there may be a BranchOnCond with constant condition in the entry block. Simplify those in removeBranchOnConst. This removes a number of redundant conditional branch from entry blocks. In some cases, it may also make the original scalar loop unreachable, because we know it will never execute. In that case, we need to remove the loop from LoopInfo, because all unreachable blocks may dominate each other, making LoopInfo invalid. In those cases, we can also completely remove the loop, for which I'll share a follow-up patch. Depends on https://github.com/llvm/llvm-project/pull/153643. PR: https://github.com/llvm/llvm-project/pull/154510	2025-09-18 19:25:05 +01:00
Florian Hahn	30e9cbacab	[VPlan] Move logic to compute scalarization overhead to cost helper(NFC) Extract the logic to compute the scalarization overhead to a helper for easy re-use in the future.	2025-09-13 20:41:44 +01:00
Florian Hahn	b8eaceb39b	[VPlan] Explicitly replicate VPInstructions by VF. (#155102 ) Extend replicateByVF added in #142433 (aa240293190) to also explicitly unroll replicating VPInstructions. Now the only remaining case where we replicate for all lanes is VPReplicateRecipes in replicate regions. PR: https://github.com/llvm/llvm-project/pull/155102	2025-09-12 17:06:26 +01:00
Florian Hahn	8796dfdcba	[VPlan] Consolidate logic to update loop metadata and profile info. This patch consolidates updating loop metadata and profile info for both the remainder and vector loops in a single place. This is NFC, modulo consistently applying vectorization specific metadata also in the experimental VPlan-native path. Split off from https://github.com/llvm/llvm-project/pull/154510.	2025-09-04 21:50:40 +01:00
Florian Hahn	507ff082c2	[VPlan] Move runtime check blocks to correct position during exec (NFC). Move adjusting the position of completely disconnected IR blocks to VPIRBasicBlock::execute.	2025-09-01 16:15:02 +01:00
Florian Hahn	a53a5ed65d	[VPlan] Add VPBlockBase::hasPredecessors (NFC). Split off from https://github.com/llvm/llvm-project/pull/154510/, add helper to check if a block has any predecessors.	2025-09-01 09:44:49 +01:00

1 2 3 4 5 ...

474 Commits