llvm-project

Author	SHA1	Message	Date
Alexey Bataev	2672c6e4dc	[SLP][NFC]Add processBuildVector member function, NFC. Introduce processBuildVector as a next step to generalize code for cost estimation and code emission for gather/buildvector nodes. Differential Revision: https://reviews.llvm.org/D149973	2023-05-05 11:00:53 -07:00
Florian Hahn	8bd02e5aef	[VPlan] Assert instead checking if VF is vec when widening calls (NFC) VPWidenCallRecipe should not be generated for scalar VFs. Replace check with an assert.	2023-05-05 18:21:57 +01:00
Vasileios Porpodas	7749f6e976	[SLP][NFC] Cleanup: Outline the code that vectorizes CmpInsts into a seaparate function. Differential Revision: https://reviews.llvm.org/D149919	2023-05-05 09:56:41 -07:00
Alexey Bataev	ca3f4236e4	[SLP][NFC]Add/use gather and createFreeeze member functions in ShuffleInstructionBuilder, NFC.	2023-05-05 09:12:54 -07:00
Florian Hahn	e3afe0b89d	[VPlan] Add VPWidenCastRecipe, split off from VPWidenRecipe (NFCI). To generate cast instructions, the result type is needed. To allow creating widened casts without underlying instruction, introduce a new VPWidenCastRecipe that also holds the result type. This functionality will be used in a follow-up patch to implement truncateToMinimalBitwidths as VPlan-to-VPlan transform. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D149081	2023-05-05 13:20:16 +01:00
Florian Hahn	29712ccda6	[VPlan] Assert instead of check if VF is vector when widening casts. VPWidenRecipes should not be generated for scalar VFs. Replace check with an assert. Suggested in preparation for D149081.	2023-05-05 09:02:33 +01:00
Alexey Bataev	d726f99d43	[SLP][NFC]Do not try to revectorize instructions with constant operands, NFC. The pass should not try to revectorize instructions with constant operands, which were not folded by the IRBuilder. It prevents the non-terminating loop in the SLP vectorizer for non foldable constant operations.	2023-05-04 13:52:42 -07:00
Florian Hahn	1b05e74982	[VPlan] Reorder cases in switch (NFC). Reorder cases to make sure they are ordered properly in preparation for D149081.	2023-05-04 21:40:22 +01:00
Florian Hahn	c2bef381fa	[VPlan] Remove setEntry to avoid leaks when replacing entry. Update the HCFG builder to directly connect the created CFG to the existing Plan's entry. This allows removing `setEntry`, which can cause leaks when the existing entry is replaced. Should fix https://lab.llvm.org/buildbot/#/builders/5/builds/33455/steps/13/logs/stdio	2023-05-04 19:12:02 +01:00
Alexey Bataev	c0e5e7db9a	[SLP]Fix a crash trying finding insert point for GEP nodes with non-gep insts. If the vectorizable GEP node is built, which should not be scheduled, and at least one node is a non-gep instruction, need to insert the vectorized instructions before the last instruction in the list, not before the first one, otherwise the instructions may be emitted in the wrong order.	2023-05-04 09:43:37 -07:00
Florian Hahn	147a56149c	[VPlan] Clean up preheader block after b85a402dd899fc. Fix a leak introduced in b85a402dd899fc and flagged by LSan https://lab.llvm.org/buildbot#builders/5/builds/33452	2023-05-04 16:29:57 +01:00
Florian Hahn	b85a402dd8	[VPlan] Introduce new entry block to VPlan for early SCEV expansion. This patch adds a new preheader block the VPlan to place SCEV expansions expansions like the trip count. This preheader block is disconnected at the moment, as the bypass blocks of the skeleton are not yet modeled in VPlan. The preheader block is executed before skeleton creation, so the SCEV expansion results can be used during skeleton creation. At the moment, the trip count expression and induction steps are expanded in the new preheader. The remainder of SCEV expansions will be moved gradually in the future. D147965 will update skeleton creation to use the steps expanded in the pre-header to fix #58811. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D147964	2023-05-04 14:00:13 +01:00
Florian Hahn	79692750d2	[LV] Use VPValue for SCEV expansion in fixupIVUsers. The step is already expanded in the VPlan. Use this expansion instead. This is a step towards modeling fixing up IV users in VPlan. It also fixes a crash casued by SCEV-expanding the Step expression in fixupIVUsers, where the IR is in an incomplete state Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D147963	2023-05-04 09:25:59 +01:00
Vasileios Porpodas	ce46e1aa76	[NFC][SLP] Cleanup: Simplify traversal loop in SLPVectorizerPass::vectorizeHorReduction(). This includes a couple of changes: 1. Moves the code that changes the root node out of the `TryToReduce` lambda and out of the traversal loop. 2. Since that code moved, there isn't much left in `TryToReduce` so the code was inlined. 3. The phi node variable `P` was also being used as a flag that turns on/off the exploration of operands as new seeds. This patch uses a new variable `TryOperandsAsNewSeeds` for this. 4. Simplifies the code executed when vectorization fails. The logic of the code should be identical to the original, but I may be missing something not caught by tests. Differential Revision: https://reviews.llvm.org/D149627	2023-05-03 12:48:01 -07:00
Alexey Bataev	d62734800c	[SLP][NFC]Add ShuffleCostBuilder and generalize BaseShuffleAnalysis::createShuffle function, NFC. Added basic implementation of ShuffleCostBuilder class in ShuffleCostEstimator and generalized BaseShuffleAnalysis::createShuffle function to support emission of Value */InstructionCost for the vectorization/cost estimation. Differential Revision: https://reviews.llvm.org/D149171	2023-05-03 12:30:54 -07:00
Florian Hahn	b9efffa7e9	[VPlan] Add assignSlot(const VPBasicBlock *) (NFC). Factor out utility to simplify D147964 as sugested.	2023-05-03 19:51:09 +01:00
Vasileios Porpodas	07484d249b	[NFC][SLP] Fix typo Differential Revision: https://reviews.llvm.org/D149670	2023-05-02 11:14:08 -07:00
Nikita Popov	5362a0d859	[LCSSA] Remove unused ScalarEvolution argument (NFC) After D149435, LCSSA formation no longer needs access to ScalarEvolution, so remove the argument from the utilities.	2023-05-02 12:17:05 +02:00
Florian Hahn	6303fa369c	[VPlan] Remove DeadInsts arg from VPInstructionsToVPRecipes (NFC) The argument isn't used. VPlan-based dead recipe removal can be used instead.	2023-05-01 15:03:29 +01:00
Florian Hahn	0b24436591	[LV] Clarify comment for selectVectorizationFactor (NFC). The comment is stale, as UserVF is handled before selectVectorizationFactor is called. Clarify the comment by remove the mention of UserVF. Suggested as independent improvement in D143938.	2023-04-30 21:12:15 +01:00
Florian Hahn	a431402fd2	[LV] Remove loop arg from CM::isCandidateForEpilogueVectorization (NFC) LVP operates on the loop it stores in TheLoop. Use it instead of the argument, to be in line with other member functions. Suggested as independent improvement in D143938.	2023-04-30 21:11:12 +01:00
Florian Hahn	6fa07a87ab	[LV] Document selectEpilogueVectorizationFactor (NFC). Add missing documentation for selectEpilogueVectorizationFactor. Suggested as independent improvement in D143938.	2023-04-30 21:09:24 +01:00
Florian Hahn	9fce1fc6f8	[LVP] Fix comment for hasPlanWithVF (NFC). The function checks if there's a plan with the specified VF. Update the comment to match the implementation. Pointed out as independent improvement in D143938.	2023-04-30 19:13:53 +01:00
Florian Hahn	8d3ff24e11	[LV] Sink collect* calls to LVP::plan() (NFC). Move calls of collect* helpers closer to where the cost-model is used. Should help simplifying D142669 & D142670. Differential Revision: https://reviews.llvm.org/D142674	2023-04-30 11:41:22 +01:00
Florian Hahn	4583d7ef7c	[LV] Rename Preheader -> VecPreheader (NFC). Clarify variable name as suggested in D147964 to reduce diff.	2023-04-28 22:15:47 +01:00
Vasileios Porpodas	fe1e50cd15	[NFC][SLP] Cleanup: Replace Value* operand with Instruction* in `vectorizeRootInstruction()` and `vectorizeHorReduction()` This makes it explicit that these functions work with instructions, and avoids calling them if the operand is not an instruction. Differential Revision: https://reviews.llvm.org/D149465	2023-04-28 12:35:09 -07:00
Vasileios Porpodas	92b2a266e9	[NFC][SLP] Cleanup: Moves code that changes the reduction root into a separate function. This makes `matchAssociativeReduction()` a bit simpler. Differential Revision: https://reviews.llvm.org/D149452	2023-04-28 10:05:32 -07:00
Florian Hahn	2c9d21a2a3	[VPlan] Turn Plan entry node into VPBasicBlock (NFCI). The entry to the plan is the preheader of the vector loop and guaranteed to be a VPBasicBlock. Make sure this is the case by adjusting the type. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D149005	2023-04-28 12:29:06 +01:00
Alexey Bataev	8bacd75125	[SLP][NFC]Fix a warning because of the missing parens, NFC.	2023-04-27 16:59:37 -07:00
Alexey Bataev	1604a100f1	[SLP][NFC]Avoid extra useless ConstantVector creation, use PointerUnion instead, NFC. Better to use PointerUnion<Value , const TreeEntry > instead of extra attempts of creating null vector values, where possible.	2023-04-27 10:48:14 -07:00
ManuelJBrito	d22edb9794	[IR][NFC] Change UndefMaskElem to PoisonMaskElem Following the change in shufflevector semantics, poison will be used to represent undefined elements in shufflevector masks. Differential Revision: https://reviews.llvm.org/D149256	2023-04-27 18:01:54 +01:00
Alexey Bataev	cf792f664a	[SLP]Fix a crash for the replaced vectorized value. If two nodes share the same value, which is replaced in one of the nodes, need to automatically replace same value in all nodes. Btter to use WeakTrackingVH for this to fix compiler crash.	2023-04-27 09:32:00 -07:00
Nikita Popov	1745341296	[LoopVectorize] Preserve SCEV As far as I can tell, LoopVectorize preserves SCEV, mainly by dint of forgetting the loop being vectorized. We should mark it as preserved in the pass manager. This is a very small compile-time improvement. Differential Revision: https://reviews.llvm.org/D149147	2023-04-26 09:43:54 +02:00
Philip Reames	09d879d060	[SCEV] Common code for computing trip count in a fixed type [NFC-ish] This is a follow on to D147117 and D147355. In both cases, we were adding special cases to compute zext(BTC+1) instead of zext(BTC)+1 when the BTC+1 computation was known not to overflow. Differential Revision: https://reviews.llvm.org/D148661	2023-04-25 12:04:42 -07:00
Luke Lau	5a4b7e1f2e	[SLP] Deduplicate loads subkey generator. NFCI Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D148923	2023-04-25 09:34:40 +01:00
David Green	1869a9c225	[LV] Use the known trip count when costing non-tail folded VFs Now that we store the ScalarCost in the VectorizationFactor it is possible to use it to get a slightly more accurate cost in isMoreProfitable between two vector factors. This extends the logic added in D101726 to non-tail-folded cases, using the costs of `VecCost * (TripCount / VF) + ScalarCost * (TripCount % VF)` to compare VFs where the TripCount is known and we are not folding the tail. This shouldn't alter very much as small trip counts are usually not vectorized, but does seem to help in the testcase where 4 * VF4 is chosen as profitable compared to 2 * VF8 + 4 * scalar. Differential Revision: https://reviews.llvm.org/D147720	2023-04-24 22:02:30 +01:00
Florian Hahn	3157f03a34	[VPlan] Add VPValue::isLiveIn() (NFC). This helps to clarify checks in multiple places. Suggested as cleanup in D147892.	2023-04-24 17:51:12 +01:00
Florian Hahn	6f999769b9	[VPlan] Remove unnecessary includes from VPlan.h (NFC). Clean up some unnecessary includes from VPlan.h, which is imported in multiple files.	2023-04-24 16:10:46 +01:00
Alexey Bataev	b1abc2beaf	[SLP]Fix PR58616: assert for gep nodes with different basic blocks. Need to relax the assertion check in the FindFirstInst lambda for GEP nodes with non-GEP instruction to avoid compiler crash.	2023-04-24 07:41:06 -07:00
Jay Foad	593e25ffae	[Vectorize] Fix vectorization, scalarization and folding of llvm.is.fpclass llvm.is.fpclass is different from other vectorizable intrinsics in that it is overloaded on an argument type, not on the return type. Differential Revision: https://reviews.llvm.org/D148905	2023-04-24 13:42:08 +01:00
Alexey Bataev	851a12138a	[SLP]Fix the cost for the extractelements, used in several nodes. Currently the compiler calculates the compensation cost for the extractelements, removed during vectorization. But if the extractelement instruction is used in several nodes, we can calculate the compensation for them several times. Differential Revision: https://reviews.llvm.org/D148806	2023-04-21 09:05:03 -07:00
Alexey Bataev	403bd583a8	[SLP]Fix a crash on scalarized vectors. Need to register in-vector for scalarized types to avoid crash in further analysis.	2023-04-21 08:13:48 -07:00
Alexey Bataev	3b7d298322	[SLP][NFC]Make computeExtractCost a member of ShuffleCostEstimator, NFC. Moved computeExtractCost to ShuffleCostEstimator class as another step for unifying actual codegen/cost estimation for buildvectors. Differential Revision: https://reviews.llvm.org/D148864	2023-04-21 06:52:53 -07:00
Alexey Bataev	0e1312fbe0	[SLP][X86]Fix the cost of reused gathers/buildvectors and floats insert. There are 2 problems in the cost estimation for buildvector/gather. 1. If the buildvector/gather node is the same as another one node, need to estimate the cost of this node as 0. 2. The cost of inserting float point register to non-poison vector is not 0, it should not be considered free. Differential Revision: https://reviews.llvm.org/D148801	2023-04-20 09:34:46 -07:00
Florian Hahn	6b8d19d2b5	Recommit "[VPlan] Switch to checking sinking legality for recurrences in VPlan." This reverts the revert commit 3d8ed8b5192a59104bfbd5bf7ac84d035ee0a4a5. The new version of the patch adds a set to avoid duplicating work in isFixedOrderRecurrence, which was previously done through the removed SinkAfter map. Original commit message: Building on D142885 and D142589, retire the SinkAfter map from the recurrence handling code. It is replaced by checking whether it is possible to sink all users of a recurrence directly in VPlan. This results in simpler code overall and allows to handle additional cases (see the improvements in @test_crash). Depends on D142885. Depends on D142589. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D142886	2023-04-20 09:31:16 +01:00
Alexey Bataev	8cf0290c4a	[SLP]Fix cost estimation for buildvectors with extracts and/or constants. If the partial matching is found and some other scalars must be inserted, need to account the cost of the extractelements, transformed to shuffles, and/or reused entries and calculate the cost of inserting constants properly into the non-poison vectors. Also, fixed the cost calculation for final gather/buildvector sequence. Differential Revision: https://reviews.llvm.org/D148362	2023-04-19 05:54:58 -07:00
Alexey Bataev	1ce4b26a21	[SLP]Add final resize to ShuffleCostEstimator::finalize member function and basic add member functions. Implemented the reshuffling in finalize member function + add basic support for add member functions, used during vector build. Part of D110978 Differential Revision: https://reviews.llvm.org/D148279	2023-04-18 11:52:04 -07:00
Alexey Bataev	d7a40a447f	Revert "[SLP]Add final resize to ShuffleCostEstimator::finalize member function and basic add member functions." This reverts commit cd341f3f4878137d1c9e7a05c4c3a7bd8ff216dc to fix a crash revealed by buildbot https://lab.llvm.org/buildbot#builders/124/builds/7108.	2023-04-18 10:41:00 -07:00
Alexey Bataev	cd341f3f48	[SLP]Add final resize to ShuffleCostEstimator::finalize member function and basic add member functions. Implemented the reshuffling in finalize member function + add basic support for add member functions, used during vector build. Part of D110978 Differential Revision: https://reviews.llvm.org/D148279	2023-04-18 05:51:23 -07:00
Florian Hahn	ff0ec4f42e	Recommit "[VPlan] Unify Value2VPValue and VPExternalDefs maps (NFCI)." This reverts the revert commit 8c2276f89887d0a27298a1bbbd2181fa54bbb509. The updated patch re-orders the getDefiningRecipe check in getVPalue to avoid a use-after-free. Original commit message: Before this patch, a VPlan contained 2 mappings for Values -> VPValue: 1) Value2VPValue and 2) VPExternalDefs. This duplication is unnecessary and there are already cases where external defs are added to Value2VPValue. This patch replaces all uses of VPExternalDefs with Value2VPValue. It clarifies the naming of getOrAddVPValue (to getOrAddExternalVPValue) and addVPValue (to addExternalVPValue). At the moment, this is NFC, but will enable additional simplifications in D147783. Depends on D147891. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D147892	2023-04-18 10:29:31 +01:00

1 2 3 4 5 ...

3757 Commits