llvm-project

Author	SHA1	Message	Date
vporpo	083369fd99	[SandboxVec][Legality] Per opcode checks (#114145 ) This patch adds more opcode-specific legality checks.	2024-11-01 15:04:03 -07:00
Florian Hahn	17bad1a9da	[LV] Bail out on header phis in shouldConsiderInvariant. This fixes an infinite recursion in rare cases. Fixes https://github.com/llvm/llvm-project/issues/113794.	2024-11-01 20:51:25 +00:00
Han-Kuan Chen	a795a18bba	[SLP][REVEC] VF should be scaled when ScalarTy is FixedVectorType. (#114551 )	2024-11-02 03:03:52 +08:00
Simon Pilgrim	718d50d6d0	[VectorCombine] foldPermuteOfBinops - prefer the new fold for matching costs. Minor tweak to #114101 - as we're reducing the instruction count, we should prefer the fold if the old/new costs are the same.	2024-11-01 17:28:37 +00:00
Han-Kuan Chen	e4aeeba84c	[SLP][REVEC] When ScalarTy is FixedVectorType, the insertion index should consider the number of elements of ScalarTy. (#114526 )	2024-11-01 21:17:57 +08:00
David Sherwood	4ed7bcb4a6	[VPlan][NFC] Add new getMiddleBlock interface to VPlan (#113558 ) This work is in preparation for PRs #112138 and #88385 where the middle block is not guaranteed to be the immediate successor to the region block. I've simply add new getMiddleBlock() interfaces to VPlan that for now just return cast<VPBasicBlock>(VectorRegion->getSingleSuccessor()) Once PR #112138 lands we'll need to do more work to discover the middle block.	2024-11-01 10:50:52 +00:00
Florian Hahn	3b4c45e4e5	[VPlan] Fix long comment added in b021464d35ca (NFC). Fix formatting of comment added in b021464d35ca.	2024-10-31 21:05:00 +00:00
Florian Hahn	b021464d35	[VPlan] Introduce scalar loop header in plan, remove VPLiveOut. (#109975 ) Update VPlan to include the scalar loop header. This allows retiring VPLiveOut, as the remaining live-outs can now be handled by adding operands to the wrapped phis in the scalar loop header. Note that the current version only includes the scalar loop header, no other loop blocks and also does not wrap it in a region block. PR: https://github.com/llvm/llvm-project/pull/109975	2024-10-31 21:36:44 +01:00
Alexey Bataev	e05def081e	[SLP]Do not vectorize code in EH and non-returning blocks The code in EH and non-returning blocks can be skipped by the vectorizer, since it does not add to the perfromance, just consumes compile/link time. Reviewers: RKSimon Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/112221	2024-10-31 13:50:02 -04:00
Alexey Bataev	19a34dded7	[SLP]Do not account external uses in EH block and in non-returning blocks No need to account the cost of the external uses in EH and non-returning basic blocks. Reviewers: RKSimon Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/112045	2024-10-31 13:23:43 -04:00
Alexey Bataev	e7080fd735	[SLP]Extra check if the intruction matked for removal, must be replaced in reduction ops If the instruction is vectorized and it is a part of the reduced values gather/buildvector node, it should replaced in reduced operation instructions before removal properly, to avoid compiler crash. Fixes #114371	2024-10-31 09:59:35 -07:00
Simon Pilgrim	92af82a48d	[VectorCombine] Fold "shuffle (binop (shuffle, shuffle)), undef" --> "binop (shuffle), (shuffle)" (#114101 ) Add foldPermuteOfBinops - to fold a permute (single source shuffle) through a binary op that is being fed by other shuffles. Fixes #94546 Fixes #49736	2024-10-31 10:58:09 +00:00
Florian Hahn	5bd1af5abc	[LV] Directly store VPlan in InnerLoopVectorizer (NFC). The current VPlan is already passed to multiple functions and more in the future. Store it once directly in InnerLoopVectorizer.	2024-10-30 18:39:50 +00:00
Mel Chen	8420dbf2b9	[VPlan] Refine the constructor of VPWidenIntrinsicRecipe. nfc (#113890 ) Infers member MayReadFromMemory, MayWriteToMemory, and MayHaveSideEffects based on intrinsic attributes. --------- Co-authored-by: Florian Hahn <flo@fhahn.com>	2024-10-30 12:22:28 +08:00
Piotr Fusik	3c02fea737	[LV][NFC] Remove stray semicolons (#114057 )	2024-10-30 04:07:14 +01:00
vporpo	ca998b071e	[SandboxVec][Legality] Check wrap flags (#113975 )	2024-10-29 15:37:03 -07:00
Florian Hahn	680901ed80	[VPlan] Implement VPHeaderPHIRecipe::computeCost. Fill out computeCost implementations for various header PHI recipes, matching the legacy cost model for now.	2024-10-29 21:04:31 +00:00
vporpo	a461869db3	[SandboxIR][Pass] Implement Analyses class (#113962 ) The Analyses class provides a way to pass around commonly used Analyses to SandboxIR passes throught `runOnFunction()` and `runOnRegion()` functions.	2024-10-28 18:00:52 -07:00
vporpo	bf4b31ad54	[SandboxVec][Legality] Check Fastmath flags (#113967 )	2024-10-28 15:32:20 -07:00
vporpo	5ea694816b	[SandboxVec][Legality] Check opcodes and types (#113741 )	2024-10-28 14:05:58 -07:00
Florian Hahn	0d0abb351b	[VPlan] Use ResumePhi to create reduction resume phis. (#110004 ) Use VPInstruction::ResumePhi to create phi nodes for reduction resume values in the scalar preheader, similar to how ResumePhis are used for first-order recurrence resume values after 9a5a8731e77. This allows simplifying createAndCollectMergePhiForReduction to only collect reduction resume phis when vectorizing epilogue loops and adding extra incoming edges from the main vector loop. Updating phis for the epilogue vector loops requires special attention, because additional incoming values from the bypass blocks need to be added. PR: https://github.com/llvm/llvm-project/pull/110004	2024-10-28 20:14:08 +01:00
Ellis Hoag	6ab26eab4f	Check hasOptSize() in shouldOptimizeForSize() (#112626 )	2024-10-28 09:45:03 -07:00
Alexey Bataev	7152bf3bc8	[SLP]Do not create new vector node if scalars fully overlap with the existing one If the list of scalars vectorized as the part of the same vector node, no need to generate vector node again, it will be handled as part of overlapping matching. Fixes #113810	2024-10-28 06:59:41 -07:00
Florian Hahn	7fe149cdf0	[VPlan] Replace getIRBasicBlock with IRBB in VPIRBB::execute (NFC). Suggested in https://github.com/llvm/llvm-project/pull/109975. This makes the function consistent throughout.	2024-10-27 16:22:18 +01:00
Shih-Po Hung	266ff98cba	[LV][VPlan] Use VF VPValue in VPVectorPointerRecipe (#110974 ) Refactors VPVectorPointerRecipe to use the VF VPValue to obtain the runtime VF, similar to #95305. Since only reverse vector pointers require the runtime VF, the patch sets VPUnrollPart::PartOpIndex to 1 for vector pointers and 2 for reverse vector pointers. As a result, the generation of reverse vector pointers is moved into a separate recipe.	2024-10-26 23:18:50 +08:00
Vasileios Porpodas	1540f772c7	Reapply "[SandboxVec][Legality] Reject non-instructions (#113190 )" This reverts commit eb9f4756bc3daaa4b19f4f46521dc05180814de4.	2024-10-25 12:55:58 -07:00
Vasileios Porpodas	eb9f4756bc	Revert "[SandboxVec][Legality] Reject non-instructions (#113190 )" This reverts commit 6c9bbbc818ae8a0d2849dbc1ebd84a220cc27d20.	2024-10-25 12:52:31 -07:00
vporpo	6c9bbbc818	[SandboxVec][Legality] Reject non-instructions (#113190 )	2024-10-25 12:47:19 -07:00
Florian Hahn	e724226da7	[VPlan] Return cost of 0 for VPWidenCastRecipe without underlying value. In some cases, VPWidenCastRecipes are created but not considered in the legacy cost model, including truncates/extends when evaluating a reduction in a smaller type. Return 0 for such casts for now, to avoid divergences between VPlan and legacy cost models. Fixes https://github.com/llvm/llvm-project/issues/113526.	2024-10-25 21:25:44 +02:00
Florian Hahn	9648271a3c	[LV] Pass flag indicating epilogue is vectorized to executePlan (NFC) This clarifies the flag, which is now only passed if the epilogue loop is being vectorized.	2024-10-25 20:39:47 +02:00
Florian Hahn	7b9f988a53	[VPlan] Limit stride replacement to vector region and middle VPBB (NFC). At the moment this in NFC, but ensures we only replace uses that are dominated by runtime checks as we model more of the skeleton in VPlan.	2024-10-24 15:08:36 -07:00
Alexey Bataev	e914421d7f	[SLP]Do correct signedness analysis for externally used scalars If the scalars is used externally is in the root node, it may have incorrect signedness info because of the conflict with the demanded bits analysis. Need to perform exact signedness analysis and compute it rather than rely on the precomputed value, which might be incorrect for alternate zext/sext nodes. Fixes #113520	2024-10-24 08:59:24 -07:00
Alexey Bataev	d2e7ee77d3	[SLP]Do not check for clustered loads only Since SLP support "clusterization" of the non-load instructions, the restriction for reduced values for loads only should be removed to avoid compiler crash. Fixes #113516	2024-10-24 08:16:42 -07:00
Alexey Bataev	cb5046da26	[SLP]Do not ignore undefs when trying to replace with "poisonous" shuffles Need to consider undefs correctly, when trying to replace them with potentially poisonous values in shuffles. Such elements should not be silently replaced by poison values, instead complex analysis should be implemented to see if it is safe to do it. Fixes #113425	2024-10-24 07:47:23 -07:00
Florian Hahn	ef217a0f6b	[VPlan] Introduce and use getVectorPreheader (NFC). Introduce a dedicated function to retrieve the vector preheader. This ensures the correct block is used, even if the skeleton is exetended.	2024-10-23 21:01:52 -07:00
Florian Hahn	2dfb1c664c	[VPlan] Try to hoist Previous (and operands), if sinking fails for FORs. (#108945 ) In some cases, Previous (and its operands) can be hoisted. This allows supporting additional cases where sinking of all users of to FOR fails, e.g. due having to sink recipes with side-effects. This fixes a crash where we fail to create a scalar VPlan for a first-order recurrence, but can create a vector VPlan, because the trunc instruction of an IV which generates the previous value of the recurrence has been optimized to a truncated induction recipe, thus hoisting it to the beginning. Fixes https://github.com/llvm/llvm-project/issues/106523. PR: https://github.com/llvm/llvm-project/pull/108945	2024-10-23 13:12:03 -07:00
Alexey Bataev	b65b2b4ab6	[SLP]Expand vector to the whole register size in extracts adjustment Need to expand the number of elements to the whole register to correctly process estimation and avoid compiler crash. Fixes #113462	2024-10-23 12:04:40 -07:00
Alexey Bataev	a3508e0246	[SLP]Small buidlvector only graph should contains scalars from same block If the graph is small and has single buildvector node, all scalars instructions must be from the same basic block to prevent compiler crash. Fixes #113451	2024-10-23 10:46:38 -07:00
Sterling-Augustine	f1be516223	[SandboxVectorizer] New class to actually collect and manage seeds (#113386 ) This relands d91318b643188bb855f115b02af9532f87c787b7, with test-only changes to make gcc-10 happy.	2024-10-23 10:26:18 -07:00
Florian Hahn	2437784a17	[LV] Replace unreachable by folding into else with assert (NFC). Simplify code as suggested post-commit in https://github.com/llvm/llvm-project/pull/110576/.	2024-10-21 21:46:48 -07:00
Kazu Hirata	38fca7b7db	[Vectorize] Simplify code with DenseMap::operator[] (NFC) (#113246 )	2024-10-21 21:35:38 -07:00
Elvis Wang	b3edc764f7	[VPlan] Implement VPWidenCastRecipe::computeCost(). (NFCI) (#111339 ) This patch implement `VPWidenCastRecipe::computeCost()` and skip cast recipies in the in-loop reduction.	2024-10-22 12:23:49 +08:00
Florian Hahn	a4819bd46d	[VPlan] Restore case accidentially dropped in 34cdd67c85. 34cdd67c85 accidentially dropped the case for VPWidenIntrinsicSC. Add it back again. Thanks @Mel-Chen for spotting this.	2024-10-21 20:33:58 -07:00
Florian Hahn	1d9b3222f3	[VPlan] Implement VPWidenSelectRecipe::computeCost. Implement VPlan-based cost computation for VPWidenSelectRecipe.	2024-10-22 03:10:04 +01:00
Florian Hahn	dac0f7e83e	[VPlan] Add general recipe matcher, replace handwritten ones (NFC) The new matcher is more flexible and can be used to build matchers for additional recipe types without unnecessary duplication.	2024-10-21 16:46:45 -07:00
Sterling-Augustine	0de8de1b84	[SandboxVectorizer] revert New class to actually collect and manage s… (#113231 ) …eeds (#112979) This reverts commit d91318b643188bb855f115b02af9532f87c787b7.	2024-10-21 16:00:35 -07:00
Sterling-Augustine	d91318b643	[SandboxVectorizer] New class to actually collect and manage seeds (#112979 ) There are many more tests to add, but I would like to get this reviewed and the details sorted out before it grows too big.	2024-10-21 15:39:51 -07:00
Alexey Bataev	4b1b51ac52	[SLP]Initial non-power-of-2 support (but still whole register) for reductions Enables initial non-power-of-2 support (but still requires number of elements, forming whole registers) for reductions. Enables extra vectorization for MultiSource/Benchmarks/7zip/7zip-benchmark, CINT2006/464.h264ref and CFP2017rate/526.blender_r (checked for SSE2) Reviewers: RKSimon Reviewed By: RKSimon Pull Request: https://github.com/llvm/llvm-project/pull/112361	2024-10-21 12:25:39 -07:00
vporpo	54c93aabec	[SandboxVec][Legality] Scaffolding for Legality (#112623 ) This patch adds a LegalityResultWithReason class for describing the reason why legality decided not to vectorize the code.	2024-10-21 09:17:46 -07:00
Alexey Bataev	9e03920cbf	[SLP]Ignore root gather node, when searching for reuses Root gather/buildvector node should be ignored when SLP vectorizer tries to find matching gather nodes, vectorized earlier. This node is definitely the last one in the pipeline and it does not have users. It may cause the compiler crash Fixes #113143	2024-10-21 09:16:16 -07:00

1 2 3 4 5 ...

5119 Commits