llvm-project

Author	SHA1	Message	Date
Hassnaa Hamdi	9491f75e1d	Reland: [LV]: Teach LV to recursively (de)interleave. (#122989 ) This commit relands the changes from "[LV]: Teach LV to recursively (de)interleave. #89018" Reason for revert: - The patch exposed a bug in the IA pass, the bug is now fixed and landed by commit: #122643	2025-01-17 10:34:57 +00:00
Mel Chen	9720be95d6	[LV][EVL] Disable fixed-order recurrence idiom with EVL tail folding. (#122458 ) The currently llvm.splice may occurs unexpected behavior if the evl of the second-to-last iteration is not VF*UF. Issue #122461	2025-01-17 16:55:35 +08:00
Luke Lau	e83e0c300d	[LV] Add test case for #119173 . NFC This showcases a miscompile involving a widened reduction-phi.	2025-01-17 09:45:42 +08:00
Florian Hahn	4481030a03	[Loads] Use use-dereferenceable-at-point-semantics=1 in test. Update the test to use use-dereferenceable-at-point-semantics=1. Existing tests are updated with the nofree attribute and a new one has been added showing that the dereferenceable assumption is used after the pointer may be freed.	2025-01-16 12:49:50 +00:00
Nikita Popov	b0c4aed4f1	[LoopVectorize] Regenerate test checks (NFC) Add a prefix to avoid conflicts, otherwise the test becomes invalid on regeneration.	2025-01-16 10:21:24 +01:00
David Sherwood	a00938eedd	Revert "[LoopVectorize] Add support for reverse loops in isDereferenceableAndAlignedInLoop (#96752 )" (#123057 ) This reverts commit bfedf6460c2cad6e6f966b457d8d27084579dcd8.	2025-01-15 13:56:42 +00:00
David Sherwood	bfedf6460c	[LoopVectorize] Add support for reverse loops in isDereferenceableAndAlignedInLoop (#96752 ) Currently when we encounter a negative step in the induction variable isDereferenceableAndAlignedInLoop bails out because the element size is signed greater than the step. This patch adds support for negative steps in cases where we detect the start address for the load is of the form base + offset. In this case the address decrements in each iteration so we need to calculate the access size differently. I have done this by caling getStartAndEndForAccess from LoopAccessAnalysis.cpp. The motivation for this patch comes from PR #88385 where a reviewer requested reusing isDereferenceableAndAlignedInLoop, but that PR itself does support reverse loops. The changed test in LoopVectorize/X86/load-deref-pred.ll now passes because previously we were calculating the total access size incorrectly, whereas now it is 412 bytes and fits perfectly into the alloca.	2025-01-15 12:47:43 +00:00
LiqinWeng	0294dab79e	[LV][VPlan] Add fast flags for selectRecipe (#121023 ) Change the inheritance of class VPWidenSelectRecipe to class VPRecipeWithIRFlags, which allows recipe of the select to pass the fastmath flags.The patch of #119847 will add the fastmath flag to for recipe	2025-01-15 10:10:11 +08:00
Florian Hahn	1de3dc7d23	[LV] Bail out early if BTC+1 wraps. Currently we fail to detect the case where BTC + 1 wraps, i.e. the vector trip count is 0, In those cases, the minimum iteration count check will fail, and the vector code will never be executed. Explicitly check for this condition in computeMaxVF and avoid trying to vectorize alltogether. Note that a number of tests needed to be updated, because the vector loop would never be executed given the input IR. Fixes https://github.com/llvm/llvm-project/issues/122558.	2025-01-14 22:07:38 +00:00
Luke Lau	f925e54554	[VPlan] Fix mutating whilst iterating over users in EVL transform (#122885 ) This fixes a miscompilation extracted from 525.x264_r, where we were failing to update the runtime VF of a VPReverseVectorPointerRecipe. We were removing a use of VF whilst iterating over the users() iterator, which messed up the iterator in-flight and caused us to miss some recipes. This fixes it by copying the users into a SmallVector first. Fixes #122681 Fixes #122682	2025-01-14 22:17:51 +08:00
Jay Foad	e87f94a6a8	[llvm-project] Fix typos mutli and mutliple. NFC. (#122880 )	2025-01-14 11:59:41 +00:00
Mel Chen	418f5cd6c2	[LV][EVL] Pre-commit test case for fixed-order recurrence with EVL tail folding. (NFC) (#122456 ) This test case is from SingleSource/UnitTests/Vectorizer/recurrences.test. Pre-commit for #122458	2025-01-13 21:15:03 +08:00
Mel Chen	3397950f2d	[LV] Fix FindLastIV reduction for epilogue vectorization. (#120395 ) Following 0e528ac404e13ed2d952a2d83aaf8383293c851e, this patch adjusts the resume value of VPReductionPHIRecipe for FindLastIV reductions. Replacing the resume value with: ResumeValue = ResumeValue == StartValue ? SentinelValue : ResumeValue; This addressed the correctness issue when the start value might not be less than the minimum value of a monotonically increasing induction variable. Thanks Florian Hahn for the help. --------- Co-authored-by: Florian Hahn <flo@fhahn.com>	2025-01-13 20:58:38 +08:00
Sam Tebbs	795e35a653	Reland "[LoopVectorizer] Add support for partial reductions" with non-phi operand fix. (#121744 ) This relands the reverted #120721 with a fix for cases where neither reduction operand are the reduction phi. Only 63114239cc8d26225a0ef9920baacfc7cc00fc58 and 63114239cc8d26225a0ef9920baacfc7cc00fc58 are new on top of the reverted PR. --------- Co-authored-by: Nicholas Guy <nicholas.guy@arm.com>	2025-01-13 11:20:35 +00:00
Florian Hahn	8df64ed777	[LV] Don't consider IV increments uniform if exit value is used outside. In some cases, there might be a chain of uniform instructions producing the exit value. To generate correct code in all cases, consider the IV increment not uniform, if there are users outside the loop. Instead, let VPlan narrow the IV, if possible using the logic from 3ff1d01985752. Test case from #122602 verified with Alive2: https://alive2.llvm.org/ce/z/bA4EGj Fixes https://github.com/llvm/llvm-project/issues/122496. Fixes https://github.com/llvm/llvm-project/issues/122602.	2025-01-12 22:03:21 +00:00
Florian Hahn	f5a35a31bf	[LV] Add test cases with incorrect IV live-outs. Add test cases for https://github.com/llvm/llvm-project/issues/122496 and https://github.com/llvm/llvm-project/issues/122602.	2025-01-12 20:55:20 +00:00
Florian Hahn	3ff1d01985	Recommit "[VPlan] Try to narrow wide and replicating recipes to uniform recipes." This reverts commit 0ebb3ac7c92c4c1c44e7f3d17832d75ec5a42a67. Re-applies commit with typos fixed.	2025-01-12 20:10:28 +00:00
Florian Hahn	0ebb3ac7c9	Revert "[VPlan] Try to narrow wide and replicating recipes to uniform recipes." This reverts commit 1afba19913253dda865a8e57b37b9f4dabead1ac. Typo breaking the build	2025-01-12 19:37:45 +00:00
Florian Hahn	1afba19913	[VPlan] Try to narrow wide and replicating recipes to uniform recipes. Use the existing VPlan-based analysis to identify recipes that only have their first lane demanded and transform them to uniform recpliate recipes. This simplifies the generated code in some places and prepares for fixing https://github.com/llvm/llvm-project/issues/122496.	2025-01-12 19:32:01 +00:00
Florian Hahn	44058e5b5f	[LV] Precommit tests for #106441 . Tests for https://github.com/llvm/llvm-project/pull/106441 from https://github.com/llvm/llvm-project/issues/82936.	2025-01-10 18:49:44 +00:00
Florian Hahn	b6cda338ab	[Loads] Also consider getPointerAlignment when checking assumptions. (#120916 ) Also use getPointerAlignment when trying to use alignment and dereferenceable assumptions. This catches cases where dereferencable is known via the assumption but alignment is known via getPointerAlignment (e.g. via argument attribute or align of 1) PR: https://github.com/llvm/llvm-project/pull/120916	2025-01-09 18:19:39 +00:00
Florian Hahn	b0697dc1de	[LV] Only check isVectorizableEarlyExitLoop with multiple exits. (#121994 ) Currently we emit early-exit related debug messages/remarks even when there is a single exit. Update to only check isVectorizableEarlyExitLoop if there isn't a single exit block. PR: https://github.com/llvm/llvm-project/pull/121994	2025-01-09 12:05:19 +00:00
Benjamin Maxwell	f88ef1bd1b	[LV] Teach LoopVectorizationLegality about struct vector calls (#119221 ) This is a split-off from #109833 and only adds code relating to checking if a struct-returning call can be vectorized. This initial patch only allows the case where all users of the struct return are `extractvalue` operations that can be widened. ``` %call = tail call { float, float } @foo(float %in_val) %extract_a = extractvalue { float, float } %call, 0 %extract_b = extractvalue { float, float } %call, 1 ``` Note: The tests require the VFABI changes from #119000 to pass.	2025-01-09 09:27:29 +00:00
Luke Lau	f0d5104c94	[VPlan] Handle some VPInstructions in may{Read,Write}FromMemory (#120058 ) This just copies the same conservative definition from mayWriteToMemory, and enables more VPInstructions to be hoisted out in LICM. I think this should give more accurate costs, and I was able to build llvm-test-suite without the legacy-vplan cost model assertion going off.	2025-01-08 15:17:26 +08:00
Florian Hahn	0eaa69eb23	[VPlan] Handle VPExpandSCEVRecipe in isUniformAfterVectorization. VPExpandSCEVRecipes must be placed in the entry and are alway uniform. This fixes a crash by always identifying them as uniform, even if the main vector loop region has been removed. Fixes https://github.com/llvm/llvm-project/issues/121897.	2025-01-07 21:35:09 +00:00
Florian Hahn	ea14bdb035	[LV] Add test showing debug output for loops with uncountable BTCs. Currently we print an early-exit related related debug message, even though there's no early exit.	2025-01-07 20:27:30 +00:00
Florian Mayer	ef391dbc29	[LV] Drop incorrect inbounds for reverse vector pointer when folding tail (#120730 ) When folding the tail, we may compute an address that we don't in the original scalar loop and it may not be inbounds. Drop Inbounds in that case.	2025-01-07 06:14:01 -08:00
Florian Hahn	f9369cc602	[VPlan] Make sure last IV increment value is available if needed. Legalize extract-from-ends using uniform VPReplicateRecipe of wide inductions to use regular VPReplicateRecipe, so the correct end value is available. Fixes https://github.com/llvm/llvm-project/issues/121745.	2025-01-06 22:40:41 +00:00
Florian Hahn	d0c00cf078	[LV] Add test case for #121745 . Test for https://github.com/llvm/llvm-project/issues/121745.	2025-01-06 22:28:44 +00:00
David Sherwood	a3fff3a14d	[LoopVectorize][NFC] Fix arith-fp-frem-costs.ll test to use new vplan cost model (#120742 )	2025-01-06 10:26:51 +00:00
Florian Hahn	f4230b4332	[VPlan] Add and use debug location for VPScalarCastRecipe. Update the recipe it always take a debug location and set it.	2025-01-05 20:08:51 +00:00
Florian Hahn	747f7f38bd	[LV] Add test with conditional load from invariant addr and assumption. This adds missing test coverage for isDereferenceableAndAlignedInLoop, related to https://github.com/llvm/llvm-project/pull/96752.	2025-01-05 19:31:54 +00:00
Florian Hahn	f48884ded8	[VPlan] Remove loop region in optimizeForVFAndUF. (#108378 ) Update optimizeForVFAndUF to completely remove the vector loop region when possible. At the moment, we cannot remove the region if it contains * widened IVs: the recipe is needed to generate the step vector * reductions: ComputeReductionResults requires the reduction phi recipe for codegen. Both cases can be addressed by more explicit modeling. The patch also includes a number of updates to allow executing VPlans without a vector loop region. Depends on https://github.com/llvm/llvm-project/pull/110004	2025-01-05 15:50:42 +00:00
Florian Hahn	df4a615c98	[VPlan] Convert induction increment check to be VPlan-based. Check the VPlan directly to determine if a VPValue is an optimiziable IV or IV use instead of checking the underlying IR instructions. Split off from https://github.com/llvm/llvm-project/pull/112147. This refactoring enables moving IV end value creation from the legacy fixupIVUsers to a VPlan-based transform. There is one case we now won't optimize, that is IVs with subtracts and non-constant steps. But as this is a minor optimization and doesn't impact correctness, the benefits of performing the check in VPlan should outweigh the missed case.	2025-01-05 11:16:01 +00:00
Luke Lau	7700695739	[VPlan] Fix crash with EVL tail folding intrinsic with no corresponding VP (#121542 ) This fixes a crash when building SPEC CPU 2017 with EVL tail folding when widening @llvm.log10 intrinsics. @llvm.log10 and some other intrinsics don't have a corresponding VP intrinsic, so this fixes the crash by removing the assert and bailing instead.	2025-01-05 11:41:56 +08:00
Florian Hahn	b95cce9904	[VPlan] Update wide induction inc recipes to use same step as Wide IV. Update wide induction increments to use the same step as the corresponding wide induction. This enables detecting induction increments directly in VPlan and removes redundant splats.	2025-01-04 20:04:59 +00:00
Florian Hahn	4a7c0b8afe	[LV] Add X86-specific induction step tests. Adds additional test coverage for induction codegen.	2025-01-04 15:09:04 +00:00
Florian Hahn	47ac7fa861	[LV] Add tests with wide inductions and live-in step. Also regenerate check lines and simplify existing tests and names.	2025-01-04 14:50:04 +00:00
Florian Mayer	62b5cf0410	[Vectorizer] precommit test for miscompilation (#120731 ) we generate GEPs that are out of bounds but mark them as "inbound"	2025-01-03 06:37:45 -08:00
John Brawn	073e65a8e5	[LoopVectorize] Make needsExtract notice scalarized instructions (#119720 ) LoopVectorizationCostModel::needsExtract should recognise instructions that have been widened by scalarizing as scalar instructions, and thus not needing an extract when used by later scalarized instructions. This fixes an incorrect cost calculation in computePredInstDiscount, where we are adding a scalarization overhead cost when we shouldn't, though I haven't come up with a test case where it makes a difference. It will make a difference when the cost model switches to using the cost kind TCK_CodeSize for optsize, as not doing this causes the test LoopVectorize/X86/small-size.ll to get worse.	2025-01-02 14:31:36 +00:00
Florian Hahn	3026ecaff5	[LV] Also verify loops in vector loop removal tests. Also verify loop info in tests added in 7d6ec3b9680.	2024-12-31 20:11:23 +00:00
Florian Hahn	7d6ec3b968	[LV] Add more tests for vector loop removal. Add missing test coverage of loops where the vector loop region can be removed that include replicate recipes as well as nested loops. Extra test coverage for https://github.com/llvm/llvm-project/pull/108378.	2024-12-31 20:08:54 +00:00
Muhammad Omair Javaid	332d2647ff	Revert "[LV]: Teach LV to recursively (de)interleave. (#89018 )" This reverts commit ccfe0de0e1e37ed369c9bf89dd0188ba0afb2e9a. This breaks LLVM build on AArch64 SVE Linux buildbots https://lab.llvm.org/buildbot/#/builders/143/builds/4462 https://lab.llvm.org/buildbot/#/builders/17/builds/4902 https://lab.llvm.org/buildbot/#/builders/4/builds/4399 https://lab.llvm.org/buildbot/#/builders/41/builds/4299	2024-12-31 03:12:24 +05:00
Florian Hahn	b20b6e9ea9	[LV] Check IR generated for both interleaving and vectorizing in test. Currently the tests would in some cases would only check the vectorized IR, but not the interleaved IR, if they are different.	2024-12-30 19:57:26 +00:00
Florian Hahn	c2be48a6ce	[LV] Add additional tests with induction users. Adds test coverage of post-inc IV users with different opcodes.	2024-12-30 17:36:48 +00:00
Florian Hahn	7f3428d3ed	[VPlan] Compute induction end values in VPlan. (#112145 ) Use createDerivedIV to compute IV end values directly in VPlan, instead of creating them up-front. This allows updating IV users outside the loop as follow-up. Depends on https://github.com/llvm/llvm-project/pull/110004 and https://github.com/llvm/llvm-project/pull/109975. PR: https://github.com/llvm/llvm-project/pull/112145	2024-12-29 19:05:08 +00:00
Zequan Wu	4d8f9594b2	Revert "Reland "[LoopVectorizer] Add support for partial reductions" (#120721 )" This reverts commit c858bf620c3ab2a4db53e84b9365b553c3ad1aa6 as it casuse optimization crash on -O2, see https://github.com/llvm/llvm-project/pull/120721#issuecomment-2563192057	2024-12-27 11:51:54 -08:00
Hassnaa Hamdi	ccfe0de0e1	[LV]: Teach LV to recursively (de)interleave. (#89018 ) Currently available intrinsics are only ld2/st2, which don't support interleaving factor > 2. This patch teaches the LV to use ld2/st2 recursively to support high interleaving factors.	2024-12-27 12:42:07 +00:00
Elvis Wang	47e1c87a61	[VPlan] Set debug location for VPReduction/VPWidenIntrinsicRecipe. (#120054 ) This patch add missing debug location for VPReduction/VPWidenIntrinsicRecipe.	2024-12-27 10:37:21 +08:00
Florian Hahn	2d038caeeb	[VPlan] Remove stray space when printing VPWidenCastRecipe. printFlags() already takes care of printing a single space if there are no flags. Remove the extra space when printing a recipe without flags.	2024-12-24 20:23:48 +00:00

1 2 3 4 5 ...

2860 Commits