llvm-project

Author	SHA1	Message	Date
Nikita Popov	6f7e5c0f1a	Reapply [SimplifyCFG][LICM] Preserve nonnull, range and align metadata when speculating This exposed a miscompile in GVN, which was fixed by D148129. ----- After D141386, violation of nonnull, range and align metadata results in poison rather than immediate undefined behavior, which means that these are now safe to retain when speculating. We only need to remove UB-implying metadata like noundef. This is done by adding a dropUBImplyingAttrsAndMetadata() helper, which lists the metadata which is known safe to retain on speculation. Differential Revision: https://reviews.llvm.org/D146629	2023-04-17 14:15:14 +02:00
Bjorn Pettersson	a20f7efbc5	Remove several no longer needed includes. NFCI Mostly removing includes of InitializePasses.h and Pass.h in passes that no longer has support for the legacy PM.	2023-04-17 13:54:19 +02:00
Nikita Popov	8cdca96690	[GVN] Adjust metadata for coerced load CSE When reusing a load in a way that requires coercion (i.e. casts or bit extraction) we currently fail to adjust metadata. Unfortunately, none of our existing tooling for this is really suitable, because combineMetadataForCSE() expects both loads to have the same type. In this case we may work on loads of different types and possibly offset memory location. As such, what this patch does is to simply drop all metadata, with the following exceptions: * Metadata for which violation is known to always cause UB. * If the load is !noundef, keep all metadata, as this will turn poison-generating metadata into UB as well. This fixes the miscompile that was exposed by D146629. Differential Revision: https://reviews.llvm.org/D148129	2023-04-17 12:52:31 +02:00
Zain Jaffal	721ecc9d41	[ConstraintElimination] Transfer info from sgt %a, %b to ugt %a, %b if %b > 0 Differential Revision: https://reviews.llvm.org/D148326	2023-04-17 09:27:33 +01:00
Kazu Hirata	7b014a0732	[Scalar] Use range-based for loops (NFC)	2023-04-16 09:05:20 -07:00
Kazu Hirata	c83c4b58d1	[Transforms] Apply fixes from performance-for-range-copy (NFC)	2023-04-16 08:25:28 -07:00
DianQK	2832d7941f	[SROA] Remove UB-implying metadata when promoting speculative instruction. After D138238 introduced the then/else blocks, we should remove UB-implying metadata for the promoted speculative instruction. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D148456	2023-04-16 22:35:52 +08:00
Kazu Hirata	1ca496bd61	Remove redundant initialization of std::optional (NFC)	2023-04-16 00:40:05 -07:00
Bjorn Pettersson	0b911a3dc3	[passes] Remove the legacy PM version of IRCE Differential Revision: https://reviews.llvm.org/D148338	2023-04-14 18:56:20 +02:00
Bjorn Pettersson	b74e89c0d4	[passes] Remove the legacy PM version of AlignmentFromAssumptions Differential Revision: https://reviews.llvm.org/D148337	2023-04-14 18:56:20 +02:00
Bjorn Pettersson	fb93f98ffa	[Passes] Remove legacy PM version of BDCE (aka BitTrackingDCEPass) BDCE is not used by the codegen pipeline so we should not need the legacy PM version of the pass any longer. Differential Revision: https://reviews.llvm.org/D148335	2023-04-14 18:56:20 +02:00
Florian Hahn	98e50881e9	[Matrix] Refine cost estimate for dot-product. Adjust lowerDotProduct cost estimate to include the cost benefits of: * emitting a wide load * emitting a wide multiply. Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D147330	2023-04-14 11:35:01 +01:00
Nikita Popov	62ef97e063	[llvm-c] Remove PassRegistry and initialization APIs Remove C APIs for interacting with PassRegistry and pass initialization. These are legacy PM concepts, and are no longer relevant for the new pass manager. Calls to these initialization functions can simply be dropped. Differential Revision: https://reviews.llvm.org/D145043	2023-04-14 12:12:48 +02:00
Nikita Popov	c508e93327	[InstSimplify] Remove unused ORE argument (NFC)	2023-04-14 10:38:32 +02:00
Max Kazantsev	a39b807d41	[IRCE][NFC] Refactor parseRangeCheckICmp to compute SCEVs instead of Values The motivation is to make an opportunity to compute and return expressions after parsing ICmp into a range check (e.g. Length + 1). Patch by Aleksandr Popov! Differential Revision: https://reviews.llvm.org/D148205	2023-04-14 12:58:51 +07:00
Craig Topper	8bba57b1f1	[LoopIdiomRecognize] Remove NUW flag from SCEV in getTripCount. Based on the conversation in D147355. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D148170	2023-04-13 11:58:10 -07:00
Florian Hahn	e6ab86a887	[Matrix] Fix IsSupported check in lowerDotProduct. The check incorrectly checks the RHS while LHS is transformed later. Update to check LHS, which fixes a crash in the newly added test cases.	2023-04-13 19:00:30 +01:00
Max Kazantsev	2124505fe4	[IRCE] Relax restrictions on IRCE's latch exit count It seems that existing logic is too strict about latch block exit count. It is required to be computable, however it is not used in any computations, and effectively the only thing it is used for is to get the type of computed exit count. Sometimes the exit count for latch block is not known, but the loop is still finite because of other exits, and safe bounds are still computable. In this case, we miss an opportunity to apply IRCE. We could instead use a more relaxed version - max symbolic exit count, which, if exists, is enough to say that the loop is finite, and its type should be good enough. There is a subtlety with type: we do not support latch count type wider than range check type. Because of that, we want to have the narrowest type available. So if it can be computed from latch block immediately, take it. Otherwise, take whatever whole loop provides and hope that it's type isn't too wide. Differential Revision: https://reviews.llvm.org/D147910 Reviewed By: danilaml	2023-04-13 16:00:19 +07:00
Bjorn Pettersson	410775ecfd	[Transforms][LTO] Remove some redundant includes. NFC No need to include CallGraphSCCPass.h from the IPO/Inliner. Also removed the include of LegacyPassManager.h in a couple of files that do not really depend on that header file. Differential Revision: https://reviews.llvm.org/D148083	2023-04-13 10:12:00 +02:00
Max Kazantsev	246f8d4be5	[NFC][IRCE] Remove meaningless local variable	2023-04-13 13:04:45 +07:00
Max Kazantsev	d093d34c33	[IRCE][NFC] Remove unused variable IsSigned Patch by Aleksandr Popov! Differential Revision: https://reviews.llvm.org/D148113	2023-04-13 12:08:46 +07:00
Yashwant Singh	aea2a14736	[LoopUnroll] Prevent LoopFullUnrollPass to perform partial/runtime unrolling FullLoopUnroll was performing runtime unrolling in certain cases when '#pragma unroll' was specified. Patch to fix this by introducing new parameter to tryToUnrollLoop() to differentiate between LoopUnrollPass and FullLoopUnrollPass. Based on the discussion here (https://discourse.llvm.org/t/loop-unroller-fails-to-unroll-loop/69834) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D148071	2023-04-13 10:21:24 +05:30
Craig Topper	d66e42ca41	[LoopIdiomRecognize] Replace getNegativeSCEV(getOne()) with getMinusOne. NFC	2023-04-12 13:42:35 -07:00
Anna Thomas	76e4070843	Revert "[GuardUtils] Add asserts about loop varying widenable conditions" This reverts commit 5675757f5fc6e27ce01b3b12bdfd04044df53aa3. Assert maybe too strict. revert and investigate why assert fires.	2023-04-12 10:58:45 -04:00
Nikita Popov	ae28e016d3	[VNCoercion] Drop some redundant functions (NFC) These load and store APIs now do the same thing, so combine them into one.	2023-04-12 16:46:54 +02:00
Nikita Popov	6e78fd58cd	[GVN][VNCoercion] Remove load widening leftovers (NFCI) GVN load widening was disabled in D24096. This removes various support code that is no longer relevant. The way this works nowadays is that we return PartialAlias with an offset from BasicAA and this gets passed on as a clobber by MDA. However, PartialAlias will only be returned if the load is properly nested inside the other load. This just removes the bulk of the code, but some additional cleanup can be done here now that we don't need to distinguish between load and store cases.	2023-04-12 16:32:46 +02:00
Florian Hahn	78148eba49	[Matrix] Fix crash during dot product lowering. Perform dot-product lowering before instruction fusion to avoid crash in newly added test. Also update lowerDotProduct to properly mark optimized matmul as fused.	2023-04-12 15:08:39 +01:00
Max Kazantsev	3b73892b43	[SimpleLoopUnswitch] Do not try to inject pointer conditions. PR62058 As shown in https://github.com/llvm/llvm-project/issues/62058, canonicalication may fail with pointer types (and basically this transform is not expected to work with pointers).	2023-04-12 20:38:17 +07:00
OCHyams	9106960724	[Assignment Tracking][SROA] Don't un-poison dbg.assigns using multiple loc ops Some dbg.assigns using poison become un-poisoned in SROA. The reason this happens at all is because dbg.assigns linked to memory intrinsics use poison to indicate they can't describe the stored value, but the value becomes available after some optimisations. This needs reworking eventually, but for now we need to ensure that when it does occur we don't create invalid expressions. D147312 prevented this occuring when the dbg.assign uses DIArgLists, but that wasn't a complete fix. We also need to ensure we avoid un-poisoning when the existing expression uses more than one location operand (DW_OP_arg, n). Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D148020	2023-04-11 18:18:11 +01:00
Anna Thomas	5675757f5f	[GuardUtils] Add asserts about loop varying widenable conditions We have now seen two miscompiles because of widening widenable conditions at incorrect IR points and thereby changing a branch's loop invariant condition to a loop-varying one (see PR60234 and PR61963). This patch adds asserts in common guard utilities that we use for widening to proactively catch these bugs in future. Note that these asserts will not fire if we were to sink a widenable condition from out of a loop into a loop (that's also incorrect for the same reason as above). Tested this without the fix for PR60234 (guard widening miscompile) and confirmed the assert fires. WARNING: Sometimes, the assert can fire if we failed to hoist the invariant condition out of the loop. This is a pass-ordering issue or a limitation in LICM, which would need an investigation. See details in review. Differential Revision: https://reviews.llvm.org/D147752	2023-04-11 10:54:07 -04:00
Max Kazantsev	a42f589197	[LICM][NFC] Unify arithmetic statistics collection Avoid divergence b/w different kinds of hoisting with reassociation. Make them all collect general stat NumHoisted and also specific stats for each particular transform.	2023-04-11 17:20:02 +07:00
Max Kazantsev	7b8692a55c	[LICM][NFC] Do not forward declaration of hoistMinMax They all are now handled by hoistArithmetics, and only it should be forwarded.	2023-04-11 17:06:20 +07:00
Nikita Popov	243df834c6	[LICM] Fix assert failure in no-allowspeculation mode In this case the source GEP might not be hoisted even though it has invariant operands. For now just bail out, but we might need additional checks for AllowSpeculation in these special-case reassociation folds.	2023-04-11 11:55:54 +02:00
Nikita Popov	b8917ac62a	[LICM] Reassociate GEPs to allow hoisting Reassociate gep (gep ptr, idx1), idx2 to gep (gep ptr, idx2), idx1 if this would make the inner GEP loop invariant and thus hoistable. This is intended to replace an InstCombine fold that does this (in `04f61fb73d/llvm/lib/Transforms/InstCombine/InstructionCombining.cpp (L2006)`). The problem with the InstCombine fold is that LoopInfo is an optional dependency, so it is not performed reliably. Differential Revision: https://reviews.llvm.org/D146813	2023-04-11 10:34:04 +02:00
Max Kazantsev	cd24665f13	[NFC] Fix typo in statistic description	2023-04-11 14:18:53 +07:00
Max Kazantsev	e5dc4dbe87	[LICM][NFC] Restructure code to have one entry point for reassociation-based hoistings We already hoist min/max functions and want to do more of this kind. Some refactoring to make growth points for it.	2023-04-11 14:18:53 +07:00
Anna Thomas	27f8a62a54	[LoopPredication] Fix where we generate widened condition. PR61963 Loop predication's predicateLoopExit pass does two incorrect things: It sinks the widenable call into the loop, thereby converting an invariant condition to a variant one It widens the widenable call at a branch thereby converting the branch into a loop-varying one. The latter is problematic when the branch may have been loop-invariant and prior optimizations (such as indvars) may have relied on this fact, and updated the deopt state accordingly. Now, when we widen this with a loop-varying condition, the deopt state is no longer correct. https://github.com/llvm/llvm-project/issues/61963 fixed. Differential Revision: https://reviews.llvm.org/D147662	2023-04-10 10:37:05 -04:00
Max Kazantsev	d0950d05a6	[NFC][IRCE] Do not store latch exit count It is not actually used for any computations. Its only purpose is to check that the loop is finite and find out the type of computed exit count. Refactor code so that we only store this type.	2023-04-10 14:00:14 +07:00
Zhongyunde	0e739ddd17	[MergeICmps] Attach metadata to new created loads Use clone to keep the metadata, the issue is reported by aeubanks on D141188. Reviewed By: nikic, paulwalker-arm Differential Revision: https://reviews.llvm.org/D146702	2023-04-08 10:45:58 +08:00
OCHyams	086635d6b9	[Assignment Tracking][SROA] Fix fragment when slice size equals variable size Correctly handle the case of splitting an alloca which backs contiguous distinct variables, where a slice's size equals the size of a backed variable. We need to ensure that we don't generate fragments expressions with fragments of the same size as the variable as this is a verifier error. Prior to this patch a fragment expression would be created in this situation. e.g. splitting an alloca i64 with two adjacent 32-bit variables into two 32-bit allocas, the new dbg.assign expressions would contain (DW_OP_LLVM_fragment, 0, 32) and (DW_OP_LLVM_fragment, 32, 32) even though those fragments cover each variable entirely. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D147696	2023-04-06 15:29:18 +01:00
Dmitry Makogon	3d7242f05e	Reapply "[LSR] Preserve LCSSA when rewriting instruction with PHI user" This reverts commit efd34ba60f3839b0a68b2e32ff9011b6823bc16f. Reapplies 8ff4832679e1. Missed a failing test. Needed to just update test checks.	2023-04-06 17:31:27 +07:00
Serguei Katkov	6bda53c591	[GuardWidening] Re-factor freezeAndPush. Re-write the code to avoid iteration over users of constants and global values. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D147450	2023-04-06 16:46:47 +07:00
Bjorn Pettersson	44773b798a	[SimpleLoopUnswitch] Fix SCEV invalidation issue This patch is making sure that we use getTopMostExitingLoop when finding out which loops to forget, when dealing with unswitchNontrivialInvariants and unswitchTrivialSwitch. It seems to at least be needed for unswitchNontrivialInvariants as detected by the included test case. Note that unswitchTrivialBranch already used getTopMostExitingLoop. This was done in commit 4a9cde5a791cd49b96993e6. The commit message in that commit says "If the patch makes sense, I will also update those places to a similar approach ...", referring to these functions mentioned above. As far as I can tell that never happened, but this is an attempt to finally fix that. Fixes https://github.com/llvm/llvm-project/issues/61080 Differential Revision: https://reviews.llvm.org/D147058	2023-04-06 09:46:42 +02:00
Nikita Popov	7c78cb4b1f	Revert "[SimplifyCFG][LICM] Preserve nonnull, range and align metadata when speculating" This reverts commit 78b1fbc63f78660ef10e3ccf0e527c667a563bc8. This causes or exposes miscompiles in Rust, revert until they have been investigated.	2023-04-05 17:05:39 +02:00
Florian Hahn	04681243b4	[Matrix] Limit dot lowering to column major matrixes. Limit to dot product lowering to column major matrixes for now. This simplifies the code and reasoning for upcoming planned improvements. Support for row-major matrixes can be added later as extension.	2023-04-05 15:49:06 +01:00
OCHyams	76740fb40e	[Assignment Tracking][SROA] Handle createFragmentExpression failure createFragmentExpression will fail if it determines that the expression cannot be split over fragments. Handle this case in SROA. Similarly to D147312 this should be a rare occurrence as the `dbg.assign` will usually reference the `Value` being stored without modifying it with a `DIExpression`. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D147431	2023-04-05 11:20:32 +01:00
Nikita Popov	7553bad1ac	[LICM] Don't require optimized uses LICM currently requests optimized use MSSA form. This is wasteful, because LICM doesn't actually care about most uses, only those of invariant pointers in loops. Everything else doesn't need to be optimized. LICM already uses the clobber walker in most places. This patch adjusts one place that was using getDefiningAccess() to use it as well, so we no longer have a dependence on pre-optimized uses. This change is not NFC in that the fallback on the defining access when there are too many clobber calls may now fall back to an unoptimized use. In practice, I've not seen any problems with this though. If desired, we could also increase licm-mssa-optimization-cap to a higher value (increasing this from 100 to 200 has no impact on average compile-time -- but also doesn't appear to have any impact on LICM quality either). This makes for a 0.9% geomean compile-time improvement on CTMark. Differential Revision: https://reviews.llvm.org/D147437	2023-04-05 11:20:25 +02:00
Jeff Byrnes	9b79d0b610	[MergedLoadStoreMotion] Merge stores with conflicting value types Since memory does not have an intrinsic type, we do not need to require value type matching on stores in order to sink them. To facilitate that, this patch finds stores which are sinkable, but have conflicting types, and bitcasts the ValueOperand so they are easily sinkable into a PHINode. Rather than doing fancy analysis to optimally insert the bitcast, we always insert right before the relevant store in the diamond branch. The assumption is that later passes (e.g. GVN, SimplifyCFG) will clean up bitcasts as needed. Differential Revision: https://reviews.llvm.org/D147348	2023-04-04 12:01:29 -07:00
Nikita Popov	78b1fbc63f	[SimplifyCFG][LICM] Preserve nonnull, range and align metadata when speculating After D141386, violation of nonnull, range and align metadata results in poison rather than immediate undefined behavior, which means that these are now safe to retain when speculating. We only need to remove UB-implying metadata like noundef. This is done by adding a dropUBImplyingAttrsAndMetadata() helper, which lists the metadata which is known safe to retain on speculation. Differential Revision: https://reviews.llvm.org/D146629	2023-04-04 10:03:45 +02:00
Nikita Popov	9b5ff4436e	[EarlyCSE] Call combineMetadataForCSE() when CSEing loads We may have to adjust metadata on the replacement load if the metadata is poison-generating.	2023-04-03 16:10:19 +02:00

1 2 3 4 5 ...

12402 Commits