llvm-project

Author	SHA1	Message	Date
Sudharsan Veeravalli	f55e12a76d	[DebugInfo][SimplifyIndVar] Fix missing debug locations for div/rem instructions (#97284 ) Fixes #97282	2024-07-02 15:01:48 +01:00
Stephen Tozer	d75f9dd1d2	Revert "[IR][NFC] Update IRBuilder to use InsertPosition (#96497 )" Reverts the above commit, as it updates a common header function and did not update all callsites: https://lab.llvm.org/buildbot/#/builders/29/builds/382 This reverts commit 6481dc57612671ebe77fe9c34214fba94e1b3b27.	2024-06-24 18:00:22 +01:00
Stephen Tozer	6481dc5761	[IR][NFC] Update IRBuilder to use InsertPosition (#96497 ) Uses the new InsertPosition class (added in #94226) to simplify some of the IRBuilder interface, and removes the need to pass a BasicBlock alongside a BasicBlock::iterator, using the fact that we can now get the parent basic block from the iterator even if it points to the sentinel. This patch removes the BasicBlock argument from each constructor or call to setInsertPoint. This has no functional effect, but later on as we look to remove the `Instruction *InsertBefore` argument from instruction-creation (discussed [here](https://discourse.llvm.org/t/psa-instruction-constructors-changing-to-iterator-only-insertion/77845)), this will simplify the process by allowing us to deprecate the InsertPosition constructor directly and catch all the cases where we use instructions rather than iterators.	2024-06-24 17:27:43 +01:00
Nikita Popov	6e3725d7f2	[IndVars] Make pushIVUsers() a member function (NFC) Make it easier to access additional state from it.	2024-06-24 14:39:05 +02:00
Andreas Jonson	ff3523f67b	[IR] Drop poison-generating return attributes when necessary (#89138 ) Rename has/dropPoisonGeneratingFlagsOrMetadata to has/dropPoisonGeneratingAnnotations and make it also handle nonnull, align and range return attributes on calls, similar to the existing handling for !nonnull, !align and !range metadata.	2024-04-18 15:27:36 +09:00
Nikita Popov	4b22a923c4	[IndVars] Mark truncs as nuw/nsw (#88686 ) When inserting truncs during IV widening, mark the trunc as either nuw or nsw depending on whether zext or sext widening was used. For non-negative IVs both nuw and nsw apply.	2024-04-16 10:42:32 +09:00
Florian Hahn	0f82469314	[Passes] Run SimpleLoopUnswitch after introducing invariant branches. (#81271 ) IndVars may be able to replace a loop dependent condition with a loop invariant one, but loop-unswitch runs before IndVars, so the invariant check remains in the loop. For an example, consider a read-only loop with a bounds check: https://godbolt.org/z/8cdj4qhbG This patch uses a approach similar to the way extra cleanup passes are run on demand after vectorization (added in acea6e9cfa4c4a0e8678c7). It introduces a new ShouldRunExtraSimpleLoopUnswitch analysis marker, which IndVars can use to indicate that extra unswitching is beneficial. ExtraSimpleLoopUnswitchPassManager uses this analysis to determine whether to run its passes on a loop. Compile-time impact (geomean) ranges from +0.0% to 0.02% https://llvm-compile-time-tracker.com/compare.php?from=138c0beb109ffe47f75a0fe8c4dc2cdabe8a6532&to=19e6e99eeb280d426907ea73a21b139ba7225627&stat=instructions%3Au Compile-time impact (geomean) of unconditionally running SimpleLoopUnswitch ranges from +0.05% - +0.16% https://llvm-compile-time-tracker.com/compare.php?from=138c0beb109ffe47f75a0fe8c4dc2cdabe8a6532&to=2930dfd5accdce2e6f8d5146ae4d626add2065a2&stat=instructions:u Unconditionally running SimpleLoopUnswitch seems to indicate that there are multiple other scenarios where we fail to run unswitching when opportunities remain. Fixes https://github.com/llvm/llvm-project/issues/85551. PR: https://github.com/llvm/llvm-project/pull/81271	2024-04-12 22:07:29 +01:00
Craig Topper	25959310a5	[IndVars] Support shl by constant and or disjoint in getExtendedOperandRecurrence. (#84282 ) We can treat a shift by constant as a multiply by a power of 2 and we can treat an or disjoint as a 'add nsw nuw'. I've added a helper struct similar to a struct used in ScalarEvolution.cpp to represent the opcode, operands, and NSW/NUW flags for normal add/sub/mul and shl/or that are being treated as mul/add. I don't think we need to teach cloneIVUser about this. It will continue to clone them using cloneBitwiseIVUser. After the cloning we will ask for the SCEV expression for the cloned IV user and verify that it matches the AddRec returned by getExtendedOperandRecurrence. Since SCEV also knows how to convert shl to mul and or disjoint to add nsw nuw, this should usually match. If it doesn't match, the cloned IV user will be deleted.	2024-03-14 12:52:12 -07:00
Jeremy Morse	6b62a9135a	[RemoveDIs] Reapply 3fda50d3915, insert instructions using iterators I'd reverted this in 6c7805d5d1 after a bad stage. Original commit messsage follows: [NFC][RemoveDIs] Bulk update utilities to insert with iterators As part of the RemoveDIs project we need LLVM to insert instructions using iterators wherever possible, so that the iterators can carry a bit of debug-info. This commit implements some of that by updating the contents of llvm/lib/Transforms/Utils to always use iterator-versions of instruction constructors. There are two general flavours of update: * Almost all call-sites just call getIterator on an instruction * Several make use of an existing iterator (scenarios where the code is actually significant for debug-info) The underlying logic is that any call to getFirstInsertionPt or similar APIs that identify the start of a block need to have that iterator passed directly to the insertion function, without being converted to a bare Instruction pointer along the way. I've also switched DemotePHIToStack to take an optional iterator: it needs to take an iterator, and having a no-insert-location behaviour appears to be important. The constructors for ICmpInst and FCmpInst have been updated too. They're the only instructions that take block _references_ rather than pointers for certain calls, and a future patch is going to make use of default-null block insertion locations. All of this should be NFC.	2024-03-04 13:14:39 +00:00
Jeremy Morse	6c7805d5d1	Revert "[NFC][RemoveDIs] Bulk update utilities to insert with iterators" This reverts commit 3fda50d3915b2163a54a37b602be7783a89dd808. Apparently I've missed a hunk while staging this; will back out for now. Picked up here: https://lab.llvm.org/buildbot/#/builders/139/builds/60429/steps/6/logs/stdio	2024-02-29 16:50:22 +00:00
Jeremy Morse	3fda50d391	[NFC][RemoveDIs] Bulk update utilities to insert with iterators As part of the RemoveDIs project we need LLVM to insert instructions using iterators wherever possible, so that the iterators can carry a bit of debug-info. This commit implements some of that by updating the contents of llvm/lib/Transforms/Utils to always use iterator-versions of instruction constructors. There are two general flavours of update: * Almost all call-sites just call getIterator on an instruction * Several make use of an existing iterator (scenarios where the code is actually significant for debug-info) The underlying logic is that any call to getFirstInsertionPt or similar APIs that identify the start of a block need to have that iterator passed directly to the insertion function, without being converted to a bare Instruction pointer along the way. I've also switched DemotePHIToStack to take an optional iterator: it needs to take an iterator, and having a no-insert-location behaviour appears to be important. The constructors for ICmpInst and FCmpInst have been updated too. They're the only instructions that take block _references_ rather than pointers for certain calls, and a future patch is going to make use of default-null block insertion locations. All of this should be NFC.	2024-02-29 16:39:09 +00:00
Vedant Paranjape	e209178d64	[SimplifyIndVar] LCSSA form is destroyed by simplifyLoopIVs, preserve it (#78696 ) In LoopUnroll, peelLoop is called on the loop. After the loop is peeled it calls simplifyLoopAfterUnroll on the loop. This call to simplifyLoopAfterUnroll doesn't preserve the LCSSA form of the parent loop and thus during the next call to peelLoop the LCSSA form is already broken. LoopPeel util takes in the PreserveLCSSA argument and it passes on the same argument to simplifyLoop which checks if the loop is in a valid LCSSA form, when (PreserveLCSSA = true). This causes an assert in simplifyLoop when (PreserveLCSSA = true), as during the last call LCSSA for the loop wasn't preserved, and thus crashes at the following assert. assert(L->isRecursivelyLCSSAForm(DT, LI) && "Requested to preserve LCSSA, but it's already broken."); Upon debugging, it is evident that simplifyLoopIVs call inside simplifyLoopAfterUnroll breaks the LCSSA form. This patch fixes llvm#77118, it checks if the replacement of IV Users with Loop Invariant preserves the LCSSA form. If it does not, it emits the required LCSSA Phi instructions.	2024-02-21 17:51:56 +05:30
Florian Hahn	4db93e5d56	[IndVars] Recompute flags if needed in widenIVUse of IV increment. (#82352 ) widenIVUse may hoist a wide induction increment and introduce new uses, but does not recompute the wrap flags. In some cases this can make the new uses of the wide IV inc more poisonous. Update the code to recompute flags if needed when hoisting an IV. If both the narrow and wide IV increment's flags match and we can re-use the flags from the increments, there's no need to recompute the flags, as the replacement won't make the new uses of the wide IV's increment more poisonous. Note that this also updates a stale comment which claimed that the widen increment is only used if it dominates the new use. The helper should also be used to guard the code added in da437330be, which I am planning on doing separately once the helper lands. Fixes https://github.com/llvm/llvm-project/issues/82243.	2024-02-20 14:03:25 +00:00
Florian Hahn	dce77a3579	[IndVars] Preserve flags of narrow IV inc if replacing with wider inc. (#80446 ) We are replacing a narrow IV increment with a wider one. If the original (narrow) increment did not wrap, the wider one should not wrap either. Set the flags to be the union of both wide increment and original increment; this ensures we preserve flags SCEV could infer for the wider increment. Fixes https://github.com/llvm/llvm-project/issues/71517.	2024-02-10 18:11:17 +00:00
Nikita Popov	7d2b6f0b35	[IndVarSimplify] Fix poison-safety when reusing instructions (#80458 ) IndVars may replace an instruction with one of its operands, if they have the same SCEV expression. However, such a replacement may be more poisonous. First, check whether the operand being poison implies that the instruction is also poison, in which case the replacement is always safe. If this fails, check whether SCEV can determine that reusing the instruction is safe, using the same check as SCEVExpander. Fixes https://github.com/llvm/llvm-project/issues/79861.	2024-02-05 10:11:39 +01:00
Jay Foad	7017efa1a1	Fix typo "widended"	2024-01-19 13:50:26 +00:00
Nikita Popov	c16559137c	[IndVars] Avoid unnecessary truncate for zext nneg use When performing sext IV widening, if one of the narrow uses is in a zext nneg, we can treat it like an sext and avoid the insertion of a trunc.	2023-12-22 11:30:17 +01:00
Nikita Popov	24e80d4cc5	[IndVars] Move "using namespace" to top-level scope (NFC)	2023-12-22 11:28:54 +01:00
Wenju He	2441e237a2	[NFC][indvars] Remove unused code in WidenIV::widenLoopCompare (#73506 )	2023-12-01 09:46:24 +08:00
Markos Horro	9d2903c8e5	[IndVars] Add check of loop invariant for trunc instructions (#71072 ) The same idea as in 34d380e1f63a7e2cdb9ab1e6498f727fcd710a14, but considering truncation instructions. Improvement for #59633.	2023-11-08 11:16:23 +00:00
Philip Reames	551c280cfd	[indvars] Always fallback to truncation if AddRec widening fails (#70967 ) The current code structure results in cases where if a) we can't clone the IV user (because it's not in our whitelist) or b) can't prove the SCEV expressions are identical, we'd sometimes leave both the original unwiddened IV and the partially widdened IV in code. Instead, just truncate thw wide IV to the use - same as what we'd do if we couldn't find an addrec to start with. Noticed this while playing with changing how we produce addrecs. The current structure results in a very tight interlock between SCEVs internal capabilities and indvars code.	2023-11-07 07:49:39 -08:00
Philip Reames	5adf6ab7ff	Revert "[IndVars] Generate zext nneg when locally obvious" This reverts commit a6c8e27b3a052913a15a13ee0d4ac466c5ab3f92. It appears likely to have caused https://lab.llvm.org/buildbot/#/builders/57/builds/30988.	2023-11-03 11:19:14 -07:00
Philip Reames	7c93452e17	[indvars] Restructure getExtendedOperandRecurrence [nfc] As suggested during review of https://github.com/llvm/llvm-project/pull/70990.	2023-11-03 10:50:57 -07:00
Philip Reames	1ffea97ffd	[indvars] Support known positive extends in getExtendedOperandRecurrence (#70990 ) IndVars has the existing notion of a narrow definition which is known to positive and thus both sign and zero extension kinds are actually the same operations. There's existing logic for forming a SCEV based on the extension kind and the no-wrap flags. This change extends that logic to form the opposite extension kind for a positive def if doing so is allowed by the flags. Note that we already do something analogous for the getWideRecurrence case as well.	2023-11-03 10:21:30 -07:00
Philip Reames	a6c8e27b3a	[IndVars] Generate zext nneg when locally obvious zext nneg was recently added to the IR in #67982. This patch teaches SimplifyIndVars to prefer zext nneg over both sext and plain zext, when a local SCEV query indicates the source is non-negative. The choice to prefer zext nneg over sext looks slightly aggressive here, but probably isn't so much in practice. For cases where we'd "remember" the range fact, instcombine would convert the sext into a zext nneg anyways. The only cases where this produces a different result overall are when SCEV knows a non-local fact, and it doesn't get materialized into the IR. Those are exactly the cases where using zext nneg are most useful. We do run the risk of e.g. a missing combine - since we haven't updated most of them yet - but that seems like a manageable risk. Note that there are much deeper algorithmic changes we could make to this code to exploit zext nneg, but this seemed like a reasonable and low risk starting point.	2023-11-03 09:20:59 -07:00
Philip Reames	a78f5c0649	[IndVars] Use IRBuilder in eliminateTrunc [nfc-ish] (#70836 ) Mostly a cleanup so that we don't need to manually emit instructions, and can eagerly constant fold where relevant.	2023-10-31 14:37:57 -07:00
Andy Kaylor	4e0c6d3057	Fix build warning caused by mixed signed/unsigned compare (#69797 )	2023-10-20 16:42:41 -07:00
Andy Kaylor	cb472fbd47	Update SimplifyIndVar.cpp (#69760 ) In SimplifyIndvar::replaceFloatIVWithIntegerIV() the return value of getFPMantissaWidth() was being cast as an unsigned integer and then compared with the number of bits needed to represent an integer that was cast to and from a floating-point type. This is a problem because getFPMantissaWidth() returns -1 if the type does not have a stable mantissa. Currently the only type that returns -1 is ppc_fp128, so you'd need a pretty big induction variable to cause a problem. However, this problem will be more likely to be exposed when we implement support for decimal floating-point types. Strictly speaking, what we want to know here is the size of the biggest integer that can be represented exactly. We could get that information even with an unstable mantissa width, but getFPMantissaWidth() won't do it.	2023-10-20 15:23:45 -07:00
Jeremy Morse	e54277fa10	[NFC][RemoveDIs] Use iterators over inst-pointers when using IRBuilder This patch adds a two-argument SetInsertPoint method to IRBuilder that takes a block/iterator instead of an instruction, and updates many call sites to use it. The motivating reason for doing this is given here [0], we'd like to pass around more information about the position of debug-info in the iterator object. That necessitates passing iterators around most of the time. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152468	2023-09-11 20:01:19 +01:00
Nikita Popov	d82f0b74de	[IndVars] Don't assume backedge value is instruction (PR64891) In degenerate cases, the backedge value can be folded to poison. Fixes https://github.com/llvm/llvm-project/issues/64891.	2023-08-22 10:33:33 +02:00
zhongyunde	34d380e1f6	[IndVars] Add check of loop invariant for indirect use We usually only check direct use instruction of IV, while the bitcast of 'ptrtoint ptr to i64' doesn't affect the result, so go a step further. Fix https://github.com/llvm/llvm-project/issues/59633. Reviewed By: markoshorro Differential Revision: https://reviews.llvm.org/D151877	2023-06-03 22:29:09 +08:00
Nikita Popov	dc81e69eb1	[IndVars] Check expansion safety in makeIVComparisonInvariant() (PR62992) Make sure the invariant expressions are safe to expand. In particular, we should not speculative a trapping division into the preheader. Fixes https://github.com/llvm/llvm-project/issues/62992.	2023-05-31 11:21:35 +02:00
Max Kazantsev	79d2c26f89	[SimplifyIndVar][NFC] Refactor Binary Operator's flag strengthening Extract complex logic of Binary Operator's flag strengthening to a separate method in order to reuse it. Patch by Aleksandr Popov! Differential Revision: https://reviews.llvm.org/D143562 Reviewed By: mkazantsev	2023-02-10 15:23:16 +07:00
Nikita Popov	094ccee2c8	Reapply [Dominators] Add findNearestCommonDominator() for Instructions (NFC) Reapply with checks for instructions in unreachable blocks. A test case for this was added in 1ee4a93b15bb. ----- This is a recurring pattern: We want to find the nearest common dominator (instruction) for two instructions, but currently only provide an API for the nearest common dominator of two basic blocks. Add an overload that accepts and return instructions.	2023-01-10 12:16:31 +01:00
Nikita Popov	c60149b49e	Revert "[Dominator] Add findNearestCommonDominator() for Instructions (NFC)" This reverts commit 7f0de9573f758f5f9108795850337a5acbd17eef. This is missing handling for !isReachableFromEntry() blocks, which may be relevant for some callers. Revert for now.	2023-01-06 17:36:01 +01:00
Nikita Popov	7f0de9573f	[Dominator] Add findNearestCommonDominator() for Instructions (NFC) This is a recurring pattern: We want to find the nearest common dominator (instruction) for two instructions, but currently only provide an API for the nearest common dominator of two basic blocks. Add an overload that accepts and return instructions.	2023-01-06 17:06:25 +01:00
Fangrui Song	c178ed33bd	Transforms/Utils: llvm::Optional => std::optional	2022-12-12 08:29:05 +00:00
Max Kazantsev	6dac170140	[SCEVExpander] Support cost evaluation of several SCEVs with same budget This is a follow-up from discussion in D138412. Sometimes we want to evaluate the cost of expansion of several SCEVs together with same budget. For example, if one of them is a bit above cheap limit, and the second one is free, then we still want to expand. Checking each of them with "cheap" limit is a bit more pessimistic. Differential Revision: https://reviews.llvm.org/D138475 Reviewed By: lebedev.ri	2022-12-06 17:02:26 +07:00
Kazu Hirata	343de6856e	[Transforms] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:37 -08:00
Max Kazantsev	57fd7ffeff	[IndVarSimplify] Lift limitations on IV being a Phi for turn-to-invariant These limitations are too strict, and their only purpose is to avoid code size explosion. These restrictions seem obsolete, and the size problem is solved in other places through cheap expansion limits. The motivation is that the old code cannot deal with comparisons against induction variant's increment. Differential Revision: https://reviews.llvm.org/D138412 Reviewed By: lebedev.ri, reames	2022-11-22 12:53:37 +07:00
Florian Hahn	fd236772f5	[IndVars] Forget SCEV for value after simplifying condition. Additional SCEV verification highlighted a case where the cached loop dispositions where incorrect after simplifying a condition in IndVars and moving the user in LoopDeletion. Fix it by invalidating ICmp and all its users. Fixes #58515.	2022-10-21 11:18:01 +01:00
Florian Hahn	c65513444b	[IndVars] Forget SCEV for instruction and users before replacing it. Extra invalidation is needed here to clear stale values to fix a verification failure. Fixes #58440.	2022-10-18 17:38:14 +01:00
Max Kazantsev	e587199a50	[SCEV] Prove condition invariance via context, try 2 Initial implementation had too weak requirements to positive/negative range crossings. Not crossing zero with nuw is not enough for two reasons: - If ArLHS has negative step, it may turn from positive to negative without crossing 0 boundary from left to right (and crossing right to left doesn't count for unsigned); - If ArLHS crosses SINT_MAX boundary, it still turns from positive to negative; In fact we require that ArLHS always stays non-negative or negative, which an be enforced by the following set of preconditions: - both nuw and nsw; - positive step (looks liftable); Because of positive step, boundary crossing is only possible from left part to the right part. And because of no-wrap flags, it is guaranteed to never happen.	2022-08-22 14:31:19 +07:00
Max Kazantsev	f798c042f4	Revert "[SCEV] Prove condition invariance via context" This reverts commit a3d1fb3b59b473e4f262a05f187de6474b7721e7. Reverting until investigation of https://github.com/llvm/llvm-project/issues/57247 has concluded.	2022-08-19 21:02:06 +07:00
Max Kazantsev	a3d1fb3b59	[SCEV] Prove condition invariance via context Contextual knowledge may be used to prove invariance of some conditions. For example, in this case: ``` ; %len >= 0 guard(%iv = {start,+,1}<nuw> <s %len) guard(%iv = {start,+,1}<nuw> <u %len) ``` the 2nd check always fails if `start` is negative and always passes otherwise. It looks like there are more opportunities of this kind that are still to be implemented in the future. Differential Revision: https://reviews.llvm.org/D129753 Reviewed By: apilipenko	2022-08-12 14:23:35 +07:00
zhongyunde	c2ab65ddaf	[IndVars] Eliminate redundant type cast with different sizes Deal with different sizes between the itofp and fptoi with trunc or sext/zext, depend on D129756. Fixes https://github.com/llvm/llvm-project/issues/55505. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D129958	2022-08-09 23:59:42 +08:00
Nuno Lopes	9df0b254d2	[NFC] Switch a few uses of undef to poison as placeholders for unreachable code	2022-07-23 21:50:11 +01:00
zhongyunde	3a6b766b1b	[IndVars] Directly use unsigned integer induction for FPToUI/FPToSI of float induction Depend on D129358 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D129756	2022-07-17 10:48:35 +08:00
Craig Topper	0e718443c7	[SimplifyIndVar] Use enum class for ExtendKind. NFC I happened to notice a two places where the enum was being pass directly to the bool IsSigned argument of createExtendInst. This was functionally ok since SignExtended in the enum has value of 1, but the code shouldn't rely on that. Using an enum class prevents the enum from being convertible to bool, but does make writing the enum values more verbose. Since we now have to write ExtendKind:: in front of them, I've shortened the names of ZeroExtended and SignExtended. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D129733	2022-07-14 10:03:58 -07:00
Nikita Popov	dcf4b733ef	[SCEVExpander] Make CanonicalMode handing in isSafeToExpand() more robust (PR50506) isSafeToExpand() for addrecs depends on whether the SCEVExpander will be used in CanonicalMode. At least one caller currently gets this wrong, resulting in PR50506. Fix this by a) making the CanonicalMode argument on the freestanding functions required and b) adding member functions on SCEVExpander that automatically take the SCEVExpander mode into account. We can use the latter variant nearly everywhere, and thus make sure that there is no chance of CanonicalMode mismatch. Fixes https://github.com/llvm/llvm-project/issues/50506. Differential Revision: https://reviews.llvm.org/D129630	2022-07-14 14:41:51 +02:00

1 2 3 4

187 Commits