llvm-project

Author	SHA1	Message	Date
Tejas Joshi	0609b65aaf	[SCEV] Fix potentially empty set for unsigned ranges The following commit enabled the analysis of ranges for heap allocations: 22ca38da25e19a7c5fcfeb3f22159aba92ec381e The range turns out to be empty in cases such as the one in test (which is [1,1)), leading to an assertion failure. This patch fixes for the same case. Fixes https://github.com/llvm/llvm-project/issues/63856 Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D159160	2023-09-04 10:46:53 +01:00
Vedant Paranjape	5a9a02f67b	[SCEV] Compute SCEV for ashr(add(shl(x, n), c), m) instr triplet %x = shl i64 %w, n %y = add i64 %x, c %z = ashr i64 %y, m The above given instruction triplet is seen many times in the generated LLVM IR, but SCEV model is not able to compute the SCEV value of AShr instruction in this case. This patch models the two cases of the above instruction pattern using the following expression: => sext(add(mul(trunc(w), 2^(n-m)), c >> m)) 1) when n = m the expression reduces to sext(add(trunc(w), c >> n)) as n-m=0, and multiplying with 2^0 gives the same result. 2) when n > m the expression works as given above. It also adds several unittest to verify that SCEV is able to compute the value. $ opt sext-add-inreg.ll -passes="print<scalar-evolution>" Comparing the snippets of the result of SCEV analysis: * SCEV of ashr before change ---------------------------- %idxprom = ashr exact i64 %sext, 32 --> %idxprom U: [-2147483648,2147483648) S: [-2147483648,2147483648) Exits: 8 LoopDispositions: { %for.body: Variant } * SCEV of ashr after change --------------------------- %idxprom = ashr exact i64 %sext, 32 --> {0,+,1}<nuw><nsw><%for.body> U: [0,9) S: [0,9) Exits: 8 LoopDispositions: { %for.body: Computable } LoopDisposition of the given SCEV was LoopVariant before, after adding the new way to model the instruction, the LoopDisposition becomes LoopComputable as it is able to compute the SCEV of the instruction. Differential Revision: https://reviews.llvm.org/D152278	2023-08-25 05:42:08 +00:00
Nikita Popov	1c6e6432ca	[SCEVExpander] Fix incorrect reuse of more poisonous instructions (PR63763) SCEVExpander tries to reuse existing instruction with the same SCEV expression. However, doing this replacement blindly is not safe, because the instruction might be more poisonous. What we were already doing is to drop poison-generating flags on the reused instruction. But this is not the only way that more poison can be introduced. The poison-generating flag might not be directly on the reused instruction, or the poison contribution might come from something like 0 * %var, which folds to 0 but can still introduce poison. This patch fixes the issue in a principled way, by determining which values can contribute poison to the SCEV expression, and then checking whether any additional values can contribute poison to the instruction being reused. Poison-generating flags are dropped if doing that enables reuse. This is a pretty big hammer and does cause some regressions in tests, but less than I would have expected. I wasn't able to come up with a less intrusive fix that still satisfies the correctness requirements. Fixes https://github.com/llvm/llvm-project/issues/63763. Fixes https://github.com/llvm/llvm-project/issues/63926. Fixes https://github.com/llvm/llvm-project/issues/64333. Fixes https://github.com/llvm/llvm-project/issues/63727. Differential Revision: https://reviews.llvm.org/D158181	2023-08-22 09:27:07 +02:00
Nikita Popov	b108c11e46	[SCEV] Move SCEVPoisonCollector outside function (NFC) To allow reusing it. Also switch it to store SCEVUnknown rather than SCEV, as only these can produce poison.	2023-08-11 16:41:31 +02:00
Nikita Popov	bbc8079906	[SCEV] Remove unnecessary bitcast (NFC)	2023-08-09 16:21:54 +02:00
Maksim Kita	96cae819cb	[ScalarEvolution][NFC] Typo fix Fix typo in ScalarEvolution public method. Differential Revision: https://reviews.llvm.org/D156621	2023-07-31 20:16:15 +03:00
Ivan Kosarev	e9df4c9892	[ADT] Support iterating size-based integer ranges. It seems the ranges start with 0 in most cases. Reviewed By: dblaikie, gchatelet Differential Revision: https://reviews.llvm.org/D156135	2023-07-26 16:28:41 +01:00
Arthur Eubanks	b4d9cd2d61	[NFC] Update stale comment after D154001	2023-06-29 09:36:10 -07:00
Arthur Eubanks	22ca38da25	[ScalarEvolution] Analyze ranges for heap allocations Followup to D153624. Allows for better exit count calculations for loops checking heap allocations against null. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D154001	2023-06-29 09:35:20 -07:00
Liren Peng	75a73c983f	Revert "[ScalarEvolution] Infer loop max trip count from array accesses" This reverts commit 57e093162e27334730d8ed8f7b25b1b6f65ec8c8.	2023-06-29 17:27:38 +08:00
Nikita Popov	3cd4571405	[SCEV] Make use of non-null pointers for range calculation We know that certain pointers (e.g. non-extern-weak globals or allocas in default address space) are not null, in which case the lowest address they can be allocated at is their alignment. This allows us to calculate better exit counts for loops that have an additional null check in the guarding condition (see alloca_icmp_null_exit_count). Differential Revision: https://reviews.llvm.org/D153624	2023-06-29 09:09:17 +02:00
Vitaly Buka	1764924f39	[SCEV] Optimize FoldID Improve compile time https://llvm-compile-time-tracker.com/compare.php?from=773e5dfbc6bf4d4c5be568a039661e9baad80d15&to=7ba15f3a4b59181110e73dc397a9fe56165a2b73&stat=instructions:u Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D144335	2023-06-27 16:59:34 -07:00
Nikita Popov	dac973226a	[SCEV] Print block dispositions on mismatch (NFC)	2023-06-26 15:26:44 +02:00
Elliot Goodrich	b0abd4893f	[llvm] Add missing StringExtras.h includes In preparation for removing the `#include "llvm/ADT/StringExtras.h"` from the header to source file of `llvm/Support/Error.h`, first add in all the missing includes that were previously included transitively through this header.	2023-06-25 15:42:22 +01:00
Nikita Popov	406e9c9372	[SCEV] Use object size for allocas as well The object size and alignment based restriction on the possible allocation range also applies to allocas, not just globals, so handle them as well. We shouldn't really need any type restriction here at all, but for now stay conservative.	2023-06-23 12:38:12 +02:00
Nikita Popov	6555b5dc99	[SCEV] Store getValue() result in variable (NFC)	2023-06-23 12:31:36 +02:00
Dmitry Makogon	ce1ac1cf18	[SCEV] Don't store AddRec loop when simplifying multiplication of AddRecs When multiplying several AddRecs, we do the following simplification: {A1,+,A2,+,...,+,An}<L> * {B1,+,B2,+,...,+,Bn}<L> = {x=1 in [ sum y=x..2x [ sum z=max(y-x, y-n)..min(x,n) [ choose(x, 2x)choose(2x-y, x-z)A_{y-z}B_z]] ],+,...up to x=2n} This is done iteratively, pair by pair. So if we try to multiply three AddRecs A1, A2, A3, then we'd try to simplify A1 A2 to A1' and then try to simplify A1' * A3 if A1' is also an AddRec. The transform is only legal if the loops of the two AddRecs are the same. It is checked in the code, but the loop of one of the AddRecs is stored in a local variable and doesn't get updated when we simplify a pair to a new AddRec. In the motivating test the new AddRec A1' was created for a different loop and, as the loop variable didn't get updated, the check for different loops passed and the transform worked for two AddRecs from different loops. So it created a wrong SCEV. And it caused LSR to replace an instruction with another one that had the same SCEV as the incorrectly computed one. Differential Revision: https://reviews.llvm.org/D153254	2023-06-22 15:49:15 +07:00
Florian Hahn	2ba78229e4	[SCEV] Try smaller ZExts when using loop guard info. If we didn't find the extact ZExt expr in the rewrite map, check if there's an entry for a smaller ZExt we can use instead. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D149786	2023-06-09 20:05:50 +01:00
Nikita Popov	c1aa0dce48	[SCEV] Remove -verify-scev-maps flag This is now checked as part of the usual SCEV verification. There is little value in checking this on each lookup. These two maps are strictly synchronized nowadays, which was not the case historically.	2023-06-09 11:51:53 +02:00
Joshua Cao	6ed152aff4	[SCEV] Compute AddRec range computations using different type BECount Before this patch, we can only use the MaxBECount for an AddRec's range computation if the MaxBECount has <= bit width of the AddRec. This patch reasons that if a MaxBECount has > bit width, and is <= the max value of AddRec's bit width, we can still use the MaxBECount. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D151698	2023-05-31 21:05:17 -07:00
Joshua Cao	46c59a55e7	[SCEV][NFC] Refactor range computation for AddRec to pass around APInt	2023-05-31 21:03:20 -07:00
Joshua Cao	ff471dcf76	[SCEV] Fix verification of SCEV multiples.	2023-05-31 21:00:22 -07:00
Nikita Popov	0c23dc20bc	Reapply [SCEV] Replace IsAvailableOnEntry with block disposition This exposed an issue in SCEVExpander/LCSSA, which has been fixed in D150681. ----- As far as I understand, the IsAvailableOnEntry() function basically implements the same functionality as the properlyDominates() block disposition. The primary difference (apart from a weaker implementation) seems to be in this comment at the top: // Checks if the SCEV S is available at BB. S is considered available at BB // if S can be materialized at BB without introducing a fault. However, I don't really understand why there would be such a requirement. It's my understanding that SCEV explicitly does not care about trapping udiv instructions itself, and it's the job of SCEVExpander's isSafeToExpand() to make sure these don't get expanded if they may trap. Differential Revision: https://reviews.llvm.org/D149344	2023-05-25 10:02:18 +02:00
Craig Topper	6006d43e2d	LLVM_FALLTHROUGH => [[fallthrough]]. NFC Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D150996	2023-05-24 12:40:10 -07:00
Dmitry Makogon	2bb3515152	[SCEV] Replace NumTripCountsComputed stat with NumExitCountsComputed This fixes assertion crash in https://github.com/llvm/llvm-project/issues/62380. In the beginning of ScalarEvolution::getBackedgeTakenInfo we make sure that BackedgeTakenCounts contains an entry for the given loop. Then we call computeBackedgeTakenCount which computes the result, and in the end we insert it in the map like so: return BackedgeTakenCounts.find(L)->second = std::move(Result); So we expect that the entry for L still exists in the cache. However, it can get deleted. When it has computed the result, getBackedgeTakenInfo clears all the cached SCEVs that use the AddRecs in the loop. In the crashing example, getBackedgeTakenInfo first gets called on an inner loop, and during this call it gets called again on its parent loop. This recursion happens after the call to computeBackedgeTakenCount. And it happens so that some SCEV from the BTI of the child loop uses an AddRec of the parent loop. So when we successfully compute BTI for the parent loop, we erase already computed result for the child one. The recursion happens in some debug only code that updates statistics. The algorithm itself is non-recursive. Namely the recursive call happens in BackedgeTakenInfo::getExact function and its return value is only used to compare it against SCEVCouldNotCompute. As suggested by nikic I replaced the NumTripCountsComputed and NumTripCountsNotComputed with NumExitCountsComputed and NumExitCountsNotComputed respectively. They are updated during computations made for single exits. It relieves us of the need to compute exact exit count for the loop just to update the named statistic and thus the recursion cannot happen anymore. Differential Revision: https://reviews.llvm.org/D149251	2023-05-22 20:10:51 +07:00
eopXD	c8eb535aed	[1/11][IR] Permit load/store/alloca for struct of the same scalable vector type This patch-set aims to simplify the existing RVV segment load/store intrinsics to use a type that represents a tuple of vectors instead. To achieve this, first we need to relax the current limitation for an aggregate type to be a target of load/store/alloca when the aggregate type contains homogeneous scalable vector types. Then to adjust the prolog of an LLVM function during lowering to clang. Finally we re-define the RVV segment load/store intrinsics to use the tuple types. The pull request under the RVV intrinsic specification is riscv-non-isa/rvv-intrinsic-doc#198 --- This is the 1st patch of the patch-set. This patch is originated from D98169. This patch allows aggregate type (StructType) that contains homogeneous scalable vector types to be a target of load/store/alloca. The RFC of this patch was posted in LLVM Discourse. https://discourse.llvm.org/t/rfc-ir-permit-load-store-alloca-for-struct-of-the-same-scalable-vector-type/69527 The main changes in this patch are: Extend `StructLayout::StructSize` from `uint64_t` to `TypeSize` to accommodate an expression of scalable size. Allow `StructType:isSized` to also return true for homogeneous scalable vector types. Let `Type::isScalableTy` return true when `Type` is `StructType` and contains scalable vectors Extra description is added in the LLVM Language Reference Manual on the relaxation of this patch. Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Co-Authored-by: eop Chen <eop.chen@sifive.com> Reviewed By: craig.topper, nikic Differential Revision: https://reviews.llvm.org/D146872	2023-05-19 09:39:36 -07:00
Simon Pilgrim	79cbedaf05	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFC.	2023-05-17 12:31:18 +01:00
Joshua Cao	b27f14d920	[SCEV][NFC-mostly] Remove constant handling in TripMultiple computation After landing more precise trip multiples in https://reviews.llvm.org/D149529, the SCEV multiple computation handles constants, so there is no longer any need for special constant handling in getSmallConstantTripMultiple. This patch can improve the multiple of a non-constant SCEV that is huge (>=2**32). This is very rare in practice. Differential Revision: https://reviews.llvm.org/D150541	2023-05-16 20:24:31 -07:00
Manoj Gupta	9fb9c7776e	Revert "[SCEV] Replace IsAvailableOnEntry with block disposition" This reverts commit 103fc0f629aa6218783f65dff0197f257137cade. Causes a clang crash in ChromeOS builds. Testcase provided at D149344.	2023-05-10 09:57:48 -07:00
Joshua Cao	9c1d5e4ae3	[SCEV][reland] More precise trip multiples We currently have getMinTrailingZeros(), from which we can get a SCEV's multiple by computing 1 << MinTrailingZeroes. However, this only gets us multiples that are a power of 2. This patch introduces a way to get max constant multiples that are not just a power of 2. The logic is similar to that of getMinTrailingZeros. getMinTrailingZerosImpl is replaced by computing the max constant multiple, and counting the number of trailing bits. I have so far found this useful in two places: 1) Computing unsigned constant ranges. For example, if we have i8 {10,+,10}<nuw>, we know the max constant it can be is 250. 2) My original intent was to use this in getSmallConstantTripMultiples, but it has no effect right now due to change from D110587. For example, if we have backedge count `(6 * %N) - 1`, the trip count becomes `1 + zext((6 * %N) - 1)`, and we cannot say that 6 is a multiple of the SCEV. I plan to look further into this separately. The implementation assumes the value is unsigned. It can probably be extended to handle signed values as well. If the code sees that a SCEV does not have <nuw>, it will fall back to finding the max multiple that is a power of 2. Multiples that are a power of 2 will still be a multiple even after the SCEV overflows. This does not apply to other values. This is the 1st commit message: --- This relands https://reviews.llvm.org/D141823. The verification fails when expensive checks are turned on. This can occur when: 1. SCEV S's multiple is cached 2. SCEV S's no wrap flags are strengthened, and the multiple changes 3. SCEV verifier finds that S's cached and recomputed multiple are different We eliminate most cases by forgetting SCEVAddRecExpr's cached values when the flags are modified, but there are still cases for other SCEV types. We relax the check by making sure the cached multiple divides the recomputed multiple, ensuring the cached multiple is correct, conservative multiple. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D149529	2023-05-07 22:01:04 -07:00
Nikita Popov	371612500a	[SCEV] Reuse Accum variable when handling GEP flags The GEP minus the base pointer (which is the pre-inc addrec) is exactly the Accum value that was already calculated.	2023-05-02 11:03:38 +02:00
Florian Hahn	b14be1e7c0	[SCEV] Use object size for globals to sharpen ranges. The highest address the object can start is ObjSize bytes before the end (unsigned max value). If this value is not a multiple of the alignment, the last possible start value is the next lowest multiple of the alignment. Note: The computations cannot overflow, because if they would there's no possible start address for the object. At the moment, this is limited to GlobalVariables, because I could not find a API similar to getObjectSize to also get the alignment of the object. With such an API, this can be generalized to general addresses. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D149483	2023-04-29 21:33:30 +01:00
Florian Hahn	4c2d29f2fc	[SCEV] Skip instrs with non-scevable types in visitAndClearUsers. No SCEVs are formed for instructions with non-scevable types, so no other SCEV expressions can depend on them. Skip those instructions and their users when invalidating SCEV expressions. Depends on D144847. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D144848	2023-04-28 15:37:35 +01:00
Nikita Popov	3ddd1ffb72	[SCEV] Don't invalidate past dependency-breaking instructions When invalidating a value, we walk all users of that value and invalidate them as well. This can be very expensive for large use graphs. However, we only need to invalidate a user U of instruction I if SCEV(U) can depend on SCEV(I). This is not the case if U is an instruction that always produces a SCEVUnknown, such as a load. If the load pointer operand is invalidated, there is no need to invalidate the load result, which is completely unrelated from a SCEV perspective. Differential Revision: https://reviews.llvm.org/D149323	2023-04-28 14:42:08 +02:00
Nikita Popov	103fc0f629	[SCEV] Replace IsAvailableOnEntry with block disposition As far as I understand, the IsAvailableOnEntry() function basically implements the same functionality as the properlyDominates() block disposition. The primary difference (apart from a weaker implementation) seems to be in this comment at the top: // Checks if the SCEV S is available at BB. S is considered available at BB // if S can be materialized at BB without introducing a fault. However, I don't really understand why there would be such a requirement. It's my understanding that SCEV explicitly does not care about trapping udiv instructions itself, and it's the job of SCEVExpander's isSafeToExpand() to make sure these don't get expanded if they may trap. Differential Revision: https://reviews.llvm.org/D149344	2023-04-28 11:02:03 +02:00
Nikita Popov	fa0014a68b	[SCEV] Drop LCSSA check in createNodeFromSelectLikePHI() SCEV expressions no longer try to preserve LCSSA form. SCEV construction will try to look through LCSSA phi nodes. As such, we also no longer need to limit this special-case fold.	2023-04-27 15:18:07 +02:00
Nikita Popov	079c525f20	[SCEV] Try simplifying phi before createNodeFromSelectLikePHI() Sometimes a phi can both be trivial and match the createNodeFromSelectLikePHI() fold. In that case it is generally more profitable to look through the phi node.	2023-04-27 15:07:19 +02:00
Nikita Popov	c27a96607c	[SCEV] Remove LCSSA special case in getSCEVAtScope() (NFCI) We no longer try to preserve LCSSA form in SCEV representation: Nowadays, we look through LCSSA PHI nodes directly during SCEV construction. As such, this separate special case in getSCEVAtScope() is no longer needed.	2023-04-27 12:53:03 +02:00
Nikita Popov	19732a3eaa	[SCEV] Check correct binary operator for nowrap flags We should be checking the current BO here, not the nested one. If the current BO has nowrap flags (and is UB on poison), then we'll fetch both operand SCEVs of that BO. We'll check the nested BO on the next iteration of the do/while loop.	2023-04-27 11:25:40 +02:00
Nikita Popov	3690e1f8a7	[SCEV] Check MatchBinaryOp opcode instead of original opcode These are not necessarily the same (e.g. or can become add) and this is what we're switching over in the first place.	2023-04-27 11:13:35 +02:00
Nikita Popov	4fcb006fb6	[SCEV] Fix getOperandsToCreate() for and/or We can create expressions either for constant operand or i1 and/or. The implementation was inverting the latter check.	2023-04-27 10:50:57 +02:00
Philip Reames	09d879d060	[SCEV] Common code for computing trip count in a fixed type [NFC-ish] This is a follow on to D147117 and D147355. In both cases, we were adding special cases to compute zext(BTC+1) instead of zext(BTC)+1 when the BTC+1 computation was known not to overflow. Differential Revision: https://reviews.llvm.org/D148661	2023-04-25 12:04:42 -07:00
Max Kazantsev	ab07cbe437	[SCEV] Support sub in and negative constants willNotOverflow This lifts two TODOs from this function, allowing us to prove no-overflow whether it happens through max int (up) or through min int (down) for both and and sub. Differential Revision: https://reviews.llvm.org/D148618 Reviewed By: dmakogon	2023-04-25 16:40:37 +07:00
Joshua Cao	a4e420ea64	Revert "[SCEV] Precise trip multiples" This reverts commit 027a4c8b96c7f97df8e98b1dac069b956810ab94.	2023-04-24 01:41:53 -07:00
Joshua Cao	027a4c8b96	[SCEV] Precise trip multiples We currently have getMinTrailingZeros(), from which we can get a SCEV's multiple by computing 1 << MinTrailingZeroes. However, this only gets us multiples that are a power of 2. This patch introduces a way to get max constant multiples that are not just a power of 2. The logic is similar to that of getMinTrailingZeros. getMinTrailingZeros is replaced by computing the max constant multiple, and counting the number of trailing bits. This is applied in two places: 1) Computing unsigned constant ranges. For example, if we have i8 {10,+,10}<nuw>, we know the max constant it can be is 250. 2) Computing trip multiples as shown in SCEV output. This is useful if for example, we are unrolling a loop by a factor of 5, and we know the trip multiple is 5, then we don't need a loop epilog. If the code sees that a SCEV does not have <nuw>, it will fall back to finding the max multiple that is a power of 2. Multiples that are a power of 2 will still be a multiple even after the SCEV overflows. Differential Revision: https://reviews.llvm.org/D141823	2023-04-24 00:21:59 -07:00
Nikita Popov	4cdb91f9e7	[SCEV] Clarify inference in isAddRecNeverPoison() The justification in isAddRecNeverPoison() no longer applies, as it dates back to a time where LLVM had an unconditional forward progress guarantee. However, we also no longer need it, because we can exploit branch on poison UB instead. For a single exit loop (without abnormal exits) we know that all instructions dominating the exit will be executed, so if any of them trigger UB on poison that means that addrec is not poison. This is slightly stronger than the previous code, because a) we don't need the exit to also be the latch and b) we don't need the value to be used in the exit branch in particular, any UB-producing instruction is fine. I don't expect much practical impact from this change, this is mainly to clarify the reasoning behind this logic. Differential Revision: https://reviews.llvm.org/D148633	2023-04-21 15:31:00 +02:00
Dmitry Makogon	e08f9894ec	[SCEV] Preserve NSW for AddRec multiplied by -1 if it cannot be signed minimum This preserves NSW flag for AddRecs multiplied by -1 if we can prove via constant ranges that the AddRec cannot be signed minimum. An explanation: Let M be signed minimum. If AddRec's range contains M, then M * (-1) will stay M and (M + 1) * (-1) will be signed maximum, so we get a signed overflow. In all other cases if an AddRec didn't signed overflow, then AddRec * (-1) wouldn't too. Differential Revision: https://reviews.llvm.org/D148084	2023-04-14 19:36:56 +07:00
Joshua Cao	921b8f40e8	[SCEV][NFC] GetMinTrailingZeros switch case and naming cleanup * combine zext and sext into the one switch case * combine vscale and udiv into one switch case * renames according to LLVM style	2023-04-10 22:56:29 -07:00
Joshua Cao	898a9ca5e9	[SCEV] Strengthen huge constant trip multiples. SCEV determines that loops with trip count >=2^32 have a trip multiple of 1 to guard against huge multiples. This patch stregthens this to instead find the greatest power of 2 divisor that is less than the threshold. Differential Revision: https://reviews.llvm.org/D147868	2023-04-10 20:00:46 -07:00
Joshua Cao	569f7e547d	[SCEV][NFC] Convert check to assert getSmallConstantTripMultiple()	2023-04-10 19:59:01 -07:00

1 2 3 4 5 ...

1977 Commits