llvm-project

Author	SHA1	Message	Date
Aleksandr Popov	cd20600767	[LoopConstrainer] Apply loop gurads to check that loop bounds are safe (#71531 ) Loop guards that apply to loop SCEV bounds allow IRCE for cases with compound loop bounds such as: if (K > 0 && M > 0) for (i = 0; i < min(K, M); i++) {...} if (K > 0 && M > 0) for (i = min(K, M); i >= 0; i--) {...} Otherwise SCEV couldn't prove that loops have safe bounds in these cases. Co-authored-by: Aleksander Popov <apopov@azul.com>	2024-03-13 18:30:03 +01:00
Philip Reames	ffb2af3ed6	[SCEVExpander] Attempt to reinfer flags dropped due to CSE (#72431 ) LSR uses SCEVExpander to generate induction formulas. The expander internally tries to reuse existing IR expressions. To do that, it needs to strip any poison generating flags (nsw, nuw, exact, nneg, etc..) which may not be valid for the newly added users. This is conservatively correct, but has the effect that LSR will strip nneg flags on zext instructions involved in trip counts in loop preheaders. To avoid this, this patch adjusts the expanded to reinfer the flags on the CSE candidate if legal for all possible users. This should fix the regression reported in https://github.com/llvm/llvm-project/issues/71200. This should arguably be done inside canReuseInstruction instead, but doing it outside is more conservative compile time wise. Both canReuseInstruction and isGuaranteedNotToBePoison walk operand lists, so right now we are performing work which is roughly O(N^2) in the size of the operand graph. We should fix that before making the per operand step more expensive. My tenative plan is to land this, and then rework the code to sink the logic into more core interfaces.	2023-12-07 13:20:36 -08:00
Nikita Popov	eecb99c5f6	[Tests] Add disjoint flag to some tests (NFC) These tests rely on SCEV looking recognizing an "or" with no common bits as an "add". Add the disjoint flag to relevant or instructions in preparation for switching SCEV to use the flag instead of the ValueTracking query. The IR with disjoint flag matches what InstCombine would produce.	2023-12-05 14:09:36 +01:00
Philip Reames	a7f35d54ee	[SCEV] Extend isImpliedCondOperandsViaRanges to independent predicates (#71110 ) As far as I can tell, there's nothing in this code which actually assumes the two predicates in (FoundLHS FoundPred FoundRHS) => (LHS Pred RHS) are the same. Noticed while investigating something else, this is purely an oppurtunistic optimization while I'm looking at the code. Unfortunately, this doesn't solve my original problem. :)	2023-11-07 07:25:47 -08:00
Aleksandr Popov	011f25a4e0	[NFC][IRCE] Add unit test to show room for improvement (#71506 ) Add tests for compound loop bounds where IRCE is possible if (K > 0 && M > 0) for (i = 0; i < min(K, M); i++) {...} if (K > 0 && M > 0) for (i = min(K, M); i >= 0; i--) {...} Co-authored-by: Aleksander Popov <apopov@azul.com>	2023-11-07 13:06:17 +01:00
Philip Reames	f6f769203d	[tests] Autogenerate a couple of tests As usual, making it easier for an upcoming test delta to be seen. Note that several of these are examples of extremely bad testing practice. Checking internal debug output (for no real purpose), and checking the result of a fully O2 + llc run instead of reducing the specific problematic pass.	2023-11-03 08:42:23 -07:00
Aleksandr Popov	483e92468e	[NFC] Extract LoopConstrainer from IRCE to reuse it outside the pass (#70508 ) Co-authored-by: Aleksandr Popov <apopov@azul.com>	2023-10-31 18:16:59 +01:00
Philip Reames	f8742b8d6a	[SCEV] Teach SCEVExpander to use zext nneg when possible (#70815 ) zext nneg was recently added to the IR in #67982. Teaching SCEVExpander to emit nneg when possible is valuable since SCEV may have proved non-trivial facts about loop bounds which would otherwise be lost when materializing the value.	2023-10-31 09:33:07 -07:00
Philip Reames	6485978120	Refresh a couple of auto-gen tests [nfc] Reducing spurious diff in an upcoming review.	2023-10-31 07:46:01 -07:00
Aleksandr Popov	236e6787de	[IRCE] Add NSW to OverflowingBinaryOperator but not BinaryOperator Fix incorrect setting NSW flag to non-overflowing indvar base (D154954) Reviewed By: danilaml Differential Revision: https://reviews.llvm.org/D156577	2023-07-30 11:14:26 +02:00
Aleksandr Popov	bca5501869	[IRCE] Add NSW flag to main loop's indvar base We have guarantees that induction variable will not overflow in the main loop after the loop constrained. Therefore we can add no wrap flags on its base in order not to miss info that loop is countable. Add NSW flag now, since adding NUW flag requires a bit more complicated analysis. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D154954	2023-07-17 01:03:52 +02:00
Aleksandr Popov	cdcefd2f9a	[IRCE] Implement runtime overflow check for computed range's end Here is activated check elimination which was parsed previously in https://reviews.llvm.org/D154069 * Added runtime check that computed range's boundary doesn't overflow in terms of range type. * From the statement INT_MIN <= END <= INT_MAX is inferred check: isNonNegative(INT_MAX - END) * isNonNegative(END - INT_MIN). * If overflow happens, check will return 0 and the safe interval will be empty. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D154188	2023-07-12 11:19:25 +02:00
Aleksandr Popov	e935878910	[NFC][IRCE] Regenerate test checks Differential Revision: https://reviews.llvm.org/D154964	2023-07-11 23:37:37 +02:00
Aleksandr Popov	8b19cbfd77	Reland "[IRCE] Parse range checks in the form of 'LHS - RHS vs Limit'" This reverts commit 4c6f95be29c6ce0f89663a5103c58ee63d76cda3 and relands e16c5c092205f68825466c25a1dd30783c4820f3 https://reviews.llvm.org/D154069	2023-07-11 18:48:14 +02:00
Aleksandr Popov	4c6f95be29	Revert "[IRCE] Parse range checks in the form of 'LHS - RHS vs Limit'" This reverts commit e16c5c092205f68825466c25a1dd30783c4820f3. Revert due to Buildbot failure https://lab.llvm.org/buildbot/#/builders/193	2023-07-10 13:59:22 +02:00
Aleksandr Popov	e16c5c0922	[IRCE] Parse range checks in the form of 'LHS - RHS vs Limit' Introduced the following range checks forms parsing: * IV - Offset vs Limit * Offset - IV vs Limit Range's end boundary is computed as (Offset +/- Limit ). If it's not possible to prove at compile time that computed upper bound will not overflow, then scale boundary computation to a wider type to perform overflow check at runtime. Runtime overflow will be implemented in the next patch. In the meantime safe range for such kind of checks isn't computed. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D154069	2023-07-10 13:00:38 +02:00
Aleksandr Popov	ec11216071	[IRCE][Tests] Add more tests with range checks in the form of 'iv + offset vs limit' Added tests on range checks with non-strick predicate: * N - IV > limit * IV - N < limit * IV + N < limit Also added tests with known to be non-negative N Differential Revision: https://reviews.llvm.org/D154593	2023-07-06 15:31:26 +02:00
Aleksandr Popov	64e289cf51	[IRCE] Support inverted range check's predicate IRCE expects true edge of range check's branch comes to loop. If it meets reverse case - invert the branch. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D148244	2023-07-03 23:20:17 +02:00
aleks-tmb	fc5a5794ab	[IRCE][Tests] Add tests with range checks in the form of 'iv + offset vs limit' Added tests on 3 types of range checks which are not supported by IRCE now: * N - IV >= limit * IV - N <= limit * IV + N <= limit Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D154062	2023-07-02 08:23:44 +02:00
Max Kazantsev	b86b468731	[IRCE] Support non-strict range check's predicate Patch by Aleksandr Popov! Differential Revision: https://reviews.llvm.org/D148227	2023-04-21 18:33:30 +07:00
Max Kazantsev	4f6fc7a30b	[Test] Add IRCE tests with non-canonical range check Patch by Aleksandr Popov! Differential Revision: https://reviews.llvm.org/D148224	2023-04-18 20:10:22 +07:00
Max Kazantsev	2124505fe4	[IRCE] Relax restrictions on IRCE's latch exit count It seems that existing logic is too strict about latch block exit count. It is required to be computable, however it is not used in any computations, and effectively the only thing it is used for is to get the type of computed exit count. Sometimes the exit count for latch block is not known, but the loop is still finite because of other exits, and safe bounds are still computable. In this case, we miss an opportunity to apply IRCE. We could instead use a more relaxed version - max symbolic exit count, which, if exists, is enough to say that the loop is finite, and its type should be good enough. There is a subtlety with type: we do not support latch count type wider than range check type. Because of that, we want to have the narrowest type available. So if it can be computed from latch block immediately, take it. Otherwise, take whatever whole loop provides and hope that it's type isn't too wide. Differential Revision: https://reviews.llvm.org/D147910 Reviewed By: danilaml	2023-04-13 16:00:19 +07:00
Max Kazantsev	7d7b178d75	[IRCE][Test] Add test showing that fake wide exit does not inhibit the transform	2023-04-13 13:33:21 +07:00
Max Kazantsev	05e4c0142d	[Test] Regenerate test checks using auto-updater	2023-04-13 13:33:14 +07:00
Max Kazantsev	aebe7ce984	[Test] Add one more test on IRCE & regenerate checks	2023-04-07 14:58:49 +07:00
Bjorn Pettersson	3528e63d89	[test] Remove duplicate RUN lines in Transform tests	2022-12-08 11:47:16 +01:00
Roman Lebedev	0ec421d024	[NFC] Port all IRCE tests to `-passes=` syntax	2022-12-08 02:38:44 +03:00
Matt Arsenault	631432a3f5	IRCE: Convert tests to opaque pointers	2022-11-28 09:35:16 -05:00
Dmitry Makogon	10ab29ec6e	[IRCE] Bail out if AddRec in icmp is for another loop (PR58912) When IRCE runs on outer loop and sees a check of an AddRec of inner loop, it crashes with an assert in SCEV that the AddRec must be loop invariant. This adds a bail out if the AddRec which is checked in icmp is for another loop. Fixes https://github.com/llvm/llvm-project/issues/58912. Differential Revision: https://reviews.llvm.org/D137822	2022-11-14 15:06:13 +07:00
Dmitry Makogon	b1e9c4334e	[Test] Add test for crash in IRCE when IV is AddRec for another loop This adds a test for https://github.com/llvm/llvm-project/issues/58912. IRCE crashes when it tries to check whether it is possible to safely calculate the bounds of a loop with IV AddRec which is in another loop.	2022-11-11 16:49:55 +07:00
Max Kazantsev	0e465c0c2f	[IRCE] Bail in case of pointer types. PR40539 We should not unconditionally expect that SCEVable types are all integers because SCEV can also be computed for pointers. Bail in this case.	2022-09-12 16:01:25 +07:00
Max Kazantsev	ccf788a565	[IRCE] Drop SCEV of a Phi after adding a new input. PR57335 Since SCEV learned to look through single value phis with 20d798bd47ec5191de1b2a8a031da06a04e612e1, whenever we add a new input to a Phi, we should make sure that the old cached value is dropped. Otherwise, it may lead to various miscompiles, such as breach of dominance as shown in the bug https://github.com/llvm/llvm-project/issues/57335	2022-08-25 18:14:29 +07:00
Nikita Popov	e4d1d0cc2c	[SCEV] Fix isImpliedViaMerge() with values from previous iteration (PR56242) When trying to prove an implied condition on a phi by proving it for all incoming values, we need to be careful about values coming from a backedge, as these may refer to a previous loop iteration. A variant of this issue was fixed in D101829, but the dominance condition used there isn't quite right: It checks that the value dominates the incoming block, which doesn't exclude backedges (values defined in a loop will usually dominate the loop latch, which is the incoming block of the backedge). Instead, we should be checking for domination of the phi block. Any values defined inside the loop will not dominate the loop header phi. Fixes https://github.com/llvm/llvm-project/issues/56242. Differential Revision: https://reviews.llvm.org/D128640	2022-07-05 15:31:23 +02:00
Philip Reames	8906a0fe64	[SCEVExpander] Drop poison generating flags when reusing instructions The basic problem we have is that we're trying to reuse an instruction which is mapped to some SCEV. Since we can have multiple such instructions (potentially with different flags), this is analogous to our need to drop flags when performing CSE. A trivial implementation would simply drop flags on any instruction we decided to reuse, and that would be correct. This patch is almost that trivial patch except that we preserve flags on the reused instruction when existing users would imply UB on overflow already. Adding new users can, at most, refine this program to one which doesn't execute UB which is valid. In practice, this fixes two conceptual problems with the previous code: 1) a binop could have been canonicalized into a form with different opcode or operands, or 2) the inbounds GEP case which was simply unhandled. On the test changes, most are pretty straight forward. We loose some flags (in some cases, they'd have been dropped on the next CSE pass anyways). The one that took me the longest to understand was the ashr-expansion test. What's happening there is that we're considering reuse of the mul, previously we disallowed it entirely, now we allow it with no flags. The surrounding diffs are all effects of generating the same mul with a different operand order, and then doing simple DCE. The loss of the inbounds is unfortunate, but even there, we can recover most of those once we actually treat branch-on-poison as immediate UB. Differential Revision: https://reviews.llvm.org/D112734	2021-11-29 15:23:34 -08:00
Philip Reames	e69f6476a8	Autogen tests for ease of future update	2021-11-05 12:46:07 -07:00
Philip Reames	6caff716da	Regen some autogen tests to account for format change	2021-10-28 09:22:20 -07:00
Florian Hahn	6c99e63120	[SCEV] By more careful when traversing phis in isImpliedViaMerge. I think currently isImpliedViaMerge can incorrectly return true for phis in a loop/cycle, if the found condition involves the previous value of Consider the case in exit_cond_depends_on_inner_loop. At some point, we call (modulo simplifications) isImpliedViaMerge(<=, %x.lcssa, -1, %call, -1). The existing code tries to prove IncV <= -1 for all incoming values InvV using the found condition (%call <= -1). At the moment this succeeds, but only because it does not compare the same runtime value. The found condition checks the value of the last iteration, but the incoming value is from the previous iteration. Hence we incorrectly determine that the previous value was <= -1, which may not be true. I think we need to be more careful when looking at the incoming values here. In particular, we need to rule out that a found condition refers to any value that may refer to one of the previous iterations. I'm not sure there's a reliable way to do so (that also works of irreducible control flow). So for now this patch adds an additional requirement that the incoming value must properly dominate the phi block. This should ensure the values do not change in a cycle. I am not entirely sure if will catch all cases and I appreciate a through second look in that regard. Alternatively we could also unconditionally bail out in this case, instead of checking the incoming values Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D101829	2021-05-07 19:52:29 +01:00
Jingu Kang	3ea4bc7842	[IRCE] Add tests for conservative bound check Prevent cases in which the start value of IV is bigger than bound for increasing. Prevent cases in which the start value of IV is smaller than bound for decreasing. Differential Revision: https://reviews.llvm.org/D101174	2021-04-28 11:14:21 +01:00
Juneyoung Lee	05884d3b52	Make FoldBranchToCommonDest poison-safe by default This is a small patch to make FoldBranchToCommonDest poison-safe by default. After fc3f0c9c, only two syntactic changes are needed to fix unit tests. This does not cause any assembly difference in testsuite as well (-O3, X86-64 Manjaro). Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D99452	2021-03-27 19:05:12 +09:00
Roman Lebedev	b46c085d2b	[NFCI] SCEVExpander: emit intrinsics for integral {u,s}{min,max} SCEV expressions These intrinsics, not the icmp+select are the canonical form nowadays, so we might as well directly emit them. This should not cause any regressions, but if it does, then then they would needed to be fixed regardless. Note that this doesn't deal with `SCEVExpander::isHighCostExpansion()`, but that is a pessimization, not a correctness issue. Additionally, the non-intrinsic form has issues with undef, see https://reviews.llvm.org/D88287#2587863	2021-03-06 21:52:46 +03:00
Simon Pilgrim	5a02bf4f95	[IRCE] Add test case for PR48051	2020-12-14 12:01:19 +00:00
David Green	c100d7ba36	[NFC] Chec[^k] -> Check Some test updates all appearing to use the wrong spelling of CHECK.	2020-12-08 11:54:39 +00:00
Serguei Katkov	400f6edce7	[IRCE] Use the same min runtime iteration threshold for BPI and BFI checks In the last change to IRCE the BPI is ignored if BFI is present, however BFI and BPI have a different thresholds. Specifically BPI approach checks only latch exit probability so it is expected if the loop has only one exit block (latch) the behavior with BFI and BPI should be the same, BPI approach by default uses threshold 10, so it considers the loop with estimated number of iterations less then 10 should not be considered for IRCE optimization. BFI approach uses the default value 3 and this is inconsistent. The CL modifies the code to use the same threshold for both approaches.. The test is updated due to it has two side-exits (except latch) and each of them has a probability 1/16, so BFI estimates the number of runtime iteration is about to 7 (1/16 + 1/16 + some for latch) and test fails. Reviewers: mkazantsev, ebrevnov Reviewed By: mkazantsev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D91230	2020-11-16 09:21:50 +07:00
Serguei Katkov	38799975ce	[IRCE] Do not transform if loop has small number of iterations IRCE has some overhead for runtime checks and in case number of iteration is small the overhead can kill the benefit from optimizations. This CL bases on BlockFrequencyInfo of pre-header and header to estimate the number of loop iterations. If it is less than irce-min-estimated-iters we do not transform the loop. Probably it is better to make more complex cost model but for simplicity it seems the be enough. The usage of BFI is added only for new pass manager and tries to use it efficiently. Reviewers: ebrevnov, dantrushin, asbirlea, mkazantsev Reviewed By: mkazantsev Subscribers: llvm-commits, fhahn Differential Revision: https://reviews.llvm.org/D89541	2020-10-20 10:33:59 +07:00
Arthur Eubanks	9c56e94a9f	[NPM] Bail out when -foo and --passes=foo are both specified Summary: Currently when --passes is used, any passes specified via -foo are ignored. Explicitly bail out when that happens. This requires changing some tests. Most were straightforward, but codegenprepare-produced-address-math.ll is tricky. One of its RUNs runs CodeGenPrepare. I tried porting CodeGenPrepare to the NPM, but ended up getting stuck when I needed a TargetMachine. NPM doesn't have support for MachineFunctions yet. So I just deleted that RUN line, since it was mass-added in https://reviews.llvm.org/D54848 and is likely not that useful. Reviewers: echristo, hans Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82271	2020-06-22 08:27:13 -07:00
Eli Friedman	4532a50899	Infer alignment of unmarked loads in IR/bitcode parsing. For IR generated by a compiler, this is really simple: you just take the datalayout from the beginning of the file, and apply it to all the IR later in the file. For optimization testcases that don't care about the datalayout, this is also really simple: we just use the default datalayout. The complexity here comes from the fact that some LLVM tools allow overriding the datalayout: some tools have an explicit flag for this, some tools will infer a datalayout based on the code generation target. Supporting this properly required plumbing through a bunch of new machinery: we want to allow overriding the datalayout after the datalayout is parsed from the file, but before we use any information from it. Therefore, IR/bitcode parsing now has a callback to allow tools to compute the datalayout at the appropriate time. Not sure if I covered all the LLVM tools that want to use the callback. (clang? lli? Misc IR manipulation tools like llvm-link?). But this is at least enough for all the LLVM regression tests, and IR without a datalayout is not something frontends should generate. This change had some sort of weird effects for certain CodeGen regression tests: if the datalayout is overridden with a datalayout with a different program or stack address space, we now parse IR based on the overridden datalayout, instead of the one written in the file (or the default one, if none is specified). This broke a few AVR tests, and one AMDGPU test. Outside the CodeGen tests I mentioned, the test changes are all just fixing CHECK lines and moving around datalayout lines in weird places. Differential Revision: https://reviews.llvm.org/D78403	2020-05-14 13:03:50 -07:00
Denis Antrushin	99a6e405ed	[IRCE] Use SCEVExpander to modify loop bound IRCE pass checks that it can calculate loop bounds by checking SCEV availability at loop entry. However it is possible that loop bound SCEV is loop invariant, but instruction used to compute it resides within loop. In such case adjusting loop bound in preheader using IRBuilder leads to malformed SSA. Use SCEVExpander instead to generate proper instructions. Reviewed-by: mkazantsev Differential Revision: https://reviews.llvm.org/D73496	2020-02-06 12:44:43 +03:00
Alina Sbirlea	67904db23c	[IRCE] Make IRCE a Function pass. Summary: Make InductiveRangeCheckElimination a FunctionPass. Reviewers: reames, mkazantsev Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73592	2020-02-05 09:22:41 -08:00
czhengsz	8b8ba44047	[SCEV] get more accurate range for AddExpr with wrap flag. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D64869	2020-01-07 20:58:04 -05:00
Philip Reames	bdf608477e	[SCEV] Add smin support to getRangeRef We were failing to compute trip counts (both exact and maximum) for any loop which involved a comparison against either an umin or smin. It looks like this simply got missed when we added smin/umin to SCEV. (Note: umin was submitted separately earlier today. Turned out two folks hit this at the same time.) Differential Revision: https://reviews.llvm.org/D67514 llvm-svn: 371776	2019-09-12 21:32:27 +00:00

1 2 3

124 Commits