llvm-project

Author	SHA1	Message	Date
Andrei Elovikov	ffa8ba8ce2	[NFC][LAA] Minor stylistic/comments improvements (#185510 ) I tried to split into individual tiny commits to ease review but uploading them as separate PRs would definitely be an overkill. --------- Co-authored-by: Ramkumar Ramachandra <r@artagnon.com> Co-authored-by: Florian Hahn <flo@fhahn.com>	2026-03-19 17:05:14 +00:00
Alexis Engelke	94da4039cb	[Analysis][NFC] Drop use of BranchInst (#186374 ) Largely straight-forward replacement.	2026-03-13 13:42:19 +00:00
Kshitij Paranjape	d432263551	[LAA] Fix type mismatch in getStartAndEndForAccess. (#183116 ) `SE.getUMaxExpr` causes assertion failure due to type mismatch here: https://github.com/llvm/llvm-project/blob/main/llvm/lib/Analysis/LoopAccessAnalysis.cpp#L253 Running `opt -S -p loop-vectorize -debug-only=loop-vectorize llvm/test/Analysis/LoopAccessAnalysis/type-mismatch-in-scalar-evolution.ll ` without the changes made in LoopAccessAnalysis.cpp causes assertion failure. Attaching the stack dump for reference: ``` LV: Checking a loop in 'loop_contains_store_assumed_bounds' from input.ll LV: Loop hints: force=? width=4 interleave=0 LV: Found a loop: for.body LV: Found an induction variable. opt: /home/kshitij/llvm-project/llvm/lib/Analysis/ScalarEvolution.cpp:3918: const llvm::SCEV* llvm::ScalarEvolution::getMinMaxExpr(llvm::SCEVTypes, llvm::SmallVectorImpl<const llvm::SCEV>&): Assertion `getEffectiveSCEVType(Ops[i]->getType()) == ETy && "Operand types don't match!"' failed. PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace and instructions to reproduce the bug. Stack dump: 0. Program arguments: opt -S -passes=loop-vectorize -debug-only=loop-vectorize -force-vector-width=4 -disable-output input.ll 1. Running pass "function(loop-vectorize<no-interleave-forced-only;no-vectorize-forced-only;>)" on module "input.ll" 2. Running pass "loop-vectorize<no-interleave-forced-only;no-vectorize-forced-only;>" on function "loop_contains_store_assumed_bounds" #0 0x000058ee97c5e652 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/usr/local/bin/opt+0x4f44652) #1 0x000058ee97c5af0f llvm::sys::RunSignalHandlers() (/usr/local/bin/opt+0x4f40f0f) #2 0x000058ee97c5b05c SignalHandler(int, siginfo_t, void) Signals.cpp:0:0 #3 0x00007c49d4c45330 (/lib/x86_64-linux-gnu/libc.so.6+0x45330) #4 0x00007c49d4c9eb2c __pthread_kill_implementation ./nptl/pthread_kill.c:44:76 #5 0x00007c49d4c9eb2c __pthread_kill_internal ./nptl/pthread_kill.c:78:10 #6 0x00007c49d4c9eb2c pthread_kill ./nptl/pthread_kill.c:89:10 #7 0x00007c49d4c4527e raise ./signal/../sysdeps/posix/raise.c:27:6 #8 0x00007c49d4c288ff abort ./stdlib/abort.c:81:7 #9 0x00007c49d4c2881b _nl_load_domain ./intl/loadmsgcat.c:1177:9 #10 0x00007c49d4c3b517 (/lib/x86_64-linux-gnu/libc.so.6+0x3b517) #11 0x000058ee98003fdb llvm::ScalarEvolution::getMinMaxExpr(llvm::SCEVTypes, llvm::SmallVectorImpl<llvm::SCEV const>&) (/usr/local/bin/opt+0x52e9fdb) #12 0x000058ee98004507 llvm::ScalarEvolution::getUMaxExpr(llvm::SCEV const, llvm::SCEV const) (/usr/local/bin/opt+0x52ea507) #13 0x000058ee980dc728 llvm::getStartAndEndForAccess(llvm::Loop const, llvm::SCEV const, llvm::Type, llvm::SCEV const, llvm::SCEV const, llvm::ScalarEvolution, llvm::DenseMap<std::pair<llvm::SCEV const, llvm::Type>, std::pair<llvm::SCEV const, llvm::SCEV const>, llvm::DenseMapInfo<std::pair<llvm::SCEV const, llvm::Type>, void>, llvm::detail::DenseMapPair<std::pair<llvm::SCEV const, llvm::Type>, std::pair<llvm::SCEV const, llvm::SCEV const>>>, llvm::DominatorTree, llvm::AssumptionCache, std::optional<llvm::ScalarEvolution::LoopGuards>&) (/usr/local/bin/opt+0x53c2728) #14 0x000058ee9814008b llvm::isDereferenceableAndAlignedInLoop(llvm::LoadInst, llvm::Loop, llvm::ScalarEvolution&, llvm::DominatorTree&, llvm::AssumptionCache, llvm::SmallVectorImpl<llvm::SCEVPredicate const>) (/usr/local/bin/opt+0x542608b) #15 0x000058ee9a0fa1ca llvm::LoopVectorizationLegality::canUncountableExitConditionLoadBeMoved(llvm::BasicBlock) (/usr/local/bin/opt+0x73e01ca) #16 0x000058ee9a0faee0 llvm::LoopVectorizationLegality::isVectorizableEarlyExitLoop() (/usr/local/bin/opt+0x73e0ee0) #17 0x000058ee9a104678 llvm::LoopVectorizationLegality::canVectorize(bool) (/usr/local/bin/opt+0x73ea678) #18 0x000058ee9a08c953 llvm::LoopVectorizePass::processLoop(llvm::Loop) (/usr/local/bin/opt+0x7372953) #19 0x000058ee9a090e21 llvm::LoopVectorizePass::runImpl(llvm::Function&) (/usr/local/bin/opt+0x7376e21) #20 0x000058ee9a0914e0 llvm::LoopVectorizePass::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/usr/local/bin/opt+0x73774e0) #21 0x000058ee99e419a5 llvm::detail::PassModel<llvm::Function, llvm::LoopVectorizePass, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) PassBuilderPipelines.cpp:0:0 #22 0x000058ee97f18905 llvm::PassManager<llvm::Function, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/usr/local/bin/opt+0x51fe905) #23 0x000058ee995d70d5 llvm::detail::PassModel<llvm::Function, llvm::PassManager<llvm::Function, llvm::AnalysisManager<llvm::Function>>, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) AMDGPUTargetMachine.cpp:0:0 #24 0x000058ee97f17051 llvm::ModuleToFunctionPassAdaptor::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/usr/local/bin/opt+0x51fd051) #25 0x000058ee995d7775 llvm::detail::PassModel<llvm::Module, llvm::ModuleToFunctionPassAdaptor, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) AMDGPUTargetMachine.cpp:0:0 #26 0x000058ee97f1783d llvm::PassManager<llvm::Module, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/usr/local/bin/opt+0x51fd83d) #27 0x000058ee9c153909 llvm::runPassPipeline(llvm::StringRef, llvm::Module&, llvm::TargetMachine, llvm::TargetLibraryInfoImpl, llvm::ToolOutputFile, llvm::ToolOutputFile, llvm::ToolOutputFile*, llvm::StringRef, llvm::ArrayRef<llvm::PassPlugin>, llvm::ArrayRef<std::function<void (llvm::PassBuilder&)>>, llvm::opt_tool::OutputKind, llvm::opt_tool::VerifierKind, bool, bool, bool, bool, bool, bool, bool, bool) (/usr/local/bin/opt+0x9439909) #28 0x000058ee97c3f380 optMain (/usr/local/bin/opt+0x4f25380) #29 0x00007c49d4c2a1ca __libc_start_call_main ./csu/../sysdeps/nptl/libc_start_call_main.h:74:3 #30 0x00007c49d4c2a28b call_init ./csu/../csu/libc-start.c:128:20 #31 0x00007c49d4c2a28b __libc_start_main ./csu/../csu/libc-start.c:347:5 #32 0x000058ee97c309a5 _start (/usr/local/bin/opt+0x4f169a5) ``` This is caused by a type mismatch between `SE.getSCEV(DerefRK.IRArgValue)` and `DerefBytesSCEV`. Fixing this by extending them to the wider type.	2026-03-11 19:05:47 +05:30
Florian Hahn	483fc738ff	[Loads] Add overload for isDerefAndAlignedInLoop that takes SCEVs.(NFC) Add an overload of isDereferenceableAndAlignedInLoop that directly takes the pointer and element sizes as SCEVs. This allows using it from contexts without relying on an underlying load instruction in follow-up patches.	2026-03-08 21:38:53 +00:00
Florian Hahn	526a4d4d8a	[LAA] Always use DepCands when grouping runtime checks. (#91196 ) Update groupChecks to always use DepCands to try and merge runtime checks. DepCands contains the dependency partition, grouping together all accessed pointers to he same underlying objects. If we computed the dependencies, We only need to check accesses to the same underlying object, if there is an unknown dependency for this underlying object; otherwise we already proved that all accesses withing the underlying object are safe w.r.t. vectorization and we only need to check that accesses to the underlying object don't overlap with accesses to other underlying objects. To ensure runtime checks are generated for the case with unknown dependencies, remove equivalence classes containing accesses involved in unknown dependencies. This reduces the number of runtime checks needed in case non-constant dependence distances are found, and is in preparation for removing the restriction that the accesses need to have the same stride which was added in https://github.com/llvm/llvm-project/pull/88039. PR: https://github.com/llvm/llvm-project/pull/91196	2026-03-02 21:30:15 +00:00
Igor Kirillov	41fc9b9845	[LAA] Fix recordAnalysis receiving null Instruction pointer (#183512 ) When a memory-reading or memory-writing instruction is not a LoadInst/StoreInst, the dyn_cast to Ld/St returns nullptr, which is then passed to recordAnalysis. This causes the optimization remark to fall back to the loop header location instead of pointing at the actual problematic instruction. Pass &I (the actual Instruction) instead.	2026-03-02 17:00:29 +00:00
Igor Kirillov	c1b2477a55	[LAA] NFC: Rename mulSCEVOverflow to mulSCEVNoOverflow (#183096 ) The function returns nullptr when the multiplication WOULD overflow, matching the semantics of its sibling addSCEVNoOverflow. The old name reads as if the function multiplies with overflow, which is the opposite of what it does.	2026-02-25 13:51:13 +00:00
Nashe Mncube	6e5cc82532	[LAA][LV]Allow recognition of strided pointers with constant stride (#171151 ) This patch fixes an issue found during LoopAccessAnalysis with respect to recognizing strided pointers that make use of runtime constants. Loop accesses of the form `p[base + offset * const]` , where `const` is a runtime constant should be considered for vectorization. However, it was found that there were cases that these access patterns weren't recognized. This patch resolves this by adding an explicit pattern match within LAA. --------- Co-authored-by: Florian Hahn <flo@fhahn.com>	2026-02-24 15:55:58 +00:00
Florian Hahn	2dcf858ba0	[LAA] Use SCEVPtrToAddr in tryToCreateDiffChecks. (#178861 ) The checks created by LAA only compute a pointer difference and do not need to capture provenance. Use SCEVPtrToAddr instead of SCEVPtrToInt for computations. To avoid regressions while parts of SCEV are migrated to use PtrToAddr this adds logic to rewrite all PtrToInt to PtrToAddr if possible in the created expressions. This is needed to avoid regressions. Similarly, if in the original IR we have a PtrToInt, SCEVExpander tries to re-use it if possible when expanding PtrToAddr. Depends on https://github.com/llvm/llvm-project/pull/178727. Fixes https://github.com/llvm/llvm-project/issues/156978. PR: https://github.com/llvm/llvm-project/pull/178861	2026-02-11 11:51:51 +00:00
Florian Hahn	9aadbb716d	[LAA] Check if access is part of loop in isNoWrap. blockNeedsPredication does not support blocks outside of the loop. Bail out for users outside the loop. Fixes https://github.com/llvm/llvm-project/issues/174760.	2026-01-19 20:21:09 +00:00
Nikita Popov	86755dd0bf	[llvm] Use ConstantInt::getAllOnesValue() Prefer getAllOnesValue() over get(-1). This is good practice to avoid issues with sign extension for large types.	2025-12-09 12:02:39 +01:00
Florian Hahn	113f058d73	[SCEV] Add m_scev_UndefOrPoison (NFC). (#170740 ) Add matcher for SCEVUnknown wrapping undef or poison. PR: https://github.com/llvm/llvm-project/pull/170740	2025-12-05 11:20:46 +00:00
Florian Hahn	af9a4263a1	[LAA] Only use inbounds/nusw in isNoWrap if the GEP is dereferenced. (#161445 ) Update isNoWrap to only use the inbounds/nusw flags from GEPs that are guaranteed to be dereferenced on every iteration. This fixes a case where we incorrectly determine no dependence. I think the issue is isolated to code that evaluates the resulting AddRec at BTC, just using it to compute the distance between accesses should still be fine; if the access does not execute in a given iteration, there's no dependence in that iteration. But isolating the code is not straight-forward, so be conservative for now. The practical impact should be very minor (only one loop changed across a corpus with 27k modules from large C/C++ workloads. Fixes https://github.com/llvm/llvm-project/issues/160912. PR: https://github.com/llvm/llvm-project/pull/161445	2025-11-04 17:08:12 +00:00
Florian Hahn	0e28c9bc9d	[LAA] Skip undef/poison strides in collectStridedAccess. The map returned by collectStridedAccess is used to replace strides with their versioned values. This does not work for Undef/Poison, which don't have use-lists. Don't try to version them, as versioning won't be useful in practice. Fixes https://github.com/llvm/llvm-project/issues/162922.	2025-10-27 05:01:17 +00:00
Florian Hahn	7ceef762c8	[LAA] Check if Ptr can be freed between Assume and CtxI. (#161725 ) When using information from dereferenceable assumptions, we need to make sure that the memory is not freed between the assume and the specified context instruction. Instead of just checking canBeFreed, check if there any calls that may free between the assume and the context instruction. This patch introduces a willNotFreeBetween to check for calls that may free between an assume and a context instructions, to also be used in https://github.com/llvm/llvm-project/pull/161255. PR: https://github.com/llvm/llvm-project/pull/161725	2025-10-03 13:44:58 +00:00
Florian Hahn	9d42c75256	[LAA] Fix picking context instr in evaluatePtrAddRec for multiple preds. A loop may have more than one predecessor out of the loop. In that case, just pick the first non-phi instruction in the loop header.	2025-09-30 20:04:29 +01:00
Florian Hahn	0898348abd	[LAA] Make blockNeedsPredication arguments const (NFC). The arguments aren't modified, mark them as const. This prepares for new users in a follow-up, which only have access to const versions of the arguments.	2025-09-30 17:05:04 +01:00
Ramkumar Ramachandra	08c1e9e80a	[LAA] Revert 56a1cbb and 1aded51, due to crash (#160993 ) This reverts commits 56a1cbb ([LAA] Fix non-NFC parts of 1aded51), 1aded51 ([LAA] Prepare to handle diff type sizes (NFC)). The original NFC patch caused some regressions, which the later patch tried to fix. However, the later patch is the cause of some crashes, and it would be best to revert both for now, and re-land after thorough testing.	2025-09-27 10:42:20 +01:00
Ramkumar Ramachandra	56a1cbbd1c	[LAA] Fix non-NFC parts of 1aded51 (#160701 ) 1aded51 ([LAA] Prepare to handle diff type sizes (NFC)) was supposed to be a non-functional patch, but introduced functional changes as known-non-negative and known-non-positive is not equivalent to !known-non-zero. Fix this.	2025-09-25 15:52:02 +01:00
Ramkumar Ramachandra	1aded51d74	[LAA] Prepare to handle diff type sizes (NFC) (#122318 ) As depend_diff_types shows, there are several places where the HasSameSize check can be relaxed for higher analysis precision. As a first step, return both the source size and the sink size from getDependenceDistanceStrideAndSize, along with a HasSameSize boolean for the moment.	2025-09-18 09:30:20 +01:00
Ramkumar Ramachandra	b7e31e7462	[LAA] Strip findForkedPointer (NFC) (#140298 ) Remove a level of indirection due to findForkedPointer, in an effort to improve code.	2025-09-10 14:06:15 +01:00
Florian Hahn	b400fd1151	[LAA] Support assumptions with non-constant deref sizes. (#156758 ) Update evaluatePtrAddrecAtMaxBTCWillNotWrap to support non-constant sizes in dereferenceable assumptions. Apply loop-guards in a few places needed to reason about expressions involving trip counts of the from (BTC - 1). PR: https://github.com/llvm/llvm-project/pull/156758	2025-09-04 11:32:33 +01:00
Florian Hahn	a434a7a4f1	Reapply "[LAA,Loads] Use loop guards and max BTC if needed when checking deref. (#155672 )" This reverts commit f0df1e3dd4ec064821f673ced7d83e5a2cf6afa1. Recommit with extra check for SCEVCouldNotCompute. Test has been added in b16930204b. Original message: Remove the fall-back to constant max BTC if the backedge-taken-count cannot be computed. The constant max backedge-taken count is computed considering loop guards, so to avoid regressions we need to apply loop guards as needed. Also remove the special handling for Mul in willNotOverflow, as this should not longer be needed after 914374624f (https://github.com/llvm/llvm-project/pull/155300). PR: https://github.com/llvm/llvm-project/pull/155672	2025-09-03 12:45:28 +01:00
Florian Hahn	f0df1e3dd4	Revert "[LAA,Loads] Use loop guards and max BTC if needed when checking deref. (#155672 )" This reverts commit 08001cf340185877665ee381513bf22a0fca3533. This triggers an assertion in some build configs, e.g. https://lab.llvm.org/buildbot/#/builders/24/builds/12211	2025-09-02 21:44:30 +01:00
Florian Hahn	08001cf340	[LAA,Loads] Use loop guards and max BTC if needed when checking deref. (#155672 ) Remove the fall-back to constant max BTC if the backedge-taken-count cannot be computed. The constant max backedge-taken count is computed considering loop guards, so to avoid regressions we need to apply loop guards as needed. Also remove the special handling for Mul in willNotOverflow, as this should not longer be needed after 914374624f (https://github.com/llvm/llvm-project/pull/155300). PR: https://github.com/llvm/llvm-project/pull/155672	2025-09-02 18:58:33 +01:00
annamthomas	00926a6db6	[SCEV][LAA] Support multiplication overflow computation (#155236 ) Add support for identifying multiplication overflow in SCEV. This is needed in LoopAccessAnalysis and that limitation was worked around by 484417a. This allows early-exit vectorization to work as expected in vect.stats.ll test without needing the workaround.	2025-08-27 12:11:32 +00:00
Benjamin Maxwell	bb3066d42b	[LAA] Move scalable vector check into `getStrideFromAddRec()` (#154013 ) This moves the check closer to the `.getFixedValue()` call and fixes #153797 (which is a regression from #126971).	2025-08-19 06:40:07 +01:00
Michael Berg	334a046a3c	[LoopDist] Consider reads and writes together for runtime checks (#145623 ) Emit safety guards for ptr accesses when cross partition loads exist which have a corresponding store to the same address in a different partition. This will emit the necessary ptr checks for these accesses. The test case was obtained from SuperTest, which SiFive runs regularly. We enabled LoopDistribution by default in our downstream compiler, this change was part of that enablement.	2025-08-14 12:50:17 -07:00
Florian Hahn	2ae996cbbe	[LAA] Support assumptions in evaluatePtrAddRecAtMaxBTCWillNotWrap (#147047 ) This patch extends the logic added in https://github.com/llvm/llvm-project/pull/128061 to support dereferenceability information from assumptions as well. Unfortunately both assumption cache and the dominator tree need to be threaded through multiple layers to make them available where needed. PR: https://github.com/llvm/llvm-project/pull/147047	2025-08-01 14:18:07 +01:00
Ramkumar Ramachandra	b692b239f0	[LAA] Rename var used to retry with RT-checks (NFC) (#147307 ) FoundNonConstantDistanceDependence is a misleading name for a variable that determines whether we retry with runtime checks. Rename it.	2025-07-22 13:36:33 +01:00
Ramkumar Ramachandra	584158f9ae	[LAA] Hoist check for SCEV-uncomputable dist (NFC) (#148841 ) Hoist the check for SCEVCouldNotCompute distance into getDependenceDistanceAndSize.	2025-07-16 15:30:53 +01:00
Florian Hahn	5a4586f468	Reapply "[LAA] Remove loop-invariant check added in 234cc40adc61." This reverts commit d43a80936d437d217d5a6dbbaa5fb131c27e7085. With the correctness issue blocking the recommit finally fixed (5d01697ec6cb), again unconditionally check if accesses are completely before or after each other.	2025-07-14 21:21:22 +01:00
Florian Hahn	9693056aac	[LAA] Move code to check if access are completely before/after (NFC). Factor out code to check if access are completely before/after each other. This reduces the diff for an upcoming re-commit and moving to a function also helps to reduce the nesting level via early exits.	2025-07-11 19:53:57 +01:00
Ramkumar Ramachandra	20864c4379	[LAA] Strip outdated comment in isDependent (NFC) (#146367 ) The comment has been outdated since 87ddd3a1 ([LAA] Rename and fix semantics of MaxSafeDepDistBytes to MinDepDistBytes).	2025-07-07 13:54:37 +01:00
Ramkumar Ramachandra	fb845f93c0	[LAA] Hoist setting condition for RT-checks (#128045 ) Strip ShouldRetyWithRuntimeCheck from the DepedenceDistanceStrideAndSizeInfo struct, and free isDependent from the responsibility of setting the condition for when runtime-checks are needed, transferring this responsibility to getDependenceDistanceStrideAndSize. We can have multiple DepType::Unknown dependences that, by themselves, do not trigger the retrying with runtime memory checks, and therefore block vectorization. But once a single FoundNonConstantDistanceDependence is found, the analysis seems to switch to the "LAA: Retrying with memory checks" path and allows all these dependences to be handled via runtime checks. There is hence no rationale for predicating FoundNonConstantDependenceDistance on DepType::Unknown, and removing this predication is one of the side-effects of this patch.	2025-07-07 12:02:41 +01:00
Ramkumar Ramachandra	619f7afd71	[LAA] Clean up APInt-overflow related code (#140048 ) Co-authored-by: Florian Hahn <flo@fhahn.com>	2025-06-30 14:48:56 +01:00
Florian Hahn	b8769104f1	[LAA] Address follow-up suggestions for #128061 . Adjust naming and add argument comments as suggested.	2025-06-24 12:00:17 +01:00
Florian Hahn	5d01697ec6	[LAA] Be more careful when evaluating AddRecs at symbolic max BTC. (#128061 ) Evaluating AR at the symbolic max BTC may wrap and create an expression that is less than the start of the AddRec due to wrapping (for example consider MaxBTC = -2). If that's the case, set ScEnd to -(EltSize + 1). ScEnd will get incremented by EltSize before returning, so this effectively sets ScEnd to unsigned max. Note that LAA separately checks that accesses cannot not wrap (52ded672492, https://github.com/llvm/llvm-project/pull/127543), so unsigned max represents an upper bound. When there is a computable backedge-taken count, we are guaranteed to execute the number of iterations, and if any pointer would wrap it would be UB (or the access will never be executed, so cannot alias). It includes new tests from the previous discussion that show a case we wrap with a BTC, but it is UB due to the pointer after the object wrapping (in `evaluate-at-backedge-taken-count-wrapping.ll`) When we have only a maximum backedge taken count, we instead try to use dereferenceability information to determine if the pointer access must be in bounds for the maximum backedge taken count. PR: https://github.com/llvm/llvm-project/pull/128061	2025-06-23 20:23:40 +01:00
Ramkumar Ramachandra	c8c4bd1ebc	[LV] Stengthen loop-invariance checks in isPredicatedInst (#140744 ) Check loop-invariance against SCEV as well.	2025-06-20 14:01:48 +01:00
Kazu Hirata	03f616eb3a	[llvm] Compare std::optional<T> to values directly (NFC) (#143340 ) This patch transforms: X && *X == Y to: X == Y where X is of std::optional<T>, and Y is of T or similar.	2025-06-08 22:37:59 -07:00
John Brawn	81d3189891	[LAA] Keep pointer checks on partial analysis (#139719 ) Currently if there's any memory access that AccessAnalysis couldn't analyze then all of the runtime pointer check results are discarded. This patch makes this able to be controlled with the AllowPartial option, which makes it so we generate the runtime check information for those pointers that we could analyze, as transformations may still be able to make use of the partial information. Of the transformations that use LoopAccessAnalysis, only LoopVersioningLICM changes behaviour as a result of this change. This is because the others either: * Check canVectorizeMemory, which will return false when we have partial pointer information as analyzeLoop() will return false. * Examine the dependencies returned by getDepChecker(), which will be empty as we exit analyzeLoop if we have partial pointer information before calling areDepsSafe(), which is what fills in the dependency information.	2025-06-04 16:47:20 +01:00
Ramkumar Ramachandra	ba57ff66a3	[LAA] Improve code in findForkedSCEVs (NFC) (#140384 )	2025-06-03 11:00:37 +01:00
Jon Roelofs	798058fca5	[Remarks] Remove an upcast footgun. NFC (#142191 ) CodeRegion's were previously passed as Value*, but then immediately upcast to BasicBlock. Let's keep the type information around until the use cases for non-BasicBlock code regions actually materialize.	2025-05-31 11:07:54 -07:00
Kazu Hirata	89308de4b0	[llvm] Value-initialize values with *Map::try_emplace (NFC) (#141522 ) try_emplace value-initializes values, so we do not need to pass nullptr to try_emplace when the value types are raw pointers or std::unique_ptr<T>.	2025-05-26 15:13:02 -07:00
Florian Hahn	c554fc9245	[LAA] Use m_scev_AffineAddRec in LAA (NFC).	2025-05-26 19:58:22 +01:00
Kazu Hirata	0918361d8b	[Analysis] Remove unused includes (NFC) (#141319 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-05-23 23:59:56 -07:00
Ramkumar Ramachandra	5a1311d516	[LAA] Strip isNoWrapGEP: dead code (NFC) (#140308 ) isNoWrap is the only caller of isNoWrapGEP, and it has subsuming check on the GEP immediately after.	2025-05-22 22:47:17 +01:00
Florian Hahn	4a6b1fb9da	[LAA] Remove dead SE arg from canCheckPtrAtRT (NFC).	2025-05-22 20:05:35 +01:00
Ramkumar Ramachandra	bb2791609d	[LAA] Tweak debug output for UTC stability (#140764 ) UpdateTestChecks has a make_analyzer_generalizer to replace pointer addressess from the debug output of LAA with a pattern, which is an acceptable solution when there is one RUN line. However, when there are multiple RUN lines with a common pattern, UTC fails to recognize common output due to mismatched pointer addresses. Instead of hacking UTC scrub the output before comparing the outputs from the different RUN lines, fix the issue once and for all by making LAA not output unstable pointer addresses in the first place. The removal of the now-dead make_analyzer_generalizer is left as a non-trivial exercise for a follow-up.	2025-05-21 12:01:49 +01:00
Florian Hahn	35ee462fef	[LAA] Add assert check CanDoRTIFNeeded can be computed w/o RT.Need (NFC) Add assert to ensure that CanDoRTIfNeeded can be computed w/o RtCheck.Need, to prepare for adjusting the condition.	2025-05-18 22:12:28 +01:00

1 2 3 4 5 ...

519 Commits