llvm-project

Author	SHA1	Message	Date
Nikita Popov	5753ee2434	[LICM] Avoid assertion failure on stale MemoryDef It can happen that the call is originally created as a MemoryDef, and then later transforms show it is actually read-only and could be a MemoryUse -- however, this is not guaranteed to be reflected in MSSA.	2025-08-19 10:25:45 +02:00
Mircea Trofin	c971c25544	[licm] don't drop `MD_prof` when dropping other metadata (#152420 ) Part of Issue #147390	2025-08-16 07:26:13 -07:00
Theodoros Theodoridis	d15b7a83a7	[llvm][LICM] Limit multi-use BOAssociation to FP and Vector (#149829 ) Limit the re-association of BOps with multiple users to FP and Vector arithmetic.	2025-08-14 11:56:55 +01:00
Mircea Trofin	374cbfd327	[licm] clone `MD_prof` when hoisting conditional branch (#152232 ) The profiling - related metadata information for the hoisted conditional branch should be copied from the original branch, not from the current terminator of the block it's hoisted to. The patch adds a way to disable the fix just so we can do an ablation test, after which the flag will be removed. The same flag will be reused for other similar fixes. (This was identified through `profcheck` (see Issue #147390), and this PR addresses most of the test failures (when running under profcheck) under `Transforms/LICM`.)	2025-08-13 02:01:00 +02:00
weiguozhi	5e87792200	[LoopInfo] Pointer to stack object may not be loop invariant in a coroutine function (#149936 ) A coroutine function may be split to ramp function and resume function, and they have different stack frames, so a pointer to stack objects may have different addresses depending on where it is used, so it's not a loop invariant. It temporarily fixes https://github.com/llvm/llvm-project/issues/149604.	2025-08-09 14:20:19 -07:00
Nikita Popov	0a41e7c87e	[LICM] Do not reassociate constant offset GEP (#151492 ) LICM tries to reassociate GEPs in order to hoist an invariant GEP. Currently, it also does this in the case where the GEP has a constant offset. This is usually undesirable. From a back-end perspective, constant GEPs are usually free because they can be folded into addressing modes, so this just increases register pressume. From a middle-end perspective, keeping constant offsets last in the chain makes it easier to analyze the relationship between multiple GEPs on the same base, especially after CSE. The worst that can happen here is if we start with something like ``` loop { p + 4x p + 4x + 1 p + 4x + 2 p + 4x + 3 } ``` And LICM converts it into: ``` p.1 = p + 1 p.2 = p + 2 p.3 = p + 3 loop { p + 4x p.1 + 4x p.2 + 4x p.3 + 4x } ``` Which is much worse than leaving it for CSE to convert to: ``` loop { p2 = p + 4*x p2 + 1 p2 + 2 p2 + 3 } ```	2025-08-01 09:43:15 +02:00
Shan Huang	1b136824bb	[DebugInfo][LICM] Fix debug value loss caused by hoisting the cmp instructions (#146640 ) fix #146621	2025-07-07 13:46:56 +08:00
Jiachen (Yangyang) Wang	1ab0e7dd60	[LICM] Hoisting writeonly calls (#143799 ) Adds support for hoisting `writeonly` calls in LICM. This patch adds a missing optimization that allows hoisting of `writeonly` function calls out of loops when it is safe to do so. Previously, such calls were conservatively retained inside the loop body, and the redundant calls were only reduced through unrolling, relying on target-dependent heuristics. Closes #143267 Testing: - Modified previously negative tests for hoisting writeonly calls to be instead positive - Added test cases for hoisting of two writeonly calls where the pointers do/do not alias - Added a test case for not argmemonly writeonly calls.	2025-06-19 10:09:19 +02:00
Nikita Popov	a13b7cc00c	[LICM] Support hoisting of non-argmemonly readonly calls (#144497 ) The code checking whether a readonly call is safe to hoist is currently limited to only argmemonly calls. However, the actual implementation does not depend on this in any way. It either does an MSSA clobber walk on the memory access (which will take all locations accessed by the call into account), or it will look at all MemoryDefs in an entirely location-independent manner. The current restriction dates back to the time when LICM still supported AST, in which case this code did reason about the individual pointer arguments.	2025-06-18 12:24:55 +02:00
Jeremy Morse	9eb0020555	[DebugInfo][RemoveDIs] Remove a swathe of debug-intrinsic code (#144389 ) Seeing how we can't generate any debug intrinsics any more: delete a variety of codepaths where they're handled. For the most part these are plain deletions, in others I've tweaked comments to remain coherent, or added a type to (what was) type-generic-lambdas. This isn't all the DbgInfoIntrinsic call sites but it's most of the simple scenarios. Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-17 15:55:14 +01:00
Stephen Tozer	a08a831515	[DLCov][NFC] Propagate annotated DebugLocs through transformations (#138047 ) Part of the coverage-tracking feature, following #107279. In order for DebugLoc coverage testing to work, we firstly have to set annotations for intentionally-empty DebugLocs, and secondly we have to ensure that we do not drop these annotations as we propagate DebugLocs throughout compilation. As the annotations exist as part of the DebugLoc class, and not the underlying DILocation, they will not survive a DebugLoc->DILocation->DebugLoc roundtrip. Therefore this patch modifies a number of places in the compiler to propagate DebugLocs directly rather than via the underlying DILocation. This has no effect on the output of normal builds; it only ensures that during coverage builds, we do not drop incorrectly annotations and therefore create false positives. The bulk of these changes are in replacing DILocation::getMergedLocation(s) with a DebugLoc equivalent, and in changing the IRBuilder to store a DebugLoc directly rather than storing DILocations in its general Metadata array. We also use a new function, `DebugLoc::orElse`, which selects the "best" DebugLoc out of a pair (valid location > annotated > empty), preferring the current DebugLoc on a tie - this encapsulates the existing behaviour at a few sites where we _may_ assign a DebugLoc to an existing instruction, while extending the logic to handle annotation DebugLocs at the same time.	2025-06-12 14:06:27 +01:00
Stephen Tozer	aa8a1fa6f5	[DLCov][NFC] Annotate intentionally-blank DebugLocs in existing code (#136192 ) Following the work in PR #107279, this patch applies the annotative DebugLocs, which indicate that a particular instruction is intentionally missing a location for a given reason, to existing sites in the compiler where their conditions apply. This is NFC in ordinary LLVM builds (each function `DebugLoc::getFoo()` is inlined as `DebugLoc()`), but marks the instruction in coverage-tracking builds so that it will be ignored by Debugify, allowing only real errors to be reported. From a developer standpoint, it also communicates the intentionality and reason for a missing DebugLoc. Some notes for reviewers: - The difference between `I->dropLocation()` and `I->setDebugLoc(DebugLoc::getDropped())` is that the former _may_ decide to keep some debug info alive, while the latter will always be empty; in this patch, I always used the latter (even if the former could technically be correct), because the former could result in some (barely) different output, and I'd prefer to keep this patch purely NFC. - I've generally documented the uses of `DebugLoc::getUnknown()`, with the exception of the vectorizers - in summary, they are a huge cause of dropped source locations, and I don't have the time or the domain knowledge currently to solve that, so I've plastered it all over them as a form of "fixme".	2025-06-11 17:42:10 +01:00
Kazu Hirata	3e1931d7fe	[Scalar] Use std::none_of (NFC) (#143282 ) While I am at it, this patch replaces It with std::next(It) for clarity. Note that It is not used after this point.	2025-06-08 01:34:32 -07:00
Nikita Popov	eea6969190	[LICM] Only check for provenance captures (#141731 ) When performing scalar promotions, only consider provenance captures, which may lead to non-thread-safe accesses. Address captures can be ignored.	2025-05-28 11:57:30 +02:00
Florian Hahn	34813d9d38	[Reassociate] Move Disjoint flag handling to OverflowTracking. (#140406 ) Move disjoint flag tracking to OverflowTracking. This enables preserving disjoint flags in Reassociate. Depends on https://github.com/llvm/llvm-project/pull/140404 PR: https://github.com/llvm/llvm-project/pull/140406	2025-05-23 14:59:18 +01:00
Florian Hahn	0c82e06bb6	[LICM] Use OverflowTracking to preserve NUW/NSW when reassociating. (#140404 ) This enables preserving NSW when both adds have NSW and NUW. For now, set AllKnownNonNegative/AllKnownNonZero to false when using in LICM. https://alive2.llvm.org/ce/z/uu79Xc Depends on https://github.com/llvm/llvm-project/pull/140403. PR: https://github.com/llvm/llvm-project/pull/140404	2025-05-20 11:18:56 +01:00
Florian Hahn	bf1d4a0710	[LICM] Preserve Disjoint flag on OR when hoisting. (#140266 ) Update hoistBOAssociation to preserve Disjoint flags on the newly created instructions if both ORs are disjoint. Fixes https://github.com/llvm/llvm-project/issues/139625. PR: https://github.com/llvm/llvm-project/pull/140266	2025-05-17 14:48:39 +01:00
Shan Huang	5971b41919	[DebugInfo][LICM] Salvage dbg_values for the dead instructions to erase (#138796 ) fix #138684 .	2025-05-12 09:07:30 +08:00
Nikita Popov	b492ec5899	[ErrorHandling] Add reportFatalInternalError + reportFatalUsageError (NFC) (#138251 ) This implements the result of the discussion at: https://discourse.llvm.org/t/rfc-report-fatal-error-and-the-default-value-of-gencrashdialog/73587 There are two different use cases for report_fatal_error, so replace it with two functions reportFatalInternalError() and reportFatalUsageError(). The former indicates a bug in LLVM and generates a crash dialog. The latter does not. The names have been suggested by rnk and people seemed to like them. This replaces a lot of the usages that passed an explicit value for GenCrashDiag. I did not bulk replace remaining report_fatal_error usage -- they probably require case by case review for which function to use.	2025-05-05 12:10:03 +02:00
Matt Arsenault	393c783a10	LICM: Avoid looking at use list of constant data (#134690 ) The codegen test changes seem incidental. Either way, sms-grp-order.ll seems to already not hit the original issue.	2025-04-13 17:06:38 +02:00
Kazu Hirata	cde58bfc16	[Transforms] Use range constructors of *Set (NFC) (#133203 )	2025-03-27 07:51:58 -07:00
Kazu Hirata	73dc2afd2c	[Transforms] Use Set::insert_range (NFC) (#132652 ) We can use Set::insert_range to collapse: for (auto Elem : Range) Set.insert(E); down to: Set.insert_range(Range); In some cases, we can further fold that into the set declaration.	2025-03-23 19:42:53 -07:00
Kazu Hirata	41b76119ec	[llvm] Use range constructors for *Set (NFC) (#132636 )	2025-03-23 15:50:34 -07:00
Kazu Hirata	1b189cab5e	[llvm] Use *Set::insert_range (NFC) (#132509 ) DenseSet, SmallPtrSet, SmallSet, SetVector, and StringSet recently gained C++23-style insert_range. This patch uses insert_range in conjunction with llvm::{predecessors,successors} and MachineBasicBlock::{predecessors,successors}.	2025-03-22 08:07:33 -07:00
Kazu Hirata	efc2f6912d	[Scalar] Avoid repeated hash lookups (NFC) (#129825 )	2025-03-05 00:44:50 -08:00
chrisPyr	71f4c7dabe	[NFC]Make file-local cl::opt global variables static (#126486 ) #125983	2025-03-03 13:46:33 +07:00
Nikita Popov	9cbdcfcafd	[CaptureTracking] Remove StoreCaptures parameter (NFC) The implementation doesn't use it, and is unlikely to use it in the future. The places that do set StoreCaptures=false, do so incorrectly and would be broken if the parameter actually did anything.	2025-02-24 12:00:57 +01:00
Jeremy Morse	6292a808b3	[NFC][DebugInfo] Use iterator-flavour getFirstNonPHI at many call-sites (#123737 ) As part of the "RemoveDIs" project, BasicBlock::iterator now carries a debug-info bit that's needed when getFirstNonPHI and similar feed into instruction insertion positions. Call-sites where that's necessary were updated a year ago; but to ensure some type safety however, we'd like to have all calls to getFirstNonPHI use the iterator-returning version. This patch changes a bunch of call-sites calling getFirstNonPHI to use getFirstNonPHIIt, which returns an iterator. All these call sites are where it's obviously safe to fetch the iterator then dereference it. A follow-up patch will contain less-obviously-safe changes. We'll eventually deprecate and remove the instruction-pointer getFirstNonPHI, but not before adding concise documentation of what considerations are needed (very few). --------- Co-authored-by: Stephen Tozer <Melamoto@gmail.com>	2025-01-24 13:27:56 +00:00
Jeremy Morse	8e70273509	[NFC][DebugInfo] Use iterator moveBefore at many call-sites (#123583 ) As part of the "RemoveDIs" project, BasicBlock::iterator now carries a debug-info bit that's needed when getFirstNonPHI and similar feed into instruction insertion positions. Call-sites where that's necessary were updated a year ago; but to ensure some type safety however, we'd like to have all calls to moveBefore use iterators. This patch adds a (guaranteed dereferenceable) iterator-taking moveBefore, and changes a bunch of call-sites where it's obviously safe to change to use it by just calling getIterator() on an instruction pointer. A follow-up patch will contain less-obviously-safe changes. We'll eventually deprecate and remove the instruction-pointer insertBefore, but not before adding concise documentation of what considerations are needed (very few).	2025-01-24 10:53:11 +00:00
Ramkumar Ramachandra	22a280d392	LICM: teach hoistMinMax about samesign (#122730 ) Follow up on 4a0d53a (PatternMatch: migrate to CmpPredicate) to get rid of one of the FIXMEs it introduced by replacing a predicate comparison with CmpPredicate::getMatching.	2025-01-13 20:03:58 +00:00
Ramkumar Ramachandra	4a0d53a0b0	PatternMatch: migrate to CmpPredicate (#118534 ) With the introduction of CmpPredicate in 51a895a (IR: introduce struct with CmpInst::Predicate and samesign), PatternMatch is one of the first key pieces of infrastructure that must be updated to match a CmpInst respecting samesign information. Implement this change to Cmp-matchers. This is a preparatory step in migrating the codebase over to CmpPredicate. Since we no functional changes are desired at this stage, we have chosen not to migrate CmpPredicate::operator==(CmpPredicate) calls to use CmpPredicate::getMatching(), as that would have visible impact on tests that are not yet written: instead, we call CmpPredicate::operator==(Predicate), preserving the old behavior, while also inserting a few FIXME comments for follow-ups.	2024-12-13 14:18:33 +00:00
Florian Hahn	ab6677e7d6	[LICM] Only set AA metadata on hoisted load if it executes. (#117204 ) https://github.com/llvm/llvm-project/pull/116220 clarified that violations of aliasing metadata are UB. Only set the AA metadata after hoisting a log, if it is guaranteed to execute in the original loop. PR: https://github.com/llvm/llvm-project/pull/117204	2024-11-26 14:16:16 +00:00
DianQK	18b02bbf44	[LICM] allow MemoryAccess creation failure (#116813 ) Fixes #116809. After running some passes (SimpleLoopUnswitch, LoopInstSimplify, etc.), MemorySSA might be outdated, and the instruction `I` may have become a non-memory touching instruction. LICM has already handled this, but it does not pass `CreationMustSucceed=false` to `createDefinedAccess`.	2024-11-20 19:52:51 +08:00
Kazu Hirata	94f9cbbe49	[Scalar] Remove unused includes (NFC) (#114645 ) Identified with misc-include-cleaner.	2024-11-02 08:32:26 -07:00
Ramkumar Ramachandra	45817aa726	LICM: hoist BO assoc for and, or, xor (#111146 ) Trivially lift the Opcode limitation on hoistBOAssociation to also hoist and, or, and xor. Alive2 proofs: https://alive2.llvm.org/ce/z/rVNP2X	2024-10-04 19:13:51 +01:00
Joshua Cao	0bc98349c8	[LICM] Use DomTreeUpdater version of SplitBlockPredecessors, nfc (#107190 ) The DominatorTree version is marked for deprecation, so we use the DomTreeUpdater version. We also update sinkRegion() to iterate over basic blocks instead of DomTreeNodes. The loop body calls SplitBlockPredecessors. The DTU version calls DomTreeUpdater::apply_updates(), which may call DominatorTree::reset(). This invalidates the worklist of DomTreeNodes to iterate over.	2024-09-29 21:28:45 -07:00
Ramkumar Ramachandra	6fe723441b	LICM: hoist BO assoc for FAdd and FMul (#108415 ) Extend hoistBOAssociation to the FAdd and FMul cases, noting that we copy an intersection of the fast-math flags present in both instructions.	2024-09-27 11:05:30 +01:00
Ramkumar Ramachandra	16900d3b98	LICM: hoist BO assoc when BinOp is in RHS (#107072 ) Extend hoistBOAssociation smoothly to handle the case when the inner BinaryOperator is in the RHS of the outer BinaryOperator. This completes the generalization of hoistBOAssociation, and the only limitation after this patch is the fact that only Add and Mul are hoisted.	2024-09-04 22:01:04 +01:00
Ramkumar Ramachandra	5818337765	LICM: hoist BO assoc when (C1 op LV) op C2 (#106999 ) Extend hoistBOAssociation to handle the "(C1 op LV) op C2" case, when op is a commutative operand.	2024-09-04 11:47:37 +01:00
Ramkumar Ramachandra	f1ef67ded5	LICM: extend hoist BO assoc to mul case (#106991 ) Trivially extend hoistBOAssociation to also handle the BinaryOperator Mul. Alive2 proofs: https://alive2.llvm.org/ce/z/zjtR5g	2024-09-03 17:08:11 +01:00
Ramkumar Ramachandra	05f5a91d00	LICM: use IRBuilder in hoist BO assoc (#106978 ) Use IRBuilder when creating the new invariant instruction, so that the constant-folder has an opportunity to constant-fold the new Instruction that we desire to create.	2024-09-03 15:27:03 +01:00
Ramkumar Ramachandra	2a8fda443e	LICM: extend hoistAddSub to unsigned case (#106373 ) Trivially extend dd0cf23 ([LICM] Reassociate & hoist sub expressions) to handle unsigned predicates as well. Alive2 proofs: https://alive2.llvm.org/ce/z/GdDBtT.	2024-08-30 14:12:52 +01:00
Jeremy Morse	fd7d7882e7	[DebugInfo][RemoveDIs] Use iterators to insert everywhere (#102003 ) These are the final few places in LLVM where we use instruction pointers to identify the position that we're inserting something. We're trying to get away from that with a view to deprecating those methods, thus use iterators in all these places. I believe they're all debug-info safe. The sketchiest part is the ExtractValueInst copy constructor, where we cast nullptr to a BasicBlock pointer, so that we take the non-default insert-into-no-block path for instruction insertion, instead of the default nullptr-instruction path for UnaryInstruction. Such a hack is necessary until we get rid of the instruction constructor entirely.	2024-08-08 14:25:06 +01:00
Ricardo Jesus	fc157522c5	[LICM] Prevent fold and hoist of binary ops with over 2 uses (#102114 ) This limits folding and hoisting associative binary ops to cases where the intermediate op has at most two uses. The more uses the intermediate op has, the more new ops we have to create to potentially reduce the loop's critical path. We keep the limit to two uses to minimise undesirable increases in code size.	2024-08-07 09:52:30 +01:00
Ricardo Jesus	25da8e5a97	Reapply "[LICM] Fold associative binary ops to promote code hoisting (#81608 )" (#100377 ) This reapplies a more strict version of `f2ccf80136`. Perform the transformation "(LV op C1) op C2" ==> "LV op (C1 op C2)" where op is an associative binary op, LV is a loop variant, and C1 and C2 are loop invariants, and hoist (C1 op C2) into the preheader. For now this fold is restricted to ADDs.	2024-07-26 10:12:25 +01:00
Nikita Popov	b48819dbcd	Revert " [LICM] Fold associative binary ops to promote code hoisting (#81608 )" This reverts commit f2ccf80136a01ca69f766becafb329db6c54c0c8. The flag propagation code is incorrect.	2024-07-23 12:01:22 +02:00
Ricardo Jesus	f2ccf80136	[LICM] Fold associative binary ops to promote code hoisting (#81608 ) Perform the transformation "(LV op C1) op C2" ==> "LV op (C1 op C2)" where op is an associative binary op, LV is a loop variant, and C1 and C2 are loop invariants to hoist. Similar patterns could be folded (left in comment) but this one seems to be the most impactful.	2024-07-23 10:03:26 +01:00
Nikita Popov	2d209d964a	[IR] Add getDataLayout() helpers to BasicBlock and Instruction (#96902 ) This is a helper to avoid writing `getModule()->getDataLayout()`. I regularly try to use this method only to remember it doesn't exist... `getModule()->getDataLayout()` is also a common (the most common?) reason why code has to include the Module.h header.	2024-06-27 16:38:15 +02:00
Tim Gymnich	0dd43774a6	[LICM] Fix dropped metadata (#95221 ) LICM drops metadata for call instructions when cloning instructions. This patch just adds the missing `copyMetadata`. Fixes #91919.	2024-06-19 10:22:52 +02:00
Antonio Frighetto	70091dc943	[LICM] Invalidate cached SCEV results in `hoistMulAddAssociation` While reassociating expressions, LICM is required to invalidate SCEV results, as otherwise subsequent passes in the pipeline that leverage LICM foldings (e.g. IndVars), may reason on invalid expressions; thus miscompiling. This is achieved by rewriting the reassociable instruction from scratch. Fixes: https://github.com/llvm/llvm-project/issues/91957.	2024-05-29 08:44:45 +02:00

1 2 3 4 5 ...

725 Commits