llvm-project

Author	SHA1	Message	Date
Ramkumar Ramachandra	45817aa726	LICM: hoist BO assoc for and, or, xor (#111146 ) Trivially lift the Opcode limitation on hoistBOAssociation to also hoist and, or, and xor. Alive2 proofs: https://alive2.llvm.org/ce/z/rVNP2X	2024-10-04 19:13:51 +01:00
Joshua Cao	0bc98349c8	[LICM] Use DomTreeUpdater version of SplitBlockPredecessors, nfc (#107190 ) The DominatorTree version is marked for deprecation, so we use the DomTreeUpdater version. We also update sinkRegion() to iterate over basic blocks instead of DomTreeNodes. The loop body calls SplitBlockPredecessors. The DTU version calls DomTreeUpdater::apply_updates(), which may call DominatorTree::reset(). This invalidates the worklist of DomTreeNodes to iterate over.	2024-09-29 21:28:45 -07:00
Ramkumar Ramachandra	6fe723441b	LICM: hoist BO assoc for FAdd and FMul (#108415 ) Extend hoistBOAssociation to the FAdd and FMul cases, noting that we copy an intersection of the fast-math flags present in both instructions.	2024-09-27 11:05:30 +01:00
Ramkumar Ramachandra	16900d3b98	LICM: hoist BO assoc when BinOp is in RHS (#107072 ) Extend hoistBOAssociation smoothly to handle the case when the inner BinaryOperator is in the RHS of the outer BinaryOperator. This completes the generalization of hoistBOAssociation, and the only limitation after this patch is the fact that only Add and Mul are hoisted.	2024-09-04 22:01:04 +01:00
Ramkumar Ramachandra	5818337765	LICM: hoist BO assoc when (C1 op LV) op C2 (#106999 ) Extend hoistBOAssociation to handle the "(C1 op LV) op C2" case, when op is a commutative operand.	2024-09-04 11:47:37 +01:00
Ramkumar Ramachandra	f1ef67ded5	LICM: extend hoist BO assoc to mul case (#106991 ) Trivially extend hoistBOAssociation to also handle the BinaryOperator Mul. Alive2 proofs: https://alive2.llvm.org/ce/z/zjtR5g	2024-09-03 17:08:11 +01:00
Ramkumar Ramachandra	05f5a91d00	LICM: use IRBuilder in hoist BO assoc (#106978 ) Use IRBuilder when creating the new invariant instruction, so that the constant-folder has an opportunity to constant-fold the new Instruction that we desire to create.	2024-09-03 15:27:03 +01:00
Ramkumar Ramachandra	2a8fda443e	LICM: extend hoistAddSub to unsigned case (#106373 ) Trivially extend dd0cf23 ([LICM] Reassociate & hoist sub expressions) to handle unsigned predicates as well. Alive2 proofs: https://alive2.llvm.org/ce/z/GdDBtT.	2024-08-30 14:12:52 +01:00
Jeremy Morse	fd7d7882e7	[DebugInfo][RemoveDIs] Use iterators to insert everywhere (#102003 ) These are the final few places in LLVM where we use instruction pointers to identify the position that we're inserting something. We're trying to get away from that with a view to deprecating those methods, thus use iterators in all these places. I believe they're all debug-info safe. The sketchiest part is the ExtractValueInst copy constructor, where we cast nullptr to a BasicBlock pointer, so that we take the non-default insert-into-no-block path for instruction insertion, instead of the default nullptr-instruction path for UnaryInstruction. Such a hack is necessary until we get rid of the instruction constructor entirely.	2024-08-08 14:25:06 +01:00
Ricardo Jesus	fc157522c5	[LICM] Prevent fold and hoist of binary ops with over 2 uses (#102114 ) This limits folding and hoisting associative binary ops to cases where the intermediate op has at most two uses. The more uses the intermediate op has, the more new ops we have to create to potentially reduce the loop's critical path. We keep the limit to two uses to minimise undesirable increases in code size.	2024-08-07 09:52:30 +01:00
Ricardo Jesus	25da8e5a97	Reapply "[LICM] Fold associative binary ops to promote code hoisting (#81608 )" (#100377 ) This reapplies a more strict version of `f2ccf80136`. Perform the transformation "(LV op C1) op C2" ==> "LV op (C1 op C2)" where op is an associative binary op, LV is a loop variant, and C1 and C2 are loop invariants, and hoist (C1 op C2) into the preheader. For now this fold is restricted to ADDs.	2024-07-26 10:12:25 +01:00
Nikita Popov	b48819dbcd	Revert " [LICM] Fold associative binary ops to promote code hoisting (#81608 )" This reverts commit f2ccf80136a01ca69f766becafb329db6c54c0c8. The flag propagation code is incorrect.	2024-07-23 12:01:22 +02:00
Ricardo Jesus	f2ccf80136	[LICM] Fold associative binary ops to promote code hoisting (#81608 ) Perform the transformation "(LV op C1) op C2" ==> "LV op (C1 op C2)" where op is an associative binary op, LV is a loop variant, and C1 and C2 are loop invariants to hoist. Similar patterns could be folded (left in comment) but this one seems to be the most impactful.	2024-07-23 10:03:26 +01:00
Nikita Popov	2d209d964a	[IR] Add getDataLayout() helpers to BasicBlock and Instruction (#96902 ) This is a helper to avoid writing `getModule()->getDataLayout()`. I regularly try to use this method only to remember it doesn't exist... `getModule()->getDataLayout()` is also a common (the most common?) reason why code has to include the Module.h header.	2024-06-27 16:38:15 +02:00
Tim Gymnich	0dd43774a6	[LICM] Fix dropped metadata (#95221 ) LICM drops metadata for call instructions when cloning instructions. This patch just adds the missing `copyMetadata`. Fixes #91919.	2024-06-19 10:22:52 +02:00
Antonio Frighetto	70091dc943	[LICM] Invalidate cached SCEV results in `hoistMulAddAssociation` While reassociating expressions, LICM is required to invalidate SCEV results, as otherwise subsequent passes in the pipeline that leverage LICM foldings (e.g. IndVars), may reason on invalid expressions; thus miscompiling. This is achieved by rewriting the reassociable instruction from scratch. Fixes: https://github.com/llvm/llvm-project/issues/91957.	2024-05-29 08:44:45 +02:00
Shan Huang	cdd782183d	[DebugInfo][LICM] Fix missing debug location updates (#91729 )	2024-05-11 16:26:04 +01:00
Craig Topper	1261c02be4	[LICM] Drop nsw/nuw flags on affected instructions in hoistMulAddAssociation. (#85486 ) Since we are introducing new multiplies earlier in the arithmetic, the nsw/nuw flags on later instructions are no longer accurate. Fixes #85457.	2024-03-18 11:46:25 -07:00
Jeremy Morse	2fe81edef6	[NFC][RemoveDIs] Insert instruction using iterators in Transforms/ As part of the RemoveDIs project we need LLVM to insert instructions using iterators wherever possible, so that the iterators can carry a bit of debug-info. This commit implements some of that by updating the contents of llvm/lib/Transforms/Utils to always use iterator-versions of instruction constructors. There are two general flavours of update: * Almost all call-sites just call getIterator on an instruction * Several make use of an existing iterator (scenarios where the code is actually significant for debug-info) The underlying logic is that any call to getFirstInsertionPt or similar APIs that identify the start of a block need to have that iterator passed directly to the insertion function, without being converted to a bare Instruction pointer along the way. Noteworthy changes: * FindInsertedValue now takes an optional iterator rather than an instruction pointer, as we need to always insert with iterators, * I've added a few iterator-taking versions of some value-tracking and DomTree methods -- they just unwrap the iterator. These are purely convenience methods to avoid extra syntax in some passes. * A few calls to getNextNode become std::next instead (to keep in the theme of using iterators for positions), * SeparateConstOffsetFromGEP has it's insertion-position field changed. Noteworthy because it's not a purely localised spelling change. All this should be NFC.	2024-03-05 15:12:22 +00:00
Arthur Eubanks	36e73e4edc	[LICM][NFCish] Consider all calls in a presplit coroutine unsinkable/unhoistable (#81951 ) NFCish since previously we'd return false for all presplit coroutines anyway. This clarifies things a bit.	2024-02-16 10:08:00 -08:00
Craig Topper	2dd5204681	Recommit "[LICM] Support integer mul/add in hoistFPAssociation. (#67736 )" With a fix for build bot failure. I was accessing the type of a deleted Instruction. Original message: The reassociation this is trying to repair can happen for integer types too. This patch adds support for integer mul/add to hoistFPAssociation. The function has been renamed to hoistMulAddAssociation. I've used separate statistics and limits for integer to allow tuning flexibility.	2024-02-12 20:33:28 -08:00
Craig Topper	ecd63afafd	Revert "[LICM] Support integer mul/add in hoistFPAssociation. (#67736 )" This reverts commit 7ff5dfbaa0c971048da0f37ec6f05f5395562c21. Causing crashes on Mac build bots.	2024-02-12 16:41:29 -08:00
Craig Topper	7ff5dfbaa0	[LICM] Support integer mul/add in hoistFPAssociation. (#67736 ) The reassociation this is trying to repair can happen for integer types too. This patch adds support for integer mul/add to hoistFPAssociation. The function has been renamed to hoistMulAddAssociation. I've used separate statistics and limits for integer to allow tuning flexibility.	2024-02-12 14:59:49 -08:00
Bruno De Fraine	656bf13004	[AST] Don't merge memory locations in AliasSetTracker (#65731 ) This changes the AliasSetTracker to track memory locations instead of pointers in its alias sets. The motivation for this is outlined in an RFC posted on LLVM discourse: https://discourse.llvm.org/t/rfc-dont-merge-memory-locations-in-aliassettracker/73336 In the data structures of the AST implementation, I made the choice to replace the linked list of `PointerRec` entries (that had to go anyway) with a simple flat vector of `MemoryLocation` objects, but for the `AliasSet` objects referenced from a lookup table, I retained the mechanism of a linked list, reference counting, forwarding, etc. The data structures could be revised in a follow-up change.	2024-01-17 15:59:13 +01:00
Jeremy Morse	4b64138ba4	[DebugInfo][RemoveDIs] Switch some insertion routines to use iterators (#75330 ) As part of RemoveDIs, we need instruction insertion to be done with iterators rather than instruction pointers, so that we can communicate some debug-info facts about the position. This patch is an entirely mechanical replacement of Instruction * with BasicBlock::iterator, plus using insertBefore to insert some instructions because we don't have iterator-taking constructors yet. Sadly it's not NFC because it causes dbg.value intrinsics / their DPValue equivalents to shift location.	2023-12-13 14:04:35 +00:00
Jeremy Morse	5ba5211a47	[DebugInfo][RemoveDIs] Have LICM insert at iterator positions (#73671 ) Because we're storing some extra debug-info information in the iterator class, we need to insert new LICM-created stores using such iterators. Switch LICM to storing iterators instead of pointers when it promotes variables in loops, add a test for the desired behaviour, and enable RemoveDIs instrumentation on a variety of other LICM tests for good measure. (This would appear to be the only pass in LLVM that needs to store iterators on the heap).	2023-11-30 13:00:26 +00:00
Nikita Popov	4b3ea337ad	[ValueTracking] Convert isKnownNonNegative() to use SimplifyQuery (NFC)	2023-11-29 10:52:52 +01:00
Nikita Popov	6b8ed78719	[IR] Add writable attribute This adds a writable attribute, which in conjunction with dereferenceable(N) states that a spurious store of N bytes is introduced on function entry. This implies that this many bytes are writable without trapping or introducing data races. See https://llvm.org/docs/Atomics.html#optimization-outside-atomic for why the second point is important. This attribute can be added to sret arguments. I believe Rust will also be able to use it for by-value (moved) arguments. Rust likely won't be able to use it for &mut arguments (tree borrows does not appear to allow spurious stores). In this patch the new attribute is only used by LICM scalar promotion. However, the actual motivation for this is to fix a correctness issue in call slot optimization, which needs this attribute to avoid optimization regressions. Followup to the discussion on D157499. Differential Revision: https://reviews.llvm.org/D158081	2023-11-01 10:46:31 +01:00
Fangrui Song	2d854dd3e7	Move global namespace cl::opt inside llvm:: or internalize them	2023-10-10 19:58:03 -07:00
Nikita Popov	1b3cc4e715	[ValueTracking] Use SimplifyQuery for the overflow APIs (NFC) Accept a SimplifyQuery instead of an unpacked list of arguments.	2023-10-10 10:57:49 +02:00
Björn Pettersson	a0ce4384a6	[LICM] Simplify isLoadInvariantInLoop given opaque pointers (#65597 ) Since we no longer support typed pointers in LLVM IR, the PtrASXTy in isLoadInvariantInLoop was set to be equal to Addr->getType() (an opaque ptr in the same address space). That made the loop looking through bitcasts redundant.	2023-09-14 16:53:34 +02:00
Jeremy Morse	6942c64e81	[NFC][RemoveDIs] Prefer iterator-insertion over instructions Continuing the patch series to get rid of debug intrinsics [0], instruction insertion needs to be done with iterators rather than instruction pointers, so that we can communicate information in the iterator class. This patch adds an iterator-taking insertBefore method and converts various call sites to take iterators. These are all sites where such debug-info needs to be preserved so that a stage2 clang can be built identically; it's likely that many more will need to be changed in the future. At this stage, this is just changing the spelling of a few operations, which will eventually become signifiant once the debug-info bearing iterator is used. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152537	2023-09-11 11:48:45 +01:00
Fangrui Song	111fcb0df0	[llvm] Fix duplicate word typos. NFC Those fixes were taken from https://reviews.llvm.org/D137338	2023-09-01 18:25:16 -07:00
Craig Topper	dc02070d69	[LICM] Check hasNoSignedZeros in hoistFPAssociation. This matches the check done by the Reassociate pass that we're trying to reverse. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D158042	2023-08-23 14:05:34 -07:00
Nikita Popov	3670ec2897	[LICM][AA] Move isWritableObject() to AA (NFC) Move this helper from LICM to AA, so it can be reused.	2023-08-16 14:43:01 +02:00
Bjorn Pettersson	4ce7c4a92a	[llvm] Drop some typed pointer handling/bitcasts Differential Revision: https://reviews.llvm.org/D157016	2023-08-03 22:54:33 +02:00
Paul Osmialowski	8698d56d99	[Transforms][LICM] Add the ability to undo unprofitable reassociation Consider the following piece of code: ``` void innermost_loop(int i, double d1, double d2, double delta, int n, double cells[n]) { int j; const double d1d = d1 * delta; const double d2d = d2 * delta; for (j = 0; j <= i; j++) cells[j] = d1d * cells[j + 1] + d2d * cells[j]; } ``` When compiling at -Ofast level, after the "Reassociate expressions" pass, this code is transformed into an equivalent of: ``` int j; for (j = 0; j <= i; j++) cells[j] = (d1 * cells[j + 1] + d2 * cells[j]) * delta; ``` Effectively, the computation of those loop invariants isn't done before the loop anymore, we have one extra multiplication on each loop iteration instead. Sadly, this results in a significant performance hit. Similarly, specifically crafted user code will also experience inability to hoist those invariants. This patch is solving this issue by adding the ability to undo such reassociation into the LICM pass. Note that for doing such transformation this pass requires the same conditions as the "Reassociate expressions" pass, namely, the involved binary operators must have the reassociations allowed (e.g. by specifying the `fast` attribute) and they must have single use only. Some parts of this patch were suggested by Nikita Popov. Reviewed By: huntergr, nikic, paulwalker-arm Differential Revision: https://reviews.llvm.org/D152281	2023-08-01 16:42:01 +01:00
Carlos Alberto Enciso	c0a986a60f	[LICM] Sunk instructions with invalid source location. Building the given test case with 'clang -O2 -g' the call to 'getInOrder' is sunk out of the loop by LICM, but the source location is not dropped. Reviewed By: aprantl, fdeazeve Differential Revision: https://reviews.llvm.org/D152691	2023-06-16 06:25:27 +01:00
Kazu Hirata	c7cf942de3	[Scalar] Remove unused function createLICMPass The last use was removed by: commit d623b2f95fd559901f008a0588dddd0949a8db01 Author: Arthur Eubanks <aeubanks@google.com> Date: Fri Mar 10 17:24:19 2023 -0800	2023-06-10 21:52:50 -07:00
Chuanqi Xu	84c033d9ba	[LICM] [Coroutines] Don't hoist threadlocals within presplit coroutines Close https://github.com/llvm/llvm-project/issues/63022 This is the following of https://reviews.llvm.org/D135550, which is discussed in https://discourse.llvm.org/t/rfc-unify-memory-effect-attributes/65579. In my imagination, we could fix the issue fundamentally after we introduces new memory kind thread id. But I am not very sure if we can fix the issue fundamentally in time. Besides that, I think the correctness is the most important. So it should not be bad to land this given it is innocent. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D151774	2023-06-07 10:25:47 +08:00
Nikita Popov	143ed21b26	Revert "[LCSSA] Remove unused ScalarEvolution argument (NFC)" This reverts commit 5362a0d859d8e96b3f7c0437b7866e17a818a4f7. In preparation for reverting a dependent revision.	2023-06-05 16:45:38 +02:00
Max Kazantsev	dd0cf23e4a	[LICM] Reassociate & hoist sub expressions LICM could reassociate mixed variant/invariant comparison/arithmetic operations and hoist invariant parts out of loop if it can prove that they can be computed without overflow. Motivating example here: ``` INV1 - VAR1 < INV2 ``` can be turned into ``` VAR > INV1 - INV2 ``` if we can prove no-signed-overflow here. Then `INV1 - INV2` can be computed out of loop, so we save one arithmetic operation in-loop. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D148001	2023-05-29 12:52:43 +07:00
Max Kazantsev	0d95b20b63	[LICM] Reassociate & hoist add expressions This patch allows LICM to reassociate and hoist following expressions: ``` loop: %sum = add nsw %iv, %C1 %cmp = icmp <signed pred> %sum, C2 ``` where `C1` and `C2` are loop invariants. The reassociated version looks like ``` preheader: %inv_sum = C2 - C1 ... loop: %cmp = icmp <signed pred> %iv, %inv_sum ``` In order to prove legality, we need both initial addition and the newly created subtraction to happen without overflow. Differential Revision: https://reviews.llvm.org/D149132 Reviewed By: skatkov	2023-05-22 13:22:22 +07:00
Christian Ulmann	794b58b467	[IR] Drop const in DILocation::getMergedLocation This commit removes constness from DILocation::getMergedLocation and fixes all its users accordingly. Having constness on the parameters forced the return type to be const as well, which does force usage of `const_cast` when the location needs to be used in metadata nodes. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D149942	2023-05-15 07:21:43 +00:00
Nikita Popov	5362a0d859	[LCSSA] Remove unused ScalarEvolution argument (NFC) After D149435, LCSSA formation no longer needs access to ScalarEvolution, so remove the argument from the utilities.	2023-05-02 12:17:05 +02:00
Nikita Popov	0659000ff7	[LICM] Don't duplicate instructions just because they're free D37076 makes LICM duplicate instructions into exit blocks if the instruction is free. For GEPs, the motivation appears to be that this allows the GEP to be folded into addressing modes, while non-foldable users outside the loop might prevent this. TBH I don't think LICM is the place to do this (why doesn't CGP apply this heuristic itself?) but at least I understand the motivation. However, the transform is also applied to all other "free" instructions, which are just that (removed during lowering and not "folded" in some way). For such instructions, this transform seems somewhere between useless, counter-productive (undoing CSE/GVN) and actively incorrect. For example, this transform can duplicate freeze instructions, which is illegal. This patch limits the transform to just foldable GEPs, though we might want to drop it from LICM entirely as a followup. This is a small compile-time improvement, because querying TTI cost model for every single instruction is expensive. Differential Revision: https://reviews.llvm.org/D149136	2023-04-28 14:31:23 +02:00
Nikita Popov	43436993f4	[LICM] Don't try to constant fold instructions This was introduced in 030f02021b6359ec5641622cf1aa63d873ecf55a as an alleged compile-time optimization. In reality, trying to constant fold instructions is more expensive than just hoisting them. In a standard pipeline, LICM tends to run either after a run of LoopInstSimplify or InstCombine, so LICM doesn't really see constant foldable instructions in the first place, and the attempted fold is futile. This makes for a very minor compile-time improvement. Differential Revision: https://reviews.llvm.org/D149134	2023-04-26 09:26:47 +02:00
Nikita Popov	a1ddfb60da	[LICM] Only forget loop/block dispositions As we are moving the instruction without changing its value, it is sufficient to only invalidate the loop/block dispositions. This is the same we do in LoopSink.	2023-04-25 09:58:31 +02:00
Nikita Popov	ebd6b5dc64	[LICM] Minor optimization (NFC) Simplify the match in hoistMinMax and only fetch the preheader once.	2023-04-24 17:05:58 +02:00
Nikita Popov	53500e333d	Reapply [SimplifyCFG][LICM] Preserve nonnull, range and align metadata when speculating This exposed another miscompile in GVN, which was fixed by 20e9b31f88149a1d5ef78c0be50051e345098e41. ----- After D141386, violation of nonnull, range and align metadata results in poison rather than immediate undefined behavior, which means that these are now safe to retain when speculating. We only need to remove UB-implying metadata like noundef. This is done by adding a dropUBImplyingAttrsAndMetadata() helper, which lists the metadata which is known safe to retain on speculation. Differential Revision: https://reviews.llvm.org/D146629	2023-04-20 14:17:15 +02:00

1 2 3 4 5 ...

691 Commits