llvm-project

Author	SHA1	Message	Date
Stephen Tozer	aa8a1fa6f5	[DLCov][NFC] Annotate intentionally-blank DebugLocs in existing code (#136192 ) Following the work in PR #107279, this patch applies the annotative DebugLocs, which indicate that a particular instruction is intentionally missing a location for a given reason, to existing sites in the compiler where their conditions apply. This is NFC in ordinary LLVM builds (each function `DebugLoc::getFoo()` is inlined as `DebugLoc()`), but marks the instruction in coverage-tracking builds so that it will be ignored by Debugify, allowing only real errors to be reported. From a developer standpoint, it also communicates the intentionality and reason for a missing DebugLoc. Some notes for reviewers: - The difference between `I->dropLocation()` and `I->setDebugLoc(DebugLoc::getDropped())` is that the former _may_ decide to keep some debug info alive, while the latter will always be empty; in this patch, I always used the latter (even if the former could technically be correct), because the former could result in some (barely) different output, and I'd prefer to keep this patch purely NFC. - I've generally documented the uses of `DebugLoc::getUnknown()`, with the exception of the vectorizers - in summary, they are a huge cause of dropped source locations, and I don't have the time or the domain knowledge currently to solve that, so I've plastered it all over them as a form of "fixme".	2025-06-11 17:42:10 +01:00
haonan	5b8664fcb2	[InstCombine][foldPHIArgGEPIntoPHI] Early return for const vector index for gep inst (#138661 )	2025-05-08 10:00:49 +02:00
Nikita Popov	bfa711a970	[InstCombine] Use combineMetadataForCSE in phi of loads fold Use combineMetadataForCSE instead of manually enumerating known metadata kinds. This is a typical sinking transform for which combineMetadataForCSE is safe to use (with DoesKMove=true). Part of https://github.com/llvm/llvm-project/issues/121495.	2025-01-08 11:29:04 +01:00
Teresa Johnson	3a423a10ff	[MemProf][PGO] Prevent dropping of profile metadata during optimization (#121359 ) This patch fixes a couple of places where memprof-related metadata (!memprof and !callsite) were being dropped, and one place where PGO metadata (!prof) was being dropped. All were due to instances of combineMetadata() being invoked. That function drops all metadata not in the list provided by the client, and also drops any not in its switch statement. Memprof metadata needed a case in the combineMetadata switch statement. For now we simply keep the metadata of the instruction being kept, which doesn't retain all the profile information when two calls with memprof metadata are being combined, but at least retains some. For the memprof metadata being dropped during call CSE, add memprof and callsite metadata to the list of known ids in combineMetadataForCSE. Neither memprof nor regular prof metadata were in the list of known ids for the callsite in MemCpyOptimizer, which was added to combine AA metadata after optimization of byval arguments fed by memcpy instructions, and similar types of optimizations of memcpy uses. There is one other callsite of combineMetadata, but it is only invoked on load instructions, which do not carry these types of metadata.	2025-01-02 12:11:59 -08:00
Antonio Frighetto	929cbe7f59	[InstCombine] Intersect nowrap flags between geps while folding into phi A miscompilation issue has been addressed with refined checking. Fixes: https://github.com/llvm/llvm-project/issues/115149.	2024-11-13 19:42:06 +01:00
Chengjun	94a98cf5dc	[InstCombine] Remove dead phi web (#108876 ) In current visitPHINode function during InstCombine, it can remove dead phi cycles (all phis have one use, which is another phi). However, it cannot deal with the case when the phis form a web (all phis have one or more uses, and all the uses are phi). This change extends the algorithm so that it can also deal with the dead phi web.	2024-09-18 10:04:49 +02:00
Alexis Engelke	5b40a05d8f	[InstCombine] Don't look at ConstantData users When looking at PHI operand for combining, only look at instructions and arguments. The loop later iteraters over Arg's users, which is not useful if Arg is a constant -- it's users are not meaningful and might be in different functions, which causes problems for the dominates() query. Pull Request: https://github.com/llvm/llvm-project/pull/103302	2024-08-13 20:44:45 +02:00
Stephen Tozer	d75f9dd1d2	Revert "[IR][NFC] Update IRBuilder to use InsertPosition (#96497 )" Reverts the above commit, as it updates a common header function and did not update all callsites: https://lab.llvm.org/buildbot/#/builders/29/builds/382 This reverts commit 6481dc57612671ebe77fe9c34214fba94e1b3b27.	2024-06-24 18:00:22 +01:00
Stephen Tozer	6481dc5761	[IR][NFC] Update IRBuilder to use InsertPosition (#96497 ) Uses the new InsertPosition class (added in #94226) to simplify some of the IRBuilder interface, and removes the need to pass a BasicBlock alongside a BasicBlock::iterator, using the fact that we can now get the parent basic block from the iterator even if it points to the sentinel. This patch removes the BasicBlock argument from each constructor or call to setInsertPoint. This has no functional effect, but later on as we look to remove the `Instruction *InsertBefore` argument from instruction-creation (discussed [here](https://discourse.llvm.org/t/psa-instruction-constructors-changing-to-iterator-only-insertion/77845)), this will simplify the process by allowing us to deprecate the InsertPosition constructor directly and catch all the cases where we use instructions rather than iterators.	2024-06-24 17:27:43 +01:00
Nikita Popov	6efba06123	[InstCombine] Preserve all gep flags in dependent IV fold	2024-06-19 12:21:52 +02:00
Nikita Popov	6467b49480	[InstCombine] Preserve all flags in phi of gep fold Preserve the intersection of all flags. Add GEPNoWrapFlags::all() to serve as the initialization value for the intersection.	2024-06-19 12:13:54 +02:00
Jay Foad	1650f1b3d7	Fix typo "indicies" (#92232 )	2024-05-15 13:10:16 +01:00
Harald van Dijk	60de56c743	[ValueTracking] Restore isKnownNonZero parameter order. (#88873 ) Prior to #85863, the required parameters of llvm::isKnownNonZero were Value and DataLayout. After, they are Value, Depth, and SimplifyQuery, where SimplifyQuery is implicitly constructible from DataLayout. The change to move Depth before SimplifyQuery needed callers to be updated unnecessarily, and as commented in #85863, we actually want Depth to be after SimplifyQuery anyway so that it can be defaulted and the caller does not need to specify it.	2024-04-16 15:21:09 +01:00
Yingwei Zheng	e0a628715a	[ValueTracking] Convert `isKnownNonZero` to use SimplifyQuery (#85863 ) This patch converts `isKnownNonZero` to use SimplifyQuery. Then we can use the context information from `DomCondCache`. Fixes https://github.com/llvm/llvm-project/issues/85823. Alive2: https://alive2.llvm.org/ce/z/QUvHVj	2024-04-12 23:47:20 +08:00
Jeremy Morse	2fe81edef6	[NFC][RemoveDIs] Insert instruction using iterators in Transforms/ As part of the RemoveDIs project we need LLVM to insert instructions using iterators wherever possible, so that the iterators can carry a bit of debug-info. This commit implements some of that by updating the contents of llvm/lib/Transforms/Utils to always use iterator-versions of instruction constructors. There are two general flavours of update: * Almost all call-sites just call getIterator on an instruction * Several make use of an existing iterator (scenarios where the code is actually significant for debug-info) The underlying logic is that any call to getFirstInsertionPt or similar APIs that identify the start of a block need to have that iterator passed directly to the insertion function, without being converted to a bare Instruction pointer along the way. Noteworthy changes: * FindInsertedValue now takes an optional iterator rather than an instruction pointer, as we need to always insert with iterators, * I've added a few iterator-taking versions of some value-tracking and DomTree methods -- they just unwrap the iterator. These are purely convenience methods to avoid extra syntax in some passes. * A few calls to getNextNode become std::next instead (to keep in the theme of using iterators for positions), * SeparateConstOffsetFromGEP has it's insertion-position field changed. Noteworthy because it's not a purely localised spelling change. All this should be NFC.	2024-03-05 15:12:22 +00:00
Nikita Popov	7ce1a11f7f	[InstCombine] Fold dependent IVs (#81151 ) Fold `iv = phi(start, iv.next = iv2.next + start)` where `iv2 = phi(iv2.start, iv2.next = iv2 + iv2.step)` to `iv = iv2 + start` removing one induction variable from the loop. Proof: https://alive2.llvm.org/ce/z/hfmwgf Fixes https://github.com/llvm/llvm-project/issues/77108.	2024-02-21 09:14:48 +01:00
Nikita Popov	d8bc546533	[InstCombine] Fix phi or icmp fold with disjoint flag We're changing the operand of the or here, such that the disjoint flag may no longer hold. Clear it.	2023-11-30 16:23:00 +01:00
LiqinWeng	5d3d08463d	[InstCombinePHI] Remove dead PHI on UnaryOperator (#71386 ) This patch mainly solves the problem of dead PHI on UnaryOperator	2023-11-07 09:45:33 +08:00
bipmis	4a074f32a6	[InstCombine] Extend Phi-Icmp use to include or (#67682 ) In InstCombinePHI currently the only use of PHI as an Icmp is being checked as a requirement to reduce a value if isKnownNonZero. However this can be extended to include or(icmp) . This is always true as OR only adds bits and we are checking against 0.	2023-10-24 13:08:25 +01:00
David Green	186c9079d4	[InstCombine] Expand redundant phi cycle elimination (#67968 ) There is a combine in instcombine that will look for phi cycles that only have a single incoming value: ``` %0 = phi i64 [ %3, %exit ], [ %othervalue, %preheader ] %3 = phi i64 [ %0, %body ], [ %othervalue, %body2 ] ``` This currently doesn't handle if %othervalue is a phi though, as the algorithm will recurse into the phi and fail with multiple incoming values. This adjusts the algorithm, not requiring the initial value to be found immediately, allowing it to be set to the value of one of the phis that would otherwise fail due to having multiple input values.	2023-10-04 09:34:08 +01:00
Nikita Popov	6cd5eb1f54	[InstCombine] Avoid some uses of ConstantExpr::getZExt() (NFC) Add helpers getLosslessUnsignedTrunc/getLosslessSignedTrunc for this common pattern.	2023-09-28 17:02:33 +02:00
Nikita Popov	2fda200cd8	[InstCombine] Canonicalize phi order for newly inserted nodes When new phi nodes are inserted at the start of the block, the order of these does not get canonicalized, as we pick the first phi node to canonicalize towards (and the other phi nodes may have already been visited). This results in phi nodes not being deduplicated and thus a fix-point verification failure. Fix this by remembering the predecessors of the first phi node we encounter -- this will usually result in the same order we picked previously. I also considered using predecessors() order instead, but that causes substantial test fallout. Additionally the predecessors() order tends to be the reverse of the "natural" order. Fixes https://github.com/llvm/llvm-project/issues/46688.	2023-09-27 09:56:23 +02:00
Jeremy Morse	e54277fa10	[NFC][RemoveDIs] Use iterators over inst-pointers when using IRBuilder This patch adds a two-argument SetInsertPoint method to IRBuilder that takes a block/iterator instead of an instruction, and updates many call sites to use it. The motivating reason for doing this is given here [0], we'd like to pass around more information about the position of debug-info in the iterator object. That necessitates passing iterators around most of the time. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152468	2023-09-11 20:01:19 +01:00
Jeremy Morse	d529943a27	[NFC][RemoveDIs] Prefer iterators over inst-pointers in InstCombine As per my proposal for how to eliminate debug intrinsics [0], for various places in InstCombine prefer to insert using an instruction iterator rather than an instruction pointer. This is so that we can eventually pass more information in the iterator class. These call-sites where I've changed the spelling are those that necessary to build a stage2clang to produce an identical binary in the coming no-debug-intrinsics mode. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939 Differential Revision: https://reviews.llvm.org/D152543	2023-09-11 15:04:51 +01:00
Nikita Popov	cd888e6ffe	[InstCombine] Fix worklist management in foldPHIArgIntToPtrToPHI() Make sure the old operand is added back to the worklist for DCE.	2023-06-01 10:20:12 +02:00
Sanjay Patel	870e6b6e65	[InstCombine] use dyn_cast instead of isa+cast; NFC	2023-03-06 13:49:47 -05:00
luxufan	232698dc17	[Local][InstCombine][GVN] Handle !noundef metadata in combineMetadata Handle !noundef metadata in comhineMetadata. The behavior of violating !noundef metadata is undefined behavior. So if K dominates J, we can preserve it uncontionally, otherwise, we preserve it if both K and J have !noundef metadata been set. This patch also makes !noundef metadata added in KnownIDs when doing instruction combine for phi node or global value numbering for loads. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D142801	2023-02-02 14:25:13 +08:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Kazu Hirata	881076cde2	[InstCombine] Use std::optional in InstCombinePHI.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:46:38 -08:00
Nikita Popov	11897708c0	[InstCombine] Directly replace instr in foldIntegerTypedPHI() (NFCI) Rather than inserting a ptrtoint + inttoptr pair, directly replace the inttoptr with the new phi node. This ensures that no other transform can undo it before the pair gets folded away. This avoids the infinite loop when combined with D134954. This is NFCI in the sense that it shouldn't make a difference, but could due to different worklist order.	2022-10-05 13:28:23 +02:00
Heejin Ahn	b2f4112f25	[InstCombine] Improve check for catchswitch BBs (NFC) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D127810	2022-06-15 01:06:13 -07:00
Heejin Ahn	ac4006b0d6	[InstCombine] Don't slice up PHIs when pred BB has catchswitch If an integer PHI has an illegal type (according to the data layout) and it is only used by `trunc` or `trunc(lshr)` operations, we split the PHI into various instructions in its predecessors: `6d1543a167/llvm/lib/Transforms/InstCombine/InstCombinePHI.cpp (L1536-L1543)` So this can produce code like the following: Before: ``` pred: ... bb: %p = phi i8 [ %somevalue, %pred ], ... ... %tobool = trunc i8 %p to i1 use %tobool ... ``` In this code, `%p` has an illegal integer type, `i8`, and its only used in a `trunc` instruction later. In this case this pass puts extraction code in its predecessors: After: ``` pred: ... %t = and i8 %somevalue, 1 %extract = icmp ne i8 %t, 0 bb: %p.new = phi i1 [ %extract, %pred ], ... use %p.new instead of %tobool ``` But this doesn't work if `pred` is a `catchswitch` BB because it cannot have any non-PHI instructions. This CL ensures we bail out in that case. Fixes https://github.com/llvm/llvm-project/issues/55803. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D127699	2022-06-13 18:32:09 -07:00
Simon Moll	b8c2781ff6	[NFC] format InstructionSimplify & lowerCaseFunctionNames Clang-format InstructionSimplify and convert all "FunctionName"s to "functionName". This patch does touch a lot of files but gets done with the cleanup of InstructionSimplify in one commit. This is the alternative to the less invasive clang-format only patch: D126783 Reviewed By: spatel, rengolin Differential Revision: https://reviews.llvm.org/D126889	2022-06-09 16:10:08 +02:00
Danila Malyutin	4fb3fd7d82	[InstCombine] Fix const folding of switches with default case In case phi was in the default block it could lead to multi-edge. Fixes #55721. Differential Revision: https://reviews.llvm.org/D126650	2022-05-31 15:13:58 +03:00
Nikita Popov	4010a7a5d0	Reapply [InstCombine] Support switch in phi to cond fold Reapply with an explicit check for multi-edges, as the expected behavior of multi-edge dominance is unclear (D120811). ----- For conditional branches, we know the value is i1 0 or i1 1 along the outgoing edges. For switches we can apply exactly the same optimization, just with the known values determined by the switch cases.	2022-03-17 10:03:09 +01:00
Arnold Schwaighofer	dcdc1f29bb	InstCombine: Can't fold a phi arg load into the phi if the load is from a swifterror address `swifterror` addresses are only allowed as operands to load, store, and calls. The following transformation is not allowed. It would create a phi with a `swifterror` address operand. ``` %addr = alloca swifterror i8* br %cond, label %bb1, label %b22 bb1: %val1 = load i8, i8* %addr br exit bb2: %val2 = load i8, i8* %addr br exit exit: %val = phi [%val1, %bb1] [%val2, %bb2] ``` => ``` %addr = alloca swifterror i8* br %cond, label %bb1, label %b22 bb1: br exit bb2: br exit exit: %val_addr = phi [%addr, %bb1] [%addr, %bb2] %val2 = load i8, i8* %val_addr ``` rdar://89865485 Differential Revision: https://reviews.llvm.org/D121217	2022-03-08 09:09:51 -08:00
Nikita Popov	5cf06d10f8	Revert "[InstCombine] Support switch in phi to cond fold" This reverts commit 0817ce86b540f909eade6a8d7370e1b47e863a70. Seeing some ppc64le stage2 failures, reverting to investigate.	2022-03-02 12:49:47 +01:00
Nikita Popov	0817ce86b5	[InstCombine] Support switch in phi to cond fold For conditional branches, we know the value is i1 0 or i1 1 along the outgoing edges. For switches we can apply exactly the same optimization, just with the known values determined by the switch cases.	2022-03-02 12:16:32 +01:00
Nikita Popov	a1f442b278	[InstCombine] Support phi to cond fold with more than two preds This transform can still be applied if there are more than two phi inputs, as long as phi inputs with the same value are dominated by the same idom edge.	2022-03-01 16:31:49 +01:00
Nikita Popov	5450963085	[InstCombine] Check source element type in phi of gep fold Rather than checking that the type is the same (which is always the case, given how these are part of the same phi) check that the source element type is the same. With opaque pointers, this is no longer implied.	2022-02-11 14:26:18 +01:00
Ricky Zhou	30ac5f9e64	[InstCombine] Do not combine atomic and non-atomic loads Before this change, InstCombine was willing to fold atomic and non-atomic loads through a PHI node as long as the first PHI argument is not an atomic load. The combined load would be non-atomic, which is incorrect. Fix this by only combining the loads in a PHI node when all of the arguments are non-atomic loads. Thanks to Eli Friedman for pointing out the bug at https://github.com/llvm/llvm-project/issues/50777#issuecomment-981045342! Fixes #50777 Differential Revision: https://reviews.llvm.org/D115113	2022-01-30 10:05:11 -05:00
Ricky Zhou	de80b53d1a	[InstCombine] Use range for loops (NFC) Preliminary clean-up for D115113 Differential Revision: https://reviews.llvm.org/D116086	2022-01-30 09:10:39 -05:00
Ricky Zhou	4aabed05a8	[InstCombine] Uppercase some variable names (NFC) Uppercase some variable names, per LLVM coding standards. This change intentionally does not rename every miscased variable, as a follow-up change ( D116086 ) intends to eliminate many of those by switching loops to range for loops. Differential Revision: https://reviews.llvm.org/D118553	2022-01-30 09:10:39 -05:00
Nikita Popov	9be67289b3	[InstCombine] Drop outdated alignment comment (NFC) Loads always have an alignment now, so this is no longer relevant.	2021-12-21 08:58:48 +01:00
Krishna	d99260641b	[InstCombine] Fold phi ( inttoptr/ptrtoint x ) to phi (x) The inttoptr/ptrtoint roundtrip optimization is not always correct. We are working towards removing this optimization and adding support to specific cases where this optimization works. In this patch, we focus on phi-node operands with inttoptr casts. We know that ptrtoint( inttoptr( ptrtoint x) ) is same as ptrtoint (x). So, we want to remove this roundtrip cast which goes through phi-node. Reviewed By: aqjune Differential Revision: https://reviews.llvm.org/D106289	2021-08-03 17:52:59 +05:30
Heejin Ahn	51fecd17bb	[InstCombine] Don't combine PHI before catchswitch This tries to bail out if the PHI is in a `catchswitch` BB in InstCombine. A PHI cannot be combined into a non-PHI instruction if it is in a `catchswitch` BB, because `catchswitch` BB cannot have any non-PHI instruction other than `catchswitch` itself. The given test case started crashing after D98058. Reviewed By: lebedev.ri, rnk Differential Revision: https://reviews.llvm.org/D105309	2021-07-02 12:10:24 -07:00
Juneyoung Lee	ce192ced2b	[InstCombine] Use poison constant to represent the result of unreachable instrs This patch updates InstCombine to use poison constant to represent the resulting value of (either semantically or syntactically) unreachable instrs, or a don't-care value of an unreachable store instruction. This allows more aggressive folding of unused results, as shown in llvm/test/Transforms/InstCombine/getelementptr.ll . Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104602	2021-06-21 09:58:44 +09:00
Florian Hahn	92da5b7119	[InstCombine] Simplify phis with incoming pointer-casts. If the incoming values of a phi are pointer casts of the same original value, replace the phi with a single cast. Such redundant phis are somewhat common after loop-rotate and removing them can avoid some unnecessary code bloat, e.g. because an iteration of a loop is peeled off to make the phi invariant. It should also simplify further analysis on its own. InstCombine already uses stripPointerCasts in a couple of places and also simplifies phis based on the incoming values, so the patch should fit in the existing scope. The patch causes binary changes in 47 out of 237 benchmarks in MultiSource/SPEC2000/SPEC2006 with -O3 -flto on X86. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D98058	2021-03-09 11:40:18 +00:00
Hongtao Yu	1cb47a063e	[CSSPGO] Unblock optimizations with pseudo probe instrumentation. The IR/MIR pseudo probe intrinsics don't get materialized into real machine instructions and therefore they don't incur runtime cost directly. However, they come with indirect cost by blocking certain optimizations. Some of the blocking are intentional (such as blocking code merge) for better counts quality while the others are accidental. This change unblocks perf-critical optimizations that do not affect counts quality. They include: 1. IR InstCombine, sinking load operation to shorten lifetimes. 2. MIR LiveRangeShrink, similar to #1 3. MIR TwoAddressInstructionPass, i.e, opeq transform 4. MIR function argument copy elision 5. IR stack protection. (though not perf-critical but nice to have). Reviewed By: wmi Differential Revision: https://reviews.llvm.org/D95982	2021-02-10 12:43:17 -08:00
Simon Pilgrim	91589cf679	Add missing namespace closure comments. NFCI. Fixes some clang-tidy llvm-namespace-comment warnings.	2020-09-23 16:19:25 +01:00

1 2 3 4

179 Commits