llvm-project

Author	SHA1	Message	Date
Pavel Skripkin	30144226a4	[llvm] [InstCombine] fold "icmp eq (X + (V - 1)) & -V, X" to "icmp eq (and X, V - 1), 0" (#152851 ) This fold optimizes ```llvm define i1 @src(i32 %num, i32 %val) { %mask = add i32 %val, -1 %neg = sub nsw i32 0, %val %num.biased = add i32 %num, %mask %_2.sroa.0.0 = and i32 %num.biased, %neg %_0 = icmp eq i32 %_2.sroa.0.0, %num ret i1 %_0 } ``` to ```llvm define i1 @tgt(i32 %num, i32 %val) { %mask = add i32 %val, -1 %tmp = and i32 %num, %mask %ret = icmp eq i32 %tmp, 0 ret i1 %ret } ``` For power-of-two `val`. Observed in real life for following code ```rust pub fn is_aligned(num: usize) -> bool { num.next_multiple_of(1 << 12) == num } ``` which verifies that num is aligned to 4096. Alive2 proof https://alive2.llvm.org/ce/z/QisECm	2025-08-14 10:23:03 +03:00
Nikita Popov	1e24b53534	[InstCombine] Add limit for expansion of gep chains (#147065 ) When converting gep subtraction / comparison to offset subtraction / comparison, avoid expanding very long multi-use gep chains.	2025-07-23 09:47:53 +02:00
Jeremy Morse	2a1869b981	[DebugInfo] Shave even more users of DbgVariableIntrinsic from LLVM (#149136 ) At this stage I'm just opportunistically deleting any code using debug-intrinsic types, largely adjacent to calls to findDbgUsers. I'll get to deleting that in probably one or more two commits.	2025-07-18 08:25:10 +01:00
Ahmed Bougacha	77bcab835a	[InstCombine] Combine ptrauth intrin. callee into same-key bundle. (#94707 ) Try to optimize a call to the result of a ptrauth intrinsic, potentially into the ptrauth call bundle: call(ptrauth.resign(p)), ["ptrauth"()] -> call p, ["ptrauth"()] call(ptrauth.sign(p)), ["ptrauth"()] -> call p as long as the key/discriminator are the same in sign and auth-bundle, and we don't change the key in the bundle (to a potentially-invalid key.) Generating a plain call to a raw unauthenticated pointer is generally undesirable, but if we ended up seeing a naked ptrauth.sign in the first place, we already have suspicious code. Unauthenticated calls are also easier to spot than naked signs, so let the indirect call shine. Note that there is an arguably unsafe extension to this, where we don't bother checking that the key in bundle and intrinsic are the same (and also allow folding away an auth into a bundle.) This can end up generating calls with a bundle that has an invalid key (which an informed frontend wouldn't have otherwise done), which can be problematic. The C that generates that is straightforward but arguably unreasonable. That wouldn't be an issue if we were to bite the bullet and make these fully AArch64-specific, allowing key knowledge to be embedded here.	2025-07-15 14:39:53 -07:00
Ahmed Bougacha	42d2ae1034	[InstCombine] Combine ptrauth constant callee into bundle. (#94706 ) Try to optimize a call to a ptrauth constant, into its ptrauth bundle: call(ptrauth(f)), ["ptrauth"()] -> call f as long as the key/discriminator are the same in constant and bundle.	2025-07-15 13:37:07 -07:00
Jeffrey Byrnes	0da9aacf48	[InstCombine] Extend bitmask mul combine to handle independent operands (#142503 ) This extends https://github.com/llvm/llvm-project/pull/136013 to capture cases where the combineable bitmask muls are nested under multiple or-disjoints. This PR is meant for commits starting at 8c403c912046505ffc10378560c2fc48f214af6a op1 = or-disjoint mul(and (X, C1), D) , reg1 op2 = or-disjoint mul(and (X, C2), D) , reg2 out = or-disjoint op1, op2 -> temp1 = or-disjoint reg1, reg2 out = or-disjoint mul(and (X, (C1 + C2)), D), temp1 Case1: https://alive2.llvm.org/ce/z/dHApyV Case2: https://alive2.llvm.org/ce/z/Jz-Nag Case3: https://alive2.llvm.org/ce/z/3xBnEV	2025-07-07 13:50:42 -07:00
Luke Lau	04c614327c	[InstCombine] Pull vector reverse through intrinsics (#146384 ) This is the intrinsic version of #146349, and handles fabs as well as other intrinsics. It's largely a copy of InstCombinerImpl::foldShuffledIntrinsicOperands but a bit simpler since we don't need to find a common mask. Creating a separate function seems to be cleaner than trying to shoehorn it into the existing one.	2025-07-01 16:49:10 +01:00
Ricardo Jesus	84c849e85b	[InstCombine] Combine interleaved recurrences. (#143878 ) Combine sequences such as: ```llvm %pn1 = phi [init1, %BB1], [%op1, %BB2] %pn2 = phi [init2, %BB1], [%op2, %BB2] %op1 = binop %pn1, constant1 %op2 = binop %pn2, constant2 %rdx = binop %op1, %op2 ``` Into: ```llvm %phi_combined = phi [init_combined, %BB1], [%op_combined, %BB2] %rdx_combined = binop %phi_combined, constant_combined ``` This allows us to simplify interleaved reductions, for example as introduced by the loop vectorizer. The anecdotal example for this is the loop below: ```c float foo() { float q = 1.f; for (int i = 0; i < 1000; ++i) q *= .99f; return q; } ``` Which currently gets lowered explicitly such as (on AArch64, interleaved by four): ```gas .LBB0_1: fmul v0.4s, v0.4s, v1.4s fmul v2.4s, v2.4s, v1.4s fmul v3.4s, v3.4s, v1.4s fmul v4.4s, v4.4s, v1.4s subs w8, w8, #32 b.ne .LBB0_1 ``` But with this patch lowers trivially: ```gas foo: mov w8, #5028 movk w8, #14389, lsl #16 fmov s0, w8 ret ```	2025-07-01 09:54:38 +01:00
Alex MacLean	59388fb0b9	[InstCombine] Preserve NSW/NUW flags when folding const BOp with min/max (#143471 ) When folding `X Pred C2 ? X BOp C1 : C2 BOp C1` to `min/max(X, C2) BOp C1`, if NUW/NSW flags are present on `X BOp C1` and could be safely applied to `C2 BOp C1`, then they may be added on the BOp after the fold is complete. https://alive2.llvm.org/ce/z/n_3aNJ Preserving these flags can allow subsequent transforms to re-order the min/max and BOp, which in the case of NVPTX would allow for some potential future transformations which would improve instruction-selection.	2025-06-13 11:16:44 -07:00
Nikita Popov	2019553a0b	[InstCombine] Extract EmitGEPOffsets() helper (NFC) Extract a reusable helper for emitting a sum of multiple GEP offsets.	2025-06-13 12:34:18 +02:00
Nikita Popov	b8e3e0749f	[InstCombine] Export logic for common base pointer (NFC) Make this available to other parts of InstCombine, to be used for pointer comparison optimization.	2025-06-12 15:33:45 +02:00
Ramkumar Ramachandra	b40e4ceaa6	[ValueTracking] Make Depth last default arg (NFC) (#142384 ) Having a finite Depth (or recursion limit) for computeKnownBits is very limiting, but is currently a load-bearing necessity, as all KnownBits are recomputed on each call and there is no caching. As a prerequisite for an effort to remove the recursion limit altogether, either using a clever caching technique, or writing a easily-invalidable KnownBits analysis, make the Depth argument in APIs in ValueTracking uniformly the last argument with a default value. This would aid in removing the argument when the time comes, as many callers that currently pass 0 explicitly are now updated to omit the argument altogether.	2025-06-03 17:12:24 +01:00
Luke Lau	9262e37d8c	[InstCombine] Fold shuffled intrinsic operands with constant operands (#141300 ) We currently pull shuffles through binops and intrinsics, which is an important canonical form for VectorCombine to be able to scalarize vector sequences. But while binops can be folded with a constant operand, intrinsics currently require all operands to be shufflevectors. This extends intrinsic folding to be in line with regular binops by reusing the constant "unshuffling" logic. As far as I can tell the list of currently folded intrinsics don't require any special UB handling. This change in combination with #138095 and #137823 fixes the following C: ```c void max(int x, int y, int n) { for (int i = 0; i < n; i++) x[i] += y > 42 ? y : 42; } ``` Not using the splatted vector form on RISC-V with `-O3 -march=rva23u64`: ```asm vmv.s.x v8, a4 li a4, 42 vmax.vx v10, v8, a4 vrgather.vi v8, v10, 0 .LBB0_9: # %vector.body # =>This Inner Loop Header: Depth=1 vl2re32.v v10, (a5) vadd.vv v10, v10, v8 vs2r.v v10, (a5) ``` i.e., it now generates ```asm li a6, 42 max a6, a4, a6 .LBB0_9: # %vector.body # =>This Inner Loop Header: Depth=1 vl2re32.v v8, (a5) vadd.vx v8, v8, a6 vs2r.v v8, (a5) ```	2025-05-28 10:57:08 +01:00
Antonio Frighetto	adfd59fdb8	[InstCombine] Introduce `foldICmpBinOpWithConstantViaTruthTable` folding Match icmps of binops where both operands are select with constant arms, i.e., `icmp pred (select A ? C1 : C2) binop (select B ? C3 : C4), C5`. Fold such patterns by creating a truth table of the possible four constant variants, and materialize back the optimal logic from it via `createLogicFromTable` helper. This also generalizes an existing fold, which has therefore been dropped. Proofs: https://alive2.llvm.org/ce/z/NS7Vzu. Fixes: https://github.com/llvm/llvm-project/issues/138212.	2025-05-13 09:04:25 +02:00
Nikita Popov	50aacb9e1b	[InstCombine] Support ptrtoint of gep folds for chain of geps (#137323 ) Support the ptrtoint(gep null, x) -> x and ptrtoint(gep inttoptr(x), y) -> x+y folds for the case where there is a chain of geps that ends in null or inttoptr. This avoids some regressions from #137297. While here, also be a bit more careful about edge cases like pointer to vector splats and mismatched pointer and index size.	2025-04-28 09:02:29 +02:00
Tim Gymnich	049f179606	[Analysis][NFC] Extract KnownFPClass (#133457 ) - extract KnownFPClass for future use inside of GISelKnownBits --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2025-03-28 18:10:02 +01:00
Yingwei Zheng	2ebc69a521	[InstCombine] Add support for GEPs in `simplifyNonNullOperand` (#128365 ) Alive2: https://alive2.llvm.org/ce/z/2KE8zG	2025-02-23 17:19:31 +08:00
Yingwei Zheng	126016b662	[InstCombine] Simplify nonnull pointers (#128111 ) This patch is the follow-up of https://github.com/llvm/llvm-project/pull/127979. It introduces a helper `simplifyNonNullOperand` to avoid duplicate logic. It also addresses the one-use issue in `visitLoadInst`, as discussed in https://github.com/llvm/llvm-project/pull/127979#issuecomment-2671013972. The `nonnull` attribute is also supported. Proof: https://alive2.llvm.org/ce/z/MCKgT9	2025-02-22 15:30:04 +08:00
Noah Goldstein	0d9c027ad7	[InstCombine] Make `takeLog2` visible in all of InstCombine; NFC Also add `tryGetLog2` helper that encapsulates the common pattern: ``` if (takeLog2(..., /DoFold=/false)) { Value * Log2 = takeLog2(..., /DoFold=/true); ... } ``` Closes #122498	2025-01-10 16:21:35 -06:00
Andreas Jonson	d4182f1b56	[InstCombine] move foldAndOrOfICmpsOfAndWithPow2 into foldLogOpOfMaskedICmps (#121970 )	2025-01-08 18:04:38 +01:00
Yingwei Zheng	4ebfd43cf0	[InstCombine] Always treat inner and/or as bitwise (#121766 ) In https://github.com/llvm/llvm-project/pull/116065, we pass `IsLogical` into `foldBooleanAndOr` when folding inner and/or ops. But it is always safe to treat them as bitwise if the outer ops are bitwise. Alive2: https://alive2.llvm.org/ce/z/hULrgH Closes https://github.com/llvm/llvm-project/issues/121701.	2025-01-07 00:03:13 +08:00
Matthias Braun	768754807f	[InstCombine] Optimistically allow multiple shufflevector uses in foldOpPhi (#114278 ) We would like to optimize situations of the form that happen after loop vectorization+SROA: ``` loop: %phi = phi zeroinitializer, %interleaved %deinterleave_a = shufflevector %phi, poison ; pick half of the lanes %deinterleave_b = shufflevector %phi, posion ; pick remaining lanes ... %a = ... %b = ... %interleaved = shufflevector %a, %b ; interleave lanes of a+b ``` where the interleave and de-interleave shuffle operations cancel each other out. This could be handled by `foldOpPhi` but does not currently work because it does not proceed when there are multiple uses of the `Phi` operation. This extends `foldOpPhi` to allow multiple `shufflevector` uses when they are shown to simplify for all `Phi` input values.	2024-12-12 17:20:48 -08:00
Ramkumar Ramachandra	51a895aded	IR: introduce struct with CmpInst::Predicate and samesign (#116867 ) Introduce llvm::CmpPredicate, an abstraction over a floating-point predicate, and a pack of an integer predicate with samesign information, in order to ease extending large portions of the codebase that take a CmpInst::Predicate to respect the samesign flag. We have chosen to demonstrate the utility of this new abstraction by migrating parts of ValueTracking, InstructionSimplify, and InstCombine from CmpInst::Predicate to llvm::CmpPredicate. There should be no functional changes, as we don't perform any extra optimizations with samesign in this patch, or use CmpPredicate::getMatching. The design approach taken by this patch allows for unaudited callers of APIs that take a llvm::CmpPredicate to silently drop the samesign information; it does not pose a correctness issue, and allows us to migrate the codebase piece-wise.	2024-12-03 13:31:04 +00:00
Nikita Popov	e477989a05	[InstCombine] Handle trunc i1 pattern in eq-of-parts fold (#112704 ) Equality/inequality of the low bit can be represented by `(trunc (xor x, y) to i1)`, possibly with an extra not. We have to handle this in the eq-of-parts fold now that we no longer canonicalize this to a masked icmp. Proofs: https://alive2.llvm.org/ce/z/qidkzq Fixes https://github.com/llvm/llvm-project/issues/110919.	2024-11-25 11:49:00 +01:00
Ramkumar Ramachandra	9568f88b7f	InstCombine: support floating-point equivalences (#114975 ) Since cd16b07 (IR: introduce CmpInst::isEquivalence), there is now an isEquivalence routine in CmpInst that we can use to determine equivalence in foldSelectValueEquivalence. Implement this, extending it to include floating-point equivalences as well.	2024-11-20 09:44:14 +00:00
Yingwei Zheng	a77dedcacb	[InstSimplify][InstCombine][ConstantFold] Move vector div/rem by zero fold to InstCombine (#114280 ) Previously we fold `div/rem X, C` into `poison` if any element of the constant divisor `C` is zero or undef. However, it is incorrect when threading udiv over an vector select: https://alive2.llvm.org/ce/z/3Ninx5 ``` define <2 x i32> @vec_select_udiv_poison(<2 x i1> %x) { %sel = select <2 x i1> %x, <2 x i32> <i32 -1, i32 -1>, <2 x i32> <i32 0, i32 1> %div = udiv <2 x i32> <i32 42, i32 -7>, %sel ret <2 x i32> %div } ``` In this case, `threadBinOpOverSelect` folds `udiv <i32 42, i32 -7>, <i32 -1, i32 -1>` and `udiv <i32 42, i32 -7>, <i32 0, i32 1>` into `zeroinitializer` and `poison`, respectively. One solution is to introduce a new flag indicating that we are threading over a vector select. But it requires to modify both `InstSimplify` and `ConstantFold`. However, this optimization doesn't provide benefits to real-world programs: https://dtcxzyw.github.io/llvm-opt-benchmark/coverage/data/zyw/opt-ci/actions-runner/_work/llvm-opt-benchmark/llvm-opt-benchmark/llvm/llvm-project/llvm/lib/IR/ConstantFold.cpp.html#L908 https://dtcxzyw.github.io/llvm-opt-benchmark/coverage/data/zyw/opt-ci/actions-runner/_work/llvm-opt-benchmark/llvm-opt-benchmark/llvm/llvm-project/llvm/lib/Analysis/InstructionSimplify.cpp.html#L1107 This patch moves the fold into InstCombine to avoid breaking numerous existing tests. Fixes #114191 and #113866 (only poison-safety issue).	2024-11-01 22:56:22 +08:00
Nikita Popov	0f7d148db4	[InstCombine] Add shared helper for logical and bitwise and/or (NFC) Add a helper for shared folds between logical and bitwise and/or and move the and/or of icmp and fcmp folds in there. This makes it easier to extend to more folds. A possible extension would be to base the current and/or of icmp reassociation logic on this helper, so that it for example also applies to fcmp.	2024-10-17 14:25:44 +02:00
Marina Taylor	d0d12fc78a	[InstCombine] Fold (X==Z) ? (Y==Z) : (!(Y==Z) && X==Y) --> X==Y (#108619 ) This corresponds to the canonicalized form of some logic that was seen in Swift-generated code for comparing optional pointers: `(X==Z \|\| Y==Z) ? (X==Z && Y==Z) : X==Y --> X==Y` where `Z` was the constant `0`. https://alive2.llvm.org/ce/z/J_3aa9	2024-10-03 15:33:30 +01:00
Chengjun	94a98cf5dc	[InstCombine] Remove dead phi web (#108876 ) In current visitPHINode function during InstCombine, it can remove dead phi cycles (all phis have one use, which is another phi). However, it cannot deal with the case when the phis form a web (all phis have one or more uses, and all the uses are phi). This change extends the algorithm so that it can also deal with the dead phi web.	2024-09-18 10:04:49 +02:00
Nikita Popov	34b10e165d	[InstCombine] Remove optional LoopInfo dependency https://github.com/llvm/llvm-project/pull/106075 has removed the last dependency on LoopInfo in InstCombine, so don't fetch the analysis anymore and remove the use-loop-info pass option.	2024-09-02 10:25:45 +02:00
Noah Goldstein	a6edcea211	[InstCombine] Simplify `(add/sub (sub/add) (sub/add))` irrelivant of use-count Added folds: - `(add (sub X, Y), (sub Z, X))` -> `(sub Z, Y)` - `(sub (add X, Y), (add X, Z))` -> `(sub Y, Z)` The fold typically is handled in the `Reassosiate` pass, but it fails if the inner `sub`/`add` are multi-use. Less importantly, Reassosiate doesn't propagate flags correctly. This patch adds the fold explicitly the InstCombine Proofs: https://alive2.llvm.org/ce/z/p6JyRP Closes #105866	2024-08-27 11:43:17 -07:00
Nikita Popov	b74248dae8	[InstCombine] Pass RPOT to InstCombiner (NFC) To make use of it in a followup change.	2024-08-26 15:17:38 +02:00
Volodymyr Vasylkun	abf69a167b	[InstCombine] Fold `(x < y) ? -1 : zext(x != y)` into `u/scmp(x,y)` (#101049 ) This patch adds the aforementioned fold to InstCombine. This pattern is produced after naive implementations of 3-way comparison in high-level languages are transformed into LLVM IR and then optimized. Proofs: https://alive2.llvm.org/ce/z/w4QLq_	2024-08-19 13:02:29 +01:00
Yingwei Zheng	48ae614701	[InstCombine] Avoid infinite loop when negating phi nodes (#104581 ) Closes https://github.com/llvm/llvm-project/issues/96012 --------- Co-authored-by: Nikita Popov <github@npopov.com>	2024-08-17 16:48:29 +08:00
Rahul Joshi	1753008bbb	[NFC] Eliminate top-level "using namespace" from some headers. (#102751 ) - Eliminate top-level "using namespace" from some headers.	2024-08-11 13:10:48 -07:00
Nikita Popov	86b37944a7	Reapply [InstCombine] Fix context for multi-use demanded bits simplification Repplied with a clang test fix. ----- When simplifying a multi-use root value, the demanded bits were reset to full, but we also need to reset the context instruction. To make this convenient (without requiring by-value passing of SimplifyQuery), move the logic that handles constants and dispatches to SimplifyDemandedUseBits/SimplifyMultipleUseDemandedBits into SimplifyDemandedBits. The SimplifyDemandedInstructionBits caller starts with full demanded bits and an appropriate context anyway. The different context instruction does mean that the ephemeral value protection no longer triggers in some cases, as the changes to assume tests show. An alternative, which I will explore in a followup, is to always use SimplifyMultipleUseDemandedBits() -- the previous root special case is only really intended for SimplifyDemandedInstructionBits(), which now no longer shares this code path. Fixes https://github.com/llvm/llvm-project/issues/97330.	2024-07-02 11:02:55 +02:00
Nikita Popov	167c860ba2	Revert "[InstCombine] Fix context for multi-use demanded bits simplification" This reverts commit b558ac0eef57a3737b1e27844115fa91e0b32582. This breaks a clang test, reverting for now.	2024-07-02 10:44:34 +02:00
Nikita Popov	b558ac0eef	[InstCombine] Fix context for multi-use demanded bits simplification When simplifying a multi-use root value, the demanded bits were reset to full, but we also need to reset the context extract. To make this convenient (without requiring by-value passing of SimplifyQuery), move the logic that that handles constants and dispatches to SimplifyDemandedUseBits/SimplifyMultipleUseDemandedBits into SimplifyDemandedBits. The SimplifyDemandedInstructionBits caller starts with full demanded bits and an appropriate context anyway. The different context instruction does mean that the ephemeral value protection no longer triggers in some cases, as the changes to assume tests show. An alternative, which I will explore in a followup, is to always use SimplifyMultipleUseDemandedBits() -- the previous root special case is only really intended for SimplifyDemandedInstructionBits(), which now no longer shares this code path. Fixes https://github.com/llvm/llvm-project/issues/97330.	2024-07-02 10:32:27 +02:00
Nikita Popov	11484cb817	[InstCombine] Pass SimplifyQuery to SimplifyDemandedBits() This will enable calling SimplifyDemandedBits() with a SimplifyQuery that has CondContext set in the future. Additionally this also marginally strengthens the analysis by retaining the original context instruction for one-use chains.	2024-07-01 12:41:21 +02:00
Abinaya Saravanan	4f326468bd	[InstCombine] Retain debug information on store to null instruction (#86105 ) Call InsertNewInstWith() instead of InsertNewInstBefore() when creating "store to null" instruction	2024-06-03 18:26:32 +01:00
Nikita Popov	3c553fc9e0	[InstCombine] Infer nuw on mul nsw with non-negative operands (#90170 ) If a mul nsw has non-negative operands, it's also nuw. Proof: https://alive2.llvm.org/ce/z/2Dz9Uu Fixes https://github.com/llvm/llvm-project/issues/90020.	2024-04-29 09:53:09 +09:00
Nikita Popov	873889b7fa	[InstCombine] Extract logic for "emit offset and rewrite gep" (NFC)	2024-04-25 14:18:11 +09:00
Haohai Wen	d8503a38b9	[InstCombine] Update BranchProbabilityAnalysis cache result (#86470 ) InstCombine may invert branch condition and profile metadata. In such case, BranchProbabilityAnalysis should also be updated.	2024-04-20 22:07:41 +08:00
Nikita Popov	1baa385065	[IR][PatternMatch] Only accept poison in getSplatValue() (#89159 ) In #88217 a large set of matchers was changed to only accept poison values in splats, but not undef values. This is because we now use poison for non-demanded vector elements, and allowing undef can cause correctness issues. This patch covers the remaining matchers by changing the AllowUndef parameter of getSplatValue() to AllowPoison instead. We also carry out corresponding renames in matchers. As a followup, we may want to change the default for things like m_APInt to m_APIntAllowPoison (as this is much less risky when only allowing poison), but this change doesn't do that. There is one caveat here: We have a single place (X86FixupVectorConstants) which does require handling of vector splats with undefs. This is because this works on backend constant pool entries, which currently still use undef instead of poison for non-demanded elements (because SDAG as a whole does not have an explicit poison representation). As it's just the single use, I've open-coded a getSplatValueAllowUndef() helper there, to discourage use in any other places.	2024-04-18 15:44:12 +09:00
Stephen Tozer	ffd08c7759	[RemoveDIs][NFC] Rename DPValue -> DbgVariableRecord (#85216 ) This is the major rename patch that prior patches have built towards. The DPValue class is being renamed to DbgVariableRecord, which reflects the updated terminology for the "final" implementation of the RemoveDI feature. This is a pure string substitution + clang-format patch. The only manual component of this patch was determining where to perform these string substitutions: `DPValue` and `DPV` are almost exclusively used for DbgRecords, except for: - llvm/lib/target, where 'DP' is used to mean double-precision, and so appears as part of .td files and in variable names. NB: There is a single existing use of `DPValue` here that refers to debug info, which I've manually updated. - llvm/tools/gold, where 'LDPV' is used as a prefix for symbol visibility enums. Outside of these places, I've applied several basic string substitutions, with the intent that they only affect DbgRecord-related identifiers; I've checked them as I went through to verify this, with reasonable confidence that there are no unintended changes that slipped through the cracks. The substitutions applied are all case-sensitive, and are applied in the order shown: ``` DPValue -> DbgVariableRecord DPVal -> DbgVarRec DPV -> DVR ``` Following the previous rename patches, it should be the case that there are no instances of any of these strings that are meant to refer to the general case of DbgRecords, or anything other than the DPValue class. The idea behind this patch is therefore that pure string substitution is correct in all cases as long as these assumptions hold.	2024-03-19 20:07:07 +00:00
zhongyunde 00443407	1752b9e4c7	[InstCombine] create a helper function foldPowiReassoc, NFC	2024-03-14 22:05:20 +08:00
Noah Goldstein	8d976c7f20	[InstCombine] Make `(binop ({s\|u}itofp),({s\|u}itofp))` transform more flexible to mismatched signs Instead of taking the sign of the cast operation as the required since for the transform, only force a sign if an operation is maybe negative. This gives us more flexability when checking if the floats are safely converable to integers. Closes #84389	2024-03-09 11:06:02 -06:00
Noah Goldstein	0f5849eeee	[InstCombine] Move folding `(add (sitofp x), (sitofp y))` impl to InstructionCombiner; NFC	2024-03-06 13:28:04 -06:00
Eikansh Gupta	db870cfc9e	[InstCombine] Extract helper from matchFunnelShift (NFC) The matchFunnelShift function was doing pattern matching and creating the fshl/fshr instruction if needed. Moved the pattern matching code to function convertOrOfShiftsToFunnelShift. It can be reused for other optimizations.	2024-02-16 16:46:41 +01:00
Matt Arsenault	decbd29f9e	Reapply "InstCombine: Introduce SimplifyDemandedUseFPClass"" (#74056 ) This reverts commit ef388334ee5a3584255b9ef5b3fefdb244fa3fd7. The referenced issue violates the spec for finite-only math only by using a return value for a constant infinity. If the interpretation is results and arguments cannot violate nofpclass, then any std::numeric_limits<T>::infinity() result is invalid under -ffinite-math-only. Without this interpretation the utility of nofpclass is slashed.	2024-02-08 14:12:39 +05:30

1 2 3 4 5 ...

429 Commits