llvm-project

Author	SHA1	Message	Date
Valeriy Savchenko	9d3d0d7bb6	[InstCombine] Support disjoint or in add-sub reassociation fold (#186827 )	2026-03-16 17:43:20 +00:00
Wenju He	79ad09175e	[InstCombine] Preserve fdiv metadata on fneg folds (#179157 ) Copy metadata from the original fdiv when folding fneg into fdiv or hoisting fneg above fdiv. This keeps !fpmath (and other metadata) intact, preventing !fpmath loss seen in libclc tanpi function. fneg only flips sign bit, so it does not affect precision.	2026-02-03 08:27:18 +08:00
Gábor Spaits	58822a6933	[InstCombine] Fold `(select C, (x bin_op a), x) bin_op b` into `x bin_op select C, (a bin_op b), b` (#173511 ) Fixes #154246. The original pattern in the issue was the following: ```llvm define i8 @src(i8 %arg0, i8 %arg1) { %v0 = icmp eq i8 %arg1, -1 %v1 = or i8 %arg0, 4 %v2 = select i1 %v0, i8 %v1, i8 %arg0 %v3 = or i8 %v2, 1 ret i8 %v3 } ``` to ```llvm define i8 @tgt(i8 %arg0, i8 %arg1) { %v0 = icmp eq i8 %arg1, -1 %v3.v = select i1 %v0, i8 5, i8 1 %v3 = or i8 %arg0, %v3.v ret i8 %v3 } ``` (Alive2: https://alive2.llvm.org/ce/z/Nq6b4) This PR works with other binary operators too like `add` and `mul`.	2026-01-04 20:45:25 +01:00
Nikita Popov	f801b6f67e	[InstCombine] Add support for ptrtoaddr in pointer difference folds (#164428 ) This adds support for folding `ptrtoaddr(p2) - ptrtoaddr(p)` pointer subtractions. We can treat ptrtoaddr the same as ptrtoint as the transform is truncation safe anyway (and in fact supports explicit truncation as well). The only interesting case is the subtraction of zext of ptrtoaddr. For this transform it's important that the address bits are not truncated -- and if any pointer bits are truncated, that the truncation is consistent for both operands. I've explicitly spelled out the three different cases for this, which also fixes a miscompile in the existing ptrtoint fold.	2025-10-23 11:57:24 +02:00
Mircea Trofin	8aa49974df	[NFC][InstCombine] Make use of `unknown` profile info clear in the API name (#162766 ) Making the choice more clear from the API name, otherwise it'd be very easy for one to just "not bother" with the `MDFrom`, especially since it is optional and follows the optional `Name` - but this time we'd have a harder time detecting it's effectivelly dropped metadata.	2025-10-10 09:33:45 -07:00
Alan Zhao	045e09f22b	[InstCombine] Set !prof metadata on Selects identified by add.ll test (#158743 ) These select instructions are created from non-branching instructions, so their branch weights are unknown. Tracking issue: #147390	2025-09-29 20:37:06 +00:00
Ramkumar Ramachandra	7fb3a91418	[PatternMatch] Introduce match functor (NFC) (#159386 ) A common idiom is the usage of the PatternMatch match function within a functional algorithm like all_of. Introduce a match functor to shorten this idiom. Co-authored-by: Luke Lau <luke@igalia.com>	2025-09-17 21:04:33 +01:00
Vedant Paranjape	c45aa5c764	[InstCombine] Revert FSub optimization from #157757 (#158315 ) Since FSub X, 0 gets canoncialised to FAdd X, -0 the said optimization didn't make much sense for FSub. Remove it from IC and the adjoined testcase.	2025-09-12 12:16:31 -04:00
Vedant Paranjape	092de9bb90	[InstCombine] Enable FAdd simplifications when user can ignore sign bit (#157757 ) When FAdd result is used by fabs, we can safely ignore the sign bit of fp zero. This patch enables an instruction simplification optimization that folds fadd x, 0 ==> x, which would otherwise not work as the compiler cannot prove that the zero isn't -0. But if the result of the fadd is used by fabs we can simply ignore this and still do the optimization. Fixes #154238	2025-09-12 00:21:18 -04:00
benwu25	2508851798	[InstCombine] Fold `min(X+1, Y) - min(X, Y) --> zext X < Y` (#157782 ) This PR closes #157524. alive2: https://alive2.llvm.org/ce/z/xe_vb2 godbolt: https://alive2.llvm.org/ce/z/7A8PxK This fold is invalid for `@llvm.smin.i1` since `smin(-1, 0) == -1`. I also avoided i1 in general since this uses zext, but it seems like those checks for width might not be necessary, since other folds get to it first. The alive2 proof in #157524 used a select for the fold, but it seems like `select X < Y, 1, 0` should be canonicalized to `zext X < Y` if the bit width is correct.	2025-09-12 10:40:17 +08:00
Nikita Popov	dcdbb39786	[InstCombine] Use intersectForOffsetAdd() in CommonPointerBase (#157851 ) Transforms using this helper will add up all the offsets, so we should use intersectForOffsetAdd() instead of plain intersection. Annoyingly, this requires specially handling the first GEP to avoid losing flags in that case. Fixes https://github.com/llvm/llvm-project/issues/157714.	2025-09-10 15:57:27 +02:00
Nikita Popov	7e878aaf23	[PatternMatch] Add support for capture-and-match (NFC) (#149825 ) When using PatternMatch, there is a common problem where we want to both match something against a pattern, but also capture the value/instruction for various reasons (e.g. to access flags). Currently the two ways to do that is to either capture using m_Value/m_Instruction and do a separate match on the result, or to use the somewhat awkward `m_CombineAnd(m_XYZ, m_Value(V))` pattern. This PR introduces to add a variant of `m_Value`/`m_Instruction` which does both a capture and a match. `m_Value(V, m_XYZ)` is basically equivalent to `m_CombineAnd(m_XYZ, m_Value(V))`. I've ported two InstCombine files to this pattern as a sample.	2025-07-23 10:05:09 +02:00
Nikita Popov	1e24b53534	[InstCombine] Add limit for expansion of gep chains (#147065 ) When converting gep subtraction / comparison to offset subtraction / comparison, avoid expanding very long multi-use gep chains.	2025-07-23 09:47:53 +02:00
Nikita Popov	889854bef1	[InstCombine] Avoid unprofitable add with remainder transform (#147319 ) If C1 is 1 and we're working with a power of two divisor, this will end up replacing the `and` for the remainder with a multiply and a longer dependency chain. Fixes https://github.com/llvm/llvm-project/issues/147176.	2025-07-08 14:29:31 +02:00
Yingwei Zheng	2472cdcb69	[InstCombine] Refine nuw propagation in `OptimizePointerDifference` (#147059 ) After https://github.com/llvm/llvm-project/pull/146100, the offset may be generated by a previous call to `EmitGEPOffsets`, causing the nuw flag on shl to be lost. This patch handles the `shl+ptradd` case as well. It also fixes a miscompilation in the case of `mul + ptradd`. Alive2: https://alive2.llvm.org/ce/z/BeaNzE This patch removes many unnecessary masking operations in Rust programs with the `ptr_offset_from_unsigned` intrinsic : https://github.com/dtcxzyw/llvm-opt-benchmark/pull/2538/files	2025-07-04 23:34:33 +08:00
Luke Lau	6f7370ced6	[InstCombine] Pull vector reverse through fneg (#146349 ) This follows on from https://github.com/llvm/llvm-project/pull/144933#issuecomment-2992372627, and allows us to remove the reverse (fneg (reverse x)) combine. A separate patch will handle the case for fabs. I haven't checked if we perform this canonicalization for either unops or binops for vp.reverse	2025-06-30 16:15:04 +01:00
Luke Lau	d0c1ea928c	[InstCombine] Pull unary shuffles through fneg/fabs (#144933 ) This canonicalizes fneg/fabs (shuffle X, poison, mask) -> shuffle (fneg/fabs X), posion, mask This undoes part of b331a7ebc1e02f9939d1a4a1509e7eb6cdda3d38 and a8f13dbdeb31be37ee15b5febb7cc2137bbece67, but keeps the binary shuffle case i.e. shuffle fneg, fneg, mask. By pulling out the shuffle we bring it inline with the same canonicalisation we perform on binary ops and intrinsics, which the original commit acknowledges it goes in the opposite direction. However nowadays VectorCombine is more powerful and can do more optimisations when the shuffle is pulled out, so I think we should revisit this. In particular we get more shuffles folded and can perform scalarization.	2025-06-30 10:40:12 +01:00
Slava Zakharin	c0c71463f6	[InstCombine] Optimize sub(sext(add(x,y)),sext(add(x,z))). (#144174 ) This pattern can be often met in Flang generated LLVM IR, for example, for the counts of the loops generated for array expressions like: `a(x:x+y)` or `a(x+z:x+z)` or their variations. In order to compute the loop count, Flang needs to subtract the lower bound of the array slice from the upper bound of the array slice. To avoid the sign wraps, it sign extends the original values (that may be of any user data type) to `i64`. This peephole is really helpful in CPU2017/548.exchange2, where we have multiple following statements like this: ``` block(row+1:row+2, 7:9, i7) = block(row+1:row+2, 7:9, i7) - 10 ``` While this is just a 2x3 iterations loop nest, LLVM cannot figure it out, ending up vectorizing the inner loop really hard (with a vector epilog and scalar remainder). This, in turn, causes problems for LSR that ends up creating too many loop-carried values in the loop containing the above statement, which are then causing too many spills/reloads. Alive2: https://alive2.llvm.org/ce/z/gLgfYX Related to #143219.	2025-06-19 10:13:58 -07:00
gaynor-anthropic	632151fbee	InstCombine: improve optimizations for ceiling division with no overflow (#142869 ) Fixes #142497. Alive2: https://alive2.llvm.org/ce/z/CeaHaH The contents of this pull request were substantially written using claude-code. I've reviewed to the best of my ability (it's been years since I did any compilers work). --------- Co-authored-by: Yingwei Zheng <dtcxzyw@qq.com> Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-17 09:52:18 +02:00
Nikita Popov	2019553a0b	[InstCombine] Extract EmitGEPOffsets() helper (NFC) Extract a reusable helper for emitting a sum of multiple GEP offsets.	2025-06-13 12:34:18 +02:00
Nikita Popov	6fc8ec720e	[InstCombine] Restore splat gep support in OptimizePointerDifference() (#143906 ) When looking for the common base pointer, support the case where the type changes because the GEP goes from pointer to vector of pointers. This was supported prior to #142958.	2025-06-13 12:29:50 +02:00
Nikita Popov	b8e3e0749f	[InstCombine] Export logic for common base pointer (NFC) Make this available to other parts of InstCombine, to be used for pointer comparison optimization.	2025-06-12 15:33:45 +02:00
SahilPatidar	971c49fbf3	[InstCombine] Ensure Safe Handling of Flags in foldFNegIntoConstant (#94148 ) Fix #93769 alive2: https://alive2.llvm.org/ce/z/MHShQY	2025-06-12 19:31:43 +09:00
Nikita Popov	b27ab06a3d	[InstCombine] Preserve flags for difference of gep chains (#143488 ) When expanding the offset of a GEP chain via a series of adds, try to preserve the nsw/nuw flags based on inbounds/nuw. This is a followup to https://github.com/llvm/llvm-project/pull/142958. Proof: https://alive2.llvm.org/ce/z/8HiFYY (note that preserving nsw in the nusw case is not valid)	2025-06-10 16:19:45 +02:00
Nikita Popov	20e8de9c8f	[InstCombine] Support nested GEPs in OptimizePointerDifference (#142958 ) Currently OptimizePointerDifference() only handles single GEPs with a common base, not GEP chains. This patch generalizes the support to nested GEPs with a common base. Finding the common base is a bit annoying because we want to stop as soon as possible and not recurse into common GEP prefixes. This helps avoids regressions from https://github.com/llvm/llvm-project/pull/137297.	2025-06-10 09:28:15 +02:00
Ramkumar Ramachandra	b40e4ceaa6	[ValueTracking] Make Depth last default arg (NFC) (#142384 ) Having a finite Depth (or recursion limit) for computeKnownBits is very limiting, but is currently a load-bearing necessity, as all KnownBits are recomputed on each call and there is no caching. As a prerequisite for an effort to remove the recursion limit altogether, either using a clever caching technique, or writing a easily-invalidable KnownBits analysis, make the Depth argument in APIs in ValueTracking uniformly the last argument with a default value. This would aid in removing the argument when the time comes, as many callers that currently pass 0 explicitly are now updated to omit the argument altogether.	2025-06-03 17:12:24 +01:00
Ruhung	4f7dc1b55a	[InstCombine] Fold (add (add A, 1), (sext (icmp ne A, 0))) to call umax(A, 1) (#122491 ) Transform (add (add A, 1), (sext (icmp ne A, 0))) into call umax(A, 1). Fixes #121853. Alive2: https://alive2.llvm.org/ce/z/TweTan	2025-01-12 16:51:58 +01:00
Yingwei Zheng	69ba565734	[InstCombine] Handle commuted pattern for `((X s/ C1) << C2) + X` (#121737 ) Closes https://github.com/llvm/llvm-project/issues/121700	2025-01-06 21:43:52 +08:00
Yingwei Zheng	a77346bad0	[IRBuilder] Refactor FMF interface (#121657 ) Up to now, the only way to set specified FMF flags in IRBuilder is to use `FastMathFlagGuard`. It makes the code ugly and hard to maintain. This patch introduces a helper class `FMFSource` to replace the original parameter `Instruction *FMFSource` in IRBuilder. To maximize the compatibility, it accepts an instruction or a specified FMF. This patch also removes the use of `FastMathFlagGuard` in some simple cases. Compile-time impact: https://llvm-compile-time-tracker.com/compare.php?from=f87a9db8322643ccbc324e317a75b55903129b55&to=9397e712f6010be15ccf62f12740e9b4a67de2f4&stat=instructions%3Au	2025-01-06 14:37:04 +08:00
Ramkumar Ramachandra	4a0d53a0b0	PatternMatch: migrate to CmpPredicate (#118534 ) With the introduction of CmpPredicate in 51a895a (IR: introduce struct with CmpInst::Predicate and samesign), PatternMatch is one of the first key pieces of infrastructure that must be updated to match a CmpInst respecting samesign information. Implement this change to Cmp-matchers. This is a preparatory step in migrating the codebase over to CmpPredicate. Since we no functional changes are desired at this stage, we have chosen not to migrate CmpPredicate::operator==(CmpPredicate) calls to use CmpPredicate::getMatching(), as that would have visible impact on tests that are not yet written: instead, we call CmpPredicate::operator==(Predicate), preserving the old behavior, while also inserting a few FIXME comments for follow-ups.	2024-12-13 14:18:33 +00:00
fengfeng	c3175c50a3	[InstCombine] Fold `(X & C1) - (X & C2) --> X & (C1 ^ C2)` if `(C1 & C2) == C2` (#119316 ) if (C1 & C2) == C2 then (X & C1) - (X & C2) --> X & (C1 ^ C2) Alive2: https://alive2.llvm.org/ce/z/JvQU8w	2024-12-10 11:06:34 +01:00
Nikita Popov	66ed8fb973	[InstCombine] Fix use after free Make sure we only access cached nowrap flags.	2024-12-04 17:20:04 +01:00
Nikita Popov	4a7abfe0a7	[InstCombine] Preserve nuw in OptimizePointerDifference If both the geps and the subs are nuw the new sub is also nuw. Proof: https://alive2.llvm.org/ce/z/mM8UvF	2024-12-04 16:58:35 +01:00
Yingwei Zheng	1a3eace82a	[InstCombine] Fold `umax(X, C) + -C` into `usub.sat(X, C)` (#118195 ) Alive2: https://alive2.llvm.org/ce/z/oSWe5S Closes https://github.com/llvm/llvm-project/issues/118155	2024-12-01 23:29:40 +08:00
David Green	18abc7e0c5	[PatternMatch] Introduce m_c_Select (#114328 ) This matches m_Select(m_Value(), L, R) or m_Select(m_Value(), R, L).	2024-11-25 13:47:23 +00:00
Nikita Popov	abac5be673	[InstCombine] Fix APInt ctor assertion The (extended) bit width might not fit into the (non-extended) type, resulting in an incorrect truncation of the compared value. Fix this by using m_SpecificInt(), which is both simpler and handles this correctly. Fixes the assertion failure reported in: https://github.com/llvm/llvm-project/pull/114539#issuecomment-2485799395	2024-11-19 15:17:59 +01:00
Yingwei Zheng	db90673d16	[InstCombine] Re-queue users of phi when nsw/nuw flags of add are inferred (#113933 ) This patch re-queue users of phi when one of its incoming add instructions is updated. If an add instruction is updated, the analysis results of phis may be improved. Thus we may further fold some users of this phi node. See the following case: ``` define i8 @trunc_in_loop_exit_block() { ; CHECK-LABEL: @trunc_in_loop_exit_block( ; CHECK-NEXT: entry: ; CHECK-NEXT: br label [[LOOP:%.]] ; CHECK: loop: ; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LOOP_LATCH:%.]] ] ; CHECK-NEXT: [[PHI:%.]] = phi i32 [ 1, [[ENTRY]] ], [ [[IV_NEXT]], [[LOOP_LATCH]] ] ; CHECK-NEXT: [[CMP:%.]] = icmp samesign ult i32 [[IV]], 100 ; CHECK-NEXT: br i1 [[CMP]], label [[LOOP_LATCH]], label [[EXIT:%.]] ; CHECK: loop.latch: ; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1 ; CHECK-NEXT: br label [[LOOP]] ; CHECK: exit: ; CHECK-NEXT: [[TRUNC:%.*]] = trunc i32 [[PHI]] to i8 ; CHECK-NEXT: ret i8 [[TRUNC]] ; entry: br label %loop loop: %iv = phi i32 [ 0, %entry ], [ %iv.next, %loop.latch ] %phi = phi i32 [ 1, %entry ], [ %iv.next, %loop.latch ] %cmp = icmp ult i32 %iv, 100 br i1 %cmp, label %loop.latch, label %exit loop.latch: %iv.next = add i32 %iv, 1 br label %loop exit: %trunc = trunc i32 %phi to i8 ret i8 %trunc } ``` `%iv u< 100` -> infer `nsw/nuw` for `%iv.next = add i32 %iv, 1` -> `%iv` is non-negative -> infer `samesign` for `%cmp = icmp ult i32 %iv, 100`. Without re-queuing users of phi nodes, we cannot improve `%cmp` in one iteration. Address review comment https://github.com/llvm/llvm-project/pull/112642#discussion_r1804712271. This patch also fixes some non-fixpoint issues in tests.	2024-11-18 17:15:46 +08:00
Nikolay Panchenko	6c1fc8213e	[InstCombine] fold `sub(zext(ptrtoint),zext(ptrtoint))` (#115369 ) On a 32-bit target if pointer arithmetic with `addrspace` is used in i64 computation, the missed folding in InstCombine results to suboptimal performance, unlike same code compiled for 64bit target.	2024-11-15 15:36:35 +01:00
Rahul Joshi	fa789dffb1	[NFC] Rename `Intrinsic::getDeclaration` to `getOrInsertDeclaration` (#111752 ) Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).	2024-10-11 05:26:03 -07:00
Noah Goldstein	a6edcea211	[InstCombine] Simplify `(add/sub (sub/add) (sub/add))` irrelivant of use-count Added folds: - `(add (sub X, Y), (sub Z, X))` -> `(sub Z, Y)` - `(sub (add X, Y), (add X, Z))` -> `(sub Y, Z)` The fold typically is handled in the `Reassosiate` pass, but it fails if the inner `sub`/`add` are multi-use. Less importantly, Reassosiate doesn't propagate flags correctly. This patch adds the fold explicitly the InstCombine Proofs: https://alive2.llvm.org/ce/z/p6JyRP Closes #105866	2024-08-27 11:43:17 -07:00
Volodymyr Vasylkun	be7d08cd59	[InstCombine] Fold `sext(A < B) + zext(A > B)` into `ucmp/scmp(A, B)` (#103833 ) This change also covers the fold of `zext(A > B) - zext(A < B)` since it is already being canonicalized into the aforementioned pattern. Proof: https://alive2.llvm.org/ce/z/AgnfMn	2024-08-21 23:15:24 +01:00
Nikita Popov	a105877646	[InstCombine] Remove some of the complexity-based canonicalization (#91185 ) The idea behind this canonicalization is that it allows us to handle less patterns, because we know that some will be canonicalized away. This is indeed very useful to e.g. know that constants are always on the right. However, this is only useful if the canonicalization is actually reliable. This is the case for constants, but not for arguments: Moving these to the right makes it look like the "more complex" expression is guaranteed to be on the left, but this is not actually the case in practice. It fails as soon as you replace the argument with another instruction. The end result is that it looks like things correctly work in tests, while they actually don't. We use the "thwart complexity-based canonicalization" trick to handle this in tests, but it's often a challenge for new contributors to get this right, and based on the regressions this PR originally exposed, we clearly don't get this right in many cases. For this reason, I think that it's better to remove this complexity canonicalization. It will make it much easier to write tests for commuted cases and make sure that they are handled.	2024-08-21 12:02:54 +02:00
Nikita Popov	dd9a99f2b6	[InstCombine] Preserve nsw in A + -B fold This was already done for -B + A, but not for A + -B. Proof: https://alive2.llvm.org/ce/z/F3V2yZ	2024-08-16 16:33:12 +02:00
Yingwei Zheng	62e9f40949	[PatternMatch] Use `m_SpecificCmp` matchers. NFC. (#100878 ) Compile-time improvement: http://llvm-compile-time-tracker.com/compare.php?from=13996378d81c8fa9a364aeaafd7382abbc1db83a&to=861ffa4ec5f7bde5a194a7715593a1b5359eb581&stat=instructions:u baseline: 803eaf29267c6aae9162d1a83a4a2ae508b440d3 ``` Top 5 improvements: stockfish/movegen.ll 2541620819 2538599412 -0.12% minetest/profiler.cpp.ll 431724935 431246500 -0.11% abc/luckySwap.c.ll 581173720 580581935 -0.10% abc/kitTruth.c.ll 2521936288 2519445570 -0.10% abc/extraUtilTruth.c.ll 1216674614 1215495502 -0.10% Top 5 regressions: openssl/libcrypto-shlib-sm4.ll 1155054721 1155943201 +0.08% openssl/libcrypto-lib-sm4.ll 1155054838 1155943063 +0.08% spike/vsm4r_vv.ll 1296430080 1297039258 +0.05% spike/vsm4r_vs.ll 1312496906 1313093460 +0.05% nuttx/lib_rand48.c.ll 126201233 126246692 +0.04% Overall: -0.02112308% ```	2024-07-29 10:04:06 +08:00
Craig Topper	01ceb9840a	[InstCombine] Fold (zext (X +nuw C)) + -C --> zext(X) when zext has additional use. (#98533 ) We have a general fold for (zext (X +nuw C2)) + C1 --> zext (X + (C2 + trunc(C1))) but this fold is disabled if the zext has an additional use. If the two constants cancel, we can fold the whole expression to zext(X) without increasing the number of instructions.	2024-07-12 07:40:29 -07:00
csstormq	96af114941	[InstCombine] Preserve the nsw/nuw flags for (X \| Op01C) + Op1C --> X + (Op01C + Op1C) (#94586 ) This patch simplifies `sdiv` to `udiv` by preserving the `nsw` flag for `(X \| Op01C) + Op1C --> X + (Op01C + Op1C)` if the sum of `Op01C` and `Op1C` will not overflow, and preserves the `nuw` flag unconditionally. Alive2 Proofs (provided by @nikic): https://alive2.llvm.org/ce/z/nrdCZT, https://alive2.llvm.org/ce/z/YnJHnH	2024-06-08 08:38:27 +08:00
Noah Goldstein	0310f7f2d0	[InstCombine] Fold `(add X, (sext/zext (icmp eq X, C)))` We can convert this to a select based on the `(icmp eq X, C)`, then constant fold the addition the true arm begin `(add C, (sext/zext 1))` and the false arm being `(add X, 0)` e.g - `(select (icmp eq X, C), (add C, (sext/zext 1)), (add X, 0))`. This is essentially a specialization of the only case that sees to actually show up from #89020 Closes #93840	2024-06-01 17:49:15 -05:00
Nikita Popov	0d335f78e4	[InstCombine] Handle more commuted cases in matchesSquareSum()	2024-05-09 12:35:16 +09:00
Nikita Popov	d26002ac38	[InstCombine] Fix use-after-free in OptimizePointerDifference() EmitGEPOffset() may remove the old GEP, so be sure to cache the inbounds flag beforehand.	2024-04-26 12:05:12 +09:00
Nikita Popov	cbe1760f02	[InstCombine] Allow multi-use OptimizePointerDifference() with two GEPs (#90017 ) Currently, the OptimizePointerDifference fold does not trigger when working on the sub of two geps where one of the geps has multiple uses, to avoid duplicating the offset arithmetic too much. However, there are cases where performing it would still be clearly profitable, e.g. test_sub_ptradd_multiuse. This patch drops the one-use restriction using the same strategy we use in GEP comparison folds: If there are multiple uses, we rewrite the GEP to use the expanded offset arithmetic instead (effectively canonicalizing it into ptradd representation). Fixes https://github.com/llvm/llvm-project/issues/88231.	2024-04-26 10:53:03 +09:00

1 2 3 4 5 ...

489 Commits