llvm-project

Author	SHA1	Message	Date
Kazu Hirata	07eb7b7692	[llvm] Replace SmallSet with SmallPtrSet (NFC) (#154068 ) This patch replaces SmallSet<T , N> with SmallPtrSet<T , N>. Note that SmallSet.h "redirects" SmallSet to SmallPtrSet for pointer element types: template <typename PointeeType, unsigned N> class SmallSet<PointeeType, N> : public SmallPtrSet<PointeeType, N> {}; We only have 140 instances that rely on this "redirection", with the vast majority of them under llvm/. Since relying on the redirection doesn't improve readability, this patch replaces SmallSet with SmallPtrSet for pointer element types.	2025-08-18 07:01:29 -07:00
Ivan R. Ivanov	7c141e2118	[ValueTracking] Add missing check for two-value PN recurrence matching (#152700 ) When InstTy is a type like IntrinsicInst which can have a variable number of arguments, we can encounter a case where Operation will have fewer than two arguments and error at the getOperand() calls. Fixes: https://github.com/llvm/llvm-project/issues/152725.	2025-08-08 17:39:24 +02:00
Abhishek Kaushik	30728eb26b	[Reland][ValueTracking] Improve Bitcast handling to match SDAG (#145223 ) Fixes #125228 --------- Co-authored-by: Simon Pilgrim <llvm-dev@redking.me.uk>	2025-08-04 14:51:03 +05:30
Jasmine Tang	e7ac49977a	[InstSimplify] Add poison propagation for trivially vectorizable intrinsics (#149243 ) Fixes https://github.com/llvm/llvm-project/issues/146769 Test cases added to `llvm/test/Transforms/InstSimplify/fold-intrinsics.ll`	2025-07-19 19:37:21 -07:00
jjasmine	2206c7d4af	[InstSimplify] Fold trig functions call of poison to poison (#148969 ) Fold trig functions call of poison to poison. This includes sin, cos, asin, acos, atan, atan2, sinh, cosh, sincos, sincospi. Test cases are fixed and also added to llvm/test/Transforms/InstSimplify/fold-intrinsics.ll just like in https://github.com/llvm/llvm-project/pull/146750	2025-07-16 08:35:13 -07:00
Ramkumar Ramachandra	1693ac35aa	[ValueTracking] Improve code using dropSameSign (NFC) (#147367 )	2025-07-08 14:39:33 +01:00
jjasmine	68309adef3	[NFC] Clean up poison folding in simplifyBinaryIntrinsic (#147259 ) Fixes #147116.	2025-07-07 23:21:09 +08:00
AZero13	16dad11f50	[ValueTracking] Have sub and xor in KnownNonZero take the same exact path (#146975 ) If x - y == 0, then x ^ y == 0. Therefore, we can do the exact same checks. https://alive2.llvm.org/ce/z/MtBRoj	2025-07-07 15:18:48 +02:00
jjasmine	07286b1fcd	[InstCombine] Propagate poison pow[i], [us]add, [us]sub and [us]mul (#146750 ) Fixes #146560 as well as propagate poison for [us]add, [us]sub and [us]mul	2025-07-04 22:55:07 +01:00
Nikita Popov	1d5d125648	[ConstantFolding] Consolidate poison propagation for intrinsics (#146878 ) This consolidates the "fold poison arg to poison result" constant folding logic for intrinsics, based on a common intrinsicPropagatesPoison() helper, which is also used for poison propagation reasoning in ValueTracking. This ensures that the set of supported intrinsics is consistent. This add ucmp, scmp, smul.fix, smul.fix.sat, canonicalize and sqrt to the intrinsicPropagatesPoison list, as these were handled by ConstantFolding but not ValueTracking. The ctpop test is an example of the converse, where it was handled by ValueTracking but not ConstantFolding.	2025-07-04 09:16:28 +02:00
Antonio Frighetto	c11ea449e5	[ValueTracking] Add `matchSimpleBinaryIntrinsicRecurrence` helper Similarly to what it is being done to match simple recurrence cycle relations, attempt to match value-accumulating recurrences of kind: ``` %umax.acc = phi i8 [ %umax, %backedge ], [ %a, %entry ] %umax = call i8 @llvm.umax.i8(i8 %umax.acc, i8 %b) ``` Preliminary work to let InstCombine avoid folding such recurrences, so that simple loop-invariant computation may get hoisted. Minor opportunity to refactor out code as well.	2025-06-27 19:04:28 +02:00
Ross Kirsling	7d2293d1d9	[InstCombine] KnownBits::isNonNegative should recognize `b - a` after `a <= b` (#145105 ) Alive2: https://alive2.llvm.org/ce/z/an9npN Fixes #142283.	2025-06-24 18:51:49 +02:00
Wenju He	9d570d568b	[ValueTracking] Return true for AddrSpaceCast in canCreateUndefOrPoison (#144686 ) In our downstream GPU target, following IR is valid before instcombine although the second addrspacecast causes UB. define i1 @test(ptr addrspace(1) noundef %v) { %0 = addrspacecast ptr addrspace(1) %v to ptr addrspace(4) %1 = call i32 @llvm.xxxx.isaddr.shared(ptr addrspace(4) %0) %2 = icmp eq i32 %1, 0 %3 = addrspacecast ptr addrspace(4) %0 to ptr addrspace(3) %4 = select i1 %2, ptr addrspace(3) null, ptr addrspace(3) %3 %5 = icmp eq ptr addrspace(3) %4, null ret i1 %5 } We have a custom optimization that replaces invalid addrspacecast with poison, and IR is still valid since `select` stops poison propagation. However, instcombine pass optimizes `select` to `or`: %0 = addrspacecast ptr addrspace(1) %v to ptr addrspace(4) %1 = call i32 @llvm.xxxx.isaddr.shared(ptr addrspace(4) %0) %2 = icmp eq i32 %1, 0 %3 = addrspacecast ptr addrspace(1) %v to ptr addrspace(3) %4 = icmp eq ptr addrspace(3) %3, null %5 = or i1 %2, %4 ret i1 %5 The transform is invalid for our target. --------- Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-24 08:43:47 +08:00
Iris Shi	32f911f3e8	[InstCombine] Fold `ceil(X / (2 ^ C)) == 0` -> `X == 0` (#143683 ) Co-authored-by: Yingwei Zheng <dtcxzyw2333@gmail.com>	2025-06-23 10:51:17 +08:00
Arthur Eubanks	5708851283	Revert "[ValueTracking] Improve `Bitcast` handling to match SDAG" (#145191 ) Reverts llvm/llvm-project#125935 Causes miscompiles, see comments in #125935	2025-06-21 16:02:26 -07:00
Abhishek Kaushik	0b8179b2ad	[ValueTracking] Improve `Bitcast` handling to match SDAG (#125935 ) Closes #125228	2025-06-19 18:35:40 +01:00
Jeremy Morse	9eb0020555	[DebugInfo][RemoveDIs] Remove a swathe of debug-intrinsic code (#144389 ) Seeing how we can't generate any debug intrinsics any more: delete a variety of codepaths where they're handled. For the most part these are plain deletions, in others I've tweaked comments to remain coherent, or added a type to (what was) type-generic-lambdas. This isn't all the DbgInfoIntrinsic call sites but it's most of the simple scenarios. Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-17 15:55:14 +01:00
Ricardo Jesus	f12dd8f86a	[ValueTracking] Remove unused variable in matchSimpleRecurrence (NFC). (#144316 )	2025-06-16 09:57:21 +01:00
Ricardo Jesus	cca454b54c	[ValueTracking] Remove opcode whitelist from matchSimpleRecurrence. (#144031 ) This also patches HashRecognize to avoid it mishandling some opcodes.	2025-06-16 09:12:42 +01:00
AZero13	30a41a6423	[ValueTracking] Add subtraction support for setLimitsForBinOp (#143618 ) We can determine the range from a subtraction if it has nsw or nuw. https://alive2.llvm.org/ce/z/tXAKVV	2025-06-15 15:32:34 +08:00
Yingwei Zheng	149cb5c43c	[ValueTracking] Infer `X \| Y != 0` from `X != Y` (#117443 ) Alive2: https://alive2.llvm.org/ce/z/cJ75Ya Closes https://github.com/llvm/llvm-project/issues/117436.	2025-06-15 15:17:53 +08:00
Yingwei Zheng	2f15637e04	[ValueTracking] Update `Ordered` when both operands are non-NaN. (#143349 ) When the original predicate is ordered and both operands are non-NaN, `Ordered` should be set to true. This variable still matters even if both operands are non-NaN because FMF only applies to select, not fcmp. Closes https://github.com/llvm/llvm-project/issues/143123.	2025-06-09 15:46:09 +08:00
Craig Topper	b8a4a3b99c	[ValueTracking] Support scalable vector splats of ConstantInt/ConstantFP in isGuaranteedNotToBeUndefOrPoison. (#142894 ) Scalable vectors use insertelt+shufflevector ConstantExpr to represent a splat.	2025-06-05 22:08:03 -07:00
Craig Topper	d5d6f60632	[ValueTracking] Support scalable vectors for ExtractElement in computeKnownFPClass. (#143051 ) We can support scalable vectors by setting the demanded mask to APInt(1, 1) to demand the whole vector.	2025-06-05 20:48:07 -07:00
Ramkumar Ramachandra	b40e4ceaa6	[ValueTracking] Make Depth last default arg (NFC) (#142384 ) Having a finite Depth (or recursion limit) for computeKnownBits is very limiting, but is currently a load-bearing necessity, as all KnownBits are recomputed on each call and there is no caching. As a prerequisite for an effort to remove the recursion limit altogether, either using a clever caching technique, or writing a easily-invalidable KnownBits analysis, make the Depth argument in APIs in ValueTracking uniformly the last argument with a default value. This would aid in removing the argument when the time comes, as many callers that currently pass 0 explicitly are now updated to omit the argument altogether.	2025-06-03 17:12:24 +01:00
Hassnaa Hamdi	67c67cd053	[ValueTracking][NFC]: Use injected condition to compute known FPClass (#139832 )	2025-06-02 11:59:43 +01:00
Yingwei Zheng	1984c7539e	[ValueTracking] Do not use FMF from fcmp (#142266 ) This patch introduces an FMF parameter for `matchDecomposedSelectPattern` to pass FMF flags from select, instead of fcmp. Closes https://github.com/llvm/llvm-project/issues/137998. Closes https://github.com/llvm/llvm-project/issues/141017.	2025-06-02 18:21:14 +08:00
Cullen Rhodes	b9fa1dfd38	[ValueTracking][NFC] Simplify binops in canCreateUndefOrPoison switch (#139906 ) Already handled by default case.	2025-06-02 10:41:32 +01:00
Yingwei Zheng	6c86b7d7d8	[ValueTracking][InstCombine] Generalize ignoreSignBitOfZero/NaN to handle more cases (#141015 ) This patch was originally part of https://github.com/llvm/llvm-project/pull/139861. It generalizes `ignoreSignBitOfZero/NaN` to handle more instructions/intrinsics. BTW, I find it mitigates performance regressions caused by https://github.com/llvm/llvm-project/pull/141010 (IR diff https://github.com/dtcxzyw/llvm-opt-benchmark/pull/2365/files). We don't need to propagate FMF from fcmp into select, since we can infer demanded properties from the user of select.	2025-05-28 19:17:51 +08:00
Andreas Jonson	58ead2cee8	[ValueTracking] Support trunc nuw condition in isImpliedCondition (#141528 ) Proof: https://alive2.llvm.org/ce/z/oqQyxC	2025-05-27 18:53:29 +02:00
Tim Gymnich	571a24c314	Reland [llvm] add GenericFloatingPointPredicateUtils #140254 (#141065 ) #140254 was previously missing 2 files in the bazel build config.	2025-05-22 17:17:02 +02:00
Kewen12	c47a5fbb22	Revert "[llvm] add GenericFloatingPointPredicateUtils (#140254 )" (#140968 ) This reverts commit d00d74bb2564103ae3cb5ac6b6ffecf7e1cc2238. The PR breaks our buildbots and blocks downstream merge.	2025-05-21 19:31:14 -04:00
Tim Gymnich	d00d74bb25	[llvm] add GenericFloatingPointPredicateUtils (#140254 ) add `GenericFloatingPointPredicateUtils` in order to generalize effects of floating point comparisons on `KnownFPClass` for both IR and MIR. --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2025-05-21 23:45:31 +02:00
Cullen Rhodes	059b0c2efb	[ValueTracking][NFC] Drop outdated TODO in canCreateUndefOrPoison (#139915 ) The inrange constexpr GEP case is handled since 425cbbc602c9.	2025-05-15 15:09:15 +01:00
Matt Arsenault	89d13f87c7	Analysis: Remove no-AssumptionCache path in getKnowledgeForValue (#139232 ) As requested in https://github.com/llvm/llvm-project/pull/138961#discussion_r2078483175	2025-05-09 12:23:38 +02:00
Matt Arsenault	609a8331a0	ValueTracking: Handle minimumnum and maximumnum in computeKnownFPClass (#138737 ) For now use the same treatment as minnum/maxnum, but these should diverge. alive2 seems happy with this, except for some preexisting bugs with weird denormal modes.	2025-05-07 08:02:24 +02:00
Matt Arsenault	51e157f581	ValueTracking: Handle minimumnum/maximumnum in canCreateUndefOrPoison (#138729 )	2025-05-06 22:26:56 +02:00
Yingwei Zheng	830cf36bd4	[LVI][ValueTracking] Take UB-implying attributes into account in `isSafeToSpeculativelyExecute` (#137604 ) Closes https://github.com/llvm/llvm-project/issues/137582. In the original case, LVI uses the edge information in `%entry -> %if.end` to get a more precise result. However, since the call to `smin` has an `noundef` return attribute, an immediate UB will be triggered after optimization. Currently, `isSafeToSpeculativelyExecuteWithOpcode(%min)` returns true because `6a288c1e32` only checks whether the function is speculatable. However, it is not enough in this case. This patch takes UB-implying attributes into account if `IgnoreUBImplyingAttrs` is set to false. If it is set to true, the caller is responsible for correctly propagating UB-implying attributes.	2025-04-30 11:53:38 +08:00
Nikita Popov	c5a5f4330a	Reapply [ValueTracking] Drop ucmp/scmp from getIntrinsicRange() (NFCI) Reapply after d51b2785abf77978d9218a7b6fb5b8ec6c770c31, which should fix optimization regressions. After #135642 we have a range attribute on the intrinsic declaration, so we should not need the special handling here.	2025-04-22 16:55:42 +02:00
Hans Wennborg	dba8acde6d	Revert "[ValueTracking] Drop ucmp/scmp from getIntrinsicRange() (NFCI)" This does seem to cause some functionality to change, see comment on `278c429d11` This reverts commit 278c429d11e63bc709ea8c537b23c4e350ce2a07.	2025-04-22 15:19:49 +02:00
Nikita Popov	278c429d11	[ValueTracking] Drop ucmp/scmp from getIntrinsicRange() (NFCI) After #135642 we have a range attribute on the intrinsic declaration, so we should not need the special handling here.	2025-04-22 10:56:04 +02:00
Yingwei Zheng	b1b065f2bf	[ValueTracking] Refactor `isKnownNonEqualFromContext` (#127388 ) This patch avoids adding RHS for comparisons with two variable operands (https://github.com/llvm/llvm-project/pull/118493#discussion_r1949397482). Instead, we iterate over related dominating conditions of both V1 and V2 in `isKnownNonEqualFromContext`, as suggested by goldsteinn (https://github.com/llvm/llvm-project/pull/117442#discussion_r1944058002). Compile-time improvement: https://llvm-compile-time-tracker.com/compare.php?from=c6d95c441a29a45782ff72d6cb82839b86fd0e4a&to=88464baedd7b1731281eaa0ce4438122b4d218a7&stat=instructions:u	2025-04-18 22:14:06 +08:00
Andreas Jonson	39562de510	[ValueTracking] Handle assume(trunc x to i1) in ComputeKnownBits (#118406 ) proof: https://alive2.llvm.org/ce/z/zAspzb	2025-04-10 19:29:28 +02:00
Matt Arsenault	258aa65197	ValueTracking: Do not look at users of constants for ephemeral values (#134618 )	2025-04-09 19:15:16 +07:00
LU-JOHN	6a46c6c865	Ensure KnownBits passed when calculating from range md has right size (#132985 ) KnownBits passed to computeKnownBitsFromRangeMetadata must have the same bit width as the range metadata bit width. Otherwise the calculated results will be incorrect. --------- Signed-off-by: John Lu <John.Lu@amd.com>	2025-04-03 10:17:14 +07:00
Tim Gymnich	049f179606	[Analysis][NFC] Extract KnownFPClass (#133457 ) - extract KnownFPClass for future use inside of GISelKnownBits --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2025-03-28 18:10:02 +01:00
pzzp	d6a2cca77e	[llvm:ir] Add support for constant data exceeding 4GiB (#126481 ) The test file is over 4GiB, which is too big, so I didn’t submit it.	2025-03-21 11:44:01 -07:00
Yingwei Zheng	029e10289a	[ValueTracking] Bail out on x86_fp80 when computing fpclass with knownbits (#130477 ) In https://github.com/llvm/llvm-project/pull/97762, we assume the minimum possible value of X is NaN implies X is NaN. But it doesn't hold for x86_fp80 format. If the knownbits of X are `?'011111111111110'????????????????????????????????????????????????????????????????`, the minimum possible value of X is NaN/unnormal. However, it can be a normal value. Closes https://github.com/llvm/llvm-project/issues/130408.	2025-03-09 21:10:35 +08:00
DianQK	462eb7e28e	[ValueTracking] Skip incoming values that are the same as the phi in `isGuaranteedNotToBeUndefOrPoison` (#130111 ) Fixes (keep it open) #130110. If the incoming value is PHI itself, we can skip this. If we can guarantee that the other incoming values are neither undef nor poison, then we can also guarantee that the value isn't either. If we cannot guarantee that, it makes no sense in calculating it.	2025-03-07 05:46:32 +08:00
Narayan	6311e3fcc8	[ValueTracking] ComputeNumSignBitsImpl - add basic handling of BITCAST nodes (#127218 ) When a wider scalar/vector type containing all sign bits is bitcast to a narrower vector type, we can deduce that the resulting narrow elements will also be all sign bits. This matches existing behavior in SelectionDAG and helps optimize cases involving SSE intrinsics where sign-extended values are bitcast between different vector types. The current implementation fails to recognize that an arithmetic right shift is redundant when applied to elements that are already known to be all sign bits. This PR improves ComputeNumSignBitsImpl to track this information through bitcasts, enabling the optimization of such cases. ``` %ext = sext <1 x i1> %cmp to <1 x i8> %sub = bitcast <1 x i8> %ext to <4 x i2> %sra = ashr <4 x i2> %sub, <i2 1, i2 1, i2 1, i2 1> ; Can be simplified to just: %sub = bitcast <1 x i8> %ext to <4 x i2> ``` Closes #87624	2025-03-06 08:30:36 +00:00

1 2 3 4 5 ...

1587 Commits