llvm-project

Author	SHA1	Message	Date
Matt Arsenault	77e3fea02b	ValueTracking: Improve trunc handling in computeKnownFPClass	2023-04-26 05:20:56 -04:00
Matt Arsenault	2ff52ea4ad	ValueTracking: Handle powi in computeKnownFPClass Extract the handling from cannotBeOrderedLessThanZeroImpl and avoid the mentioned -0 bug.	2023-04-26 05:20:55 -04:00
Matt Arsenault	8e72219973	ValueTracking: Implement computeKnownFPClass for log	2023-04-26 05:20:55 -04:00
Matt Arsenault	f40d186d4a	ValueTracking: Add ordered negative handling for fmul to computeKnownFPClass Port from the existing handling in cannotBeOrderedLessThanZero	2023-04-24 22:31:20 -04:00
Matt Arsenault	7aeec64215	ValueTracking: Handle fptrunc_round in computeKnownFPClass	2023-04-24 22:31:20 -04:00
Matt Arsenault	a070dbfd14	ValueTracking: Implement computeKnownFPClass for fma/fmuladd Copy handling from CannotBeOrderedLessThanZero	2023-04-24 14:29:35 -04:00
Matt Arsenault	d46f8c6ec9	ValueTracking: Handle exp/exp2 in computeKnownFPClass	2023-04-24 14:25:06 -04:00
Matt Arsenault	b0aa6d76eb	ValueTracking: Fix computeKnownFPClass for fabs The fabs utility functions have the opposite purpose and probably should not be a general utility.	2023-04-24 14:25:06 -04:00
Matt Arsenault	c55fffecce	ValueTracking: Recognize >=, <= compares with 0 as is.fpclass masks Leave DAZ handling for a future change.	2023-04-21 08:15:04 -04:00
Matt Arsenault	6966859059	ValueTracking: Implement computeKnownFPClass for fpext	2023-04-21 07:02:55 -04:00
Matt Arsenault	83adfc91e8	ValueTracking: uitofp/sitofp cannot return denormal results	2023-04-19 20:11:34 -04:00
Matt Arsenault	02f647f892	ValueTracking: Handle sign bit of constrained sitofp/uitofp This is for parity with CannotBeNegativeZero which is close to droppable.	2023-04-19 20:11:33 -04:00
Matt Arsenault	f6d79ad9eb	ValueTracking: Implement computeKnownFPClass for fdiv for nan handling	2023-04-19 20:11:33 -04:00
Matt Arsenault	e7bcfea622	ValueTracking: Fix backwards handling of fpclass assumes This was a bit confused because nofpclass expresses the opposite from what an assume of class expresses. We need to assume the intersection of assumed classes, which also needs to be inverted to convert to nofpclass.	2023-04-19 20:11:32 -04:00
Matt Arsenault	dea4f37b7d	ValueTracking: Handle shufflevector in computeKnownFPClass	2023-04-19 08:18:37 -04:00
Matt Arsenault	8e70ed6efd	ValueTracking: Handle insertelement in computeKnownFPClass	2023-04-19 08:18:37 -04:00
Matt Arsenault	0d448783c3	ValueTracking: sitofp cannot return -0	2023-04-19 08:18:37 -04:00
OCHyams	ca10e73b53	[NFC] Rename isPointerOffset to getPointerOffsetFrom and move to Value.h Linking LLVMCore failed when building D148536 with shared libs enabled: https://lab.llvm.org/buildbot/#/builders/121/builds/29766 Make isPointerOffset a Value method and rename it to getPointerOffsetFrom. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D148698	2023-04-19 12:22:58 +01:00
Noah Goldstein	e846ec57cb	Recommit "[ValueTracking] Apply the isKnownNonZero techniques in `ashr`/`lshl` to `shl` and vice-versa" (2nd Try) Wasn't related to the bug it was original thought to be causing.	2023-04-18 17:17:57 -05:00
Nikita Popov	294831688f	[ValueTracking] Use SmallPtrSetImpl (NFC) Don't hardcode set size in function signature.	2023-04-18 12:37:15 +02:00
Noah Goldstein	3c4d9cc273	Revert "[ValueTracking] Apply the isKnownNonZero techniques in `ashr`/`lshl` to `shl` and vice-versa" May be related to PR62175 This reverts commit 57590d1dd47bbe9aa4b79a0f93cc3ec62cc5d060.	2023-04-18 01:23:08 -05:00
Noah Goldstein	57590d1dd4	[ValueTracking] Apply the isKnownNonZero techniques in `ashr`/`lshl` to `shl` and vice-versa For all shifts we can apply the same two optimizations. 1) `ShiftOp(KnownVal.One, Max(KnownCnt)) != 0` -> result is non-zero 2) If already known `Val != 0` and we only shift out zeros (based on `Max(KnownCnt)`) -> result is non-zero The former exists for `shl` and the latter (for constant `Cnt`) exists for `ashr`/`lshr`. This patch improves the latter to use `Max(KnownCnt)` instead of relying on a constant shift `Cnt` and applies both techniques for all shift ops. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D148404	2023-04-17 22:39:06 -05:00
Nikita Popov	fd63a7d5c8	Revert "ValueTracking: Handle freeze in computeKnownFPClass" This reverts commit 2c8d0048f03d054f13909a26f959ef95b2a0a4de. This is incorrect: computeKnownFPClass() is only known up to poison, and freeze poison may have any FP class.	2023-04-17 12:59:23 +02:00
Noah Goldstein	f688d215e5	[ValueTracking] Add `shl nsw %val, %cnt != 0` if `%val != 0`. Alive2 Link: https://alive2.llvm.org/ce/z/mxZLJn Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D147898	2023-04-14 18:23:47 -05:00
Noah Goldstein	684963b86d	[ValueTracking] Use maximum shift count in `shl` when determining if `shl` can be zero. Previously only return `shl` non-zero if the shift value was `1`. We can expand this if we have some bounds on the shift count. For example: ``` %cnt = and %c, 16 ; Max cnt == 16 %val = or %v, 4 ; val[2] is known one %shl = shl %val, %cnt ; (val.known.one << cnt.maxval) != 0 ``` Differential Revision: https://reviews.llvm.org/D147897	2023-04-14 18:23:45 -05:00
Matt Arsenault	2c8d0048f0	ValueTracking: Handle freeze in computeKnownFPClass	2023-04-14 17:53:41 -04:00
Matt Arsenault	49b931bdc5	ValueTracking: Implement computeKnownFPClass for arithmetic.fence	2023-04-14 17:41:27 -04:00
Matt Arsenault	3dabcdc78b	ValueTracking: Implement computeKnownFPClass for llvm.trunc	2023-04-14 17:41:26 -04:00
Matt Arsenault	656b52a6c6	ValueTracking: Handle non-splat vectors in computeKnownFPClass Avoids some regressions when the implementation of isKnownNeverNaN is replaced with computeKnownFPClass.	2023-04-14 17:41:26 -04:00
Matt Arsenault	e2d68c2fa4	ValueTracking: Implement computeKnownFPClass for canonicalize	2023-04-14 16:17:55 -04:00
Matt Arsenault	cb022084f0	ValueTracking: Handle fptrunc in computeKnownFPClass Handle nan.	2023-04-14 14:36:56 -04:00
Matt Arsenault	409ef45000	ValueTracking: Handle extractelement and extractvalue in computeKnownFPClass	2023-04-14 14:36:56 -04:00
Matt Arsenault	c603fd2f39	ValueTracking: Implement computeKnownFPClass for sin/cos	2023-04-14 14:36:55 -04:00
Matt Arsenault	054cac104f	ValueTracking: Address todo for nan fmul handling in computeKnownFPClass If both operands can't be zero or nan, the result can't be nan.	2023-04-13 14:44:34 -04:00
Matt Arsenault	4d044bfb33	ValueTracking: Handle no-nan check for computeKnownFPClass for fmul Copy the logic from isKnownNeverNaN for fadd/fsub. Leave the extension to handle the zero case for a future change.	2023-04-13 14:44:34 -04:00
Matt Arsenault	6aca400986	ValueTracking: Handle no-nan check for computeKnownFPClass for fadd/fsub Copy the logic from isKnownNeverNaN for fadd/fsub.	2023-04-12 06:48:58 -04:00
Matt Arsenault	eb8e43a2a1	ValueTracking: Remove outdated todo	2023-04-12 06:48:58 -04:00
Nikita Popov	3f53a58597	[ValueTracking] Fix incorrect computeConstantRange() arguments The second argument is ForSigned, not UseInstrInfo.	2023-03-31 16:56:56 +02:00
Philip Reames	54539fa8b3	[LSR/LFTR] Move two utilities to common code for reuse [nfc] We're working on a transform in LSR which is essentiall an inverse of LFTR (in certain sub-cases). Move utilties so that they can be reused.	2023-03-20 09:05:38 -07:00
Matt Arsenault	d2404ea6ce	Attributor: Assume handling for nofpclass	2023-03-17 07:39:40 -04:00
Nikita Popov	a5242483e4	[SCEV] Recognize vscale intrinsics Now that SCEV has a dedicated vscale node type, we should also map vscale intrinsics to it. To make sure this does not regress ranges (which were KnownBits based previously), add support for vscale to getRangeRef() as well. Differential Revision: https://reviews.llvm.org/D146226	2023-03-17 10:07:39 +01:00
Nikita Popov	402dfa389e	[ValueTracking] Support vscale in computeConstantRange() Add support for vscale in computeConstantRange(), based on vscale_range attributes. This allows simplifying based on the precise range, rather than a KnownBits approximation (which will be off by a factor of two for the usual case of a power of two upper bound). Differential Revision: https://reviews.llvm.org/D146217	2023-03-17 10:03:24 +01:00
Matt Arsenault	8a37512924	ValueTracking: Extract fcmpToClassTest out of InstCombine Also update unsigned to FPClassTest	2023-03-16 23:14:40 -04:00
Matt Arsenault	b39deda3e1	ValueTracking: Handle nofpclass in computeKnownFPClass	2023-03-16 23:14:40 -04:00
Matt Arsenault	931d4098a2	ValueTracking: Add start of computeKnownFPClass API Add a new compute-known-bits like function to compute all the interesting floating point properties at once. Eventually this should absorb all the various floating point queries we already have.	2023-03-16 23:14:40 -04:00
Nikita Popov	531e06668b	[ValueTracking] Return ConstantRange for intrinsic ranges (NFC) Instead of setting Lower and Upper, return a ConstantRange. Should do this for the others as well.	2023-03-16 14:25:28 +01:00
Matt Arsenault	5da674492a	IR: Add nofpclass parameter attribute This carries a bitmask indicating forbidden floating-point value kinds in the argument or return value. This will enable interprocedural -ffinite-math-only optimizations. This is primarily to cover the no-nans and no-infinities cases, but also covers the other floating point classes for free. Textually, this provides a number of names corresponding to bits in FPClassTest, e.g. call nofpclass(nan inf) @must_be_finite() call nofpclass(snan) @cannot_be_snan() This is more expressive than the existing nnan and ninf fast math flags. As an added bonus, you can represent fun things like nanf: declare nofpclass(inf zero sub norm) float @only_nans() Compared to nnan/ninf: - Can be applied to individual call operands as well as the return value - Can distinguish signaling and quiet nans - Distinguishes the sign of infinities - Can be safely propagated since it doesn't imply anything about other operands. - Does not apply to FP instructions; it's not a flag This is one step closer to being able to retire "no-nans-fp-math" and "no-infs-fp-math". The one remaining situation where we have no way to represent no-nans/infs is for loads (if we wanted to solve this we could introduce !nofpclass metadata, following along with noundef/!noundef). This is to help simplify the GPU builtin math library distribution. Currently the library code has explicit finite math only checks, read from global constants the compiler driver needs to set based on the compiler flags during linking. We end up having to internalize the library into each translation unit in case different linked modules have different math flags. By propagating known-not-nan and known-not-infinity information, we can automatically prune the edge case handling in most functions if the function is only reached from fast math uses.	2023-02-24 07:41:29 -04:00
Noah Goldstein	196d3e3965	Add logic for tracking lowbit of (and/xor/or X, (add/sub X, Odd)) Any case of logicop + add/sub(Odd) we can prove the low bit is either zero/non-zero. Alive2 Links: xor: sub x, C: https://alive2.llvm.org/ce/z/aaABdS sub C, x: https://alive2.llvm.org/ce/z/2W-ZJ7 add C, x: https://alive2.llvm.org/ce/z/pzDkte or: sub x, C: https://alive2.llvm.org/ce/z/xd-bcP sub C, x: https://alive2.llvm.org/ce/z/p8hXJF add C, x: https://alive2.llvm.org/ce/z/osmkB6 and: sub x, C: https://alive2.llvm.org/ce/z/D_NNxR sub C, x: https://alive2.llvm.org/ce/z/N_5C62 add C, x: https://alive2.llvm.org/ce/z/4cy7a4 Differential Revision: https://reviews.llvm.org/D142427	2023-02-23 19:52:17 -06:00
Noah Goldstein	6ad6f9c579	Add helper for handling `computeKnownBits` for and/xor/or; NFC This change just factors out the existing logic for and/xor/or and puts them in a publicly available helper. functionality is the same. Differential Revision: https://reviews.llvm.org/D142849	2023-02-23 19:52:16 -06:00
Max Kazantsev	0cbb8ec030	Revert "[AssumptionCache] caches @llvm.experimental.guard's" This reverts commit f9599bbc7a3f831e1793a549d8a7a19265f3e504. For some reason it caused us a huge compile time regression in downstream workloads. Not sure whether the source of it is in upstream code ir not. Temporarily reverting until investigated. Differential Revision: https://reviews.llvm.org/D142330	2023-02-20 18:38:07 +07:00

... 4 5 6 7 8 ...

1403 Commits