llvm-project

Author	SHA1	Message	Date
Yingwei Zheng	2f1f6b704d	[LLVM] Use `std::move` for APInt. NFC. (#86257 ) This patch adjusts argument passing for `APInt` to improve the compile-time. Compile-time improvement: https://llvm-compile-time-tracker.com/compare.php?from=d1f182c895728d89c5c3d198b133e212a5d9d4a3&to=ba3e326def3a6e5cd6d72ff5a49c74fba18de1df&stat=instructions:u	2024-03-23 14:58:25 +08:00
Nikita Popov	0f46e31cfb	[IR] Change representation of getelementptr inrange (#84341 ) As part of the migration to ptradd (https://discourse.llvm.org/t/rfc-replacing-getelementptr-with-ptradd/68699), we need to change the representation of the `inrange` attribute, which is used for vtable splitting. Currently, inrange is specified as follows: ``` getelementptr inbounds ({ [4 x ptr], [4 x ptr] }, ptr @vt, i64 0, inrange i32 1, i64 2) ``` The `inrange` is placed on a GEP index, and all accesses must be "in range" of that index. The new representation is as follows: ``` getelementptr inbounds inrange(-16, 16) ({ [4 x ptr], [4 x ptr] }, ptr @vt, i64 0, i32 1, i64 2) ``` This specifies which offsets are "in range" of the GEP result. The new representation will continue working when canonicalizing to ptradd representation: ``` getelementptr inbounds inrange(-16, 16) (i8, ptr @vt, i64 48) ``` The inrange offsets are relative to the return value of the GEP. An alternative design could make them relative to the source pointer instead. The result-relative format was chosen on the off-chance that we want to extend support to non-constant GEPs in the future, in which case this variant is more expressive. This implementation "upgrades" the old inrange representation in bitcode by simply dropping it. This is a very niche feature, and I don't think trying to upgrade it is worthwhile. Let me know if you disagree.	2024-03-20 10:59:45 +01:00
Artem Tyurin	141145232f	[IRBuilder] Fold binary intrinsics (#80743 ) Fixes https://github.com/llvm/llvm-project/issues/61240.	2024-03-15 09:58:25 +01:00
Björn Pettersson	7677453886	[ConstantFolding] Do not consider padded-in-memory types as uniform (#81854 ) Teaching ConstantFoldLoadFromUniformValue that types that are padded in memory can't be considered as uniform. Using the big hammer to prevent optimizations when loading from a constant for which DataLayout::typeSizeEqualsStoreSize would return false. Main problem solved would be something like this: store i17 -1, ptr %p, align 4 %v = load i8, ptr %p, align 1 If for example the i17 occupies 32 bits in memory, then LLVM IR doesn't really tell where the padding goes. And even if we assume that the 15 most significant bits are padding, then they should be considered as undefined (even if LLVM backend typically would pad with zeroes). Anyway, for a big-endian target the load would read those most significant bits, which aren't guaranteed to be one's. So it would be wrong to constant fold the load as returning -1. If LLVM IR had been more explicit about the placement of padding, then we could allow the constant fold of the load in the example, but only for little-endian. Fixes: https://github.com/llvm/llvm-project/issues/81793	2024-02-15 15:40:21 +01:00
Jessica Del	f85e7ab035	[AMDGPU] - Add constant folding to s_wqm intrinsic (#72382 ) Fold any constant input to the `s_wqm` intrinsic.	2023-11-21 16:36:45 +01:00
Simon Pilgrim	19e745890b	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning.	2023-11-17 16:30:08 +00:00
Jessica Del	b1e039f3b7	[AMDGPU] - Add constant folding for s_quadmask (#72381 ) If the input is a constant we can constant fold the `s_quadmask` intrinsic.	2023-11-17 15:24:23 +01:00
Jessica Del	af05f9ff06	[AMDGPU] - Add constant folding for s_bitreplicate (#72366 ) If the input is a constant, we can constant fold the s_bitreplicate operation.	2023-11-16 09:08:00 +01:00
Nikita Popov	a1e1c24331	[ConstantFolding] Avoid use of ConstantExpr::getLShr() (NFC) Work on APInt instead.	2023-11-10 15:06:18 +01:00
Nikita Popov	7513650be6	[ConstantFolding] Remove unnecessary pointer handling in FoldBitCast (NFCI) The destination element type here cannot be a pointer type, as this would require an inttoptr rather than bitcast.	2023-11-10 14:54:35 +01:00
Nikita Popov	d9f36c45da	[ConstantFolding] Add ConstantFoldIntegerCast helper This is intended as the replacement for ConstantExpr::getIntegerCast(), which does not require availability of the corresponding constant expressions. It just forwards to ConstantFoldCastOperand with the correct opcode.	2023-11-01 11:13:10 +01:00
Nikita Popov	3b25407d97	[IR] Mark zext/sext constant expressions as undesirable Introduce isDesirableCastOp() which determines whether IR builder and constant folding should produce constant expressions for a given cast type. This mirrors what we do for binary operators. Mark zext/sext as undesirable, which prevents most creations of such constant expressions. This is still somewhat incomplete and there are a few more places that can create zext/sext expressions. This is part of the work for https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179. The reason for the odd result in the constantexpr-fneg.c test is that initially the "a[]" global is created with an [0 x i32] type, at which point the icmp expression cannot be folded. Later it is replaced with an [1 x i32] global and the icmp gets folded away. But at that point we no longer fold the zext.	2023-10-02 12:40:20 +02:00
Nikita Popov	893416051d	[ConstantFolding] Avoid some uses of ConstantExpr::getCast() Call the constant folding API instead.	2023-09-29 11:41:14 +02:00
Nikita Popov	739c86df80	[llvm] Use more explicit cast methods (NFC) Instead of ConstantExpr::getCast() with a fixed opcode, use the corresponding getXYZ methods instead. For the one place creating a pointer bitcast drop it entirely, as this is redundant with opaque pointers.	2023-09-29 11:21:13 +02:00
Nikita Popov	d6d44d6f19	[ConstantFolding] Avoid use of ConstantExpr::getZExt() (NFC) Use the constant folding API instead, which should always succeed in this case.	2023-09-28 17:13:49 +02:00
Matt Arsenault	edecb60481	Reapply "AMDGPU: Drop and auto-upgrade llvm.amdgcn.ldexp to llvm.ldexp" This reverts commit d9333e360a7c52587ab6e4328e7493b357fb2cf3.	2023-09-13 08:38:48 +03:00
Matt Arsenault	00061843bd	InstSimplify: Simplifications for ldexp Ported from old amdgcn intrinsic which will soon be deleted. https://reviews.llvm.org/D149587	2023-09-13 08:38:48 +03:00
Matt Arsenault	ee8d1d26e9	ConstantFolding: Handle exp10 intrinsic https://reviews.llvm.org/D157892	2023-09-02 09:20:51 -04:00
Nikita Popov	625113402f	[IR] Remove support for and/or constant expressions As part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179, this removes support for and and or constant expressions. Places creating such expressions have been migrated in advance, so this is mostly API removal and test updates. Differential Revision: https://reviews.llvm.org/D155924	2023-08-22 09:29:54 +02:00
Ramkumar Ramachandra	aabc71485a	ConstantFolding: remove function in context of opaque ptrs The function StripPtrCastKeepAS() no longer makes any sense, as we've migrated to using opaque pointers throughout the codebase. Hence, remove it. No changes to tests are required. Differential Revision: https://reviews.llvm.org/D156555	2023-07-28 17:46:15 +01:00
Pravin Jagtap	1462053608	[AMDGPU] Propagate constants for llvm.amdgcn.wave.reduce.umin/umax Reviewed By: arsenm, #amdgpu Differential Revision: https://reviews.llvm.org/D156077	2023-07-26 23:46:01 -04:00
Matt Arsenault	a09d9b42f1	ConstantFolding: Constant fold denormal inputs to canonicalize for IEEE This makes it possible to use canonicalize to perform a dynamic check for whether denormal flushing is enabled, which will fold out when the denormal mode is known. Previously it would only fold if denormal flushing were known enabled. https://reviews.llvm.org/D156107	2023-07-24 19:49:06 -04:00
Nikita Popov	a1403dc3d0	[ConstantFolding] Avoid use of ConstantExpr::getOr() (NFC) Constant folding cannot fail here, because we're really working on plain integers. It might be better to make all of this work on APInts instead of Constants.	2023-07-24 17:00:38 +02:00
Matt Arsenault	952fe94c72	ConstantFolding: Fix canonicalize folding for dynamic mode denormal inputs We have to assume the input could be positive-zero. Makes alive2 happy.	2023-07-22 08:07:49 -04:00
Nikita Popov	7be7f23269	[llvm] Remove uses of getWithSamePointeeType() (NFC)	2023-07-18 12:07:09 +02:00
Nikita Popov	e65cabbbb5	[ConstantFolding] Remove some typed pointer handling (NFC) No need to insert a bitcast.	2023-07-18 11:35:03 +02:00
Matt Arsenault	29d2a9bf9d	InstSimplify: Handle basic folds for frexp Handle constant folding and idempotent folding. Not sure this is an appropriate use of undef for the inf/nan case. The C version says the second result is "unspecified". The AMDGPU instruction returns 0.	2023-07-17 17:28:01 -04:00
Nikita Popov	3eae1bf4c2	[llvm] Remove uses of getNonOpaquePointerElementType() (NFC)	2023-07-14 11:52:13 +02:00
khei4	0d67d9aeda	[ConstantFolding] fold integer bitwidth is greater than 63, and not multiple of 8 variables Differential Revision: https://reviews.llvm.org/D150422	2023-05-17 11:09:18 +09:00
khei4	f5dbbf494f	[ConstantFold] use StoreSize for VectorType folding Differential Revision: https://reviews.llvm.org/D150515 Reviewed By: nikic	2023-05-15 23:04:47 +09:00
Matt Arsenault	bc37be1855	LangRef: Add "dynamic" option to "denormal-fp-math" This is stricter than the default "ieee", and should probably be the default. This patch leaves the default alone. I can change this in a future patch. There are non-reversible transforms I would like to perform which are legal under IEEE denormal handling, but illegal with flushing zero behavior. Namely, conversions between llvm.is.fpclass and fcmp with zeroes. Under "ieee" handling, it is legal to translate between llvm.is.fpclass(x, fcZero) and fcmp x, 0. Under "preserve-sign" handling, it is legal to translate between llvm.is.fpclass(x, fcSubnormal\|fcZero) and fcmp x, 0. I would like to compile and distribute some math library functions in a mode where it's callable from code with and without denormals enabled, which requires not changing the compares with denormals or zeroes. If an IEEE function transforms an llvm.is.fpclass call into an fcmp 0, it is no longer possible to call the function from code with denormals enabled, or write an optimization to move the function into a denormal flushing mode. For the original function, if x was a denormal, the class would evaluate to false. If the function compiled with denormal handling was converted to or called from a preserve-sign function, the fcmp now evaluates to true. This could also be of use for strictfp handling, where code may be changing the denormal mode. Alternative name could be "unknown". Replaces the old AMDGPU custom inlining logic with more conservative logic which tries to permit inlining for callees with dynamic handling and avoids inlining other mismatched modes.	2023-04-29 08:44:59 -04:00
Craig Topper	1f60c8d025	[IR] Replace calls to ConstantFP::getNullValue with ConstantFP::getZero. NFC There is no getNullValue in ConstantFP. Due to inheritance, we're calling Constant::getNullValue which handles any type including FP. Since we already know we want an FP constant we can use ConstantFP::getZero which might be faster and is a more readable name for an FP zero.	2023-04-03 23:14:02 -07:00
Matt Arsenault	bba6ca4bfb	ConstantFolding: Minor cleanups for is_fpclass	2023-03-15 18:06:36 -04:00
Paul Walker	62e46f2621	[LLVM] Remove support for constant scalable vector GEPs. This work has fallen out from D134648 as a requirement to loosen the "constness" of vscale. Differential Revision: https://reviews.llvm.org/D145404	2023-03-14 16:48:33 +00:00
Nikita Popov	475f30dfc8	[ConstantFolding] Preserve inbounds when casting GEP indices This canonicalization just makes the implicit sext/trunc explicit, and does not affect the inbounds-ness of the GEP.	2023-03-09 13:08:19 +01:00
Nikita Popov	5d6dfba1a8	[ConstExpr] Avoid creation of select constant expressions These expressions will now only be created if explicitly requested in IR/bitcode (and by LowerTypeTests, which has a tricky to remove use). This is in preparation for removing these expressions entirely, but also fixes #60983 in the meantime.	2023-02-27 17:10:05 +01:00
Kazu Hirata	a28b252d85	Use APInt::getSignificantBits instead of APInt::getMinSignedBits (NFC) Note that getMinSignedBits has been soft-deprecated in favor of getSignificantBits.	2023-02-19 23:56:52 -08:00
Kazu Hirata	f8f3db2756	Use APInt::count{l,r}_{zero,one} (NFC)	2023-02-19 22:04:47 -08:00
Kazu Hirata	cbde2124f1	Use APInt::popcount instead of APInt::countPopulation (NFC) This is for consistency with the C++20-style bit manipulation functions in <bit>.	2023-02-19 11:29:12 -08:00
Nikita Popov	07916cea2e	[ConstantFold] Check for constant global earlier (NFC) Check that the underlying object is a constant global with definitive initializer upfront, so we can skip the more expensive offset calculation logic if we can't perform the fold anyway.	2023-02-15 15:17:05 +01:00
Joe Loser	a288d7f937	[llvm][ADT] Replace uses of `makeMutableArrayRef` with deduction guides Similar to how `makeArrayRef` is deprecated in favor of deduction guides, do the same for `makeMutableArrayRef`. Once all of the places in-tree are using the deduction guides for `MutableArrayRef`, we can mark `makeMutableArrayRef` as deprecated. Differential Revision: https://reviews.llvm.org/D141814	2023-01-16 14:49:37 -07:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Guillaume Chatelet	87b6b347fc	Revert D141134 "[NFC] Only expose getXXXSize functions in TypeSize" The patch should be discussed further. This reverts commit dd56e1c92b0e6e6be249f2d2dd40894e0417223f.	2023-01-06 15:27:50 +00:00
Guillaume Chatelet	dd56e1c92b	[NFC] Only expose getXXXSize functions in TypeSize Currently 'TypeSize' exposes two functions that serve the same purpose: - getFixedSize / getFixedValue - getKnownMinSize / getKnownMinValue source : `bf82070ea4/llvm/include/llvm/Support/TypeSize.h (L337-L338)` This patch offers to remove one of the two and stick to a single function in the code base. Differential Revision: https://reviews.llvm.org/D141134	2023-01-06 15:24:52 +00:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Matt Arsenault	6acf6661dd	ConstantFolding: Ignore output denorm mode for canonicalize Alive2 opt plugin is now happy with the test. Fixes issue 59245	2022-12-13 08:44:21 -05:00
Matt Arsenault	4fa54f8b81	ConstantFolding: Fix handling of canonicalize for ppc_fp128 0s	2022-12-13 08:44:20 -05:00
Matt Arsenault	d647e252b8	InstSimplify: Add basic folding of llvm.is.fpclass intrinsic Copied from the existing llvm.amdgcn.class handling; eventually I will fold that to the generic intrinsic when legal. The tests should probably move into an instsimplify only test.	2022-12-12 21:54:04 -05:00
Fangrui Song	89fae41ef1	[IR] llvm::Optional => std::optional Many llvm/IR/* files have been migrated by other contributors. This migrates most remaining files.	2022-12-05 04:13:11 +00:00
Krzysztof Parzyszek	ab672e9173	FPEnv: convert Optional to std::optional	2022-12-03 13:55:56 -06:00

1 2 3 4 5 ...

578 Commits