llvm-project

Author	SHA1	Message	Date
Simon Moll	173d4b84f6	[VP] Add test to show optimization opportunities Add vp.add test cases that can are optimized with D92086 to show the potential of generalized pattern rewriting. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D129746	2022-07-14 12:36:46 +02:00
Nikita Popov	4bb7b6fae3	[IR] Remove support for float binop constant expressions As part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179, this removes support for the floating-point binop constant expressions fadd, fsub, fmul, fdiv and frem. As part of this change, the C APIs LLVMConstFAdd, LLVMConstFSub, LLVMConstFMul, LLVMConstFDiv and LLVMConstFRem are removed. The LLVMBuild APIs should be used instead. Differential Revision: https://reviews.llvm.org/D129478	2022-07-12 09:40:49 +02:00
Nikita Popov	11950efe06	[ConstExpr] Remove div/rem constant expressions D128820 stopped creating div/rem constant expressions by default; this patch removes support for them entirely. The getUDiv(), getExactUDiv(), getSDiv(), getExactSDiv(), getURem() and getSRem() on ConstantExpr are removed, and ConstantExpr::get() now only accepts binary operators for which ConstantExpr::isSupportedBinOp() returns true. Uses of these methods may be replaced either by corresponding IRBuilder methods, or ConstantFoldBinaryOpOperands (if a constant result is required). On the C API side, LLVMConstUDiv, LLVMConstExactUDiv, LLVMConstSDiv, LLVMConstExactSDiv, LLVMConstURem and LLVMConstSRem are removed and corresponding LLVMBuild methods should be used. Importantly, this also means that constant expressions can no longer trap! This patch still keeps the canTrap() method to minimize diff -- I plan to drop it in a separate NFC patch. Differential Revision: https://reviews.llvm.org/D129148	2022-07-06 10:11:34 +02:00
Nikita Popov	02b38ba8aa	[ConstFold] Salvage some div/rem folding test (NFC) The div/rem constant expressions are going away in D129148. Convert some tests to use InstSimplify instead, to show that the constant folding still happens.	2022-07-06 10:03:03 +02:00
Chen Zheng	758de0e931	[InstructionSimplify] handle denormal input for fcmp Handle denormal constant input for fcmp instructions based on the denormal handling mode. Reviewed By: spatel, dcandler Differential Revision: https://reviews.llvm.org/D128647	2022-07-01 03:51:28 -04:00
Chen Zheng	36ac436068	add testcases for D128647, NFC	2022-06-30 09:54:49 -04:00
Nikita Popov	5548e807b5	[IR] Remove support for extractvalue constant expression This removes the extractvalue constant expression, as part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179. extractvalue is already not supported in bitcode, so we do not need to worry about bitcode auto-upgrade. Uses of ConstantExpr::getExtractValue() should be replaced with IRBuilder::CreateExtractValue() (if the fact that the result is constant is not important) or ConstantFoldExtractValueInstruction() (if it is). Though for this particular case, it is also possible and usually preferable to use getAggregateElement() instead. The C API function LLVMConstExtractValue() is removed, as the underlying constant expression no longer exists. Instead, LLVMBuildExtractValue() should be used (which will constant fold or create an instruction). Depending on the use-case, LLVMGetAggregateElement() may also be used instead. Differential Revision: https://reviews.llvm.org/D125795	2022-06-28 10:40:17 +02:00
Bradley Smith	a83aa33d1b	[IR] Move vector.insert/vector.extract out of experimental namespace These intrinsics are now fundemental for SVE code generation and have been present for a year and a half, hence move them out of the experimental namespace. Differential Revision: https://reviews.llvm.org/D127976	2022-06-27 10:48:45 +00:00
Nikita Popov	60a32157a5	[Tests] Remove unnecessary bitcasts from opaque pointer tests (NFC) Previously left these behind due to the required instruction renumbering, drop them now. This more accurately represents opaque pointer input IR. Also drop duplicate opaque pointer check lines in one SROA test.	2022-06-22 14:15:46 +02:00
David Candler	d3919a8cc5	[ConstantFolding] Respect denormal handling mode attributes when folding instructions Depending on the environment, a floating point instruction should treat denormal inputs as zero, and/or flush a denormal output to zero. Denormals are not currently accounted for when an instruction gets folded to a constant, which can lead to differences in output between a folded and a unfolded instruction when running on the target. The denormal handling mode can be set by the function level attribute denormal-fp-math, which this patch uses to determine whether any denormal inputs to or outputs from folding should be zero, and that the sign is set appropriately. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D116952	2022-06-20 16:41:46 +01:00
David Candler	2e2fdcd0f9	[ConstantFolding] Pre-commit tests showing denormal handling during folding These tests demonstrate cases where the constant produced by folding a floating point instruction should differ based on the denormal handling mode set in function attributes. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D125807	2022-06-20 15:38:30 +01:00
Nikita Popov	7e64a29e58	[InstSimplify][IR] Handle trapping constant aggregate (PR49839) Unfortunately, it's not just constant expressions that can trap, we might also have a trapping constant expression nested inside a constant aggregate. Perform the check during phi folding on Constant rather than ConstantExpr, and extend the Constant::mayTrap() implementation to also recursive into ConstantAggregates, not just ConstantExprs. Fixes https://github.com/llvm/llvm-project/issues/49839.	2022-06-13 12:35:17 +02:00
Nikita Popov	c232a15df4	[InstSimplify] Add additional test for PR49839 (NFC) This is a variant involving an aggregate constant, which was not covered by the previous patch.	2022-06-13 12:23:30 +02:00
Nikita Popov	2a3288776c	[InstSimplify] Update GEP test to use opaque pointers (NFC) With opaque pointers, we end up merging these GEPs and dropping the inrange attribute (in the last two cases). This did not happen previously, because typed pointers use less powerful GEP folding logic. I'm a bit unsure whether this is something we need to be concerned about or not. I believe that generally our stance is that we should perform folds even if this requires losing poison-generating flags like inrange. We can either a) accept this as-is, b) try to inhibit folding if it requires dropping inrange or c) try to fold to poison if we know that inrange is going to be violated. For now, we accept it as-is. Differential Revision: https://reviews.llvm.org/D127503	2022-06-13 10:45:55 +02:00
Nikita Popov	04b944e230	[InstSimplify] Convert tests to opaque pointers (NFC) The only interesting test change is in @PR31262, where the following fold is now performed, while it previously was not: https://alive2.llvm.org/ce/z/a5Qmr6 llvm/test/Transforms/InstSimplify/ConstProp/gep.ll has not been updated, because there is a tradeoff between folding and inrange preservation there that we may want to discuss. Updates have been performed using: https://gist.github.com/nikic/98357b71fd67756b0f064c9517b62a34	2022-06-10 17:16:28 +02:00
Nikita Popov	addc12fb1f	[InstSimplify] Name variables/labels in test (NFC) Run the test through -instnamer. Also drop irrelevant uwtable attribute.	2022-06-10 16:56:32 +02:00
Nikita Popov	0a5ec1f034	[InstSimplify] Regenerate test checks (NFC)	2022-06-10 16:54:09 +02:00
Danila Malyutin	ed6c309d4b	[APFloat] Fix truncation of certain subnormal numbers Certain subnormals would be incorrectly rounded away from zero. Fixes #55838 Differential Revision: https://reviews.llvm.org/D127140	2022-06-08 21:54:35 +03:00
Sanjay Patel	abb21b54bc	[ConstProp] add tests for APFloat truncate miscompile; NFC issue #55838	2022-06-05 20:07:18 -04:00
Nikita Popov	356d47ccb9	[ValueTracking] Handle and/or on RHS of isImpliedCondition() isImpliedCondition() currently handles and/or on the LHS, but not on the RHS, resulting in asymmetric behavior. This patch adds two new implication rules: * LHS ==> (RHS1 \|\| RHS2) if LHS ==> RHS1 or LHS ==> RHS2 * LHS ==> !(RHS1 && RHS2) if LHS ==> !RHS1 or LHS ==> !RHS2 Differential Revision: https://reviews.llvm.org/D125551	2022-05-16 16:30:26 +02:00
Nikita Popov	f01c7583b5	[InstSimplify] Add additional implied condition tests (NFC)	2022-05-13 17:19:41 +02:00
Nikita Popov	ddfee07519	[InstSimplify] Fold and/or using implied conditions This adds two conjugated folds: * A \| B -> B if A implies B (https://alive2.llvm.org/ce/z/R6GU4j) * A & B -> A if A implies B (https://alive2.llvm.org/ce/z/EGMqyy) If A and B are icmps themselves, we will usually fold this through other logic already (though the tests show a couple additional cases we previously missed). However, isImpliedCond() also supports A being of the form X & Y, which allows us to handle cases like (X & Y) \| B where X implies B. This addresses the regression from D125398. Something that notably doesn't work yet is the (X \| Y) & B case. This is due to an asymmetry in the isImpliedCondition() implementation that will have to be addressed separately. Differential Revision: https://reviews.llvm.org/D125530	2022-05-13 15:09:14 +02:00
Nikita Popov	4de9a8ae3f	[InstSimplify] Add tests for and/or with implied conditions (NFC)	2022-05-13 12:08:16 +02:00
Benjamin Kramer	08b20f20d2	[ConstantFold] Use getFltSemantics instead of manually checking the type Simplifies the code and makes fpext/fptrunc constant folding not crash when the result is bf16.	2022-05-05 15:52:19 +02:00
Craig Topper	ac8c720d48	[IR] Allow constant folding (insertelement <vscale x 2 x i32> zeroinitializer, i32 0, i32 i32 0. Most of insertelement constant folding is blocked if the vector type is scalable. I believe we can make an exception for inserting null into an all zeros vector. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D123413	2022-04-15 17:44:32 -07:00
Kevin P. Neal	d43d9e1d5c	[FPEnv][InstSimplify] Fold fsub -0.0, -X ==> X Currently the fsub optimizations in InstSimplify don't know how to fold -0.0 - (-X) to X when the constrained intrinsics are used. This adds partial support. The rest of the support will come later with work on the IR matchers. This review is split out from D107285. Differential Revision: https://reviews.llvm.org/D123396	2022-04-14 11:48:54 -04:00
Nikita Popov	1d530b914e	[InstSimplify] Don't fold phi of poison and trapping const expr (PR49839) Folding this case would result in the constant expression being executed unconditionally, which may introduce a new trap. Fixes https://github.com/llvm/llvm-project/issues/49839.	2022-04-12 17:32:25 +02:00
Nikita Popov	bc6d7ed8a9	[InstSimplify] Add test for PR49839 (NFC)	2022-04-12 17:32:25 +02:00
Nikita Popov	659871cede	[ConstantFold] Add test for load of i8 from i1 (NFC) Semantics here are a bit unclear, but the store-to-load forwarding case at least should be a miscompile.	2022-04-08 16:32:51 +02:00
Hirochika Matsumoto	447a4485c5	[InstSimplify] Fold (ctpop(X) == N) \|\| (X != 0) into X != 0 where N > 0 (ctpop(X) == N) \|\| (X != 0) --> (X != 0) https://alive2.llvm.org/ce/z/udgUVV (ctpop(X) != N) && (X == 0) --> (X == 0) https://alive2.llvm.org/ce/z/9dq-cR Differential Revision: https://reviews.llvm.org/D122757	2022-04-04 23:23:34 +09:00
Nikita Popov	3c9f3f76f1	[ConstantFold] Fold zero-index GEPs with opaque pointers With opaque pointers, we can eliminate zero-index GEPs even if they have multiple indices, as this no longer impacts the result type of the GEP. This optimization is already done for instructions in InstSimplify, but we were missing the corresponding constant expression handling. The constexpr transform is a bit more powerful, because it can produce a vector splat constant and also handles undef values -- it is an extension of an existing single-index transform.	2022-04-04 13:04:27 +02:00
Nikita Popov	d092df42f3	[InstSimplify] Add tests for zero-offset opaque ptr constexpr GEP (NFC)	2022-04-04 13:04:26 +02:00
Dávid Bolvanský	11b41910dd	[NFCI] Regenerate instsimplify test checks	2022-04-03 20:55:15 +02:00
Hirochika Matsumoto	f138a9964b	Reapply "[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop" This change was previously reverted because I forgot rerunning update_test_checks.py and tests were not actually baseline. Extracted from: https://reviews.llvm.org/D122757	2022-04-03 22:07:04 +09:00
Hirochika Matsumoto	f65c78a094	Revert "[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop" This reverts commit b48abeea44ac3c7860b13b863210116e8db1d978. Accidentally added already optimized tests, not baseline tests.	2022-04-03 02:27:59 +09:00
Hirochika Matsumoto	b48abeea44	[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop Extracted from: https://reviews.llvm.org/D122757	2022-04-03 02:19:24 +09:00
Kevin P. Neal	bd050a34fe	[FPEnv][InstSimplify] Teach CannotBeNegativeZero() about constrained intrinsics. Currently some optimizations are disabled because llvm::CannotBeNegativeZero() does not know how to deal with the constrained intrinsics. This patch fixes that by extending the existing implementation. Differential Revision: https://reviews.llvm.org/D121483	2022-03-18 10:24:48 -04:00
Kevin P. Neal	ae6db2f3d8	Precommit test for D121483: [FPEnv][InstSimplify] Teach CannotBeNegativeZero() about constrained intrinsics.	2022-03-17 15:03:05 -04:00
Nikita Popov	02c2106002	[InstSimplify] Handle vector GEP when simplifying zero indices If the base is a scalar and the index is a vector, we can't simplify, as this is effectively a splat operation.	2022-03-11 10:56:44 +01:00
Serge Pavlov	6982c38cb1	[ConstantFolding] Fix folding of constrained compare intrinsics The change fixes treatment of constrained compare intrinsics if compared values are of vector type. Differential revision: https://reviews.llvm.org/D110322	2022-02-27 10:19:19 +07:00
Sanjay Patel	fc3b34c508	[InstSimplify] remove shift that is redundant with part of funnel shift In D111530, I suggested that we add some relatively basic pattern-matching folds for shifts and funnel shifts and avoid a more specialized solution if possible. We can start by implementing at least one of these in IR because it's easier to write the code and verify with Alive2: https://alive2.llvm.org/ce/z/qHpmNn This will need to be adapted/extended for SDAG to handle the motivating bug ( #49541 ) because the patterns only appear later with that example (added some tests: bb850d422b64) This can be extended within InstSimplify to handle cases where we 'and' with a shift too (in that case, kill the funnel shift). We could also handle patterns where the shift and funnel shift directions are inverted, but I think it's better to canonicalize that instead to avoid pattern-match case explosion. Differential Revision: https://reviews.llvm.org/D120253	2022-02-23 09:10:01 -05:00
Sanjay Patel	ee5580a8eb	[InstSimplify] add tests for funnel shift with redundant shift; NFC	2022-02-21 10:24:46 -05:00
Philip Reames	ff2e4c04c4	[instsimplify] Assume storage for byval args doesn't overlap allocas, globals, or other byval args This allows us to discharge many pointer comparisons based on byval arguments. Differential Revision: https://reviews.llvm.org/D120133	2022-02-18 11:08:01 -08:00
Philip Reames	a259e62bb6	[instsimplify] Add a couple more pointer compare folding tests [NFC]	2022-02-18 08:24:06 -08:00
Philip Reames	1cf790bd04	[instsimplify] Add pointer compare tests for byval args and globals	2022-02-18 07:50:57 -08:00
Philip Reames	9f7075de5c	{instsimplify] Precommit some tests for provable inequal pointers derived from allocas	2022-02-17 12:06:06 -08:00
Philip Reames	cf5e88864b	[instsimplify] When compare allocas, consider their minimal size The code was using exact sizing only, but since what we really need is just to make sure the offsets are in bounds, a minimum bound on the object size is sufficient. To demonstrate the difference, support computing minimum sizes from obects of scalable vector type.	2022-02-17 09:53:24 -08:00
Philip Reames	2404313d80	[instsimplify] Fix a miscompile with zero sized allocas Remove some code which tried to handle the case of comparing two allocas where an object size could not be precisely computed. This code had zero coverage in tree, and at least one nasty bug. The bug comes from the fact that the code uses the size of the result pointer as a proxy for whether the alloca can be of size zero. Since the result of an alloca is always a pointer type, and a pointer type can never be empty, this check was a nop. As a result, we blindly consider a zero offset from two allocas to never be equal. They can in fact be equal when one or more of the allocas is zero sized. This is particularly ugly because instcombine contains the exact opposite rule. If instcombine reaches the allocas first, it combines them into one (making them equal). If instsimplify reaches the compare first, it would consider them not equal. This creates all kinds of fun scenarios for order of optimization reaching different and contradictory conclusions.	2022-02-17 09:27:34 -08:00
Philip Reames	7eb3ce997a	[instsimplify] Precommit a test showing an alloca equality miscompile	2022-02-17 09:16:31 -08:00
Kevin P. Neal	c7400892ca	[FPEnv][InstSimplify] Fold fsub X, -0 ==> X, when we know X is not -0 Currently the fsub optimizations in InstSimplify don't know how to fold X - -0.0 to X when we know X is not zero and the constrained intrinsics are used. This adds the support. This review is split out from D107285. Differential Revision: https://reviews.llvm.org/D119746	2022-02-16 10:10:13 -05:00

1 2 3 4 5 ...

1107 Commits