llvm-project

Author	SHA1	Message	Date
Bjorn Pettersson	ac696ac453	Use opt -passes=<name> instead of opt -name Updated the RUN line in several test cases to use the new PM syntax opt -passes=<pipeline> instead of the deprecated syntax opt -pass1 -pass2	2022-11-08 12:15:42 +01:00
Arthur Eubanks	c384b20b55	[opt] Remove temporary legacy pass name translations And update corresponding tests.	2022-10-07 11:09:46 -07:00
Sanjay Patel	2981a94902	[EarlyCSE][ConstantFolding] do not constant fold atan2(+/-0.0, +/-0.0), part 2 Follow-up to 7f1262a322c0d80f3. That patch avoided removing the call, but it still allowed the constant-folded result. This makes the behavior consistent with 1-arg libm folding: if the call potentially raises an exception, then we just bail out. It seems likely that there are other corner-cases like this, but the tests are incomplete, so we have lived with these discrepancies for a long time. This was untested before the the constant folding was expanded in D127964.	2022-08-20 10:16:06 -04:00
Sanjay Patel	7f1262a322	[EarlyCSE][ConstantFolding] do not constant fold atan2(+/-0.0, +/-0.0) These may raise an error (set errno) as discussed in the post-commit comments for D127964, so we can't fold away the call and potentially alter that behavior.	2022-08-19 12:27:29 -04:00
Sanjay Patel	4bff1037bb	[EarlyCSE][ConstantFolding] add tests for atan2 with zero args; NFC	2022-08-19 12:18:53 -04:00
Kevin P. Neal	05ac82de40	[FPEnv][EarlyCSE] Support for CSE when exception behavior is "ignore" or "maytrap" and the rounding mode is known. Previously we would only CSE constrained FP intrinsics in the default floating point environment. Exception behavior of "strict" is still not allowed since we are not allowed to remove any traps in that case. There are no restrictions on CSE across function calls inside a function. Differential Revision: https://reviews.llvm.org/D112256	2022-08-16 08:31:42 -04:00
Sanjay Patel	43dd567443	[EarlyCSE] allow flexibility in atan(-0.0) test As discussed in the post-commit feedback for b53d44fe47413c87f619b, this test was failing on AIX because atan(-0.0) results in 0.0 (positive). Differential Revision: https://reviews.llvm.org/D131601	2022-08-10 15:02:01 -04:00
Mohammed Nurul Hoque	30abc1a6a1	[ConstantFolding] Eliminate atan and atan2 calls From the opengroup specifications, atan2 may fail if the result underflows and atan may fail if the argument is subnormal, but we assume that does not happen and eliminate the calls if we can constant fold the result at compile-time. Differential Revision: https://reviews.llvm.org/D127964	2022-08-10 11:01:50 -04:00
Jake Egan	c1226585b3	[AIX][tests] XFAIL for system-aix instead The Clang folding for floating-point sometimes calls out to the host.	2022-08-10 09:31:42 -04:00
Jake Egan	6da3f90195	[AIX][tests] XFAIL atan.ll test on AIX XFAIL this newly added test for now to get the AIX bot back to green.	2022-08-09 09:58:08 -04:00
Sanjay Patel	59f3b3d796	[EarlyCSE][ConstantFolding] move test files to dir of pass in RUN line; NFC	2022-08-08 10:08:55 -04:00
Mohammed Nurul Hoque	b53d44fe47	[EarlyCSE][ConstantFolding] add tests for atan/atan2; NFC Baseline coverage for D127964.	2022-08-08 09:24:58 -04:00
Denis Antrushin	36cc533471	[EarlyCSE][OpaquePointers]Replace assert with return for mask type check. When EarlyCSE tries to common vector masked loads/stores, it first checks that they have same base operand and then assumes that this is enough for mask types to be equal. This is true for typed pointers but false for opaque ones - two loads of different vector sizes from same base pointer '%b' are the same, `ptr %b`. (For typed pointers, `%b` was cast to vector pointer type so bases were different). Change assert to return from lambda `isSubmask` so this transformation properly works with opaque pointers. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D131251	2022-08-08 16:14:42 +03:00
Chris Bieneman	383e754072	NFC. Require DirectX backend for these tests Should have added this when I added the test directory. This just requires the DirectX target for running these tests.	2022-08-03 15:55:03 -05:00
Chris Bieneman	ee4d815008	[DX] Remove IntrNoMem from create handle intrinsic The create handle intrinsic calls can't be removed, so it was incorrect to mark them as IntrNoMem.	2022-08-02 16:57:22 -05:00
Kevin P. Neal	25a83005ef	Precommit tests for D112256 "[FPEnv][EarlyCSE] Add support for CSE of constrained FP intrinsics, take 2"	2022-07-28 08:59:27 -04:00
Nikita Popov	60a32157a5	[Tests] Remove unnecessary bitcasts from opaque pointer tests (NFC) Previously left these behind due to the required instruction renumbering, drop them now. This more accurately represents opaque pointer input IR. Also drop duplicate opaque pointer check lines in one SROA test.	2022-06-22 14:15:46 +02:00
Florian Hahn	b8d728a098	[SimplifyCFG,EarlyCSE] Update 2 tests to not branch on undef (NFC).	2022-06-12 18:03:26 +01:00
Nikita Popov	3c514d31d7	[EarlyCSE] Update tests to use opaque pointers (NFC) Update the EarlyCSE tests to use opaque pointers. Worth noting that this leaves some bitcast ptr to ptr instructions in the input IR behind which are no longer necessary. This is because these use numbered instructions, so it's hard to drop them in an automated fashion (as it would require renumbering all other instructions as well). I'm leaving that as a problem for another day. The test updates have been performed using https://gist.github.com/nikic/98357b71fd67756b0f064c9517b62a34. Differential Revision: https://reviews.llvm.org/D127278	2022-06-10 09:53:35 +02:00
Artur Pilipenko	5ee0123642	[EarlyCSE] Add tests demonstrating missed opportunitites Add tests demonstrating missed opportunitites around invariant.start intrinsic. NFC.	2022-04-26 11:58:16 -07:00
Arthur Eubanks	af6b9939aa	[EarlyCSE][OpaquePtr] Check access type when performing DSE This will bail out on target specific intrinsics. If those are deemed important enough for EarlyCSE to handle, we can augment MemIntrinsicInfo with an access type for TargetTransformInfo::getTgtMemIntrinsic() to handle. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D120077	2022-02-17 11:58:53 -08:00
Nikita Popov	46f9e45ef0	[Statepoint] Update gc.statepoint calls in tests with elementtype (NFC) This updates tests for the LangRef change in D117890.	2022-02-04 14:15:41 +01:00
Nikita Popov	60147c6034	[EarlyCSE] Regenerate test checks (NFC)	2022-01-20 14:49:26 +01:00
Nikita Popov	918015c9ba	[EarlyCSE] Support opaque pointers Explicitly check the load/store value type, because this is no longer implicitly checked through the pointer type.	2022-01-06 17:08:50 +01:00
Florian Hahn	361111906b	[EarlyCSE] Retain poison flags, if program is UB if poison. Poison-generating flags can be retained during CSE on the earlier instruction , if the earlier instruction being poison causes UB. For now, always take AND for floating point instructions. https://alive2.llvm.org/ce/z/4K3D7P Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D115247	2021-12-11 15:11:44 +00:00
Florian Hahn	22e6094b20	[EarlyCSE] Add test case with inbounds gep where flags can be retained.	2021-12-07 13:46:25 +00:00
Florian Hahn	aca7a19039	[EarlyCSE] Auto-generate check lines for flags.ll. The test already checks the full IR. To make updating easier, auto-generate the check lines.	2021-12-07 13:46:13 +00:00
Bjorn Pettersson	d52f506192	[NewPM] Use parameterized syntax for a couple of more passes A couple of passes that are parameterized in new-PM used different pass names (in cmd line interface) while using the same pass class name. This patch updates the PassRegistry to model pass parameters more properly using PASS_WITH_PARAMS. Reason for the change is to ensure that we have a 1-1 mapping between class name and pass name (when disregarding the params). With a 1-1 mapping it is more obvious which pass name to use in options such as -debug-only, -print-after etc. The opt -passes syntax is changed for the following passes: early-cse-memssa => early-cse<memssa> post-inline-ee-instrument => ee-instrument<post-inline> loop-extract-single => loop-extract<single> lower-matrix-intrinsics-minimal => lower-matrix-intrinsics<minimal> This patch is not updating pass names in docs/Passes.rst. Not quite sure what the status is for that document (e.g. when it comes to listing pass paramters). It is only loop-extract-single that is mentioned in Passes.rst today, out of the passes mentioned above. Differential Revision: https://reviews.llvm.org/D108362	2021-08-20 14:59:21 +02:00
Kevin P. Neal	f21f1eea05	[FPEnv] EarlyCSE support for constrained intrinsics, default FP environment edition EarlyCSE cannot distinguish between floating point instructions and constrained floating point intrinsics that are marked as running in the default FP environment. Said intrinsics are supposed to behave exactly the same as the regular FP instructions. Teach EarlyCSE to handle them in that case. Differential Revision: https://reviews.llvm.org/D99962	2021-05-20 14:40:51 -04:00
Philip Reames	6972e39d47	[gvn] CSE gc.relocates based on meaning, not spelling (try 2) This was (partially) reverted in cfe8f8e0 because the conversion from readonly to readnone in Intrinsics.td exposed a couple of problems. This change has been reworked to not need that change (via some explicit checks in client code). This is being done to address the original optimization issue and simplify the testing of the readonly changes. I'm working on that piece under 49607. Original commit message follows: The last two operands to a gc.relocate represent indices into the associated gc.statepoint's gc bundle list. (Effectively, gc.relocates are projections from the gc.statepoints multiple return values.) We can use this to recognize when two gc.relocates are equivalent (and can be CSEd), even when the indices are non-equal. This is particular useful when considering a chain of multiple statepoints as it lets us eliminate all duplicate gc.relocates in a single pass. Differential Revision: https://reviews.llvm.org/D97974	2021-03-16 10:59:31 -07:00
Serguei Katkov	cfe8f8e0f0	Revert "Mark gc.relocate and gc.result as readnone" As readnone function they become movable and LICM can hoist them out of a loop. As a result in LCSSA form phi node of type token is created. No one is ready that GCRelocate first operand is phi node but expects to be token. GVN test were also updated, it seems it does not do what is expected. Test for LICM is also added. This reverts commit f352463ade6e49c3b0275f296d9190d828b7630b.	2021-03-12 16:59:17 +07:00
Philip Reames	f352463ade	Mark gc.relocate and gc.result as readnone For some reason, we had been marking gc.relocates as reading memory. There's no known reason for this, and I suspect it to be a legacy of very early implementation conservatism. gc.relocate and gc.result are simply projections of the return values from the associated statepoint. Note that the LangRef has always declared them readnone. The EarlyCSE change is simply moving the special casing from readonly to readnone handling. As noted by the test diffs, this does allow some additional CSE when relocates are separated by stores, but since we generate gc.relocates in batches, this is unlikely to help anything in practice. This was reviewed as part of https://reviews.llvm.org/D97974, but split at reviewer request before landing. The motivation is to enable the GVN changes in that patch.	2021-03-05 10:07:17 -08:00
Philip Reames	9fe46d6487	[tests] precommit some additional tests for D97974	2021-03-05 10:04:07 -08:00
Philip Reames	8998b811c9	[tests] Expand coverage of gc.relocate CSE in early-cse	2021-03-04 12:12:55 -08:00
Jeroen Dobbelaere	121cac01e8	[noalias.decl] Look through llvm.experimental.noalias.scope.decl Just like llvm.assume, there are a lot of cases where we can just ignore llvm.experimental.noalias.scope.decl. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D93042	2021-01-19 20:09:42 +01:00
Juneyoung Lee	d3f1f7b6bc	[EarlyCSE] Use m_LogicalAnd/Or matchers to handle branch conditions EarlyCSE's handleBranchCondition says: ``` // If the condition is AND operation, we can propagate its operands into the // true branch. If it is OR operation, we can propagate them into the false // branch. ``` This holds for the corresponding select patterns as well. This is a part of an ongoing work for disabling buggy select->and/or transformations. See llvm.org/pr48353 and D93065 for more context Proof: and: https://alive2.llvm.org/ce/z/MQWodU or: https://alive2.llvm.org/ce/z/9GLbB_ Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D93842	2020-12-28 05:36:26 +09:00
Juneyoung Lee	0060f10134	[EarlyCSE] Add tests for select form of and/or (NFC)	2020-12-28 04:19:22 +09:00
Matt Arsenault	20c43d6bd5	OpaquePtr: Bulk update tests to use typed sret	2020-11-20 17:58:26 -05:00
Chen Zheng	4eb8359e74	[EarlyCSE] delete abs/nabs handling delete abs/nabs handling in earlycse pass to avoid bugs related to hashing values. After abs/nabs is canonicalized to intrinsics in D87188, we should get CSE ability for abs/nabs back. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D90734	2020-11-10 21:10:58 -05:00
Krzysztof Parzyszek	57f097420d	Clean up test file, NFC	2020-09-23 15:32:46 -05:00
Krzysztof Parzyszek	e976fb1e54	[EarlyCSE] Fix crash with expensive checks after D87691 D87691 reordered some checks, which turned out to be unsafe. More specifically, when examining a store instruction, the check against getOrCreateResult should be done before attempting to call isSameMemGeneration. Otherwise a crash in MSSA walker can occur. This patch restores the order of these calls to what it was originally.	2020-09-23 12:21:34 -05:00
Krzysztof Parzyszek	ae3f54c1e9	[EarlyCSE] Handle masked loads and stores Extend the handling of memory intrinsics to also include non- target-specific intrinsics, in particular masked loads and stores. Invent "isHandledNonTargetIntrinsic" to distinguish between intrin- sics that should be handled natively from intrinsics that can be passed to TTI. Add code that handles masked loads and stores and update the testcase to reflect the results. Differential Revision: https://reviews.llvm.org/D87340	2020-09-21 18:47:10 -05:00
Krzysztof Parzyszek	ae0ecb3c50	Pre-commit test for CSEing masked loads/stores	2020-09-18 14:30:53 -05:00
Michael Liao	41e68f7ee7	[EarlyCSE] Fix and recommit the revised c9826829d74e637163fdb0351870b8204e62d6e6 In addition to calculate hash consistently by swapping SELECT's operands, we also need to inverse the select pattern favor to match the original logic. [EarlyCSE] Equivalent SELECTs should hash equally DenseMap<SimpleValue> assumes that, if its isEqual method returns true for two elements, then its getHashValue method must return the same value for them. This invariant is broken when one SELECT node is a min/max operation, and the other can be transformed into an equivalent min/max by inverting its predicate and swapping its operands. This patch fixes an assertion failure that would occur intermittently while compiling the following IR: define i32 @t(i32 %i) { %cmp = icmp sle i32 0, %i %twin1 = select i1 %cmp, i32 %i, i32 0 %cmpinv = icmp sgt i32 0, %i %twin2 = select i1 %cmpinv, i32 0, i32 %i %sink = add i32 %twin1, %twin2 ret i32 %sink } Differential Revision: https://reviews.llvm.org/D86843	2020-09-10 23:30:56 -04:00
Michael Liao	39dc75f66c	Revert "[EarlyCSE] Equivalent SELECTs should hash equally" This reverts commit c9826829d74e637163fdb0351870b8204e62d6e6 as it breaks regression tests.	2020-09-10 22:37:35 -04:00
Bryan Chan	c9826829d7	[EarlyCSE] Equivalent SELECTs should hash equally DenseMap<SimpleValue> assumes that, if its isEqual method returns true for two elements, then its getHashValue method must return the same value for them. This invariant is broken when one SELECT node is a min/max operation, and the other can be transformed into an equivalent min/max by inverting its predicate and swapping its operands. This patch fixes an assertion failure that would occur intermittently while compiling the following IR: define i32 @t(i32 %i) { %cmp = icmp sle i32 0, %i %twin1 = select i1 %cmp, i32 %i, i32 0 %cmpinv = icmp sgt i32 0, %i %twin2 = select i1 %cmpinv, i32 0, i32 %i %sink = add i32 %twin1, %twin2 ret i32 %sink } Differential Revision: https://reviews.llvm.org/D86843	2020-09-10 16:59:24 -04:00
Florian Hahn	2bcc4db761	[EarlyCSE] Explicitly require AAResultsWrapperPass. The MemorySSAWrapperPass depends on AAResultsWrapperPass and if MemorySSA is preserved but AAResultsWrapperPass is not, this could lead to a crash when updating the last user of the MemorySSAWrapperPass. Alternatively AAResultsWrapperPass could be marked preserved by GVN, but I am not sure if that would be safe. I am not sure what is required in order to preserve AAResultsWrapperPass. At the moment, it seems like a couple of passes that do similar transforms to GVN are preserving it. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D87137	2020-09-09 09:14:50 +01:00
Krzysztof Parzyszek	889cf9bedf	[EarlyCSE] Add testcase for masked loads and stores, NFC	2020-09-08 19:52:04 -05:00
Bryan Chan	3404add468	[EarlyCSE] Verify hash code in regression tests As discussed in D86843, -earlycse-debug-hash should be used in more regression tests to catch inconsistency between the hashing and the equivalence check. Differential Revision: https://reviews.llvm.org/D86863	2020-09-04 10:40:35 -04:00
Sanjay Patel	e1a3038689	[EarlyCSE] add tests for fma/fmuladd; NFC	2020-09-03 09:11:54 -04:00

1 2 3 4

152 Commits