llvm-project

Author	SHA1	Message	Date
Florian Hahn	bba4a1daff	[ArgPromotion] Remove incorrect TranspBlocks set for loads. (#84835 ) The TranspBlocks set was used to cache aliasing decision for all processed loads in the parent loop. This is incorrect, because each load can access a different location, which means one load not being modified in a block doesn't translate to another load not being modified in the same block. All loads access the same underlying object, so we could perhaps use a location without size for all loads and retain the cache, but that would mean we loose precision. For now, just drop the cache. Fixes https://github.com/llvm/llvm-project/issues/84807 PR: https://github.com/llvm/llvm-project/pull/84835	2024-03-12 09:47:42 +00:00
Florian Hahn	31ffdb56b4	[ArgPromotion] Add test case for #84807 . Test case for https://github.com/llvm/llvm-project/issues/84807, showing a mis-compile in ArgPromotion.	2024-03-11 21:06:15 +00:00
Nikita Popov	2d69827c5c	[Transforms] Convert tests to opaque pointers (NFC)	2024-02-05 11:57:34 +01:00
Jeremy Morse	d2d9dc8eb4	[DebugInfo][RemoveDIs] Make debugify pass convert to/from RemoveDIs mode (#73251 ) Debugify is extremely useful as a testing and debugging tool, and a good number of LLVM-IR transform tests use it. We need it to support "new" non-instruction debug-info to get test coverage, but it's not important enough to completely convert right now (and it'd be a large undertaking). Thus: convert to/from dbg.value/DPValue mode on entry and exit of the pass, which gives us the functionality without any further work. The cost is compile-time, but again this is only happening during tests. Tested by: the large set of debugify tests enabled here. Note the InstCombine test (cast-mul-select.ll) that hasn't been fully enabled: this is because there's a debug-info sinking piece of code there that hasn't been instrumented.	2023-11-29 13:19:50 +00:00
Nikita Popov	c7aacbb5b6	[ArgPromotion] Update allocsize indices after promotion Promotion can add/remove arguments. We need to update the indices in the allocsize attribute accordingly. Fixes https://github.com/llvm/llvm-project/issues/66103.	2023-09-18 16:15:16 +02:00
Matt Arsenault	25bc999d1f	Intrinsics: Add type overload to stacksave and stackstore This allows use with non-0 address space stacks. llvm_ptr_ty should never be used. This could use some more percolation up through mlir, but this is enough to fix existing tests. https://reviews.llvm.org/D156666	2023-08-09 18:33:11 -04:00
Matt Arsenault	f3c9e5807f	Analysis: Fix assertion when load alignment exceeds address space size Apparently the maximum alignment no longer fits in 32-bits now, which overflows a 32-bit offset and would fail on the isPowerOf2 assert.	2023-06-30 12:31:32 -04:00
Tobias Hieta	f84bac329b	[NFC][Py Reformat] Reformat lit.local.cfg python files in llvm This is a follow-up to b71edfaa4ec3c998aadb35255ce2f60bba2940b0 since I forgot the lit.local.cfg files in that one. Reformatting is done with `black`. If you end up having problems merging this commit because you have made changes to a python file, the best way to handle that is to run git checkout --ours <yourfile> and then reformat it with black. If you run into any problems, post to discourse about it and we will try to help. RFC Thread below: https://discourse.llvm.org/t/rfc-document-and-standardize-python-code-style Reviewed By: barannikov88, kwk Differential Revision: https://reviews.llvm.org/D150762	2023-05-17 17:03:15 +02:00
Shoaib Meenai	0e2b4b2dba	Revert "[ArgumentPromotion] Bail if any callers are minsize" This reverts commit 8b8466fd31e5a194fd8ba7a73a0f23d32f164318. This is causing size regressions with -Oz and FullLTO. Revert while I come up with a repro.	2023-05-05 14:26:57 -07:00
Arthur Eubanks	8b8466fd31	[ArgumentPromotion] Bail if any callers are minsize Argument promotion mostly works on functions with more than one caller (otherwise the function would be inlined or is dead), so there's a good chance that performing this increases code size since we introduce loads at every call site. If any caller is marked minsize, bail. We could compare the number of loads/stores removed from the function with the number of loads introduced in callers, but that's TODO. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D149768	2023-05-03 11:29:15 -07:00
Yonghong Song	da816c2985	[TTI][BPF] Ensure ArgumentPromotion Not Exceeding Target MaxArgs With LLVM patch https://reviews.llvm.org/D148269, we hit a linux kernel bpf selftest compilation failure like below: ... progs/test_xdp_noinline.c:739:8: error: too many args to t8: i64 = GlobalAddress<ptr @encap_v4> 0, progs/test_xdp_noinline.c:739:8 if (!encap_v4(xdp, cval, &pckt, dst, pkt_bytes)) ^ ... progs/test_xdp_noinline.c:321:6: error: defined with too many args bool encap_v4(struct xdp_md xdp, struct ctl_value cval, ^ ... Note that bpf selftests are compiled with -O2 which is the recommended flag for bpf community. The bpf backend calling convention is only allowing 5 parameters in registers and does not allow pass arguments through stacks. In the above case, ArgumentPromotionPass replaced parameter '&pckt' as two parameters, so the total number of arguments after ArgumentPromotion pass becomes 6 and this caused later compilation failure during instruction selection phase. This patch added a TargetTransformInfo hook getMaxNumArgs() which returns 5 for BPF and UINT_MAX for other targets. Differential Revision: https://reviews.llvm.org/D148551	2023-04-19 09:09:20 -07:00
Nikita Popov	b066505d88	[ArgPromotion] Require noundef to copy poison-generating metadata For poison-generating (rather than IUB) metadata, only copy it from the dominating must-exec load if it is combined with !noundef. This could be further extended by additionall intersecting the metadata from all loads, which does not require !noundef.	2023-04-05 14:34:33 +02:00
Nikita Popov	4923b4dbac	[Local] Check for null VH in RecursivelyDeleteTriviallyDeadInstructionsPermissive() Peculiarly, the non-permissive variant handled this gracefully, but the permissive one did not.	2023-03-24 12:56:06 +01:00
Jeff Byrnes	7739be7c6b	[ArgPromotion] Remove dead code produced by removing dead arguments ArgPromotion currently produces phantom / dead loads. A good example of this is store-into-inself.ll. First, ArgPromo finds the promotable argument %p in @l. Then it inserts a load of %p in the caller, and passes instead the loaded value / transforms the function body. PromoteMem2Reg is able to optimize out the entire function body, resulting in an unused argument. In a subsequent ArgPromotion pass, it removes the dead argument, resulting in a dead load in the caller. These dead loads may reduce effectiveness of other transformations (e.g. SimplifyCFG, MergedLoadStoreMotion). This patch removes loads and geps that are made dead in the caller after removal of dead args. Differential Revision: https://reviews.llvm.org/D146327	2023-03-23 09:43:35 -07:00
Jeff Byrnes	08622314d2	Precommit tests for D146327	2023-03-22 12:23:28 -07:00
Nikita Popov	e6241cbdcb	[Mem2Reg] Only convert !nonnull to assume if !noundef present After D141386 !nonnull violation returns poison rather than resulting in immediate undefined behavior. However, converting it into an assume would result in IUB. As such, we can only perform this transform if !noundef is also present.	2023-01-20 16:38:26 +01:00
Nikita Popov	bcbc615164	[ArgPromotion] Convert tests to opaque pointers (NFC) update_test_checks was rerun for some of those, because we use a different GEP representation with opaque pointers.	2022-12-23 09:53:50 +01:00
Roman Lebedev	679eaeb2f6	[NFC] Port all ArgumentPromotion tests to `-passes=` syntax	2022-12-08 02:38:40 +03:00
Phoebe Wang	19c5638e4f	[ArgPromotion] Transfer metadata nontemporal to promoted loads Fixes #56703 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D130536	2022-07-26 16:30:08 +08:00
Pavel Samolysov	170c4d21bd	[ArgPromotion] Unify byval promotion with non-byval It makes sense to handle byval promotion in the same way as non-byval but also allowing `store` instructions. However, these should use the same checks as the `load` instructions do, i.e. be part of the `ArgsToPromote` collection. For these instructions, the check for interfering modifications can be disabled, though. The promotion algorithm itself has been modified a lot: all the accesses (i.e. loads and stores) are rewritten to the emitted `alloca` instructions. To optimize these new `alloca`s out, the `PromoteMemToReg` function from `Transforms/Utils/PromoteMemoryToRegister.cpp` file is invoked after promotion. In order to let the `PromoteMemToReg` promote as many `alloca`s as it is possible, there should be no `GEP`s from the `alloca`s. To eliminate the `GEP`s, its own `alloca` is generated for every argument part because a single `alloca` for the whole argument (that significantly simplifies the code of the pass though) unfortunately cannot be used. The idea comes from the following discussion: https://reviews.llvm.org/D124514#3479676 Differential Revision: https://reviews.llvm.org/D125485	2022-06-28 15:19:58 +03:00
Nikita Popov	217e85761c	[ArgPromotion] Remove legacy PM support Support for the legacy pass manager in ArgPromotion causes complications in D125485. As the legacy pass manager for middle-end optimizations is unsupported, drop ArgPromotion from the legacy pipeline, rather than introducing additional complexity to deal with it. Differential Revision: https://reviews.llvm.org/D128536	2022-06-27 09:42:17 +02:00
Pavel Samolysov	d81064949f	[ArgPromotion] Add unused-argument.ll test (NFC) If a pointer argument is unused within the callee, this argument should be removed from the function's signature while all used pointer arguments should be promoted as it is expected. The ArgumentPromotion pass doesn't touch unused non-pointer arguments at all.	2022-05-18 10:05:13 +03:00
Pavel Samolysov	d6852155b9	[ArgPromotion] Add tests for already seen offsets (NFC) If a load with the same offset has already been seen but the load had a lower alignment, the pass has to check whether the pointer is dereferenceable and is sufficiently aligned (so, the new alignment must be taken into account).	2022-05-13 13:29:38 +03:00
Pavel Samolysov	098afdb0a0	[ArgPromotion] Make a non-byval promotion attempt first It makes sense to make a non-byval promotion attempt first and then fall back to the byval one. The non-byval ('usual') promotion is generally better, for example it does promotion even when a structure has more elements than 'MaxElements' but not all of them are actually used in the function. Differential Revision: https://reviews.llvm.org/D124514	2022-05-12 16:44:52 +02:00
Phoebe Wang	7c04454227	[ArgPromotion][Attributor] Update min-legal-vector-width when do promotion X86 codegen uses function attribute `min-legal-vector-width` to select the proper ABI. The intention of the attribute is to reflect user's requirement when they passing or returning vector arguments. So Clang front-end will iterate the vector arguments and set `min-legal-vector-width` to the width of the maximum for both caller and callee. It is assumed any middle end optimizations won't care of the attribute expect inlining and argument promotion. - For inlining, we will propagate the attribute of inlined functions because the inlining functions become the newer caller. - For argument promotion, we check the `min-legal-vector-width` of the caller and callee and refuse to promote when they don't match. The problem comes from the optimizations' combination, as shown by https://godbolt.org/z/zo3hba8xW. The caller `foo` has two callees `bar` and `baz`. When doing argument promotion, both `foo` and `bar` has the same `min-legal-vector-width`. So the argument was promoted to vector. Then the inlining inlines `baz` to `foo` and updates `min-legal-vector-width`, which results in ABI mismatch between `foo` and `bar`. This patch fixes the problem by expanding the concept of `min-legal-vector-width` to indicator of functions arguments. That says, any passes touch functions arguments have to set `min-legal-vector-width` to the value reflects the width of vector arguments. It makes sense to me because any arguments modifications are ABI related and should response for the ABI compatibility. Differential Revision: https://reviews.llvm.org/D123284	2022-05-02 14:13:05 +08:00
Pavel Samolysov	6b825e50f7	[ArgPromotion] Change the condition to check the promotion limit The condition should be 'ArgParts.size() > MaxElements', so that if we have exactly 3 elements in the 'ArgParts' vector, the promotion should be allowed because the 'MaxElement' threshold is not exceeded yet. The default value for 'MaxElement' has been decreased to 2 in order to avoid an actual change in argument promoting behavior. However, this changes byval argument transformation behavior by allowing adding not more than 2 arguments to the function instead of 3 allowed before. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D124178	2022-04-28 09:42:58 -07:00
Arthur Eubanks	51561b5e80	[ArgPromo][OpaquePointer] Don't promote mismatched function types Mismatched call/callee function types is considered an indirect call. Fixes crash in https://reviews.llvm.org/D123300#3446023.	2022-04-12 15:17:45 -07:00
Valery Pykhtin	152325d2f3	[ArgPromotion] Regenerate test checks for crash.ll – restored ALL_OLDPM prefix, add –allow-unused-prefixes. This test has two runs that differ in what functions are left after the inliner, for example: barney exists on OLDPM path but don’t exist on NEWPM path. I restored prefixes this test had had after automatic checks were introduced for this test. For now there are no checks left for ALL_NEWPM path, but the behavior seem to change over time so I added –allow-unused-prefixes to ease following check updates. Renamed %tmp => %temp IR values to avoid update warning. Differential revision: https://reviews.llvm.org/D120207	2022-02-23 13:39:46 +03:00
Nico Weber	c31ef42530	Revert "[ArgPromotion] Regenerate test checks for crash.ll - removed ALL_NEWPM prefix." This reverts commit 52577cd26f26f6428c72395e7337af3fc84bc6f6. Breaks check-llvm, see comments on https://reviews.llvm.org/D120207	2022-02-21 13:29:37 -05:00
Valery Pykhtin	52577cd26f	[ArgPromotion] Regenerate test checks for crash.ll - removed ALL_NEWPM prefix. Rename %tmp => %temp IR values to avoid update warning. Reviewed by Nikita Popov Differential revision: https://reviews.llvm.org/D120207	2022-02-21 19:18:39 +03:00
Valery Pykhtin	29d2ae59e4	[ArgPromotion] Regenerate test checks for dead-gep-no-promotion.ll with --function-signature option (otherwise filecheck gets confused).	2022-02-20 15:00:18 +03:00
Valery Pykhtin	a2ce8df49b	[ArgPromotion] auto-update test checks. Rename %tmp => %temp IR values to avoid update warning.	2022-02-20 13:23:12 +03:00
Nikita Popov	e24067819f	[ArgPromotion] Protect harder against recursive promotion (PR42028) In addition to the self-recursion check, also check whether there is more than one node in the SCC, which implies that there is a larger cycle. I believe checking SCC structure (rather than something like norecurse) is the right thing to do here, because this is specifically about preventing infinite loops over the SCC. Fixes https://github.com/llvm/llvm-project/issues/42028. Differential Revision: https://reviews.llvm.org/D119418	2022-02-11 09:30:39 +01:00
Nikita Popov	8018d6be34	[ArgPromotion] Transfer metadata to promoted loads Also transfer selected non-AA metadata to the promoted load. Only metadata from guaranteed to execute loads is transferred.	2022-02-10 11:28:07 +01:00
Nikita Popov	e76c697106	[ArgPromotion] Add test for metadata on promoted loads (NFC)	2022-02-10 11:28:07 +01:00
Nikita Popov	68c1eeb4ba	[ArgPromotion] Make implementation offset based This rewrites ArgPromotion to be based on offsets rather than GEP structure. We inspect all loads at constant offsets and remember which types are loaded at which offsets. Then we promote based on those types. This generalizes ArgPromotion to work with bitcasted loads, and is compatible with opaque pointers. This patch also fixes incorrect handling of alignment during argument promotion. Previously, the implementation only checked that the pointer is dereferenceable, but was happy to speculate overaligned loads. (I would have fixed this separately in advance, but I found this hard to do with the previous implementation approach). Differential Revision: https://reviews.llvm.org/D118685	2022-02-09 09:35:01 +01:00
Nikita Popov	b896334834	[ArgPromotion] Check dereferenceability on argument as well Before walking all the callers, check whether we have a dereferenceable attribute directly on the argument. Also make it clearer that the code currently does not treat alignment correctly.	2022-02-08 10:29:51 +01:00
Nikita Popov	c2b476767e	[ArgPromotion] Test dereferenceable annotation on callee (NFC) While we check dereferenceability of all callers, we don't check dereferenceability annotations on the callee.	2022-02-08 10:27:17 +01:00
Nikita Popov	8af8119177	[ArgPromotion] Add test with bitcasts (NFC) Argument promotion currently doesn't handle these.	2022-02-02 14:46:27 +01:00
Nikita Popov	be20ee67e5	[ArgPromotion] Add test for volatile and atomic loads (NFC) Argument promotion does handle these correctly (by not promoting them), but there were no tests to ensure this.	2022-02-02 09:44:28 +01:00
Nikita Popov	a24cc48bc6	[ArgPromotion] Add alignment test (NFC) This shows a miscompile in the current argpromotion implementation: We may speculatively execute overaligned loads.	2022-02-01 10:45:14 +01:00
Nikita Popov	db04266bf6	[ArgPromotion] Regenerate test checks (NFC)	2022-02-01 10:34:14 +01:00
Nikita Popov	0ebbf3435f	[ArgPromotion] Don't assume all entry block instrs are executed We should abort this walk if we hit any instruction that is not guaranteed to transfer.	2022-01-28 16:08:42 +01:00
Nikita Popov	2dc45bf4de	[ArgPromotion] Add test for non-willreturn load hoisting (NFC) In this case, we have no guarantee that the load is unconditionally executed, so the argument promotion is not legal.	2022-01-28 16:08:42 +01:00
Bjorn Pettersson	3f8027fb67	[test] Update some test cases to use -passes when specifying the pipeline This updates transform test cases for ADCE AddDiscriminators AggressiveInstCombine AlignmentFromAssumptions ArgumentPromotion BDCE CalledValuePropagation DCE Reg2Mem WholeProgramDevirt to use the -passes syntax when specifying the pipeline. Given that LLVM_ENABLE_NEW_PASS_MANAGER isn't set to off (which is a deprecated feature) the updated test cases already used the new pass manager, but they were using the legacy syntax when specifying the passes to run. This patch can be seen as a step toward deprecating that interface. This patch also removes some redundant RUN lines. Here I am referring to test cases that had multiple RUN lines verifying both the legacy "-passname" syntax and the new "-passes=passname" syntax. Since we switched the default pass manager to "new PM" both RUN lines have verified the new PM version of the pass (more or less wasting time running the same test twice), unless LLVM_ENABLE_NEW_PASS_MANAGER is set to "off". It is assumed that it is enough to run these tests with the new pass manager now. Differential Revision: https://reviews.llvm.org/D108472	2021-09-29 21:51:08 +02:00
Arthur Eubanks	d350dd8ba2	[test] Properly match parameter/argument ABI attributes These were found with D103412.	2021-05-31 09:12:18 -07:00
Eli Friedman	61cbbba7a6	[ArgumentPromotion] Fix byval alignment handling. Make sure the alignment of the generated operations matches the alignment of the byval argument. Previously, we were just ignoring alignment and getting lucky. While I'm here, also delete the unnecessary "tail" handling. Passing a pointer to a byval argument to a "tail" call is UB, so rewriting to an alloca doesn't require any special handling. Differential Revision: https://reviews.llvm.org/D89819	2021-05-11 11:22:18 -07:00
Matt Arsenault	9a0c9402fa	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 07e46367baeca96d84b03fa215b41775f69d5989.	2021-03-29 08:55:30 -04:00
Oliver Stannard	07e46367ba	Revert "Reapply "OpaquePtr: Turn inalloca into a type attribute"" Reverting because test 'Bindings/Go/go.test' is failing on most buildbots. This reverts commit fc9df309917e57de704f3ce4372138a8d4a23d7a.	2021-03-29 11:32:22 +01:00
Matt Arsenault	fc9df30991	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit 20d5c42e0ef5d252b434bcb610b04f1cb79fe771.	2021-03-28 13:35:21 -04:00

1 2 3 4

166 Commits