llvm-project

Author	SHA1	Message	Date
Nikita Popov	f5c02dd06e	[MemCpyOpt] Use EarliestEscapeInfo (#110280 ) Pass EarliestEscapeInfo to BatchAA in MemCpyOpt. This allows memcpy elimination in cases where one of the involved pointers is captured after the relevant memcpy/call.	2024-09-30 09:35:54 +02:00
Nikita Popov	f1c2331a64	[MemCpyOpt] Add additional tests for earliest escape (NFC)	2024-09-27 16:58:40 +02:00
Ramkumar Ramachandra	f664d313cd	MemCpyOpt: replace an AA query with MSSA query (NFC) (#108535 ) Fix a long-standing TODO.	2024-09-24 11:18:37 +01:00
Nikita Popov	2afe678f0a	[MemCpyOpt] Allow memcpy elision for non-noalias arguments (#107860 ) We currently elide memcpys for readonly nocapture noalias arguments. noalias is checked to make sure that there are no other ways to write the memory, e.g. through a different argument or an escaped pointer. In addition to the current noalias check, also query alias analysis, in case it can prove that modification is not possible through other means. This fixes the problem reported in https://discourse.llvm.org/t/problem-about-memcpy-elimination/81121.	2024-09-11 10:04:37 +02:00
Nikita Popov	1199e5b9ce	[MemCpyOpt] Add more tests for memcpy passed to readonly arg (NFC)	2024-09-09 14:55:27 +02:00
Maciej Gabka	95d2d1cba0	Move stepvector intrinsic out of experimental namespace (#98043 ) This patch is moving out stepvector intrinsic from the experimental namespace. This intrinsic exists in LLVM for several years now, and is widely used.	2024-08-28 12:48:20 +01:00
Yingwei Zheng	378daa6c6f	[MemCpyOpt] Avoid infinite loops in `MemCpyOptPass::processMemCpyMemCpyDependence` (#103218 ) Closes https://github.com/llvm/llvm-project/issues/102994.	2024-08-22 17:20:47 +08:00
Nikita Popov	71051deff2	[MemCpyOpt] Fix infinite loop in memset+memcpy fold (#98638 ) For the case where the memcpy size is zero, this transform is a complex no-op. This can lead to an infinite loop when the size is zero in a way that BasicAA understands, because it can still understand that dst and dst + src_size are MustAlias. I've tried to mitigate this before using the isZeroSize() check, but we can hit cases where InstSimplify doesn't understand that the size is zero, but BasicAA does. As such, this bites the bullet and adds an explicit isKnownNonZero() check to guard against no-op transforms. Fixes https://github.com/llvm/llvm-project/issues/98610.	2024-07-15 09:41:11 +02:00
Yingwei Zheng	99685a54d1	[MemCpyOpt] Use `dyn_cast` to fix assertion failure in `processMemCpyMemCpyDependence` (#98686 ) Fixes https://github.com/llvm/llvm-project/issues/98675.	2024-07-13 04:27:07 +08:00
DianQK	fa24213928	[MemCpyOpt] Forward `memcpy` based on the actual copy memory location. (#87190 ) Fixes #85560. We can forward `memcpy` as long as the actual memory location being copied have not been altered. alive2: https://alive2.llvm.org/ce/z/q9JaHV	2024-07-12 22:58:28 +08:00
DianQK	117cc4abea	[MemCpyOpt] No need to create `memcpy(a <- a)` (#98321 ) When forwarding `memcpy`, we don't need to create `memcpy(a, a)`.	2024-07-11 19:54:28 +08:00
DianQK	62b3e68d33	[MemCpyOpt] Fixes `test6_memcpy` test (NFC) We should forward `src` to `dest`.	2024-07-10 21:10:28 +08:00
Stephen Tozer	094572701d	[RemoveDIs] Print IR with debug records by default (#91724 ) This patch makes the final major change of the RemoveDIs project, changing the default IR output from debug intrinsics to debug records. This is expected to break a large number of tests: every single one that tests for uses or declarations of debug intrinsics and does not explicitly disable writing records. If this patch has broken your downstream tests (or upstream tests on a configuration I wasn't able to run): 1. If you need to immediately unblock a build, pass `--write-experimental-debuginfo=false` to LLVM's option processing for all failing tests (remember to use `-mllvm` for clang/flang to forward arguments to LLVM). 2. For most test failures, the changes are trivial and mechanical, enough that they can be done by script; see the migration guide for a guide on how to do this: https://llvm.org/docs/RemoveDIsDebugInfo.html#test-updates 3. If any tests fail for reasons other than FileCheck check lines that need updating, such as assertion failures, that is most likely a real bug with this patch and should be reported as such. For more information, see the recent PSA: https://discourse.llvm.org/t/psa-ir-output-changing-from-debug-intrinsics-to-debug-records/79578	2024-06-14 15:07:27 +01:00
Paul Walker	900bea9b1c	[LLVM][test] Convert remaining instances of ConstantExpr based splats to use splat(). This is mostly NFC but some output does change due to consistently inserting into poison rather than undef and using i64 as the index type for inserts.	2024-02-27 13:37:23 +00:00
Nikita Popov	2d69827c5c	[Transforms] Convert tests to opaque pointers (NFC)	2024-02-05 11:57:34 +01:00
Philip Reames	42d6eb5475	[MemCpyOpt] Handle scalable aggregate types in memmove/memset formation (#80487 ) Without this change, the included test cases crash the compiler. I believe this is fallout from the homogenous scalable struct work from a while back; I think we just forgot to update this case. Likely to fix https://github.com/llvm/llvm-project/issues/80463.	2024-02-02 18:47:18 -08:00
Shilei Tian	7e956ca88a	[NFC][AMDGPU] Require `x86-registered-target` for `llvm/test/Transforms/MemCpyOpt/no-libcalls.ll` The test sets `-mtriple=x86_64` but doesn't require it. This can cause issue on non-x86 system.	2024-01-09 14:42:38 -05:00
Nikita Popov	bf5d96c96c	[IR] Add dead_on_unwind attribute (#74289 ) Add the `dead_on_unwind` attribute, which states that the caller will not read from this argument if the call unwinds. This allows eliding stores that could otherwise be visible on the unwind path, for example: ``` declare void @may_unwind() define void @src(ptr noalias dead_on_unwind %out) { store i32 0, ptr %out call void @may_unwind() store i32 1, ptr %out ret void } define void @tgt(ptr noalias dead_on_unwind %out) { call void @may_unwind() store i32 1, ptr %out ret void } ``` The optimization is not valid without `dead_on_unwind`, because the `i32 0` value might be read if `@may_unwind` unwinds. This attribute is primarily intended to be used on sret arguments. In fact, I previously wanted to change the semantics of sret to include this "no read after unwind" property (see D116998), but based on the feedback there it is better to keep these attributes orthogonal (sret is an ABI attribute, dead_on_unwind is an optimization attribute). This is a reboot of that change with a separate attribute.	2023-12-14 09:58:14 +01:00
Wang Pengcheng	6aa6ef73ec	[MemCpyOpt] Don't perform call slot opt if alloc type is scalable (#75027 ) This fixes #75010.	2023-12-11 19:45:13 +08:00
Jeremy Morse	d2d9dc8eb4	[DebugInfo][RemoveDIs] Make debugify pass convert to/from RemoveDIs mode (#73251 ) Debugify is extremely useful as a testing and debugging tool, and a good number of LLVM-IR transform tests use it. We need it to support "new" non-instruction debug-info to get test coverage, but it's not important enough to completely convert right now (and it'd be a large undertaking). Thus: convert to/from dbg.value/DPValue mode on entry and exit of the pass, which gives us the functionality without any further work. The cost is compile-time, but again this is only happening during tests. Tested by: the large set of debugify tests enabled here. Note the InstCombine test (cast-mul-select.ll) that hasn't been fully enabled: this is because there's a debug-info sinking piece of code there that hasn't been instrumented.	2023-11-29 13:19:50 +00:00
Nikita Popov	369c9b791b	[MemCpyOpt] Require writable object during call slot optimization (#71542 ) Call slot optimization may introduce writes to the destination object that occur earlier than in the original function. We currently already check that that the destination is dereferenceable and aligned, but we do not make sure that it is writable. As such, we might introduce a write to read-only memory, or introduce a data race. Fix this by checking that the object is writable. For arguments, this is indicated by the new writable attribute. Tests using sret/dereferenceable are updated to use it.	2023-11-09 15:55:44 +01:00
Nikita Popov	5c3beb7b1e	[MemCpyOpt] Handle memcpy marked as memory(none) Fixes #71183.	2023-11-03 15:20:21 +01:00
DianQK	0c4f326d8b	[MemCpyOpt] Combine alias metadatas when replacing byval arguments (#70580 ) Fixes #70578.	2023-10-29 16:07:55 +08:00
Alex Richardson	e39f6c1844	[opt] Infer DataLayout from triple if not specified There are many tests that specify a target triple/CPU flags but no DataLayout which can lead to IR being generated that has unusual behaviour. This commit attempts to use the default DataLayout based on the relevant flags if there is no explicit override on the command line or in the IR file. One thing that is not currently possible to differentiate from a missing datalayout `target datalayout = ""` in the IR file since the current APIs don't allow detecting this case. If it is considered useful to support this case (instead of passing "-data-layout=" on the command line), I can change IR parsers to track whether they have seen such a directive and change the callback type. Differential Revision: https://reviews.llvm.org/D141060	2023-10-26 12:07:37 -07:00
Kai Yan	df116d1dc4	[MemCpyOpt] Fix the invalid code modification for GEP (#68479 ) Relocate the GEP modification to a later stage of the function performCallSlotOptzn(), ensuring that the code remains unchanged if the optimization fails. Co-authored-by: aklkaiyan <aklkaiyan@tencent.com>	2023-10-09 12:54:16 +02:00
Craig Topper	689ace53a5	[MemCpyOptimizer] Support scalable vectors in performStackMoveO… (#67632 ) …ptzn. This changes performStackMoveOptzn to take a TypeSize instead of uint64_t to avoid an implicit conversion when called from processStoreOfLoad. performStackMoveOptzn has been updated to allow scalable types in the rest of its code.	2023-09-28 12:25:38 -07:00
DianQK	4e6e476329	[MemCpyOpt] Merge alias metadatas when replacing arguments (#67539 ) Alias metadata may no longer be valid after replacing the call argument. Fix this by merging it with the memcpy alias metadata. This fixes a miscompilation encountered in https://rust-lang.zulipchat.com/#narrow/stream/131828-t-compiler/topic/Failing.20tests.20when.20rustc.20is.20compiled.20with.201.20CGU.	2023-09-28 10:13:21 +02:00
Nikita Popov	d5c8b23b1e	[MemCpyOpt] Add test for #67539 (NFC)	2023-09-28 09:44:05 +02:00
Kohei Asano	2a207128a7	[MemCpyOpt] move SrcAlloca to the entry if transformation is performed (#67226 ) This is fixup for https://github.com/llvm/llvm-project/pull/66618#discussion_r1328523770 . This transformation checks whether allocas are static, if the transformation is performed. This patch moves the SrcAlloca to the entry of the BB when the optimization performed.	2023-09-26 16:27:34 +09:00
Kohei Asano	baf031a853	[MemCpyOpt] fix miscompile for non-dominated use of src alloca for stack-move optimization (#66618 ) Stack-move optimization, the optimization that merges src and dest alloca of the full-size copy, replaces all uses of the dest alloca with src alloca. For safety, we needed to check all uses of the dest alloca locations are dominated by src alloca, to be replaced. This PR adds the check for that. Fixes #65225	2023-09-18 21:29:10 +09:00
Nikita Popov	07460b6666	[MemCpyOpt] Avoid infinite loop in processMemSetMemCpyDependence (PR54983) This adds an additional transform to drop zero-size memcpys, also in the case where the size is only zero after instruction simplification. The motivation is the case from PR54983 where the size is non-trivially zero, and processMemSetMemCpyDependence() keeps trying to reduce the memset size by zero bytes. This fix it's not really principled. It only works on the premise that if InstSimplify doesn't realize the size is zero, then AA also won't. The principled approach would be to instead add a isKnownNonZero() guard to the processMemSetMemCpyDependence() transform, but I suspect that would render that optimization mostly useless (at least it breaks all the existing test coverage -- worth noting that the constant size case is also handled by DSE, so I think this transform is primarily about the dynamic size case). Fixes https://github.com/llvm/llvm-project/issues/54983. Fixes https://github.com/llvm/llvm-project/issues/64886. Differential Revision: https://reviews.llvm.org/D124078	2023-09-15 09:10:15 +02:00
khei4	7f3610ac69	Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization" This reverts commit efe8aa2e618122e8050af10cc5d6ad83f24ef557. Differential Revision: https://reviews.llvm.org/D155406	2023-09-14 19:42:36 +09:00
Vitaly Buka	efe8aa2e61	Revert "Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization"" Suspecting incorrect lifetime markers. This reverts commit 3a1409f93da32bf626f76257e0aac71716f2f67e.	2023-09-07 11:14:19 -07:00
khei4	c4d37c35e1	[MemCpyOpt] fix false negative case and add it as a true positive case(NFC)	2023-08-30 11:18:26 +09:00
khei4	3a1409f93d	Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization" This reverts commit e0f9cc71cb6f4eb2e1566177e05425c497759dc6. Differential Revision: https://reviews.llvm.org/D155406	2023-08-29 19:40:29 +09:00
khei4	c652987dd9	[MemCpyOpt] remove test noises (NFC)	2023-08-29 17:24:36 +09:00
khei4	5a9a7f5303	[MemCpyOpt] add tests for unreachable cycles for post dominators(NFC)	2023-08-29 14:07:45 +09:00
khei4	98d1b0eb64	[MemCpyOpt] add tests for unreachable block before calculating common dominator(NFC)	2023-08-28 14:44:41 +09:00
Vitaly Buka	e0f9cc71cb	Revert "Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization""" Breaks multiple bots. e.g. https://lab.llvm.org/buildbot/#/builders/19/builds/18856 This reverts commit ac0072602c9d01fc031a2d0acb418f7191480ef0.	2023-08-26 19:24:50 -07:00
khei4	ac0072602c	Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization"" This reverts commit 3bb32c61b2f1f5d14dd056dd198dc898dce5a44e. Use InsertionPt for DT to handle non-memory access dominators Differential Revision: https://reviews.llvm.org/D155406	2023-08-27 06:50:19 +09:00
khei4	fe285ae091	[NFC][MemCpyOpt] add test for MemoryAccess crash on D155406	2023-08-25 01:00:29 +09:00
khei4	3bb32c61b2	Revert "[MemCpyOpt] implement multi BB stack-move optimization" This reverts commit ef867d2ea10e8246be20c608160e07a54eb2ed14. crash on sanitizer build https://lab.llvm.org/buildbot/#/builders/70/builds/42861/steps/10/logs/stdio	2023-08-24 22:56:39 +09:00
khei4	ef867d2ea1	[MemCpyOpt] implement multi BB stack-move optimization Differential Revision: https://reviews.llvm.org/D155406	2023-08-24 22:19:01 +09:00
khei4	e0911b98d1	[MemCpyOpt] precommit test for D155406 (NFC) Differential Revision: https://reviews.llvm.org/D155422	2023-08-24 22:19:01 +09:00
khei4	ca68a7f956	Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas This reverts commit 207718029e1e62d82145b479f6349941b6384045.	2023-08-15 22:13:09 +09:00
khei4	0b4f8c9fc4	(NFC)[MemCpyOpt] add a test to avoid crash for last memory use	2023-08-15 20:20:57 +09:00
Vitaly Buka	207718029e	Revert "Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas" Fails on https://lab.llvm.org/buildbot/#/builders/85/builds/18296 This reverts commit 43698c1ddc179ccd97b3f3b2bb03f4a3fe9556f3.	2023-08-13 16:29:39 -07:00
khei4	43698c1ddc	Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas Differential Revision: https://reviews.llvm.org/D153453 This reverts commit 00653889883f2d818536efcb21c6c8b739f0888b.	2023-08-13 21:38:00 +09:00
Matt Arsenault	25bc999d1f	Intrinsics: Add type overload to stacksave and stackstore This allows use with non-0 address space stacks. llvm_ptr_ty should never be used. This could use some more percolation up through mlir, but this is enough to fix existing tests. https://reviews.llvm.org/D156666	2023-08-09 18:33:11 -04:00
khei4	90ecb9d5b0	[MemCpyOpt][test] add memssa verification on stack-move tests(NFC)	2023-08-08 18:56:59 +09:00

1 2 3 4 5 ...

325 Commits