llvm-project

Author	SHA1	Message	Date
khei4	c9d419c1df	Revert "Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas" revert because crash on chrome windows https://reviews.llvm.org/D153453#4524256 This reverts commit 569769b64858fd38f41267db41b461d3163aa754.	2023-07-22 13:08:12 +09:00
khei4	569769b648	Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas This reverts commit 8f3864ba4323a253bcf29825d23cd325b52c4106. Differential Revision: https://reviews.llvm.org/D153453	2023-07-19 18:26:04 +09:00
khei4	8f3864ba43	Revert "Revert "Revert "[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas""" This reverts commit b02d349cbfaa81c9bbc928c4de46b12d976c1882.	2023-07-18 18:42:36 +09:00
khei4	b02d349cbf	Revert "Revert "[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas"" This reverts commit 36a6eb7d12a9f827bf3d5d4e5fdc68b8a62807b2. [MemCpyOpt] check that load/store and dest/src alloca are all in the same bb Differential Revision: https://reviews.llvm.org/D153453 Co-authored-by: serge-sans-paille <sguelton@mozilla.com>	2023-07-15 16:27:38 +09:00
khei4	36a6eb7d12	Revert "[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas" This reverts commit 96ae0851c26237378fa1280b0a9ad713e1b72bdb.	2023-07-13 18:04:49 +09:00
khei4	96ae0851c2	[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas Differential Revision: https://reviews.llvm.org/D153453	2023-07-13 14:52:30 +09:00
khei4	361464c027	[MemCpyOpt] Use memcpy source directly if dest is known to be immutable from attributes Differential Revision: https://reviews.llvm.org/D150970	2023-06-10 15:46:32 +09:00
khei4	f4eafba206	[MemCpyOpt] Remove unnecessary bitcast and reuse some variables for ByValOptimization (NFC) Differential Revision: https://reviews.llvm.org/D152056	2023-06-04 15:37:41 +09:00
Bjorn Pettersson	198e0a12f6	[MemCpyOpt] Fix up debug loc for simplified memset in processMemSetMemCpyDependence Make sure the code comments in processMemSetMemCpyDependence match with the actual transform. They indicated that the memset being rewritten was sunk to after a memcpy, while it actually is inserted just before the memcpy. Also make sure we use the debug location of the original memset when creating the new simplified memset. In the past we've been using the debug location for the memcpy which could be a bit confusing. Differential Revision: https://reviews.llvm.org/D135574	2023-05-16 16:54:51 +02:00
OCHyams	ca10e73b53	[NFC] Rename isPointerOffset to getPointerOffsetFrom and move to Value.h Linking LLVMCore failed when building D148536 with shared libs enabled: https://lab.llvm.org/buildbot/#/builders/121/builds/29766 Make isPointerOffset a Value method and rename it to getPointerOffsetFrom. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D148698	2023-04-19 12:22:58 +01:00
Bjorn Pettersson	a20f7efbc5	Remove several no longer needed includes. NFCI Mostly removing includes of InitializePasses.h and Pass.h in passes that no longer has support for the legacy PM.	2023-04-17 13:54:19 +02:00
Mikhail Maltsev	ab8150acc5	[MemCpyOpt] Don't fold memcpy.inline into memmove The llvm.memcpy.inline intrinsic must be expanded into code that does not contain any function calls because it is intended for the implementation of low-level functions like memcpy. Currently the MemCpyOpt might covert llvm.memcpy.inline into llvm.memmove in certain circumstances. This patch fixes the issue. Fixes https://github.com/llvm/llvm-project/issues/61791. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D147162	2023-03-30 13:14:59 +01:00
Arthur Eubanks	1a90faacf1	[Passes] Remove some legacy passes NewGVN GVNHoist GVNSink MemCpyOpt Float2Int These were only used for the optimization pipeline, of which the legacy version was removed.	2023-03-14 13:32:59 -07:00
Guillaume Chatelet	135f23d67b	Deprecate MemIntrinsicBase::getDestAlignment() and MemTransferBase::getSourceAlignment() Differential Revision: https://reviews.llvm.org/D141840	2023-01-16 14:22:03 +00:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Nikita Popov	07bf39df80	[MemCpyOpt] Extract processStoreOfLoad() method (NFC)	2023-01-06 16:11:10 +01:00
Nikita Popov	a6a526ec54	[IR] Add AllocaInst::getAllocationSize() (NFC) When fetching allocation sizes, we almost always want to have the size in bytes, but we were only providing an InBits API. Also add the corresponding byte-based conjugate to save some *8 and /8 juggling everywhere.	2023-01-06 15:36:16 +01:00
Owen Anderson	8256ddf78c	Resolve a long-standing FIXME in memcpyopt. Inspecting the downstream use of the cpyAlign, it is clear that `performCallSlotOptzn` is expecting it to represent the alignment of the copy destination, not the minimum of the src and dest alignments. This patch renames the parameter to make this more obvious. I believe this change is NFC, because the downstream code has alignment checks such that it all works out in the end. I have not been able to construct a test case that actually triggers a change in output. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D140603	2022-12-23 16:15:24 -07:00
Fangrui Song	d4b6fcb32e	[Analysis] llvm::Optional => std::optional	2022-12-14 07:32:24 +00:00
Kazu Hirata	405fc404bf	[ADT] Don't including None.h (NFC) These source files no longer use None, so they do not need to include None.h. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-06 20:14:51 -08:00
Nikita Popov	b7ede701d8	[MemCpyOpt] Use BatchAA when processing one instruction (NFCI) While we can't use a single BatchAA instance for the entire MemCpyOpt run without further justification, we can use BatchAA while performing the queries related to a single instruction (these will first perform some AA-based checks, and then modify the IR only afterwards).	2022-12-06 10:16:39 +01:00
Krzysztof Parzyszek	f3b6dbfda8	Instructions: convert Optional to std::optional	2022-12-04 14:25:11 -06:00
Kazu Hirata	343de6856e	[Transforms] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:37 -08:00
OCHyams	e292f91291	[Assignment Tracking][17/*] Account for assignment tracking in memcpyopt The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir Maintain and propagate DIAssignID attachments in memcpyopt. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133312	2022-11-15 11:51:10 +00:00
Nikita Popov	4d33cf4166	[MemCpyOpt] Avoid moving lifetime marker above def (PR58903) This is unlikely to happen with opaque pointers, so just bail out of the transform, rather than trying to move bitcasts/etc as well. Fixes https://github.com/llvm/llvm-project/issues/58903.	2022-11-11 15:06:34 +01:00
Nikita Popov	9a45e4beed	[MemCpyOpt] Move lifetime marker before call to enable call slot optimization Currently call slot optimization may be prevented because the lifetime markers for the destination only start after the call. In this case, rather than aborting the transform, we should move the lifetime.start before the call to enable the transform. Differential Revision: https://reviews.llvm.org/D135886	2022-11-07 15:26:00 +01:00
Nikita Popov	b54b84fde6	[MemCpyOpt] Add additional debug output (NFC)	2022-10-13 17:03:44 +02:00
Nikita Popov	5fa14ee835	[MemCpyOpt] Don't hoist above producer of pointer operand This was already handled correctly below, but not checked for the original store pointer operand. Encountered when converting tests to opaque pointers, where the intermediate bitcast goes away.	2022-10-05 14:52:33 +02:00
Matt Arsenault	c867401407	MemCpyOpt: Pass through AssumptionCache	2022-09-19 19:25:22 -04:00
Matt Arsenault	0d8ffcc532	Analysis: Add AssumptionCache argument to isDereferenceableAndAlignedPointer This does not try to pass it through from the end users.	2022-09-19 18:57:33 -04:00
Guillaume Chatelet	3c126d5fe4	[Alignment] Replace commonAlignment with std::min `commonAlignment` is a shortcut to pick the smallest of two `Align` objects. As-is it doesn't bring much value compared to `std::min`. Differential Revision: https://reviews.llvm.org/D128345	2022-06-28 07:15:02 +00:00
Guillaume Chatelet	57ffff6db0	Revert "[NFC] Remove dead code" This reverts commit 8ba2cbff70f2c49a8926451c59cc260d67b706cf.	2022-06-22 14:55:47 +00:00
Guillaume Chatelet	8ba2cbff70	[NFC] Remove dead code	2022-06-22 13:33:58 +00:00
Guillaume Chatelet	f9bb8c24ac	[NFC][Alignment] Convert MemCpyOptimizer.cpp	2022-06-13 10:07:09 +00:00
Fangrui Song	d86a206f06	Remove unneeded cl::ZeroOrMore for cl::opt/cl::list options	2022-06-05 00:31:44 -07:00
Nikita Popov	a5c3b5748c	[MemCpyOpt] Work around PR54682 As discussed on https://github.com/llvm/llvm-project/issues/54682, MemorySSA currently has a bug when computing the clobber of calls that access loop-varying locations. I think a "proper" fix for this on the MemorySSA side might be non-trivial, but we can easily work around this in MemCpyOpt: Currently, MemCpyOpt uses a location-less getClobberingMemoryAccess() call to find a clobber on either the src or dest location, and then refines it for the src and dest clobber. This was intended as an optimization, as the location-less API is cached, while the location-affected APIs are not. However, I don't think this really makes a difference in practice, because I don't think anything will use the cached clobbers on those calls later anyway. On CTMark, this patch seems to be very mildly positive actually. So I think this is a reasonable way to avoid the problem for now, though MemorySSA should also get a fix. Differential Revision: https://reviews.llvm.org/D122911	2022-04-04 10:19:51 +02:00
Philip Reames	7c51669c21	[memcpyopt] Restructure store(load src, dest) form of callslotopt for compile time The search for the clobbering call is fairly expensive if uses are not optimized at construction. Defer the clobber walk to the point in the implementation we need it; there are a bunch of bailouts before that point. (e.g. If the source pointer is not an alloca, we can't do callslotopt.) On a test case which involves a bunch of copies from argument pointers, this switches memcpyopt from > 1/2 second to < 10ms.	2022-04-03 20:16:20 -07:00
Philip Reames	33deaa13b8	[memcpyopt] Common code into performCallSlotOptzn [NFC] We have the same code repeated in both callers, sink it into callee. The motivation here isn't just code style, we can also defer the relatively expensive aliasing checks until the cheap structural preconditions have been validated. (e.g. Don't bother aliasing if src is not an alloca.) This helps compile time significantly.	2022-03-28 20:10:13 -07:00
serge-sans-paille	59630917d6	Cleanup includes: Transform/Scalar Estimated impact on preprocessor output line: before: 1062981579 after: 1062494547 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120817	2022-03-03 07:56:34 +01:00
Florian Hahn	7662d1687b	[MemCpyOpt] Check all access for MemoryUses in writtenBetween. Currently writtenBetween can miss clobbers of Loc between End and Start, if End is a MemoryUse. To guarantee we see all write clobbers of Loc between Start and End for MemoryUses, restrict to Start and End being in the same block and check all accesses between them. This fixes 2 mis-compiles illustrated in llvm/test/Transforms/MemCpyOpt/memcpy-byval-forwarding-clobbers.ll Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D119929	2022-02-21 16:54:30 +00:00
Nikita Popov	6b69985da4	[MemCpyOpt] Use helper for unwind check This extends support to byval arguments. It would be further extended to handle the case of non-captured noalias returns.	2022-01-26 12:43:31 +01:00
Nikita Popov	0d20407d1a	Reapply [MemCpyOpt] Look through pointer casts when checking capture This is a recommit of the patch without changes. The reason for the revert has been addressed in D117679. ----- The user scanning loop above looks through pointer casts, so we also need to strip pointer casts in the capture check. Previously the source was incorrectly considered not captured if a bitcast was passed to the call.	2022-01-20 09:30:21 +01:00
Nikita Popov	655a7024db	Reapply [MemCpyOpt] Make capture check during call slot optimization more precise This is a recommit of the patch without changes. The reason for the revert has been addressed in D117679. ----- Call slot optimization is currently supposed to be prevented if the call can capture the source pointer. Due to an implementation bug, this check currently doesn't trigger if a bitcast of the source pointer is passed instead. I'm somewhat afraid of the fallout of fixing this bug (due to heavy reliance on call slot optimization in rust), so I'd like to strengthen the capture reasoning a bit first. In particular, I believe that the capture is fine as long as a) the call itself cannot depend on the pointer identity, because neither dest has been captured before/at nor src before the call and b) there is no potential use of the captured pointer before the lifetime of the source alloca ends, either due to lifetime.end or a return from a function. At that point the potentially captured pointer becomes dangling. Differential Revision: https://reviews.llvm.org/D115615	2022-01-20 09:30:20 +01:00
Nikita Popov	d7bff2e9d2	[MemCpyOpt] Fix metadata merging during call slot optimization Call slot optimization currently merges the metadata between the call and the load. However, we also need to merge in the metadata of the store. Part of the reason why we might have gotten away with this previously is that usually the load and the store are the same instruction (a memcpy), this can only happen if call slot optimization occurs on an actual load/store pair. This addresses the issue reported in https://reviews.llvm.org/D115615#3251386. Differential Revision: https://reviews.llvm.org/D117679	2022-01-20 09:25:13 +01:00
Nikita Popov	4dc4815f56	[MemCpyOpt] Add some debug output to call slot optimization (NFC)	2022-01-19 15:51:10 +01:00
Hans Wennborg	53a51acc36	Revert "[MemCpyOpt] Make capture check during call slot optimization more precise" This casued a miscompile due to call slot optimization replacing a call argument without considering the call's !noalias metadata, see discussion on the code review. > Call slot optimization is currently supposed to be prevented if > the call can capture the source pointer. Due to an implementation > bug, this check currently doesn't trigger if a bitcast of the source > pointer is passed instead. I'm somewhat afraid of the fallout of > fixing this bug (due to heavy reliance on call slot optimization > in rust), so I'd like to strengthen the capture reasoning a bit first. > > In particular, I believe that the capture is fine as long as a) > the call itself cannot depend on the pointer identity, because > neither dest has been captured before/at nor src before the > call and b) there is no potential use of the captured pointer > before the lifetime of the source alloca ends, either due to > lifetime.end or a return from a function. At that point the > potentially captured pointer becomes dangling. > > Differential Revision: https://reviews.llvm.org/D115615 Also reverting the dependent commit: > [MemCpyOpt] Look through pointer casts when checking capture > > The user scanning loop above looks through pointer casts, so we > also need to strip pointer casts in the capture check. Previously > the source was incorrectly considered not captured if a bitcast > was passed to the call. This reverts commit 487a34ed9d7d24a7b1fb388c8856c784a459b22b and 00e6869463ae6023d0d48f30de8511d6d748b14f.	2022-01-18 17:41:49 +01:00
Simon Pilgrim	5bbcff6181	[MemCpyOptimizer] hasUndefContents - only look for underlying object if we've found an alloca Provides an early-out if we fail to find an AllocaInst, and avoids a static analyzer warning about null dereferencing.	2022-01-06 15:15:03 +00:00
Simon Pilgrim	8399fa673b	[MemCpyOptimizer] Use auto* for cast<> results (style). NFC.	2022-01-06 15:15:03 +00:00
Nikita Popov	00e6869463	[MemCpyOpt] Look through pointer casts when checking capture The user scanning loop above looks through pointer casts, so we also need to strip pointer casts in the capture check. Previously the source was incorrectly considered not captured if a bitcast was passed to the call.	2022-01-05 09:50:33 +01:00
Nikita Popov	487a34ed9d	[MemCpyOpt] Make capture check during call slot optimization more precise Call slot optimization is currently supposed to be prevented if the call can capture the source pointer. Due to an implementation bug, this check currently doesn't trigger if a bitcast of the source pointer is passed instead. I'm somewhat afraid of the fallout of fixing this bug (due to heavy reliance on call slot optimization in rust), so I'd like to strengthen the capture reasoning a bit first. In particular, I believe that the capture is fine as long as a) the call itself cannot depend on the pointer identity, because neither dest has been captured before/at nor src before the call and b) there is no potential use of the captured pointer before the lifetime of the source alloca ends, either due to lifetime.end or a return from a function. At that point the potentially captured pointer becomes dangling. Differential Revision: https://reviews.llvm.org/D115615	2022-01-05 09:39:25 +01:00

1 2 3 4 5 ...

394 Commits