llvm-project

Author	SHA1	Message	Date
khei4	3a1409f93d	Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization" This reverts commit e0f9cc71cb6f4eb2e1566177e05425c497759dc6. Differential Revision: https://reviews.llvm.org/D155406	2023-08-29 19:40:29 +09:00
Vitaly Buka	e0f9cc71cb	Revert "Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization""" Breaks multiple bots. e.g. https://lab.llvm.org/buildbot/#/builders/19/builds/18856 This reverts commit ac0072602c9d01fc031a2d0acb418f7191480ef0.	2023-08-26 19:24:50 -07:00
khei4	ac0072602c	Reapply "Revert "[MemCpyOpt] implement multi BB stack-move optimization"" This reverts commit 3bb32c61b2f1f5d14dd056dd198dc898dce5a44e. Use InsertionPt for DT to handle non-memory access dominators Differential Revision: https://reviews.llvm.org/D155406	2023-08-27 06:50:19 +09:00
khei4	3bb32c61b2	Revert "[MemCpyOpt] implement multi BB stack-move optimization" This reverts commit ef867d2ea10e8246be20c608160e07a54eb2ed14. crash on sanitizer build https://lab.llvm.org/buildbot/#/builders/70/builds/42861/steps/10/logs/stdio	2023-08-24 22:56:39 +09:00
khei4	ef867d2ea1	[MemCpyOpt] implement multi BB stack-move optimization Differential Revision: https://reviews.llvm.org/D155406	2023-08-24 22:19:01 +09:00
Nikita Popov	9d2f8ecac8	[MSSAU] Clarify that the defining access does not matter (NFC) New memory accesses are usually inserted by using one of the createMemoryAccessXYZ() methods followed by insertUse() or insertDef(). createMemoryAccessXYZ() accepts a defining access, however this defining access will always be overwritten by insertUse() / insertDef(). Update the documentation to clarify this, and stop passing Definition to createMemoryAccessXYZ() if it's followed by insertUse/insertDef. Alternatively, we could also make insertUse / insertDef keep the defining access if it is specified, and only recompute it if it's missing. Differential Revision: https://reviews.llvm.org/D157979	2023-08-16 09:00:32 +02:00
khei4	ca68a7f956	Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas This reverts commit 207718029e1e62d82145b479f6349941b6384045.	2023-08-15 22:13:09 +09:00
Vitaly Buka	207718029e	Revert "Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas" Fails on https://lab.llvm.org/buildbot/#/builders/85/builds/18296 This reverts commit 43698c1ddc179ccd97b3f3b2bb03f4a3fe9556f3.	2023-08-13 16:29:39 -07:00
khei4	43698c1ddc	Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas Differential Revision: https://reviews.llvm.org/D153453 This reverts commit 00653889883f2d818536efcb21c6c8b739f0888b.	2023-08-13 21:38:00 +09:00
Vitaly Buka	0065388988	Revert "Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas""" Breaks Asan and LTO. This reverts commit ea72b5137eb72391ad192dbb01084c21b9fe8b71.	2023-08-02 12:32:35 -07:00
khei4	ea72b5137e	Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas"" This reverts commit c9d419c1df72b0160e374f8d0b9f30508b3b98a7. Differential Revision: https://reviews.llvm.org/D153453	2023-07-24 10:16:49 +09:00
khei4	c9d419c1df	Revert "Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas" revert because crash on chrome windows https://reviews.llvm.org/D153453#4524256 This reverts commit 569769b64858fd38f41267db41b461d3163aa754.	2023-07-22 13:08:12 +09:00
khei4	569769b648	Reapply: [MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas This reverts commit 8f3864ba4323a253bcf29825d23cd325b52c4106. Differential Revision: https://reviews.llvm.org/D153453	2023-07-19 18:26:04 +09:00
khei4	8f3864ba43	Revert "Revert "Revert "[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas""" This reverts commit b02d349cbfaa81c9bbc928c4de46b12d976c1882.	2023-07-18 18:42:36 +09:00
khei4	b02d349cbf	Revert "Revert "[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas"" This reverts commit 36a6eb7d12a9f827bf3d5d4e5fdc68b8a62807b2. [MemCpyOpt] check that load/store and dest/src alloca are all in the same bb Differential Revision: https://reviews.llvm.org/D153453 Co-authored-by: serge-sans-paille <sguelton@mozilla.com>	2023-07-15 16:27:38 +09:00
khei4	36a6eb7d12	Revert "[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas" This reverts commit 96ae0851c26237378fa1280b0a9ad713e1b72bdb.	2023-07-13 18:04:49 +09:00
khei4	96ae0851c2	[MemCpyOpt] implement single BB stack-move optimization which unify the static unescaped allocas Differential Revision: https://reviews.llvm.org/D153453	2023-07-13 14:52:30 +09:00
khei4	361464c027	[MemCpyOpt] Use memcpy source directly if dest is known to be immutable from attributes Differential Revision: https://reviews.llvm.org/D150970	2023-06-10 15:46:32 +09:00
khei4	f4eafba206	[MemCpyOpt] Remove unnecessary bitcast and reuse some variables for ByValOptimization (NFC) Differential Revision: https://reviews.llvm.org/D152056	2023-06-04 15:37:41 +09:00
Bjorn Pettersson	198e0a12f6	[MemCpyOpt] Fix up debug loc for simplified memset in processMemSetMemCpyDependence Make sure the code comments in processMemSetMemCpyDependence match with the actual transform. They indicated that the memset being rewritten was sunk to after a memcpy, while it actually is inserted just before the memcpy. Also make sure we use the debug location of the original memset when creating the new simplified memset. In the past we've been using the debug location for the memcpy which could be a bit confusing. Differential Revision: https://reviews.llvm.org/D135574	2023-05-16 16:54:51 +02:00
OCHyams	ca10e73b53	[NFC] Rename isPointerOffset to getPointerOffsetFrom and move to Value.h Linking LLVMCore failed when building D148536 with shared libs enabled: https://lab.llvm.org/buildbot/#/builders/121/builds/29766 Make isPointerOffset a Value method and rename it to getPointerOffsetFrom. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D148698	2023-04-19 12:22:58 +01:00
Bjorn Pettersson	a20f7efbc5	Remove several no longer needed includes. NFCI Mostly removing includes of InitializePasses.h and Pass.h in passes that no longer has support for the legacy PM.	2023-04-17 13:54:19 +02:00
Mikhail Maltsev	ab8150acc5	[MemCpyOpt] Don't fold memcpy.inline into memmove The llvm.memcpy.inline intrinsic must be expanded into code that does not contain any function calls because it is intended for the implementation of low-level functions like memcpy. Currently the MemCpyOpt might covert llvm.memcpy.inline into llvm.memmove in certain circumstances. This patch fixes the issue. Fixes https://github.com/llvm/llvm-project/issues/61791. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D147162	2023-03-30 13:14:59 +01:00
Arthur Eubanks	1a90faacf1	[Passes] Remove some legacy passes NewGVN GVNHoist GVNSink MemCpyOpt Float2Int These were only used for the optimization pipeline, of which the legacy version was removed.	2023-03-14 13:32:59 -07:00
Guillaume Chatelet	135f23d67b	Deprecate MemIntrinsicBase::getDestAlignment() and MemTransferBase::getSourceAlignment() Differential Revision: https://reviews.llvm.org/D141840	2023-01-16 14:22:03 +00:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Nikita Popov	07bf39df80	[MemCpyOpt] Extract processStoreOfLoad() method (NFC)	2023-01-06 16:11:10 +01:00
Nikita Popov	a6a526ec54	[IR] Add AllocaInst::getAllocationSize() (NFC) When fetching allocation sizes, we almost always want to have the size in bytes, but we were only providing an InBits API. Also add the corresponding byte-based conjugate to save some *8 and /8 juggling everywhere.	2023-01-06 15:36:16 +01:00
Owen Anderson	8256ddf78c	Resolve a long-standing FIXME in memcpyopt. Inspecting the downstream use of the cpyAlign, it is clear that `performCallSlotOptzn` is expecting it to represent the alignment of the copy destination, not the minimum of the src and dest alignments. This patch renames the parameter to make this more obvious. I believe this change is NFC, because the downstream code has alignment checks such that it all works out in the end. I have not been able to construct a test case that actually triggers a change in output. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D140603	2022-12-23 16:15:24 -07:00
Fangrui Song	d4b6fcb32e	[Analysis] llvm::Optional => std::optional	2022-12-14 07:32:24 +00:00
Kazu Hirata	405fc404bf	[ADT] Don't including None.h (NFC) These source files no longer use None, so they do not need to include None.h. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-06 20:14:51 -08:00
Nikita Popov	b7ede701d8	[MemCpyOpt] Use BatchAA when processing one instruction (NFCI) While we can't use a single BatchAA instance for the entire MemCpyOpt run without further justification, we can use BatchAA while performing the queries related to a single instruction (these will first perform some AA-based checks, and then modify the IR only afterwards).	2022-12-06 10:16:39 +01:00
Krzysztof Parzyszek	f3b6dbfda8	Instructions: convert Optional to std::optional	2022-12-04 14:25:11 -06:00
Kazu Hirata	343de6856e	[Transforms] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:37 -08:00
OCHyams	e292f91291	[Assignment Tracking][17/*] Account for assignment tracking in memcpyopt The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir Maintain and propagate DIAssignID attachments in memcpyopt. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133312	2022-11-15 11:51:10 +00:00
Nikita Popov	4d33cf4166	[MemCpyOpt] Avoid moving lifetime marker above def (PR58903) This is unlikely to happen with opaque pointers, so just bail out of the transform, rather than trying to move bitcasts/etc as well. Fixes https://github.com/llvm/llvm-project/issues/58903.	2022-11-11 15:06:34 +01:00
Nikita Popov	9a45e4beed	[MemCpyOpt] Move lifetime marker before call to enable call slot optimization Currently call slot optimization may be prevented because the lifetime markers for the destination only start after the call. In this case, rather than aborting the transform, we should move the lifetime.start before the call to enable the transform. Differential Revision: https://reviews.llvm.org/D135886	2022-11-07 15:26:00 +01:00
Nikita Popov	b54b84fde6	[MemCpyOpt] Add additional debug output (NFC)	2022-10-13 17:03:44 +02:00
Nikita Popov	5fa14ee835	[MemCpyOpt] Don't hoist above producer of pointer operand This was already handled correctly below, but not checked for the original store pointer operand. Encountered when converting tests to opaque pointers, where the intermediate bitcast goes away.	2022-10-05 14:52:33 +02:00
Matt Arsenault	c867401407	MemCpyOpt: Pass through AssumptionCache	2022-09-19 19:25:22 -04:00
Matt Arsenault	0d8ffcc532	Analysis: Add AssumptionCache argument to isDereferenceableAndAlignedPointer This does not try to pass it through from the end users.	2022-09-19 18:57:33 -04:00
Guillaume Chatelet	3c126d5fe4	[Alignment] Replace commonAlignment with std::min `commonAlignment` is a shortcut to pick the smallest of two `Align` objects. As-is it doesn't bring much value compared to `std::min`. Differential Revision: https://reviews.llvm.org/D128345	2022-06-28 07:15:02 +00:00
Guillaume Chatelet	57ffff6db0	Revert "[NFC] Remove dead code" This reverts commit 8ba2cbff70f2c49a8926451c59cc260d67b706cf.	2022-06-22 14:55:47 +00:00
Guillaume Chatelet	8ba2cbff70	[NFC] Remove dead code	2022-06-22 13:33:58 +00:00
Guillaume Chatelet	f9bb8c24ac	[NFC][Alignment] Convert MemCpyOptimizer.cpp	2022-06-13 10:07:09 +00:00
Fangrui Song	d86a206f06	Remove unneeded cl::ZeroOrMore for cl::opt/cl::list options	2022-06-05 00:31:44 -07:00
Nikita Popov	a5c3b5748c	[MemCpyOpt] Work around PR54682 As discussed on https://github.com/llvm/llvm-project/issues/54682, MemorySSA currently has a bug when computing the clobber of calls that access loop-varying locations. I think a "proper" fix for this on the MemorySSA side might be non-trivial, but we can easily work around this in MemCpyOpt: Currently, MemCpyOpt uses a location-less getClobberingMemoryAccess() call to find a clobber on either the src or dest location, and then refines it for the src and dest clobber. This was intended as an optimization, as the location-less API is cached, while the location-affected APIs are not. However, I don't think this really makes a difference in practice, because I don't think anything will use the cached clobbers on those calls later anyway. On CTMark, this patch seems to be very mildly positive actually. So I think this is a reasonable way to avoid the problem for now, though MemorySSA should also get a fix. Differential Revision: https://reviews.llvm.org/D122911	2022-04-04 10:19:51 +02:00
Philip Reames	7c51669c21	[memcpyopt] Restructure store(load src, dest) form of callslotopt for compile time The search for the clobbering call is fairly expensive if uses are not optimized at construction. Defer the clobber walk to the point in the implementation we need it; there are a bunch of bailouts before that point. (e.g. If the source pointer is not an alloca, we can't do callslotopt.) On a test case which involves a bunch of copies from argument pointers, this switches memcpyopt from > 1/2 second to < 10ms.	2022-04-03 20:16:20 -07:00
Philip Reames	33deaa13b8	[memcpyopt] Common code into performCallSlotOptzn [NFC] We have the same code repeated in both callers, sink it into callee. The motivation here isn't just code style, we can also defer the relatively expensive aliasing checks until the cheap structural preconditions have been validated. (e.g. Don't bother aliasing if src is not an alloca.) This helps compile time significantly.	2022-03-28 20:10:13 -07:00
serge-sans-paille	59630917d6	Cleanup includes: Transform/Scalar Estimated impact on preprocessor output line: before: 1062981579 after: 1062494547 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120817	2022-03-03 07:56:34 +01:00

1 2 3 4 5 ...

405 Commits