llvm-project

Author	SHA1	Message	Date
Florian Hahn	34e75240cb	[GlobalOpt] Use structured bindings to access TypesVector (NFC). This helps to reduce the diff of a follow-up change and improves readability of the existing code.	2023-03-07 13:59:37 +01:00
Florian Hahn	226778e1b2	[GlobalOpt] Extend logic in SRA heuristic to skip stores of initializer. If all stores only store the initializer value of a global, consider it as not stored in the heuristic. GlobalOpt will remove such stores later on. Depends on D129857. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D144476	2023-03-07 11:08:06 +01:00
Nikita Popov	82f2ce7eb9	[GlobalOpt] Drop bitcast handling in global to alloca fold Pointer bitcasts no longer occur with opaque pointers -- and in this case not handling them allows us to drop the code for promoting constant expressions to instructions as well.	2023-03-02 17:12:00 +01:00
Nikita Popov	0ecef88cb1	[GlobalOpt] Use more specific type (NFC) SRA always works on GlobalVariables.	2023-03-02 11:45:27 +01:00
Florian Hahn	d94e8479fc	[GlobalOpt] Extend CleanupPointerRootUsers to handle CE users. Extend CleanupPointerRootUsers to iterate over a worklist, add users of constant expressions to the worklist to enable additional cleanups. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D144468	2023-03-02 10:12:24 +01:00
Nikita Popov	49aa3777f8	[GlobalOpt] Ignore only loaded / only stored global parts in global SRA heuristic When limiting the number of parts we split a global into, ignore any parts that are either only loaded or only stored, because we expect these to be optimized away after SRA. Differential Revision: https://reviews.llvm.org/D129857	2023-02-27 14:58:08 +01:00
Nikita Popov	7d9d855a07	[GlobalOpt] Add struct for parts during GlobalSRA (NFC) This is in preparation for https://reviews.llvm.org/D129857.	2023-02-22 16:35:10 +01:00
Vasileios Porpodas	823186b14d	Recommit: [NFC][IR] Make Module::getGlobalList() private This reverts commit cb5f239363a3c94db5425c105fcd45e77d2a16a9.	2023-02-14 15:12:51 -08:00
Vasileios Porpodas	cb5f239363	Revert "[NFC][IR] Make Module::getGlobalList() private" This reverts commit ed3e3ee9e30dfbffd2170a770a49b36a7f444916.	2023-02-14 14:29:42 -08:00
Vasileios Porpodas	ed3e3ee9e3	[NFC][IR] Make Module::getGlobalList() private This patch adds several missing GlobalList modifier functions, like removeGlobalVariable(), eraseGlobalVariable() and insertGlobalVariable(). There is no longer need to access the list directly so it also makes getGlobalList() private. Differential Revision: https://reviews.llvm.org/D144027	2023-02-14 14:25:10 -08:00
Arthur Eubanks	aecb36c848	[GlobalOpt] Remove legacy pass This is part of the optimization pipeline, of which the legacy pass manager version is deprecated.	2023-02-14 12:51:45 -08:00
Vasileios Porpodas	afad153a08	Recommit: [NFC][IR] Make Module::getAliasList() private This reverts commit 6d4a674acbc56458bb084878d82d16e393d45a6b.	2023-02-13 20:07:56 -08:00
Vasileios Porpodas	6d4a674acb	Revert "[NFC][IR] Make Module::getAliasList() private" This reverts commit b64f7d028bdcaf679130afeed9518c09663f6dc8.	2023-02-13 19:12:30 -08:00
Vasileios Porpodas	b64f7d028b	[NFC][IR] Make Module::getAliasList() private This patch adds several missing AliasList modifier functions, like removeAlias(), eraseAlias() and insertAlias(). There is no longer need to access the list directly so it also makes getAliaList() private. Differential Revision: https://reviews.llvm.org/D143958	2023-02-13 18:45:12 -08:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Kazu Hirata	343de6856e	[Transforms] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:37 -08:00
Kazu Hirata	a5f8a36d02	[IPO] Use std::optional in GlobalOpt.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:38:32 -08:00
Arthur Eubanks	8c49b01a1e	[GlobalOpt] Don't remove inalloca from varargs functions Varargs and inalloca have a weird interaction where varargs are actually passed via the inalloca alloca. Removing inalloca breaks the varargs because they're still not passed as separate arguments. Fixes #58718 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D137182	2022-11-01 13:04:05 -07:00
Mikael Holmen	51d4c7ceea	[GlobalOpt] Fix debug variance problem in hasOnlyColdCalls hasOnlyColdCalls skipped over calls to intrinsics, but it did so after checking the linkage of the called function. This meant that the presence of a call to a debug intrinsic could affect the outcome of the optimization. In my original reproducer (for an out of tree target) it was particularly interesting, because the actual IR after GlobalOpt was not different with debug instrinsics present, so -print-after-all printouts didn't show anything there. However, without debuginfo, GlobalOpt went further and ran BlockFrequencyAnalysis and (more importanly) LoopAnalysis, and later on in the pipeline, instcombine behaved in different ways when LoopInfo was present. So a call to a dbg.declare prevented running LoopAnalysis in GlobalOpt, which later prevented InstCombine from doing an optimization. The dbg-intrinsic-loopanalysis.ll testcase tries to expose this. Then I also noted that adding a dbg.declare actually made the existing testcase colccc_coldsites.ll generate different code, so I modified that to now test it behaves the same way with and without the dbg.declare. Reviewed By: nikic, fhahn Differential Revision: https://reviews.llvm.org/D133193	2022-09-02 12:29:44 +02:00
Cameron McInally	38d58c1b37	[GlobalOpt] Bail out of GlobalOpt SROA if a Scalable Vector is seen The SROA algorithm won't work for Scalable Vectors, since we don't know how many bytes are loaded/stored. Bail out if a Scalable Vector is seen. Differential Revision: https://reviews.llvm.org/D132417	2022-08-24 13:17:59 -07:00
Kazu Hirata	50724716cd	[Transforms] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-14 12:51:58 -07:00
Kazu Hirata	109df7f9a4	[llvm] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-13 12:55:42 -07:00
Kazu Hirata	ba0407ba86	[llvm] Use range-based for loops (NFC)	2022-08-07 00:16:21 -07:00
Kazu Hirata	acf648b5e9	Use llvm::less_first and llvm::less_second (NFC)	2022-07-24 16:21:29 -07:00
Nikita Popov	f45ab43332	[MemoryBuiltins] Avoid isAllocationFn() call before checking removable alloc Alloc directly checking whether a given call is a removable allocation, instead of first checking whether it is an allocation first.	2022-07-21 09:39:19 +02:00
Nikita Popov	8ee913d83b	[IR] Remove Constant::canTrap() (NFC) As integer div/rem constant expressions are no longer supported, constants can no longer trap and are always safe to speculate. Remove the Constant::canTrap() method and its usages.	2022-07-06 10:36:47 +02:00
Arthur Eubanks	e422c0d3b2	[GlobalOpt] Perform store->dominated load forwarding for stored once globals The initial land incorrectly optimized forwarding non-Constants in non-nosync/norecurse functions. Bail on non-Constants since norecurse should cause global -> alloca promotion anyway. The initial land also incorrectly assumed that StoredOnceStore was the only store to the global, but it actually means that only one value other than the global initializer is stored. Add a check that there's only one store. Compile time tracker: https://llvm-compile-time-tracker.com/compare.php?from=c80b88ee29f34078d2149de94e27600093e6c7c0&to=ef2c2b7772424b6861a75e794f3c31b45167304a&stat=instructions Reviewed By: nikic, asbirlea, jdoerfert Differential Revision: https://reviews.llvm.org/D128128	2022-06-24 09:09:26 -07:00
Arthur Eubanks	b5db65e0da	Reland [GlobalOpt] Preserve CFG analyses The only place we modify the CFG is when calling removeUnreachableBlocks(), so insert a callback there which invalidates analyses for that function (or recomputes DT in the legacy PM). We may delete functions, make sure to clear analyses for those functions. (this was missed in the original revision) Small compile time wins across the board: https://llvm-compile-time-tracker.com/compare.php?from=f444ea8ce0aaaa5ec1a4129809389da15cc41396&to=698f41f4fc26cbf1006ed5d88e9d658edfc5b749&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D128145	2022-06-21 09:19:59 -07:00
Arthur Eubanks	13ff7d6f39	Revert "[GlobalOpt] Perform store->dominated load forwarding for stored once globals" This reverts commit 6f348b146b69a50d5fb1b9fbfd14bc1d204e45c4. Am seeing internal test failures plus a linux kernel breakage reported due to this.	2022-06-20 10:26:47 -07:00
Arthur Eubanks	1cd2c72bef	Revert "[GlobalOpt] Preserve CFG analyses" This reverts commit cc65f3e167144c39ef9ca3a69c3148b71dcab496. Causes crashes: https://github.com/llvm/llvm-project/issues/56131	2022-06-20 10:25:10 -07:00
Arthur Eubanks	cc65f3e167	[GlobalOpt] Preserve CFG analyses The only place we modify the CFG is when calling removeUnreachableBlocks(), so insert a callback there which invalidates analyses for that function (or recomputes DT in the legacy PM). Small compile time wins across the board: https://llvm-compile-time-tracker.com/compare.php?from=f444ea8ce0aaaa5ec1a4129809389da15cc41396&to=698f41f4fc26cbf1006ed5d88e9d658edfc5b749&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D128145	2022-06-19 16:13:02 -07:00
Arthur Eubanks	6f348b146b	[GlobalOpt] Perform store->dominated load forwarding for stored once globals Compile time tracker: https://llvm-compile-time-tracker.com/compare.php?from=1e556f459b44dd0ca4073e932f66ecb6f40fe31a&to=6d7bed4e1e72c6a8592748626091274209740a40&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D128128	2022-06-19 10:27:20 -07:00
Arthur Eubanks	4a5201f484	[NFC][GlobalOpt] Remove unused parameters	2022-06-18 21:23:39 -07:00
Chuanqi Xu	0e10f12844	[NFC] Remove commented cerr debugging loggings There are some unused cerr debugging loggings in the codes. It is weird to remain such commented debug helpers in the product.	2022-06-08 15:58:06 +08:00
Fangrui Song	557efc9a8b	[llvm] Remove unneeded cl::ZeroOrMore for cl::opt options. NFC Some cl::ZeroOrMore were added to avoid the `may only occur zero or one times!` error. More were added due to cargo cult. Since the error has been removed, cl::ZeroOrMore is unneeded. Also remove cl::init(false) while touching the lines.	2022-06-03 21:59:05 -07:00
Alexander Shaposhnikov	badd088c57	[GlobalOpt] Enable optimization of constructors with different priorities Adjust `optimizeGlobalCtorsList` to handle the case of different priorities. This addresses the issue https://github.com/llvm/llvm-project/issues/55083. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D125278	2022-05-13 22:19:29 +00:00
Arthur Eubanks	b07aab8fc1	[GlobalOpt] Iterate over replaced values deterministically to constprop If there are pre-existing dead instructions, the order we visit replaced values can cause us sometimes to not delete dead instructions. The added test non-deterministically failed without the change.	2022-05-02 09:43:20 -07:00
David Green	9727c77d58	[NFC] Rename Instrinsic to Intrinsic	2022-04-25 18:13:23 +01:00
Nikita Popov	db561064f6	[GlobalOpt] Handle non-instruction MTI source (PR54572) This was reusing a cast to GlobalVariable to check for an Instruction, which means we'll try to dereference a null pointer if it's not actually a GlobalVariable. We should be casting MTI->getSource() instead. I don't think this problem is really specific to opaque pointers, but it certainly makes it a lot easier to reproduce. Fixes https://github.com/llvm/llvm-project/issues/54572.	2022-03-28 14:28:47 +02:00
serge-sans-paille	f1985a3f85	Cleanup includes: Transforms/IPO Preprocessor output diff: -238205 lines Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D122183	2022-03-22 10:06:28 +01:00
Fangrui Song	c6692f819e	[GlobalOpt] Don't replace alias with aliasee if either alias/aliasee may be preemptible Generalize D99629 for ELF. A default visibility non-local symbol is preemptible in a -shared link. `isInterposable` is an insufficient condition. Moreover, a non-preemptible alias may be referenced in a sub constant expression which intends to lower to a PC-relative relocation. Replacing the alias with a preemptible aliasee may introduce a linker error. Respect dso_preemptable and suppress optimization to fix the abose issues. With the change, `alias = 345` will not be rewritten to use aliasee in a `-fpic` compile. ``` int aliasee; extern int alias __attribute__((alias("aliasee"), visibility("hidden"))); void foo() { alias = 345; } // intended to access the local copy ``` While here, refine the condition for the alias as well. For some binary formats like COFF, `isInterposable` is a sufficient condition. But I think canonicalization for the changed case has little advantage, so I don't bother to add the `Triple(M.getTargetTriple()).isOSBinFormatELF()` or `getPICLevel/getPIELevel` complexity. For instrumentations, it's recommended not to create aliases that refer to globals that have a weak linkage or is preemptible. However, the following is supported and the IR needs to handle such cases. ``` int aliasee __attribute__((weak)); extern int alias __attribute__((alias("aliasee"))); ``` There are other places where GlobalAlias isInterposable usage may need to be fixed. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D107249	2022-03-18 14:17:05 -07:00
Alok Kumar Sharma	94823500a7	[DebugInfo][SROA] Correct debug info for global variables in case of SROA The existing handling produced crash for test case (attached with patch). Now the function transferSRADebugInfo is modified to - Ignore the current variable if it starts after the current Fragment. - Ignore the current variable if it ends before the current Fragment. - Generate (!DIExpression()) if current variable completely fits the current Fragment. - Otherwise (as earlier), generate the DW_OP_LLVM_fragment in IR if current Fragment partially defines current variable. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D121107	2022-03-10 00:41:30 +05:30
Arthur Eubanks	f0b61f7957	Revert "[GlobalOpt] Don't replace alias with aliasee if either alias/aliasee may be preemptible" This reverts commit 30e8f83c84c5a302a559722fc0d2973dc3f425ee. Causes huge compile time regressions on certain large files. Will followup offline with author.	2022-03-03 11:04:14 -08:00
Fangrui Song	30e8f83c84	[GlobalOpt] Don't replace alias with aliasee if either alias/aliasee may be preemptible Generalize D99629 for ELF. A default visibility non-local symbol is preemptible in a -shared link. `isInterposable` is an insufficient condition. Moreover, a non-preemptible alias may be referenced in a sub constant expression which intends to lower to a PC-relative relocation. Replacing the alias with a preemptible aliasee may introduce a linker error. Respect dso_preemptable and suppress optimization to fix the abose issues. With the change, `alias = 345` will not be rewritten to use aliasee in a `-fpic` compile. ``` int aliasee; extern int alias __attribute__((alias("aliasee"), visibility("hidden"))); void foo() { alias = 345; } // intended to access the local copy ``` While here, refine the condition for the alias as well. For some binary formats like COFF, `isInterposable` is a sufficient condition. But I think canonicalization for the changed case has little advantage, so I don't bother to add the `Triple(M.getTargetTriple()).isOSBinFormatELF()` or `getPICLevel/getPIELevel` complexity. For instrumentations, it's recommended not to create aliases that refer to globals that have a weak linkage or is preemptible. However, the following is supported and the IR needs to handle such cases. ``` int aliasee __attribute__((weak)); extern int alias __attribute__((alias("aliasee"))); ``` There are other places where GlobalAlias isInterposable usage may need to be fixed. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D107249	2022-02-01 10:41:16 -08:00
Nikita Popov	1652c3b80c	[GlobalOpt] Avoid early exit before dead constant check In a similar vein to 236fbf571dc6cebcb81ac5187a170c8de6d5bc0e, make sure we don't early-exit before the dead constant check.	2022-02-01 15:57:19 +01:00
Philip Reames	26049b8ce3	[GlobalOpt] Generalize malloc-to-global for any allocation function We can generalize the malloc-to-global transform for other allocation functions which are both a) removable, and b) have a known initialization value. One subtlety that I want to point out - mostly because I hadn't realized it was true until I took a closer look - is that the existing code doesn't prove that initialization/malloc happens only once. The initialization function can be called multiple times. This is correct without special handling for malloc as undef can map to any value previously written, but a non-undef initializing allocation it means we may end up memseting the new global repeatedly. In particular, this means it's not legal to fold the memset into the initializer of the global. Differential Revision: https://reviews.llvm.org/D117503	2022-01-17 15:06:23 -08:00
Nikita Popov	12bee2c054	[GlobalOpt] Drop an incorrect check This was a last-minute addition to D117249, and of course I ended up inverting the condition in a way that caused an uninitialized memory read. I've dropped it entirely, as I don't think we actually care whether the size is zero or not here. The previous code wasn't checking this either.	2022-01-17 10:10:56 +01:00
Nikita Popov	499f1ca79f	[GlobalOpt] Use generic type when converting malloc to global The malloc to global transform currently determines the type of the global by looking at bitcasts of the malloc. This is limited (the transform fails if there are multiple different types) and incompatible with opaque pointers. My initial approach was to construct an appropriate struct type based on usage in loads/stores. What this patch does instead is to always create an [i8 x AllocSize] global, without trying to guess types at all. This does mean that other transforms that require a certain global type may break. I fixed two of these in D117034 and D117223, which I believe should be sufficient to avoid regressions. In particular, the global SRA change should end up splitting the global into naturally-typed sub-globals, at which point all other optimizations should work. Differential Revision: https://reviews.llvm.org/D117092	2022-01-17 09:55:33 +01:00
Nikita Popov	4796b4ae7b	[GlobalOpt] Make global SRA offset based Currently global SRA uses the GEP structure to determine how to split the global. This patch instead analyses the loads and stores that are performed on the global, and collects which types are used at which offset, and then splits the global according to those. This is both more general, and works fine with opaque pointers. This is also closer to how ordinary SROA is performed. Differential Revision: https://reviews.llvm.org/D117223	2022-01-17 09:28:36 +01:00
Nikita Popov	1cbb456123	[GlobalOpt] Fix global to select transform under opaque pointers We need to check that the load/store type is also the same, as this is no longer implicitly checked through the pointer type.	2022-01-13 11:13:06 +01:00

1 2 3 4 5 ...

723 Commits