llvm-project

Author	SHA1	Message	Date
Nikita Popov	35bad229c1	[PredicateInfo] Use bitcast instead of ssa.copy (#151174 ) PredicateInfo needs some no-op to which the predicate can be attached. Currently this is an ssa.copy intrinsic. This PR replaces it with a no-op bitcast. Using a bitcast is more efficient because we don't have the overhead of an overloaded intrinsic. It also makes things slightly simpler overall.	2025-08-11 09:25:01 +02:00
Peter Collingbourne	e3c72e1075	LowerTypeTests: Shrink check size by 1 instruction on x86. We currently generate code like this on x86 for a jump table with 5 elements, assuming the call target is in rbx: lea global_addr(%rip), %rax # initialize temporary rax with base address mov %rbx, %rcx # initialize another temporary rcx for index (rbx will be used for the call, so it is still live) sub %rax, %rcx # compute `address - base` ror $0x3, %rcx # compute `(address - base) ror 3` i.e. index cmp $0x4, %rcx # check index <= 4 ja .Ltrap [...] .Ltrap: ud1 A more efficient instruction sequence, that only needs one temporary register and one fewer instruction, is possible by subtracting the address we are testing from the fixed address instead of vice versa: lea (global_addr + 4*8)(%rip), %rax # initialize temporary rax with address of last element sub %rbx, %rax # compute `last element - address` ror $0x3, %rax # compute `(last element - address) ror 3` i.e. 4 - index cmp $0x4, %rax # check 4 - index <= 4 (same as above) ja .Ltrap [...] .Ltrap: ud1 Change LowerTypeTests to generate that sequence. As a consequence, the order of bits in the bitsets is reversed. Because it doesn't matter how we do the subtraction on other architectures (to the best of my knowledge), do so unconditionally. Reviewers: fmayer, vitalybuka Reviewed By: fmayer Pull Request: https://github.com/llvm/llvm-project/pull/142887	2025-06-06 12:43:24 -07:00
PiJoules	2661e995ce	[llvm] Ensure propagated constants in the vtable are aligned (#136630 ) It's possible for virtual constant propagation in whole program devirtualization to create unaligned loads. We originally saw this with 4-byte aligned relative vtables where we could store 8-byte values before/after the vtable. But since the vtable is 4-byte aligned and we unconditionally do an 8-byte load, we can't guarantee that the stored constant will always be aligned to 8 bytes. We can also see this with normal vtables whenever a 1-byte char is stored in the vtable because the offset calculation for the GEP doesn't take into account the original vtable alignment. This patch introduces two changes to virtual constant propagation: 1. Do not propagate constants whose preferred alignment is larger than the vtable alignment. This is required because if the constants are stored in the vtable, we can only guarantee the constant will be stored at an address at most aligned to the vtable's alignment. 2. Round up the offset used in the GEP before the load to ensure it's at an address suitably aligned such that we can load from it. This patch updates tests to reflect this alignment change and adds some cases for relative vtables.	2025-05-15 11:52:25 -07:00
Kazu Hirata	c4e9901b5b	[llvm] Use llvm::append_range (NFC) (#135931 )	2025-04-16 12:28:47 -07:00
Nikita Popov	b96e7570c9	[UnitTests] Move MergeFunction test from Utils to IPO So we depend on the correct library.	2024-11-28 16:25:05 +01:00
Hari Limaye	5f30b1aae0	[FuncSpec] Improve handling of BinaryOperator instructions (#114534 ) When visiting BinaryOperator instructions during estimation of codesize savings for a candidate specialization, don't bail when the other operand is not found to be constant. This allows us to find more constants than we otherwise would, for example `and(false, x)`.	2024-11-04 11:39:53 +00:00
Hari Limaye	c6931c2525	[FuncSpec] Only compute Latency bonus when necessary (#113159 ) Only compute the Latency component of a specialisation's Bonus when necessary, to avoid unnecessarily computing the Block Frequency Information for a Function.	2024-10-23 09:05:44 +01:00
Alexandros Lamprineas	6472cb1e21	[FuncSpec] Improve estimation of select instruction. (#111176 ) When propagating a constant to a select instruction we only consider the condition operand as the use. I am extending the logic to consider the true and false values too, in case the condition had been found to be constant in a previous propagation but halted.	2024-10-09 10:25:20 +01:00
Kazu Hirata	bd6531b950	[LTO] Introduce a new class ImportIDTable (#106503 ) The new class implements a deduplication table to convert import list elements: {SourceModule, GUID, Definition/Declaration} into 32-bit integers, and vice versa. This patch adds a unit test but does not add a use yet. To be precise, the deduplication table holds {SourceModule, GUID} pairs. We use the bottom one bit of the 32-bit integers to indicate whether we have a definition or declaration. A subsequent patch will collapse the import list hierarchy -- FunctionsToImportTy holding many instances of FunctionsToImportTy -- down to DenseSet<uint32_t> with each element indexing into the deduplication table above. This will address multiple sources of space inefficiency.	2024-08-29 09:45:19 -07:00
Nikita Popov	36c6632eb4	[IR] Don't include PassInstrumentation.h in PassManager.h (NFC) (#96219 ) Move PassInstrumentationAnalysis into PassInstrumentation.h and stop including it in PassManager.h (effectively inverting the direction of the dependency). Most places using PassManager are not interested in PassInstrumentation, and we no longer have any uses of it in PassManager.h itself (only in PassManagerImpl.h).	2024-06-21 08:41:16 +02:00
Michael Kruse	4ecbfacf9e	[llvm] Revise IDE folder structure (#89741 ) Update the folder titles for targets in the monorepository that have not seen taken care of for some time. These are the folders that targets are organized in Visual Studio and XCode (`set_property(TARGET <target> PROPERTY FOLDER "<title>")`) when using the respective CMake's IDE generator. * Ensure that every target is in a folder * Use a folder hierarchy with each LLVM subproject as a top-level folder * Use consistent folder names between subprojects * When using target-creating functions from AddLLVM.cmake, automatically deduce the folder. This reduces the number of `set_property`/`set_target_property`, but are still necessary when `add_custom_target`, `add_executable`, `add_library`, etc. are used. A LLVM_SUBPROJECT_TITLE definition is used for that in each subproject's root CMakeLists.txt.	2024-05-25 13:28:30 +02:00
Matthias Braun	5181156b37	Use BlockFrequency type in more places (NFC) (#68266 ) The `BlockFrequency` class abstracts `uint64_t` frequency values. Use it more consistently in various APIs and disable implicit conversion to make usage more consistent and explicit. - Use `BlockFrequency Freq` parameter for `setBlockFreq`, `getProfileCountFromFreq` and `setBlockFreqAndScale` functions. - Return `BlockFrequency` in `getEntryFreq()` functions. - While on it change some `const BlockFrequency& Freq` parameters to plain `BlockFreqency Freq`. - Mark `BlockFrequency(uint64_t)` constructor as explicit. - Add missing `BlockFrequency::operator!=`. - Remove `uint64_t BlockFreqency::getMaxFrequency()`. - Add `BlockFrequency BlockFrequency::max()` function.	2023-10-05 11:40:17 -07:00
Alexandros Lamprineas	d1b376fd7b	[FuncSpec] Rework the discardment logic for unprofitable specializations. Currently we make an arbitrary comparison between codesize and latency in order to decide whether to keep a specialization or not. Sometimes the latency savings are biased in favor of loops because of imprecise block frequencies, therefore this metric contains a lot of noise. This patch tries to address the problem as follows: * Reject specializations whose codesize savings are less than X% of the original function size. * Reject specializations whose latency savings are less than Y% of the original function size. * Reject specializations whose inlining bonus is less than Z% of the original function size. I am not saying this is super precise, but at least X, Y and Z are configurable, allowing us to tweak the cost model. Moreover, it lets us prioritize codesize over latency, which is a less noisy metric. I am also increasing the minimum size a function should have to be considered a candidate for specialization. Initially the cost of a function was calculated as CodeMetrics::NumInsts * InlineConstants::getInstrCost() which later in D150464 was altered into CodeMetrics::NumInsts since the metric is supposed to model TargetTransformInfo::TCK_CodeSize. However, we omitted adjusting MinFunctionSize in that commit. Differential Revision: https://reviews.llvm.org/D157123	2023-08-09 10:28:46 +01:00
Alexandros Lamprineas	c2d19002ae	[FuncSpec] Estimate dead blocks more accurately. Currently we only consider basic blocks with a unique predecessor when estimating the size of dead code. However, we could expand to this to consider blocks with a back-edge, or blocks preceded by dead blocks. Differential Revision: https://reviews.llvm.org/D156903	2023-08-07 11:04:23 +01:00
Alexandros Lamprineas	5bfefff1c4	Reland [FuncSpec] Split the specialization bonus into CodeSize and Latency. Currently we use a combined metric TargetTransformInfo::TCK_SizeAndLatency when estimating the specialization bonus. This is suboptimal, and in some cases erroneous. For example we shouldn't be weighting the codesize decrease attributed to constant propagation by the block frequency of the dead code. Instead only the latency savings should be weighted by block frequency. The total codesize savings from all the specialization arguments should be deducted from the specialization cost. Differential Revision: https://reviews.llvm.org/D155103	2023-08-02 12:41:13 +01:00
Alexandros Lamprineas	893d3a61c0	Reland [FuncSpec] Add Phi nodes to the InstCostVisitor. This patch allows constant folding of PHIs when estimating the user bonus. Phi nodes are a special case since some of their inputs may remain unresolved until all the specialization arguments have been processed by the InstCostVisitor. Therefore, we keep a list of dead basic blocks and then lazily visit the Phi nodes once the user bonus has been computed for all the specialization arguments. Differential Revision: https://reviews.llvm.org/D154852	2023-07-31 08:25:48 +01:00
Douglas Yung	32683b231e	Revert "[FuncSpec] Add Phi nodes to the InstCostVisitor." This reverts commit 96ff464dd3aac255adc52787a1e28487a9cd4c35. The test in this change was failing on many buildbots: https://lab.llvm.org/buildbot/#/builders/164/builds/41292 https://lab.llvm.org/buildbot/#/builders/258/builds/4491 https://lab.llvm.org/buildbot/#/builders/192/builds/3566 https://lab.llvm.org/buildbot/#/builders/123/builds/20411 https://lab.llvm.org/buildbot/#/builders/58/builds/42553 https://lab.llvm.org/buildbot/#/builders/247/builds/7037 https://lab.llvm.org/buildbot/#/builders/139/builds/46259 https://lab.llvm.org/buildbot/#/builders/216/builds/24650 https://lab.llvm.org/buildbot/#/builders/234/builds/12571 https://lab.llvm.org/buildbot/#/builders/232/builds/12574 https://lab.llvm.org/buildbot/#/builders/235/builds/975	2023-07-27 13:47:52 -07:00
Alexandros Lamprineas	96ff464dd3	[FuncSpec] Add Phi nodes to the InstCostVisitor. This patch allows constant folding of PHIs when estimating the user bonus. Phi nodes are a special case since some of their inputs may remain unresolved until all the specialization arguments have been processed by the InstCostVisitor. Therefore, we keep a list of dead basic blocks and then lazily visit the Phi nodes once the user bonus has been computed for all the specialization arguments. In addition to the last revision this one fixes the bug reported on Phabricator. Differential Revision: https://reviews.llvm.org/D154852	2023-07-27 19:24:11 +01:00
Alexandros Lamprineas	2e00eba232	[FuncSpec][NFC] Remove SSA copy intrinsics in the unittests. Those are added by the SCCP Solver before invoking the Specializer. They need to be removed otherwise the destructor of PredicateInfo complains. Differential Revision: https://reviews.llvm.org/D156365	2023-07-27 08:37:33 +01:00
Alexandros Lamprineas	c52ab9ea2f	Revert "[FuncSpec] Add Phi nodes to the InstCostVisitor." Reverting due to the crash reported in D154852. Also reverting the subsequent commit as collateral damage: "[FuncSpec] Split the specialization bonus into CodeSize and Latency."	2023-07-26 12:33:41 +01:00
Alexandros Lamprineas	20c8f58c11	[FuncSpec] Split the specialization bonus into CodeSize and Latency. Currently we use a combined metric TargetTransformInfo::TCK_SizeAndLatency when estimating the specialization bonus. This is suboptimal, and in some cases erroneous. For example we shouldn't be weighting the codesize decrease attributed to constant propagation by the block frequency of the dead code. Instead only the latency savings should be weighted by block frequency. The total codesize savings from all the specialization arguments should be deducted from the specialization cost. Differential Revision: https://reviews.llvm.org/D155103	2023-07-26 12:03:46 +01:00
Alexandros Lamprineas	03f1d09fe4	[FuncSpec] Add Phi nodes to the InstCostVisitor. This patch allows constant folding of PHIs when estimating the user bonus. Phi nodes are a special case since some of their inputs may remain unresolved until all the specialization arguments have been processed by the InstCostVisitor. Therefore, we keep a list of dead basic blocks and then lazily visit the Phi nodes once the user bonus has been computed for all the specialization arguments. Differential Revision: https://reviews.llvm.org/D154852	2023-07-25 11:00:20 +01:00
Alexandros Lamprineas	1d0476cb4d	[FuncSpec] Prefer DataLayout-aware constant folding of GEPs. As shown in D154820, the DataLayout-independent constant folding interface is not good enough for handling GEPs. Instead we should be using the DataLayout-aware constant folding interface. Since there isn't a method to specifically handle GEPs we can use the one which folds generic instruction operands. Differential Revision: https://reviews.llvm.org/D154821	2023-07-11 13:24:26 +01:00
Alexandros Lamprineas	cae00b2a9b	[FuncSpec][NFC] Improve the unittest coverage for constant folding of GEPs. The InstCostVisitor is currently using the DataLayout-independent constant folding interface. This is a workaround since we can't directly call ConstantExpr::getGetElementPtr due to deprecation. This patch shows that the constant folding interface we are using is not good enough. Differential Revision: https://reviews.llvm.org/D154820	2023-07-11 13:24:12 +01:00
Johannes Doerfert	e9fc399db3	[Attributor][NFCI] Use pointers to pass around AAs This will make it easier to create less trivial AAs in the future as we can simply return `nullptr` rather than an AA with in invalid state.	2023-06-23 17:21:20 -07:00
Alexandros Lamprineas	5400257ded	[FuncSpec] Add Freeze and CallBase to the InstCostVisitor. Allows constant folding of such instructions when estimating user bonus. Differential Revision: https://reviews.llvm.org/D153036	2023-06-19 10:53:08 +01:00
Alexandros Lamprineas	f11d8c88dd	[FuncSpec][NFC] Improve the unittest coverage. The specialization bonus is zero in some unittests because the basic blocks containing the users of the constant arguments are executed less frequently than the entry block. Sinking them into loops solves that. Differential Revision: https://reviews.llvm.org/D153230	2023-06-19 09:43:26 +01:00
Alexandros Lamprineas	4d13896d8a	Reland "[FuncSpec] Improve the accuracy of the cost model" Instead of blindly traversing the use-def chain of constant arguments, compute known constants along the way. Stop as soon as a user cannot be replaced by a constant. Keep it light-weight by handling some basic instruction types. Differential Revision: https://reviews.llvm.org/D150464	2023-06-08 17:44:48 +01:00
Nikita Popov	96a14f388b	Revert "[FuncSpec] Replace LoopInfo with BlockFrequencyInfo" As reported on https://reviews.llvm.org/D150375#4367861 and following, this change causes PDT invalidation issues. Revert it and dependent commits. This reverts commit 0524534d5220da5ecb2cd424a46520184d2be366. This reverts commit ced90d1ff64a89a13479a37a3b17a411a3259f9f. This reverts commit 9f992cc9350a7f7072a6dbf018ea07142ea7a7ed. This reverts commit 1b1232047e83b69561fd64b9547cb0a0d374473a.	2023-05-30 14:49:03 +02:00
Alexandros Lamprineas	0524534d52	[FuncSpec] Enable specialization of literal constants. To do so we have to tweak the cost model such that specialization does not trigger excessively. Differential Revision: https://reviews.llvm.org/D150649	2023-05-25 09:55:46 +01:00
Alexandros Lamprineas	ced90d1ff6	[FuncSpec] Improve the accuracy of the cost model. Instead of blindly traversing the use-def chain of constant arguments, compute known constants along the way. Stop as soon as a user cannot be replaced by a constant. Keep it light-weight by handling some basic instruction types. Differential Revision: https://reviews.llvm.org/D150464	2023-05-24 11:40:12 +01:00
Johannes Doerfert	d987fe67ad	[Attributor] Properly repair broken unittest Reverts 2dc7c7095153822ecd1a8f43aa4c185f9e80cc00 and instead repairs the unittest properly. The test was broken as that it used references to dead functions, assumed dead functions could reach code, assumed code would not be deleted, and did not pre-query all assertion queries. Arguably, the querry AAs don't make it easy to use them outside the attributor pipeline, maybe we just should not (or should fix them pessimistically). For now, the unittest is fixed.	2023-01-12 02:23:59 -08:00
Johannes Doerfert	2dc7c70951	[Attributor] Temporarily disable unit test to unbreak buildbots The root cause seems to have expressed in two separate errors and isn't caught by any IR tests. Will be investigated.	2023-01-12 00:53:21 -08:00
Johannes Doerfert	31ad4dbcb9	Reapply "[Attributor] Introduce AA[Intra/Inter]Reachability" This reverts commit e425a4c45618fcfa8ffb13be4ddfaa5d28aa38f1 after the memory leak has been fixed.	2023-01-10 12:29:24 -08:00
Archibald Elliott	f09cf34d00	[Support] Move TargetParsers to new component This is a fairly large changeset, but it can be broken into a few pieces: - `llvm/Support/TargetParser` are all moved from the LLVM Support component into a new LLVM Component called "TargetParser". This potentially enables using tablegen to maintain this information, as is shown in https://reviews.llvm.org/D137517. This cannot currently be done, as llvm-tblgen relies on LLVM's Support component. - This also moves two files from Support which use and depend on information in the TargetParser: - `llvm/Support/Host.{h,cpp}` which contains functions for inspecting the current Host machine for info about it, primarily to support getting the host triple, but also for `-mcpu=native` support in e.g. Clang. This is fairly tightly intertwined with the information in `X86TargetParser.h`, so keeping them in the same component makes sense. - `llvm/ADT/Triple.h` and `llvm/Support/Triple.cpp`, which contains the target triple parser and representation. This is very intertwined with the Arm target parser, because the arm architecture version appears in canonical triples on arm platforms. - I moved the relevant unittests to their own directory. And so, we end up with a single component that has all the information about the following, which to me seems like a unified component: - Triples that LLVM Knows about - Architecture names and CPUs that LLVM knows about - CPU detection logic for LLVM Given this, I have also moved `RISCVISAInfo.h` into this component, as it seems to me to be part of that same set of functionality. If you get link errors in your components after this patch, you likely need to add TargetParser into LLVM_LINK_COMPONENTS in CMake. Differential Revision: https://reviews.llvm.org/D137838	2022-12-20 11:05:50 +00:00
Mitch Phillips	e425a4c456	Revert "[Attributor] Introduce AA[Intra/Inter]Reachability" This reverts commit fc21f2d7bae2e0be630470cc7ca9323ed5859892. This patch broke the ASan buildbot. See https://reviews.llvm.org/rGfc21f2d7bae2e0be630470cc7ca9323ed5859892 for more information.	2022-12-16 17:56:48 -08:00
Johannes Doerfert	fc21f2d7ba	[Attributor] Introduce AA[Intra/Inter]Reachability We had two AAs for reachability but it was very cumbersome to extend them. We also had some fallback to use LLVM-core mechanisms and cache the result. The new design shares the query code and interface nicely between AAIntraFnReachability and AAInterFnReachability. As part of the rewrite we also added the ExclusionSet to the queries.	2022-12-13 19:38:15 -08:00
Johannes Doerfert	93be9f02aa	[Attributor][FIX] Also update the unit test to match expectations	2022-07-22 01:23:55 -05:00
Johannes Doerfert	24c993dab6	[Attributor][FIX] Update unit test after API change	2022-07-22 01:05:33 -05:00
Grace Jennings	f20e6a6e61	[test-suite][cmake] sort unit test targets This patch sorts unit test targets into directories corresponding to the test source file directories to improve target navigation. Reviewed By: smeenai Differential Revision: https://reviews.llvm.org/D124810	2022-05-16 16:55:40 -07:00
Johannes Doerfert	81143b69dd	[Attributor][FIX] Use AttributorConfig in the unit tests too	2022-04-15 18:36:38 -05:00
Johannes Doerfert	783544bd16	[Attributor][FIX] Repair broken unit test	2022-02-01 02:13:17 -06:00
Kuter Dinel	b2d1ae0611	[Attributor] AAFunctionReachability, Instruction reachability. This patch implement instruction reachability for AAFunctionReachability attribute. It is used to tell if a certain instruction can reach a function transitively. NOTE: I created a new commit based of D106720 and set the author back to Kuter. Other metadata, etc. is wrong. I also addressed the remaining review comments and fixed the unit test. Differential Revision: https://reviews.llvm.org/D106720	2022-02-01 01:40:44 -06:00
Kuter Dinel	66a0b3464c	[Attributor] AAFunctionReachability, Handle CallBase Reachability. This patch makes it possible to query callbase reachability (Can a callbase reach a function Fn transitively). The patch moves the reachability query handling logic to a member class, this class will have more users within the AA once we add other function reachability queries. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106402	2021-09-13 01:35:44 +03:00
Kuter Dinel	0cd964ff25	[Attributor][FIX] checkForAllInstructions, correctly handle declarations checkForAllInstructions was not handling declarations correctly. It should have been returning false when it gets called on a declaration The patch also fixes a test case for AAFunctionReachability for it to be able to pass after the changes to the checkForAllinstructions. Differential Revision: https://reviews.llvm.org/D106625	2021-07-24 02:21:29 +03:00
Kuter Dinel	5d44d56f7d	[Attributor] Derive AAFunctionReachability attribute. This attribute uses Attributor's internal 'optimistic' call graph information to answer queries about function call reachability. Functions can become reachable over time as new call edges are discovered. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104599	2021-06-23 20:43:10 +03:00
Simon Pilgrim	512f008ad9	Fix MSVC "'type cast': conversion from 'unsigned int' to 'const llvm::CallBase *' of greater size" warning. NFCI.	2021-03-11 10:40:46 +00:00
kuterd	d75c9e61a5	[Attributor] Attributor call site specific AAValueConstantRange This patch makes uses of the context bridges introduced in D83299 to make AAValueConstantRange call site specific. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D83744	2021-03-11 01:19:44 +03:00
Simon Pilgrim	6316b0023e	Attributor.h - remove unnecessary includes. NFCI. Fix implicit cpp include dependencies.	2020-07-30 15:26:41 +01:00
Luofan Chen	a7044edde7	[Attributor] Fix qualifier warning of the unittest Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D84532	2020-07-27 22:28:39 +08:00

1 2

63 Commits