llvm-project

Author	SHA1	Message	Date
Daniel Thornburgh	fecf609998	Reland "[LTO][LLD] Prevent invalid LTO libfunc transforms (#164916 )" (#190642 ) This reverts commit 1ec7e86b3a779df2a0af3f37e58c8f5b3a398d7f after issue #190072 was fixed.	2026-04-06 19:20:45 +00:00
Kewen Meng	1ec7e86b3a	Revert "[LTO][LLD] Prevent invalid LTO libfunc transforms (#164916 )" This reverts commit 8b21fe60b43fe358321bca904ae307406725c002. to unblock bot: https://lab.llvm.org/buildbot/#/builders/67/builds/1196	2026-03-30 22:25:25 -05:00
Daniel Thornburgh	8b21fe60b4	[LTO][LLD] Prevent invalid LTO libfunc transforms (#164916 ) In LTO, part of LLVM's middle-end runs after linking has finished. LTO's semantics depend on the complete set of extracted bitcode files being known at this time. If the middle-end inserts new calls to library functions (libfuncs) that are implemented in bitcode, this could extract new bitcode object files into the link. These cannot be compiled, leading to undefined symbol references. Additionally, the middle-end in LTO may reason that such library functions have no references, and it may internalize them, then manipulate their API or even delete them. Afterwards, it may emit a call to them, again producing undefined symbol references. This patch resolves the former issue by ensuring that the middle end emits no new references to symbols defined in bitcode, and it resolves the latter issue by ensuring that extracted bitcode for libfuncs is considered external, since new calls may be emitted to them at any time. The new semantics are not yet established for MachO LLD, which does not yet appear to have any special handling for libcalls in LTO. It also does not yet support distributed ThinLTO; doing so would require additional (de)serialization work. This is the patch referenced in @ilovepi's and my talk at the last LLVM devmeeting: "LT-Uh-Oh" Gemini 3.1 was used in porting to COFF and WASM LLDs.	2026-03-30 14:44:52 -07:00
Nuri Amari	23cb4e5f46	Support -fpass-plugin + -fthinlto-index together (#183525 ) Without this change, passing -fthinlto-index causes -fpass-plugin arguments to be ignored. We want to be able to use plugins with distributed thin-lto, so add support for this.	2026-03-06 10:17:01 -05:00
Tianle Liu	7f96336a22	[LTOBackend] Add -filter-save-modules= for save-temps to only dump specific modules. (#175696 ) Only if module's name contains a string in -filter-save-modules, will its temp BC files be saved. If -filter-save-modules= not set, all modules' BC files will be saved. This feature is more useful for ThinLto when huge numbers of modules are built separately. Using -filter-save-modules= can reduce build time and size of generated files, even avoid crash if some other unrelated files have issues during BC files dumping.	2026-01-19 09:34:25 +08:00
Alexis Engelke	f54df0d09e	[LLVM][NFC] Move PassPlugin from Passes to separate library This avoid pulling in the entire Passes library with all passes as dependencies when just referring to PassPlugin, which is in fact independent of the Passes themselves. Pull Request: https://github.com/llvm/llvm-project/pull/173279	2025-12-22 18:11:34 +01:00
Alexis Engelke	2634a2bda1	Revert "[LLVM][NFC] Move PassPlugin from Passes to Extensions lib" and subsequent commit This reverts commit d87b47d3a893b849cfd1ee5309b9fec2b0aec8cd. This reverts commit f7ed3d44a198bfe689a1aa284452e875d5bb8a55.	2025-12-22 15:51:37 +00:00
Alexis Engelke	d87b47d3a8	[LLVM][NFC] Move PassPlugin from Passes to Extensions lib This avoid pulling in the entire Passes library with all passes as dependencies when just referring to PassPlugin, which is in fact independent of the Passes themselves. Pull Request: https://github.com/llvm/llvm-project/pull/172478	2025-12-22 11:42:27 +01:00
Usha Gupta	edf80dce39	[LTO][Veclib] Fix vector library handling with LTO (#170638 ) Commit #167996 moved VecLib into TargetOptions and ensured clang properly sets it. However, some LTO backend code paths were still creating _TargetLibraryInfoImpl_ without passing the VecLib parameter from `TargetMachine::Options`. This PR completes the fix by ensuring that: _LTOBackend.cpp, ThinLTOCodeGenerator.cpp, UpdateCompilerUsed.cpp_ all pass `TM->Options.VecLib` when constructing _TargetLibraryInfoImpl_. Without this fix, vector library information (e.g., -fveclib=ArmPL) would not be properly recognized during LTO optimization and code generation, potentially causing incorrect optimizations or linker errors when vector library functions are referenced.	2025-12-09 11:36:53 +00:00
Matt Arsenault	04c81a9973	CodeGen: Add LibcallLoweringInfo analysis pass (#168622 ) The libcall lowering decisions should be program dependent, depending on the current module's RuntimeLibcallInfo. We need another related analysis derived from that plus the current function's subtarget to provide concrete lowering decisions. This takes on a somewhat unusual form. It's a Module analysis, with a lookup keyed on the subtarget. This is a separate module analysis from RuntimeLibraryAnalysis to avoid that depending on codegen. It's not a function pass to avoid depending on any particular function, to avoid repeated subtarget map lookups in most of the use passes, and to avoid any recomputation in the common case of one subtarget (and keeps it reusable across repeated compilations). This also switches ExpandFp and PreISelIntrinsicLowering as a sample function and module pass. Note this is not yet wired up to SelectionDAG, which is still using the LibcallLoweringInfo constructed inside of TargetLowering.	2025-12-03 22:00:12 +01:00
Teresa Johnson	683e2bf059	[ThinLTO] Make SummaryList private (NFC) (#164355 ) In preparation for a follow on change that will require checking every time a new summary is added to the SummaryList for a GUID, make the SummaryList private and require all accesses to go through one of two new interfaces. Most changes are to access the list via the read only getSummaryList() method, and the few that add new summaries (e.g. while building the combined summary) use the new addSummary() method.	2025-10-21 06:53:40 -07:00
Teresa Johnson	2a7e7e2ac4	[MemProf] Convert removal of memprof attrs and metadata to a pass (#163841 ) In preparation for a follow on fix that removes these attributes and metadata in non-LTO pipelines, convert updateMemProfAttributes to a new MemProfRemoveInfo pass that executes at the start of the LTO backend pass pipelines when we don't have an index indicating that we linked with a library support hot cold operator new. This is largely NFC from an end user perspective but changes where the removal can be observed, hence the test updates. A follow on change will use the new pass for non-LTO pipelines (for cases when the bitcode is initially matched with memprof data but we decide to complete the compile without LTO).	2025-10-16 12:25:51 -07:00
Jan Svoboda	a5569b4bd7	[llvm] Add `vfs::FileSystem` to `PassBuilder` (#160188 ) Some LLVM passes need access to the filesystem to read configuration files and similar. In some places, this is achieved by grabbing the VFS from `PGOOptions`, but some passes don't have access to these and resort to just calling `vfs::getRealFileSystem()`. This PR allows setting the VFS directly on `PassBuilder` that's able to pass it down to all passes that need it.	2025-09-25 10:15:47 -07:00
Tobias Stadler	dfbd76bda0	[Remarks] Restructure bitstream remarks to be fully standalone (#156715 ) Currently there are two serialization modes for bitstream Remarks: standalone and separate. The separate mode splits remark metadata (e.g. the string table) from actual remark data. The metadata is written into the object file by the AsmPrinter, while the remark data is stored in a separate remarks file. This means we can't use bitstream remarks with tools like opt that don't generate an object file. Also, it is confusing to post-process bitstream remarks files, because only the standalone files can be read by llvm-remarkutil. We always need to use dsymutil to convert the separate files to standalone files, which only works for MachO. It is not possible for clang/opt to directly emit bitstream remark files in standalone mode, because the string table can only be serialized after all remarks were emitted. Therefore, this change completely removes the separate serialization mode. Instead, the remark string table is now always written to the end of the remarks file. This requires us to tell the serializer when to finalize remark serialization. This automatically happens when the serializer goes out of scope. However, often the remark file goes out of scope before the serializer is destroyed. To diagnose this, I have added an assert to alert users that they need to explicitly call finalizeLLVMOptimizationRemarks. This change paves the way for further improvements to the remark infrastructure, including more tooling (e.g. #159784), size optimizations for bitstream remarks, and more. Pull Request: https://github.com/llvm/llvm-project/pull/156715	2025-09-22 16:41:39 +01:00
Alexandre Ganea	5cda2424c8	[LLD][COFF] Add more `--time-trace` tags for ThinLTO linking (#156471 ) In order to better see what's going on during ThinLTO linking, this PR adds more profile tags when using `--time-trace` on a `lld-link.exe` invocation. After PR, linking `clang.exe`: <img width="3839" height="2026" alt="Capture d’écran 2025-09-02 082021" src="https://github.com/user-attachments/assets/bf0c85ba-2f85-4bbf-a5c1-800039b56910" /> Linking a custom (Unreal Engine game) binary gives a completly different picture, probably because of using Unity files, and the sheer amount of input files (here, providing over 60 GB of .OBJs/.LIBs). <img width="1940" height="1008" alt="Capture d’écran 2025-09-02 102048" src="https://github.com/user-attachments/assets/60b28630-7995-45ce-9e8c-13f3cb5312e0" />	2025-09-05 15:28:19 -04:00
Kazu Hirata	64e89353b2	[LTO] Remove unused includes (NFC) (#141355 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-05-24 09:37:37 -07:00
Kewen12	851da60304	Revert "[LLVM] Use `reportFatalUsageError` for LTO usage errors" (#141000 ) The PR causes check-lld fail: >TEST 'lld :: COFF/lto-cache-errors.ll' Tested on local revert and pass the check. Reverts llvm/llvm-project#140955	2025-05-22 00:07:38 -04:00
bd1976bris	b4d2e502e0	[LLVM] Use `reportFatalUsageError` for LTO usage errors (#140955 ) Usage errors in `LTOBackend.cpp` were previously, misleadingly, reported as internal crashes. This PR updates `LTOBackend.cpp` to use `reportFatalUsageError` for reporting usage-related issues. LLVM Issue: https://github.com/llvm/llvm-project/issues/140953 Internal Tracker: TOOLCHAIN-17744	2025-05-21 23:21:59 +01:00
Alexey Karyakin	eac7466448	[LTO] Fix a crash with thin LTO caching and asm output (#138203 ) The `CacheStream::commit()` function (defined in Caching.cpp) deletes the underlying raw stream. Some output streamers may hold a pointer to it, which then will outlive the stream object. In particular, MCAsmStreamer keeps the pointer to the raw stream though a separate `formatted_raw_stream` object, which buffers data and there is no path to explicitly flush this data. Before this change, the buffered data was flushed during the MCAsmStreamer destructor. After #136121, this happened after the `commit()` function is called. Therefore, it caused a crash because the `formatted_raw_stream` object tries to write the buffered data into a deleted raw stream. Even if we don't delete the stream to avoid the crash, it would be too late as the output stream cannot accept data after commit(). Fixes: #138194.	2025-05-14 10:33:57 -05:00
Nikita Popov	b492ec5899	[ErrorHandling] Add reportFatalInternalError + reportFatalUsageError (NFC) (#138251 ) This implements the result of the discussion at: https://discourse.llvm.org/t/rfc-report-fatal-error-and-the-default-value-of-gencrashdialog/73587 There are two different use cases for report_fatal_error, so replace it with two functions reportFatalInternalError() and reportFatalUsageError(). The former indicates a bug in LLVM and generates a crash dialog. The latter does not. The names have been suggested by rnk and people seemed to like them. This replaces a lot of the usages that passed an explicit value for GenCrashDiag. I did not bulk replace remaining report_fatal_error usage -- they probably require case by case review for which function to use.	2025-05-05 12:10:03 +02:00
anjenner	c3f815ba82	Modify the localCache API to require an explicit commit on CachedFile… (#136121 ) …Stream. CachedFileStream has previously performed the commit step in its destructor, but this means its only recourse for error handling is report_fatal_error. Modify this to add an explicit commit() method, and call this in the appropriate places with appropriate error handling for the location. Currently the destructor of CacheStream gives an assert failure in Debug builds if commit() was not called. This will help track down any remaining uses of the API that assume the old destructior behaviour. In Release builds we fall back to the previous behaviour and call report_fatal_error if the commit fails. This is version 2 of this PR, superseding reverted PR https://github.com/llvm/llvm-project/pull/115331 . I have incorporated a change to the testcase to make it more reliable on Windows, as well as two follow-up changes (`df79000896` and `b0baa1d8bd`) that were also reverted when 115331 was reverted. --------- Co-authored-by: Augie Fackler <augie@google.com> Co-authored-by: Vitaly Buka <vitalybuka@google.com>	2025-04-22 09:45:15 +01:00
Nikita Popov	f137c3d592	[TargetRegistry] Accept Triple in createTargetMachine() (NFC) (#130940 ) This avoids doing a Triple -> std::string -> Triple round trip in lots of places, now that the Module stores a Triple.	2025-03-12 17:35:09 +01:00
Douglas Yung	1d763f3833	Revert "Modify the localCache API to require an explicit commit on CachedFile… (#115331 )" This reverts commit ce9e1d3c15ed6290f1cb07b482939976fa8115cd. The unittest added in this commit seems to be flaky causing random failure on buildbots: - https://lab.llvm.org/buildbot/#/builders/46/builds/13235 - https://lab.llvm.org/buildbot/#/builders/46/builds/13232 - https://lab.llvm.org/buildbot/#/builders/46/builds/13228 - https://lab.llvm.org/buildbot/#/builders/46/builds/13224 - https://lab.llvm.org/buildbot/#/builders/46/builds/13220 - https://lab.llvm.org/buildbot/#/builders/46/builds/13210 - https://lab.llvm.org/buildbot/#/builders/46/builds/13208 - https://lab.llvm.org/buildbot/#/builders/46/builds/13207 - https://lab.llvm.org/buildbot/#/builders/46/builds/13202 - https://lab.llvm.org/buildbot/#/builders/46/builds/13196 and - https://lab.llvm.org/buildbot/#/builders/180/builds/14266 - https://lab.llvm.org/buildbot/#/builders/180/builds/14254 - https://lab.llvm.org/buildbot/#/builders/180/builds/14250 - https://lab.llvm.org/buildbot/#/builders/180/builds/14245 - https://lab.llvm.org/buildbot/#/builders/180/builds/14244 - https://lab.llvm.org/buildbot/#/builders/180/builds/14226	2025-03-08 23:54:57 +00:00
anjenner	ce9e1d3c15	Modify the localCache API to require an explicit commit on CachedFile… (#115331 ) …Stream. CachedFileStream has previously performed the commit step in its destructor, but this means its only recourse for error handling is report_fatal_error. Modify this to add an explicit commit() method, and call this in the appropriate places with appropriate error handling for the location. Currently the destructor of CacheStream gives an assert failure in Debug builds if commit() was not called. This will help track down any remaining uses of the API that assume the old destructior behaviour. In Release builds we fall back to the previous behaviour and call report_fatal_error if the commit fails.	2025-03-07 17:58:36 +00:00
Kito Cheng	733ad3fdeb	[LTO] Override TargetABI from module flags if present when creating TargetMachine (#126497 ) …argetMachine RISC-V's data layout is determined by the ABI, not just the target triple. However, the TargetMachine is created using the data layout from the target triple, which is not always correct. This patch uses the target ABI from the module and passes it to the TargetMachine, ensuring that the data layout is set correctly according to the ABI. The same problem will happen with other targets like MIPS, but unfortunately, MIPS didn't emit the target-abi into the module flags, so this patch only fixes the issue for RISC-V. NOTE: MIPS with -mabi=n32 can trigger the same issue. Another possible solution is add new parameter to the TargetMachine constructor, but that would require changes in all the targets.	2025-03-07 14:09:26 +08:00
Nikita Popov	979c275097	[IR] Store Triple in Module (NFC) (#129868 ) The module currently stores the target triple as a string. This means that any code that wants to actually use the triple first has to instantiate a Triple, which is somewhat expensive. The change in #121652 caused a moderate compile-time regression due to this. While it would be easy enough to work around, I think that architecturally, it makes more sense to store the parsed Triple in the module, so that it can always be directly queried. For this change, I've opted not to add any magic conversions between std::string and Triple for backwards-compatibilty purses, and instead write out needed Triple()s or str()s explicitly. This is because I think a decent number of them should be changed to work on Triple as well, to avoid unnecessary conversions back and forth. The only interesting part in this patch is that the default triple is Triple("") instead of Triple() to preserve existing behavior. The former defaults to using the ELF object format instead of unknown object format. We should fix that as well.	2025-03-06 10:27:47 +01:00
Teresa Johnson	799955eb17	[ThinLTO] Skip opt pipeline and summary wrapper pass on empty modules (#120143 ) Follow up to PR118508, to avoid unnecessary compile time for an empty combind regular LTO module if all modules end up being ThinLTO only. This required minor changes to a few tests to ensure they weren't empty.	2025-01-10 19:33:20 -08:00
Rahul Joshi	5c8c90d821	[NFC][LLVM] Fix a couple of build warnings (#116490 ) - Fix `HANDLE_EXTENSION` macro redefinition warning in LTOBackend.cpp - Fix "unnecessary brackets" around rf/df variable definitions warning.	2024-11-17 07:54:41 -08:00
Kyungwoo Lee	dc85d5263e	[CGData][ThinLTO] Global Outlining with Two-CodeGen Rounds (#90933 ) This feature is enabled by `-codegen-data-thinlto-two-rounds`, which effectively runs the `-codegen-data-generate` and `-codegen-data-use` in two rounds to enable global outlining with ThinLTO. 1. The first round: Run both optimization + codegen with a scratch output. Before running codegen, we serialize the optimized bitcode modules to a temporary path. 2. From the scratch object files, we merge them into the codegen data. 3. The second round: Read the optimized bitcode modules and start the codegen only this time. Using the codegen data, the machine outliner effectively performs the global outlining. Depends on #90934, #110461 and #110463. This is a patch for https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753.	2024-10-09 15:37:41 -07:00
Kyungwoo Lee	c1959813d6	[CGData][ThinLTO][NFC] Prep for two-codegen rounds (#90934 ) This is NFC for https://github.com/llvm/llvm-project/pull/90933. - Create a lambda function, `RunBackends`, to group the backend operations into a single function. - Explicitly pass the `CodeGenOnly` argument to thinBackend, instead of depending on a configuration value. Depends on https://github.com/llvm/llvm-project/pull/90304. This is a patch for https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753.	2024-10-03 09:58:01 -07:00
Kazu Hirata	3563907969	[LTO] Turn ImportMapTy into a proper class (NFC) (#105748 ) This patch turns type alias ImportMapTy into a proper class to provide a more intuitive interface like: ImportList.addDefinition(...) as opposed to: FunctionImporter::addDefinition(ImportList, ...) Also, this patch requires all non-const accesses to go through addDefinition, maybeAddDeclaration, and addGUID while providing const accesses via: const ImportMapTyImpl &getImportMap() const { return ImportMap; } I realize ImportMapTy may not be the best name as a class (maybe OK as a type alias). I am not renaming ImportMapTy in this patch at least because there are 47 mentions of ImportMapTy under llvm/.	2024-08-22 21:56:01 -07:00
Kazu Hirata	3082a381f5	[LTO] Introduce helper functions to add GUIDs to ImportList (NFC) (#105555 ) The new helper functions make the intent clearer while hiding implementation details, including how we handle previously added entries. Note that: - If we are adding a GUID as a GlobalValueSummary::Definition, then we override a previously added GlobalValueSummary::Declaration entry for the same GUID. - If we are adding a GUID as a GlobalValueSummary::Declaration, then a previously added GlobalValueSummary::Definition entry for the same GUID takes precedence, and no change is made.	2024-08-22 12:06:47 -07:00
macurtis-amd	26e455bac0	[lld][LTO] Teach LTO to print pipeline passes (#101018 ) I found this useful while debugging code generation differences between old and new offloading drivers. No functional change (intended).	2024-07-29 15:56:43 -04:00
Joel E. Denny	d29fdfbc4e	[LTO] Avoid assert fail on failed pass plugin load (#96691 ) Without this patch, passing -load-pass-plugin=nonexistent.so to llvm-lto2 produces a backtrace because LTOBackend.cpp does not handle the error correctly: ``` Failed to load passes from 'nonexistant.so'. Request ignored. Expected<T> must be checked before access or destruction. Unchecked Expected<T> contained error: Could not load library 'nonexistant.so': nonexistant.so: cannot open shared object file: No such file or directoryPLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace. ``` Any tool using `lto::Config::PassPlugins` should suffer similarly. Based on the message "Request ignored" and the continue statement, the intention was apparently to continue on failure to load a plugin. However, no one appears to rely on that behavior now given that it crashes instead, and terminating is consistent with opt.	2024-06-26 14:51:24 -04:00
Mingming Liu	8d9db947b7	Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#95482 ) Make `FunctionsToImportTy` an `unordered_map` rather than `DenseMap`. Credit goes to jvoung@ for the 'DenseMap -> unordered_map' change. This is a reland of https://github.com/llvm/llvm-project/pull/92718 * `DenseMap` allocates space for a large number of key/value pairs and wastes space when the number of elements are small. * While init bucket size is zero [1], it quickly allocates buckets for 64 elements [2] when the number of elements is small (for example, 3 or 4 elements). The programmer manual [3] also mentions it could waste space. * Experiments show `FunctionsToImportTy.size()` is smaller than 4 for multiple binaries with high indexing ram usage. `unordered_map` grows factor is at most 2 in llvm libc [4] for insert operations. With this change, `ComputeCrossModuleImport` ram increase is smaller than 0.5G on a couple of binaries with high indexing ram usage. A wider range of (pre-release) tests pass. [1] `ad79a14c9e/llvm/include/llvm/ADT/DenseMap.h (L431-L432)` [2] `ad79a14c9e/llvm/include/llvm/ADT/DenseMap.h (L849)` [3] https://llvm.org/docs/ProgrammersManual.html#llvm-adt-densemap-h [4] `ad79a14c9e/libcxx/include/__hash_table (L1525-L1526)` Original commit message The goal is to populate `declaration` import status if a new flag `-import-declaration` is on. * For in-process ThinLTO, the `declaration` status is visible to backend `function-import` pass, so `FunctionImporter::importFunctions` should read the import status and be no-op for declaration summaries. Basically, the postlink pipeline is updated to keep its current behavior (import definitions), but not updated to handle `declaration` summaries. Two use cases ([better call-graph sort](https://discourse.llvm.org/t/rfc-for-better-call-graph-sort-build-a-more-complete-call-graph-by-adding-more-indirect-call-edges/74029#support-cross-module-function-declaration-import-5) or [cross-module auto-init](https://github.com/llvm/llvm-project/pull/87597#discussion_r1556067195)) would use this bit differently. * For distributed ThinLTO, the `declaration` status is not serialized to bitcode. As discussed, https://github.com/llvm/llvm-project/pull/87600 will do this.	2024-06-20 10:50:31 -07:00
Abhina Sree	d3342e5b92	[SystemZ][z/OS] Continue marking text files with OF_Text (#95111 ) Text files should be opened with OF_Text to have the correct encoding.	2024-06-12 09:22:21 -04:00
Mingming Liu	707f4de428	Revert "Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92718 ) (#94503 ) This reverts commit e33db249b53fb70dce62db3ebd82d42239bd1d9d. The change from set to map increases memory usage, and caused indexing OOM in some applications. Need to profile offline to bring the memory usage down.	2024-06-05 10:06:55 -07:00
Mingming Liu	e33db249b5	Reland "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92718 ) The original PR is reviewed in https://github.com/llvm/llvm-project/pull/88024, and this PR adds one line (`b9f04d199d`) to fix test Limit to one thread for in-process ThinLTO to test `LLVM_DEBUG` log. - This should fix build bot failure like https://lab.llvm.org/buildbot/#/builders/259/builds/4727 and https://lab.llvm.org/buildbot/#/builders/9/builds/43876 - I could repro the failure and see interleaved log messages by using `-thinlto-threads=all` Original Commit Message: The goal is to populate `declaration` import status if a new flag `-import-declaration` is on. * For in-process ThinLTO, the `declaration` status is visible to backend `function-import` pass, so `FunctionImporter::importFunctions` should read the import status and be no-op for declaration summaries. Basically, the postlink pipeline is updated to keep its current behavior (import definitions), but not updated to handle `declaration` summaries. Two use cases ([better call-graph sort](https://discourse.llvm.org/t/rfc-for-better-call-graph-sort-build-a-more-complete-call-graph-by-adding-more-indirect-call-edges/74029#support-cross-module-function-declaration-import-5) or [cross-module auto-init](https://github.com/llvm/llvm-project/pull/87597#discussion_r1556067195)) would use this bit differently. * For distributed ThinLTO, the `declaration` status is not serialized to bitcode. As discussed, https://github.com/llvm/llvm-project/pull/87600 will do this.	2024-05-20 08:55:31 -07:00
Kazu Hirata	32ae9a28a5	[llvm] Use SmallString::str (NFC) (#92712 )	2024-05-19 22:48:06 -07:00
Mingming Liu	6b0733e3a3	Revert "[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option" (#92715 ) Reverts llvm/llvm-project#88024 Build bot failures (https://lab.llvm.org/buildbot/#/builders/259/builds/4727 and https://lab.llvm.org/buildbot/#/builders/9/builds/43876)	2024-05-19 22:42:18 -07:00
Mingming Liu	8de7890572	[ThinLTO] Populate declaration import status except for distributed ThinLTO under a default-off new option (#88024 ) The goal is to populate `declaration` import status if a new flag`-import-declaration` is on. * For in-process ThinLTO, the `declaration` status is visible to backend `function-import` pass, so `FunctionImporter::importFunctions` should read the import status and be no-op for declaration summaries. Basically, the postlink pipeline is updated to keep its current behavior (import definitions), but not updated to handle `declaration` summaries. Two use cases (better call-graph sort and cross-module auto-init) would use this bit differently. * For distributed ThinLTO, the `declaration` status is not serialized to bitcode. As discussed, https://github.com/llvm/llvm-project/pull/87600 will do this. [1] https://discourse.llvm.org/t/rfc-for-better-call-graph-sort-build-a-more-complete-call-graph-by-adding-more-indirect-call-edges/74029#support-cross-module-function-declaration-import-5 [2] https://github.com/llvm/llvm-project/pull/87597#discussion_r1556067195	2024-05-19 22:22:47 -07:00
Pierre van Houtryve	e86ebe4ff8	[LTO] Allow target-specific module splittting (#83128 ) Allow targets to implement custom module splitting logic for --lto-partitions, see #89245 https://discourse.llvm.org/t/rfc-lto-target-specific-module-splittting/77252	2024-04-22 08:59:18 +02:00
Mehdi Amini	716042a63f	Rename llvm::ThreadPool -> llvm::DefaultThreadPool (NFC) (#83702 ) The base class llvm::ThreadPoolInterface will be renamed llvm::ThreadPool in a subsequent commit. This is a breaking change: clients who use to create a ThreadPool must now create a DefaultThreadPool instead.	2024-03-05 18:00:46 -08:00
Igor Kudrin	ec24094b56	[LTO] Remove Config.UseDefaultPipeline (#82587 ) This option is not used. It was added in [D122133](https://reviews.llvm.org/D122133), 5856f30b, with the only usage in `ClangLinkerWrapper.cpp`, which was later updated in a1d57fc2, and then finally removed in [D142650](https://reviews.llvm.org/D142650), 6185246f.	2024-02-23 01:05:06 +07:00
Arthur Eubanks	93cdd1b5cf	[PGO] Add ability to mark cold functions as optsize/minsize/optnone (#69030 ) The performance of cold functions shouldn't matter too much, so if we care about binary sizes, add an option to mark cold functions as optsize/minsize for binary size, or optnone for compile times [1]. Clang patch will be in a future patch. This is intended to replace `shouldOptimizeForSize(Function&, ...)`. We've seen multiple cases where calls to this expensive function, if not careful, can blow up compile times. I will clean up users of that function in a followup patch. Initial version: https://reviews.llvm.org/D149800 [1] https://discourse.llvm.org/t/rfc-new-feature-proposal-de-optimizing-cold-functions-using-pgo-info/56388	2024-02-12 14:52:08 -08:00
Arthur Eubanks	64573da4bf	[IR] Add "Large Data Threshold" module metadata (#66797 ) This allows us to not have to pass -mllvm flags to set the large data threshold for (in-LLD/not-distributed) ThinLTO. Follows https://reviews.llvm.org/D52322, which did the same for the code model. Since the large data threshold is tied to the code model and we disallow mixing different code models, do the same for the large data threshold.	2023-09-19 16:36:30 -07:00
Teresa Johnson	546ec641b4	Restore "[MemProf] Use new option/pass for profile feedback and matching" This restores commit b4a82b62258c5f650a1cccf5b179933e6bae4867, reverted in 3ab7ef28eebf9019eb3d3c4efd7ebfd160106bb1 because it was thought to cause a bot failure, which ended up being unrelated to this patch set. Differential Revision: https://reviews.llvm.org/D154856	2023-07-11 13:16:20 -07:00
JP Lehr	3ab7ef28ee	Revert "[MemProf] Use new option/pass for profile feedback and matching" This reverts commit b4a82b62258c5f650a1cccf5b179933e6bae4867. Broke AMDGPU OpenMP Offload buildbot	2023-07-11 05:44:42 -04:00
Teresa Johnson	b4a82b6225	[MemProf] Use new option/pass for profile feedback and matching Previously the MemProf profile was expected to be in the same profile file as a normal PGO profile, passed via the usual -fprofile-use= option, and was matched in the same pass. To simplify profile preparation, since the raw MemProf profile requires the binary for symbolization and may be simpler to index separately from the raw PGO profile, and also to enable providing a MemProf profile for a SamplePGO build, separate out the MemProf feedback option and matching pass. This patch adds the -fmemory-profile-use=${file} option, and the provided file is passed down to LLVM and ultimately used in a new MemProfUsePass which performs the matching of just the memory profile contents of that file. Note that a single profile file containing both normal PGO and MemProf profile data is still supported, and the relevant profile data is matched by the appropriate matching pass(es) based on which option(s) the profile is provided with (the same profile file can be supplied to both feedback options). Differential Revision: https://reviews.llvm.org/D154856	2023-07-10 16:42:56 -07:00
Matthew Voss	a1ca3af31e	[llvm] A Unified LTO Bitcode Frontend Here's a high level summary of the changes in this patch. For more information on rational, see the RFC. (https://discourse.llvm.org/t/rfc-a-unified-lto-bitcode-frontend/61774). - Add config parameter to LTO backend, specifying which LTO mode is desired when using unified LTO. - Add unified LTO flag to the summary index for efficiency. Unified LTO modules can be detected without parsing the module. - Make sure that the ModuleID is generated by incorporating more types of symbols. Differential Revision: https://reviews.llvm.org/D123803	2023-07-05 14:53:14 -07:00

1 2 3 4 5

227 Commits