llvm-project

Author	SHA1	Message	Date
Fangrui Song	48d0eb5181	[CodeGen] Simplify EmitAssemblyHelper and emitBackendOutput Prepare for -ftime-report change (#122225).	2025-01-09 21:23:52 -08:00
Vitaly Buka	4c8fdc2954	[nfc][BoundsChecking] Rename BoundsCheckingOptions into Options (#122359 )	2025-01-09 20:38:13 -08:00
Vitaly Buka	9c2de994a1	[nfc][BoundsChecking] Refactor BoundsCheckingOptions (#122346 ) Remove ReportingMode and ReportingOpts.	2025-01-09 20:19:01 -08:00
Mingjie Xu	2a1632824d	[tysan] Convert TySan from function+module pass to just module pass (#120667 ) As mentioned in https://github.com/llvm/llvm-project/pull/118989, all sanitizers but tsan are converted to just module pass for easier maintenance. This patch removes the TySan function pass, convert TySan from function+module pass to just module pass.	2025-01-08 11:25:32 +08:00
Thurston Dang	5bb650345d	Remove -bounds-checking-unique-traps (replace with -fno-sanitize-merge=local-bounds) (#120682 ) #120613 removed -ubsan-unique-traps and replaced it with -fno-sanitize-merge (introduced in #120511), which allows fine-grained control of which UBSan checks to prevent merging. This analogous patch removes -bound-checking-unique-traps, and allows it to be controlled via -fno-sanitize-merge=local-bounds. Most of this patch is simply plumbing through the compiler flags into the bounds checking pass. Note: this patch subtly changes -fsanitize-merge (the default) to also include -fsanitize-merge=local-bounds. This is different from the previous behavior, where -fsanitize-merge (or the old -ubsan-unique-traps) did not affect local-bounds (requiring the separate -bounds-checking-unique-traps). However, we argue that the new behavior is more intuitive. Removing -bounds-checking-unique-traps and merging its functionality into -fsanitize-merge breaks backwards compatibility; we hope that this is acceptable since '-mllvm -bounds-checking-unique-traps' was an experimental flag.	2024-12-20 10:07:44 -08:00
Vitaly Buka	c2aee50620	[ubsan] Runtime and driver support for local-bounds (#120515 ) Implements ``-f[no-]sanitize-trap=local-bounds``, and ``-f[no-]sanitize-recover=local-bounds``. LLVM part is here #120513.	2024-12-19 16:38:07 -08:00
Vitaly Buka	55e87a79b9	[BoundsChecking] Add parameters to pass (#119894 ) This check is a part of UBSAN, but does not support verbose output like other UBSAN checks. This is a step to fix that.	2024-12-17 22:07:14 -08:00
Florian Hahn	c135f6ffe2	[TySan] Add initial Type Sanitizer support to Clang) (#76260 ) This patch introduces the Clang components of type sanitizer: a sanitizer for type-based aliasing violations. It is based on Hal Finkel's https://reviews.llvm.org/D32198. The Clang changes are mostly formulaic, the one specific change being that when the TBAA sanitizer is enabled, TBAA is always generated, even at -O0. It goes together with the corresponding LLVM changes (https://github.com/llvm/llvm-project/pull/76259) and compiler-rt changes (https://github.com/llvm/llvm-project/pull/76261) PR: https://github.com/llvm/llvm-project/pull/76260	2024-12-17 15:13:42 +00:00
Chris Apple	0f776f1df9	[rtsan][clang] NFC: Move rtsan init to addSanitizers (#119904 )	2024-12-13 13:00:09 -08:00
Chris Apple	4a65861402	[rtsan][llvm] Remove function pass, only support module pass (#119739 ) Most of the other sanitizers are now only module level passes. This moves all functionality into the module pass, and removes the function pass.	2024-12-13 08:50:36 -08:00
Vitaly Buka	7787328dd6	[ubsan] Improve lowering of @llvm.allow.ubsan.check (#119013 ) This fix the case, when single hot inlined callsite, prevent checks for all other. This helps to reduce number of removed checks up to 50% (deppedes on `cutoff-hot` value) . `ScalarOptimizerLateEPCallback` was happening during CGSCC walk, after each inlining, but this is effectively after inlining. Example, order in comments: ``` static void overflow() { // 1. Inline get/set if possible // 2. Simplify // 3. LowerAllowCheckPass set(get() + get()); } void test() { // 4. Inline // 5. Nothing for LowerAllowCheckPass overflow(); } ``` With this patch it will look like: ``` static void overflow() { // 1. Inline get/set if possible // 2. Simplify set(get() + get()); } void test() { // 3. Inline // 4. Simplify overflow(); } // Later, after inliner CGSCC walk complete: // 5. LowerAllowCheckPass for `overflow` // 6. LowerAllowCheckPass for `test` ```	2024-12-07 16:12:58 -08:00
Chris Apple	ca3180ad6e	[LLVM][rtsan] Add module pass to initialize rtsan (#118989 ) This allows shared libraries instrumented with RTSan to be initialized. This approach directly mirrors the approach in Tsan, Asan and many of the other sanitizers	2024-12-06 11:29:11 -08:00
Kazu Hirata	e8a6624325	[CodeGen] Remove unused includes (NFC) (#116459 ) Identified with misc-include-cleaner.	2024-11-16 07:37:13 -08:00
Shilei Tian	390300d9f4	[PassBuilder] Add `ThinOrFullLTOPhase` to optimizer pipeline (#114577 )	2024-11-03 23:25:29 -05:00
Shilei Tian	dc45ff1d2a	[PassBuilder] Add `ThinOrFullLTOPhase` to early simplication EP call backs (#114547 ) The early simplication pipeline is used in non-LTO and (Thin/Full)LTO pre-link stage. There are some passes that we want them in non-LTO mode, but not at LTO pre-link stage. The control is missing currently. This PR adds the support. To demonstrate the use, we only enable the internalization pass in non-LTO mode for AMDGPU because having it run in pre-link stage causes some issues.	2024-11-03 23:24:10 -05:00
Paul Kirth	b01e2a8b56	[llvm] Allow always dropping all llvm.type.test sequences Currently, the `DropTypeTests` parameter only fully works with phi nodes and llvm.assume instructions. However, we'd like CFI to work in conjunction with FatLTO, in so far as the bitcode section should be able to contain the CFI instrumentation, while any incompatible bits are dropped when compiling the object code. To do that, we need to drop the llvm.type.test instructions everywhere, and not just their uses in phi nodes. This patch updates the LowerTypeTest pass so that uses are removed, and replaced with `true` in all cases, and not just in phi nodes. Addressing this will allow us to fix #112053 by modifying the FatLTO pipeline. Reviewers: pcc, nikic Reviewed By: pcc Pull Request: https://github.com/llvm/llvm-project/pull/112787	2024-10-30 16:56:30 -07:00
Kyungwoo Lee	dc85d5263e	[CGData][ThinLTO] Global Outlining with Two-CodeGen Rounds (#90933 ) This feature is enabled by `-codegen-data-thinlto-two-rounds`, which effectively runs the `-codegen-data-generate` and `-codegen-data-use` in two rounds to enable global outlining with ThinLTO. 1. The first round: Run both optimization + codegen with a scratch output. Before running codegen, we serialize the optimized bitcode modules to a temporary path. 2. From the scratch object files, we merge them into the codegen data. 3. The second round: Read the optimized bitcode modules and start the codegen only this time. Using the codegen data, the machine outliner effectively performs the global outlining. Depends on #90934, #110461 and #110463. This is a patch for https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753.	2024-10-09 15:37:41 -07:00
Kyungwoo Lee	c1959813d6	[CGData][ThinLTO][NFC] Prep for two-codegen rounds (#90934 ) This is NFC for https://github.com/llvm/llvm-project/pull/90933. - Create a lambda function, `RunBackends`, to group the backend operations into a single function. - Explicitly pass the `CodeGenOnly` argument to thinBackend, instead of depending on a configuration value. Depends on https://github.com/llvm/llvm-project/pull/90304. This is a patch for https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753.	2024-10-03 09:58:01 -07:00
Rahman Lavaee	7b7747dc1d	Reapply "Deprecate the `-fbasic-block-sections=labels` option." (#110039 ) This reapplies commit 1911a50fae8a441b445eb835b98950710d28fc88 with a minor fix in lld/ELF/LTO.cpp which sets Options.BBAddrMap when `--lto-basic-block-sections=labels` is passed.	2024-09-25 22:03:10 -07:00
Kazu Hirata	639a0afa99	Revert "Deprecate the `-fbasic-block-sections=labels` option. (#107494 )" This reverts commit 1911a50fae8a441b445eb835b98950710d28fc88. Several bots are failing: https://lab.llvm.org/buildbot/#/builders/190/builds/6519 https://lab.llvm.org/buildbot/#/builders/3/builds/5248 https://lab.llvm.org/buildbot/#/builders/18/builds/4463	2024-09-25 12:34:43 -07:00
Rahman Lavaee	1911a50fae	Deprecate the `-fbasic-block-sections=labels` option. (#107494 ) This feature is supported via the newer option `-fbasic-block-address-map`. Using the old option still works by delegating to the newer option, while a warning is printed to show deprecation.	2024-09-25 12:03:38 -07:00
Fangrui Song	4a9da96dc6	[clang] Add cc1 --output-asm-variant= to set output syntax 2fcaa549a824efeb56e807fcf750a56bf985296b (2010) added cc1as option `-output-asm-variant` (untested) to set the output syntax. `clang -cc1as -filetype asm -output-asm-variant 1` allows AT&T input and Intel output (`AssemblerDialect` is also used by non-x86 targets). This patch renames the cc1as option (to avoid collision with -o) and makes it available for cc1 to set output syntax. This allows different input & output syntax: ``` echo 'asm("mov $1, %eax");' \| clang -xc - -S -o - -Xclang --output-asm-variant=1 ``` Note: `AsmWriterFlavor` (with a misleading name), used to initialize MCAsmInfo::AssemblerDialect, is primarily used for assembly input, not for output. Therefore, `echo 'asm("mov $1, %eax");' \| clang -x c - -mllvm --x86-asm-syntax=intel -S -o -`, which achieves a similar goal before Clang 19, was unintended. Close #109157 Pull Request: https://github.com/llvm/llvm-project/pull/109360	2024-09-24 15:59:33 -07:00
Youngsuk Kim	7db641af13	[clang] Don't call raw_string_ostream::flush() (NFC) Don't call raw_string_ostream::flush(), which is essentially a no-op. As specified in the docs, raw_string_ostream is always unbuffered	2024-09-19 17:18:10 -05:00
nebulark	f5ba3e1fa6	[CodeView] Flatten cmd args in frontend for LF_BUILDINFO (#106369 )	2024-09-16 19:29:42 +02:00
Antonio Frighetto	2ae968a0d9	[Instrumentation] Move out to Utils (NFC) (#108532 ) Utility functions have been moved out to Utils. Minor opportunity to drop the header where not needed.	2024-09-15 21:07:40 -07:00
Kazu Hirata	5c0d61e318	[LTO] Reduce memory usage for import lists (#106772 ) This patch reduces the memory usage for import lists by employing memory-efficient data structures. With this patch, an import list for a given destination module is basically DenseSet<uint32_t> with each element indexing into the deduplication table containing tuples of: {SourceModule, GUID, Definition/Declaration} In one of our large applications, the peak memory usage goes down by 9.2% from 6.120GB to 5.555GB during the LTO indexing step. This patch addresses several sources of space inefficiency associated with std::unordered_map: - std::unordered_map<GUID, ImportKind> takes up 16 bytes because of padding even though ImportKind only carries one bit of information. - std::unordered_map uses pointers to elements, both in the hash table proper and for collision chains. - We allocate an instance of std::unordered_map for each {Destination Module, Source Module} pair for which we have at least one import. Most import lists have less than 10 imports, so the metadata like the size of std::unordered_map and the pointer to the hash table costs a lot relative to the actual contents.	2024-09-01 08:36:06 -07:00
Tarun Prabhu	a1441ca747	[flang][Driver] Add support for -mllvm -print-pipeline-passes The behavior deliberately mimics that of clang. Ideally, -print-pipeline-passes should be a first-class driver option. Notes to this effect have been added in the appropriate places in both flang and clang. --------- Co-authored-by: Tarun Prabhu <tarun.prabhu@gmail.com>	2024-08-29 16:21:43 -06:00
Chris Apple	f77e8f765e	[clang][rtsan] Reland realtime sanitizer codegen and driver (#102622 ) This reverts commit a1e9b7e646b76bf844e8a9a101ebd27de11992ff This relands commit d010ec6af8162a8ae4e42d2cac5282f83db0ce07 No modifications from the original patch. It was determined that the ubsan build failure was happening even after the revert, some examples: https://lab.llvm.org/buildbot/#/builders/159/builds/4477 https://lab.llvm.org/buildbot/#/builders/159/builds/4478 https://lab.llvm.org/buildbot/#/builders/159/builds/4479	2024-08-23 08:16:52 -07:00
Chris Apple	a1e9b7e646	Revert "[clang][rtsan] Introduce realtime sanitizer codegen and drive… (#105744 ) …r (#102622)" This reverts commit d010ec6af8162a8ae4e42d2cac5282f83db0ce07. Build failure: https://lab.llvm.org/buildbot/#/builders/159/builds/4466	2024-08-22 15:19:41 -07:00
Chris Apple	d010ec6af8	[clang][rtsan] Introduce realtime sanitizer codegen and driver (#102622 ) Introduce the `-fsanitize=realtime` flag in clang driver Plug in the RealtimeSanitizer PassManager pass in Codegen, and attribute a function based on if it has the `[[clang::nonblocking]]` function effect.	2024-08-22 14:08:24 -07:00
Fangrui Song	eb549da9e5	[Driver] Add -Wa, options -mmapsyms={default,implicit} -Wa,-mmapsyms=implicit enables the alternative mapping symbol scheme discussed at #99718. While not conforming to the current aaelf64 ABI, the option is invaluable for those with full control over their toolchain, no reliance on weird relocatable files, and a strong focus on minimizing both relocatable and executable sizes. The option is discouraged when portability of the relocatable objects is a concern. https://maskray.me/blog/2024-07-21-mapping-symbols-rethinking-for-efficiency elaborates the risk. Pull Request: https://github.com/llvm/llvm-project/pull/104542	2024-08-22 09:20:53 -07:00
Fangrui Song	3333ec1183	[Driver] Make CodeGenOptions name match MCTargetOptions names * Initialize `X86RelaxRelocations`. * Fix #96860 test to actually test -Wa,-msse2avx for non-x86.	2024-08-15 15:52:20 -07:00
Peter Rong	74e4694b8c	[LTO] enable `ObjCARCContractPass` only on optimized build (#101114 ) \#92331 tried to make `ObjCARCContractPass` by default, but it caused a regression on O0 builds and was reverted. This patch trys to bring that back by: 1. reverts the [revert](`1579e9ca9c`). 2. `createObjCARCContractPass` only on optimized builds. Tests are updated to refelect the changes. Specifically, all `O0` tests should not include `ObjCARCContractPass` Signed-off-by: Peter Rong <PeterRong@meta.com>	2024-08-09 13:04:25 -07:00
Fangrui Song	04a1a3482c	[Driver] Add -Wa, options --crel and --allow-experimental-crel The two options are discussed in a few comments around https://github.com/llvm/llvm-project/pull/91280#issuecomment-2099344079 * -Wa,--crel: error "-Wa,--allow-experimental-crel must be specified to use -Wa,--crel..." * -Wa,--allow-experimental-crel: no-op * -Wa,--crel,--allow-experimental-crel: enable CREL in the integrated assembler (#91280) MIPS's little-endian n64 ABI messed up the `r_info` field in relocations. While this could be fixed with CREL, my intention is to avoid complication in assembler/linker. The implementation simply doesn't allow CREL for MIPS. Link: https://discourse.llvm.org/t/rfc-crel-a-compact-relocation-format-for-elf/77600 Pull Request: https://github.com/llvm/llvm-project/pull/97378	2024-07-03 13:45:48 -07:00
Alexander Shaposhnikov	3402a1a4d2	[Clang] Enable nsan instrumentation pass (#97359 ) Enable nsan instrumentation pass	2024-07-02 16:57:30 -07:00
Jacob Lambert	2264544e2d	[clang][CodeGen] Remove unnecessary ShouldLinkFiles conditional (#96951 ) We have reworked the bitcode linking option to no longer link twice if post-optimization linking is requested. As such, we no longer need to conditionally link bitcodes supplied via -mlink-bitcode-file, as there is no danger of linking them twice	2024-06-28 14:35:29 -07:00
Egor Pasko	cab81dd038	[EntryExitInstrumenter] Move passes out of clang into LLVM default pipelines (#92171 ) Move EntryExitInstrumenter(PostInlining=true) to as late as possible and EntryExitInstrumenter(PostInlining=false) to an early pre-inlining stage (but skip for ThinLTO post-link). This should fix the issues reported in https://github.com/rust-lang/rust/issues/92109 and https://github.com/llvm/llvm-project/issues/52853. These are caused by https://reviews.llvm.org/D97608.	2024-05-31 12:48:45 -07:00
Nikita Popov	1579e9ca9c	Revert "Run ObjCContractPass in Default Codegen Pipeline (#92331 )" This reverts commit 8cc8e5d6c6ac9bfc888f3449f7e424678deae8c2. This reverts commit dae55c89835347a353619f506ee5c8f8a2c136a7. Causes major compile-time regressions for unoptimized builds.	2024-05-24 08:14:26 +02:00
Nuri Amari	8cc8e5d6c6	Run ObjCContractPass in Default Codegen Pipeline (#92331 ) Prior to this patch, when using -fthinlto-index= the ObjCARCContractPass isn't run prior to CodeGen, and instruction selection fails on IR containing arc intrinsics. This patch is motivated by that usecase. The pass was previously added in various places codegen is performed. This patch adds the pass to the default codegen pipepline, makes sure it bails immediately if no arc intrinsics are found, and removes the adhoc scheduling of the pass. Co-authored-by: Nuri Amari <nuriamari@fb.com>	2024-05-23 10:04:55 -07:00
Jacob Lambert	11a6799740	[clang][CodeGen] Omit pre-opt link when post-opt is link requested (#85672 ) Currently, when the -relink-builtin-bitcodes-postop option is used we link builtin bitcodes twice: once before optimization, and again after optimization. With this change, we omit the pre-opt linking when the option is set, and we rename the option to the following: -Xclang -mlink-builtin-bitcodes-postopt (-Xclang -mno-link-builtin-bitcodes-postopt) The goal of this change is to reduce compile time. We do lose the theoretical benefits of pre-opt linking, but in practice these are small than the overhead of linking twice. However we may be able to address this in a future patch by adjusting the position of the builtin-bitcode linking pass. Compilations not setting the option are unaffected	2024-05-08 08:11:15 -07:00
Petr Hosek	8bcb073705	[Clang] -fseparate-named-sections option (#91028 ) When set, the compiler will use separate unique sections for global symbols in named special sections (e.g. symbols that are annotated with __attribute__((section(...)))). Doing so enables linker GC to collect unused symbols without having to use a different section per-symbol.	2024-05-07 09:18:55 -07:00
Arthur Eubanks	1b79a34a56	[clang] Add flag to experiment with cold function attributes (#89298 ) To be removed and promoted to a proper driver flag if experiments turn out fruitful. For now, this can be experimented with `-mllvm -pgo-cold-func-opt=[optsize\|minsize\|optnone\|default] -mllvm -enable-pgo-force-function-attrs`. Original LLVM patch for this functionality: #69030	2024-04-19 09:30:47 -07:00
Vitaly Buka	49f0b536fd	[UBSAN] Rename `remove-traps` to `lower-allow-check` (#84853 )	2024-04-04 21:29:46 -07:00
Vitaly Buka	b76eb1ddfb	[clang][CodeGen] Remove SimplifyCFGPass preceding RemoveTrapsPass (#84852 ) There is no performance difference after switching to `llvm.experimental.hot`.	2024-04-04 17:47:16 -07:00
Vitaly Buka	18380c522a	[UBSAN][HWASAN] Remove redundant flags (#87709 ) Presense of `cutoff-hot` or `random-skip-rate` should be enough to trigger optimization.	2024-04-04 14:32:30 -07:00
Vitaly Buka	633bc3bfda	[CodeGen][NFC] Make an opt<> static	2024-04-02 15:47:04 -07:00
Vitaly Buka	e93b5f5a47	[ubsan][NFC] Remove recently added `cl::init(false)` Extracted from #84858	2024-04-01 13:37:42 -07:00
Vitaly Buka	eaa71a97f9	[clang] Add optional pass to remove UBSAN traps using PGO (#84214 ) With #83471 it reduces UBSAN overhead from 44% to 6%. Measured as "Geomean difference" on "test-suite/MultiSource/Benchmarks" with PGO build. On real large server binary we see 95% of code is still instrumented, with 10% -> 1.5% UBSAN overhead improvements. We can pass this test only with subset of UBSAN, so base overhead is smaller. We have followup patches to improve it even further.	2024-03-11 12:07:28 -07:00
Fangrui Song	a331937197	[MC] Move CompressDebugSections/RelaxELFRelocations from TargetOptions/MCAsmInfo to MCTargetOptions The convention is for such MC-specific options to reside in MCTargetOptions. However, CompressDebugSections/RelaxELFRelocations do not follow the convention: `CompressDebugSections` is defined in both TargetOptions and MCAsmInfo and there is forwarding complexity. Move the option to MCTargetOptions and hereby simplify the code. Rename the misleading RelaxELFRelocations to X86RelaxRelocations. llvm-mc -relax-relocations and llc -x86-relax-relocations can now be unified.	2024-03-06 23:19:59 -08:00
Paul Kirth	7d8b50aaab	[clang][fat-lto-objects] Make module flags match non-FatLTO pipelines (#83159 ) In addition to being rather hard to follow, there isn't a good reason why FatLTO shouldn't just share the same code for setting module flags for (Thin)LTO. This patch simplifies the logic and makes sure we use set these flags in a consistent way, independent of FatLTO. Additionally, we now test that output in the .llvm.lto section actually matches the output from Full and Thin LTO compilation.	2024-02-28 19:11:55 -08:00

1 2 3 4 5 ...

864 Commits