llvm-project

Author	SHA1	Message	Date
Oliver Hunt	7575bda7e1	[clang][Obj-C][PAC] Add support for authenticating block metadata (#152978 ) Introduces the use of pointer authentication to protect the invocation, copy and dispose, reference, and descriptor pointers in Objective-C block objects. Resolves #141176	2025-08-22 14:18:22 +02:00
Koakuma	5c69f70244	[SPARC][Driver] Move feature mode selection to Arch/Sparc.cpp (#149652 ) This is so that it's performed also for flang and not just for clang. This should fix https://github.com/llvm/llvm-project/issues/138494. (cherry picked from commit 38fc453afdb6a4511b7c8e189f12a92559ecc396)	2025-07-22 10:38:31 +02:00
Oliver Hunt	451a9ce9ff	[clang][ObjC][PAC] Add ptrauth protections to objective-c (#147899 ) This PR introduces the use of pointer authentication to objective-c[++]. This includes: * __ptrauth qualifier support for ivars * protection of isa and super fields * protection of SEL typed ivars * protection of class_ro_t data * protection of methodlist pointers and content	2025-07-14 19:32:18 -07:00
Joseph Huber	535d6917ec	[Clang] Extract offloading code from static libs with 'offload-arch=' (#147823 ) Summary: The semantics of static libraries only extract stuff that's used. We somewhat extend this behavior with the linker wrapper only doing this to fat binaries that match any found architectures. However, this has some unfortunate effects when the user uses static libraries. This is somewhat of a hack, but we now assume that if the user specified `--offload-arch=` on the link job, they definitely want that architecture to be used if it exists. This patch just forces extraction of those libraries which resolves an issue observed with some customers. The old behavior will still be present if the user does `--offload-link` with no offloading architecture present, and for the vast, vast majority of cases this will change nothing. Fixes: https://github.com/llvm/llvm-project/issues/147788	2025-07-11 10:26:24 -05:00
Artem Chikin	a7091951f0	[APINotes] Add support for capturing all possible versioned APINotes without applying them Swift-versioned API notes get applied at PCM constrution time relying on '-fapinotes-swift-version=X' argument to pick the appropriate version. This change adds a new APINotes application mode with '-fswift-version-independent-apinotes' which causes all versioned API notes to get recorded into the PCM wrapped in 'SwiftVersionedAttr' instances. The expectation in this mode is that the Swift client will perform the required transformations as per the API notes on the client side, when loading the PCM, instead of them getting applied on the producer side. This will allow the same PCM to be usable by Swift clients building with different language versions. In addition to versioned-wrapping the various existing API notes annotations which are carried in declaration attributes, this change adds a new attribute for two annotations which were previously applied directly to the declaration at the PCM producer side: 1) Type and 2) Nullability annotations with 'SwiftTypeAttr' and 'SwiftNullabilityAttr', respectively. The logic to apply these two annotations to a declaration is refactored into API.	2025-07-10 19:19:18 +01:00
Nilanjana Basu	2fc4a4a9d3	[Driver][SamplePGO] Enable -fsample-profile-use-profi (#146795 ) Since profile inference improves sample coverage, it should be turned on by default.	2025-07-09 14:53:28 -07:00
Shunsuke Watanabe	c9900015a9	[flang] Add -fcomplex-arithmetic= option and select complex division algorithm (#146641 ) This patch adds an option to select the method for computing complex number division. It uses `LoweringOptions` to determine whether to lower complex division to a runtime function call or to MLIR's `complex.div`, and `CodeGenOptions` to select the computation algorithm for `complex.div`. The available option values and their corresponding algorithms are as follows: - `full`: Lower to a runtime function call. (Default behavior) - `improved`: Lower to `complex.div` and expand to Smith's algorithm. - `basic`: Lower to `complex.div` and expand to the algebraic algorithm. See also the discussion in the following discourse post: https://discourse.llvm.org/t/optimization-of-complex-number-division/83468 --------- Co-authored-by: Tarun Prabhu <tarunprabhu@gmail.com>	2025-07-09 13:43:54 +09:00
Aaron Ballman	60630310e9	[clang-cl] Support /std:clatest (#147284 ) cl.exe has /std:clatest, analogous to /std:c++latest, which sets the language mode to the latest standard. For C, that's C23. This adds support for the option. Fixes #147233	2025-07-07 09:15:23 -04:00
Eli Friedman	2aa0f0a3bd	[AArch64] Add option -msve-streaming-vector-bits= . (#144611 ) This is similar to -msve-vector-bits, but for streaming mode: it constrains the legal values of "vscale", allowing optimizations based on that constraint. This also fixes conversions between SVE vectors and fixed-width vectors in streaming functions with -msve-vector-bits and -msve-streaming-vector-bits. This rejects any use of arm_sve_vector_bits types in streaming functions; if it becomes relevant, we could add arm_sve_streaming_vector_bits types in the future. This doesn't touch the __ARM_FEATURE_SVE_BITS define.	2025-07-03 13:44:38 -07:00
Nilanjana Basu	d4331344ac	[Clang][Driver][SamplePGO] Introduce -fno_sample_profile_use_profi flag for SamplePGO (#145957 ) This flag allows opting out of using profile inference pass for SamplePGO.	2025-07-02 14:41:30 -07:00
Orlando Cazalet-Hyams	b29fea6eeb	[KeyInstr][Clang][NFC] Don't set -dwarf-use-key-instructions (#144115 ) Now PR 144104 has landed the flag is true by default (each DISubprogram tracks whether or not it's using key instructions).	2025-06-30 12:30:35 +01:00
Florian Mayer	8d2034cf68	[clang] Add flag fallow-runtime-check-skip-hot-cutoff (#145999 ) Co-authored-by: Kazu Hirata <kazu@google.com>	2025-06-27 13:46:54 -07:00
Finn Plummer	e93a0d0d1e	[HLSL][RootSignature] Add `fdx-rootsignature-version` option to specify root signature version (#144813 ) This pr provides the ability to specify the root signature version as a compiler option and to retain this in the root signature decl. It also updates the methods to serialize the version when dumping the declaration and to output the version when generating the metadata. - Update `DXContainer.hI` to define the root signature versions - Update `Options.td` and `LangOpts.h` to define the `fdx-rootsignature-version` compiler option - Update `Options.td` to provide an alias `force-rootsig-ver` in clang-dxc - Update `Decl.[h\|cpp]` and `SeamHLSL.cpp` so that `RootSignatureDecl` will retain its version type - Updates `CGHLSLRuntime.cpp` to generate the extra metadata field - Add tests to illustrate Resolves https://github.com/llvm/llvm-project/issues/126557. Note: this does not implement validation based on versioning. https://github.com/llvm/llvm-project/issues/129940 is required to retrieve the version and use it for validations.	2025-06-24 16:21:24 -07:00
sivadeilra	0a3c5c42a1	Add support for Windows Secure Hot-Patching (redo) (#145565 ) (This is a re-do of #138972, which had a minor warning in `Clang.cpp`.) This PR adds some of the support needed for Windows hot-patching. Windows implements a form of hot-patching. This allows patches to be applied to Windows apps, drivers, and the kernel, without rebooting or restarting any of these components. Hot-patching is a complex technology and requires coordination between the OS, compilers, linkers, and additional tools. This PR adds support to Clang and LLVM for part of the hot-patching process. It enables LLVM to generate the required code changes and to generate CodeView symbols which identify hot-patched functions. The PR provides new command-line arguments to Clang which allow developers to identify the list of functions that need to be hot-patched. This PR also allows LLVM to directly receive the list of functions to be modified, so that language front-ends which have not yet been modified (such as Rust) can still make use of hot-patching. This PR: * Adds a `MarkedForWindowsHotPatching` LLVM function attribute. This attribute indicates that a function should be _hot-patched_. This generates a new CodeView symbol, `S_HOTPATCHFUNC`, which identifies any function that has been hot-patched. This attribute also causes accesses to global variables to be indirected through a `_ref_` global variable. This allows hot-patched functions to access the correct version of a global variable; the hot-patched code needs to access the variable in the _original_ image, not the patch image. Adds a `AllowDirectAccessInHotPatchFunction` LLVM attribute. This attribute may be placed on global variable declarations. It indicates that the variable may be safely accessed without the `_ref_` indirection. Adds two Clang command-line parameters: `-fms-hotpatch-functions-file` and `-fms-hotpatch-functions-list`. The `-file` flag may point to a text file, which contains a list of functions to be hot-patched (one function name per line). The `-list` flag simply directly identifies functions to be patched, using a comma-separated list. These two command-line parameters may also be combined; the final set of functions to be hot-patched is the union of the two sets. * Adds similar LLVM command-line parameters: `--ms-hotpatch-functions-file` and `--ms-hotpatch-functions-list`. * Adds integration tests for both LLVM and Clang. * Adds support for dumping the new `S_HOTPATCHFUNC` CodeView symbol. Although the flags are redundant between Clang and LLVM, this allows additional languages (such as Rust) to take advantage of hot-patching support before they have been modified to generate the required attributes. Credit to @dpaoliello, who wrote the original form of this patch.	2025-06-24 14:56:55 -07:00
Qinkun Bao	4b4782bc86	Revert "Add support for Windows Secure Hot-Patching" (#145553 ) Reverts llvm/llvm-project#138972	2025-06-24 13:11:52 -04:00
sivadeilra	26d318e4a9	Add support for Windows Secure Hot-Patching (#138972 ) This PR adds some of the support needed for Windows hot-patching. Windows implements a form of hot-patching. This allows patches to be applied to Windows apps, drivers, and the kernel, without rebooting or restarting any of these components. Hot-patching is a complex technology and requires coordination between the OS, compilers, linkers, and additional tools. This PR adds support to Clang and LLVM for part of the hot-patching process. It enables LLVM to generate the required code changes and to generate CodeView symbols which identify hot-patched functions. The PR provides new command-line arguments to Clang which allow developers to identify the list of functions that need to be hot-patched. This PR also allows LLVM to directly receive the list of functions to be modified, so that language front-ends which have not yet been modified (such as Rust) can still make use of hot-patching. This PR: * Adds a `MarkedForWindowsHotPatching` LLVM function attribute. This attribute indicates that a function should be _hot-patched_. This generates a new CodeView symbol, `S_HOTPATCHFUNC`, which identifies any function that has been hot-patched. This attribute also causes accesses to global variables to be indirected through a `_ref_` global variable. This allows hot-patched functions to access the correct version of a global variable; the hot-patched code needs to access the variable in the _original_ image, not the patch image. Adds a `AllowDirectAccessInHotPatchFunction` LLVM attribute. This attribute may be placed on global variable declarations. It indicates that the variable may be safely accessed without the `_ref_` indirection. Adds two Clang command-line parameters: `-fms-hotpatch-functions-file` and `-fms-hotpatch-functions-list`. The `-file` flag may point to a text file, which contains a list of functions to be hot-patched (one function name per line). The `-list` flag simply directly identifies functions to be patched, using a comma-separated list. These two command-line parameters may also be combined; the final set of functions to be hot-patched is the union of the two sets. * Adds similar LLVM command-line parameters: `--ms-hotpatch-functions-file` and `--ms-hotpatch-functions-list`. * Adds integration tests for both LLVM and Clang. * Adds support for dumping the new `S_HOTPATCHFUNC` CodeView symbol. Although the flags are redundant between Clang and LLVM, this allows additional languages (such as Rust) to take advantage of hot-patching support before they have been modified to generate the required attributes. Credit to @dpaoliello, who wrote the original form of this patch.	2025-06-24 09:22:38 -07:00
Yaxun (Sam) Liu	44936c8d13	[CUDA][HIP] add options `--[no-]offload-inc` (#140106 ) Currently there is only option -nogpuinc for disabling the default CUDA/HIP wrapper headers. However, there are situations where -nogpuinc needs to be overriden for enabling CUDA/HIP wrapper headers. This patch adds --[no-]offload-inc for that purpose. When both exist, the last wins. -nogpuinc and -nocudainc are now alias to --no-offload-inc.	2025-06-23 11:02:06 -04:00
Kazu Hirata	7cbb141155	[clang] Migrate away from ArrayRef(std::nullopt) (NFC) (#144982 ) ArrayRef has a constructor that accepts std::nullopt. This constructor dates back to the days when we still had llvm::Optional. Since the use of std::nullopt outside the context of std::optional is kind of abuse and not intuitive to new comers, I would like to move away from the constructor and eventually remove it. This patch takes care of the clang side of the migration.	2025-06-19 23:29:50 -07:00
Shilei Tian	633e740e34	[Clang][AMDGPU][Driver] Add `avail-extern-gv-in-addrspace-to-local` option when ThinTLO is enabled (#144914 ) On AMDGPU, we need an extra argument `-avail-extern-gv-in-addrspace-to-local=3` to privatize LDS global variables when ThinLTO is enabled.	2025-06-19 14:32:20 -04:00
Stephen Tozer	36af7345df	Reapply "[Clang] Enable -fextend-variable-liveness at -Og (#118026 )" Relands this feature after several fixes: * Force fake uses to be emitted before musttail calls (#136867) * Added soften-float legalization for fake uses (#142714) * Treat fake uses as size-less instructions in a SystemZ assert (#144390) If further issues with fake uses are found then this may be reverted again, but all currently-known issues are resolved. This reverts commit 2dc6e98169baeb1f73036da0ea50fd828d8323d0.	2025-06-19 16:22:50 +01:00
Mary Kassayova	c377ce1216	[AArch64][VecLib] Add libmvec support for AArch64 targets (#143696 ) This patch adds support for the `libmvec` vector library on AArch64 targets. Currently, all `libmvec` functions in GLIBC version 2.40 are supported. The full list of math functions enabled can be found [here](`96abd59bf2/sysdeps/aarch64/fpu/Versions`) (up to GLIBC 2.40). Previously, `libmvec` was only supported on x86_64 targets. Attempts to use it on AArch64 resulted in the following error from Clang: `unsupported option 'libmvec' for target 'aarch64'`.	2025-06-17 11:07:43 +01:00
Ying Yi	6f29837659	Reland: "[Frontend][PCH]-Add support for ignoring PCH options (-ignore-pch). (#142409 )" (#143614 ) Visual Studio has an argument to ignore all PCH related switches. clang-cl has also support option /Y-. Having the same option in clang would be helpful. This commit is to add support for ignoring PCH options (-ignore-pch). The commit includes: 1. Implement -ignore-pch as a Driver option. 2. Add a Driver test and a PCH test. 3. Add a section of -ignore-pch to user manual. 4. Add a release note for the new option '-ignore-pch'. The change since the original landing: 1. preprocessing-only mode doesn't imply that -include-pch is disabled. Co-authored-by: Matheus Izvekov <mizvekov@gmail.com>	2025-06-17 10:54:22 +01:00
Daniel Paoliello	2488f26d15	[win][x64] Unwind v2 3/n: Add support for requiring unwind v2 to be used (equivalent to MSVC's /d2epilogunwindrequirev2) (#143577 ) #129142 added support for emitting Windows x64 unwind v2 information, but it was "best effort". If any function didn't follow the requirements for v2 it was silently downgraded to v1. There are some parts of Windows (specifically kernel-mode code running on Xbox) that require v2, hence we need the ability to fail the compilation if v2 can't be used. This change also adds a heuristic to check if there might be too many unwind codes, it's currently conservative (i.e., assumes that certain prolog instructions will use the maximum number of unwind codes). Future work: attempting to chain unwind info across multiple tables if there are too many unwind codes due to epilogs and adding a heuristic to detect if an epilog will be too far from the end of the function.	2025-06-16 15:06:41 -07:00
Yaxun (Sam) Liu	7232c07eb9	Reland [HIP] use offload wrapper for non-device-only non-rdc (#143964 ) Fixed a typo: - auto Section = (Prefix + "llvm_offload_entries").str(); + auto Section = (Prefix + "_offload_entries").str(); which broke buildbot e.g. https://lab.llvm.org/buildbot/#/builders/208/builds/1948	2025-06-12 21:41:41 -04:00
Yaxun (Sam) Liu	8890706db6	Revert "Reland [HIP] use offload wrapper for non-device-only non-rdc (#132869 ) (#143964 )" This reverts commit 22f9b4aa1dad597d908be77be1e10ba4c77330ce.	2025-06-12 21:33:05 -04:00
Yaxun (Sam) Liu	22f9b4aa1d	Reland [HIP] use offload wrapper for non-device-only non-rdc (#132869 ) (#143964 ) Fixed two issues: 1. assertion with -flto. the linker wrapper action is missing for wrapping the device binary. Added it for -flto. 2. when there are two HIP files, the kernels in the second file were not found. This is because the -r option of linker wrapper assumes offload entries section of HIP to be hip_offloading_entries but it is actually llvm_offload_entries, causing the offload entries sections not made unique for different object files. Fixed and tested working for both -fgpu-rdc and -fno-gpu-rdc case with and without -r	2025-06-12 20:08:55 -04:00
Feng Zou	703e446022	[Clang] Add check for -mstack-alignment (#143124 ) Currently the assertion in Alignment.h is triggered if a wrong value is passed -mstack-alignment option: ``` Assertion `(Value == 0 \|\| llvm::isPowerOf2_64(Value)) && "Alignment is neither 0 nor a power of 2"' failed. ``` Added check in clang driver for the value of -mstack-alignment option, and emitted an error message when the wrong value was passed.	2025-06-13 06:45:28 +08:00
Shunsuke Watanabe	c431618041	[Clang][Driver] Override complex number calculation method by -fno-fast-math (#132680 ) This patch fixes a bug where -fno-fast-math doesn't revert the complex number calculation method to the default. The priority of overriding options related to complex number calculations differs slightly from GCC, as discussed in: https://discourse.llvm.org/t/the-priority-of-fno-fast-math-regarding-complex-number-calculations/84679	2025-06-12 10:19:26 +09:00
Ying Yi	c926bff560	Revert "[Frontend][PCH]-Add support for ignoring PCH options (-ignore-pch). (#142409 )" This reverts commit 4fb81f11cea0fce9f1f5409fcd55ae3aba70d368.	2025-06-10 17:19:58 +01:00
Cameron McInally	cde1035a2f	[flang] Add support for -mrecip[=<list>] (#143418 ) This patch adds support for the -mrecip command line option. The parsing of this options is equivalent to Clang's and it is implemented by setting the "reciprocal-estimates" function attribute. Also move the ParseMRecip(...) function to CommonArgs, so that Flang is able to make use of it as well. --------- Co-authored-by: Cameron McInally <cmcinally@nvidia.com>	2025-06-10 08:25:33 -06:00
MaggieYingYi	4fb81f11ce	[Frontend][PCH]-Add support for ignoring PCH options (-ignore-pch). (#142409 ) Visual Studio has an argument to ignore all PCH related switches. clang-cl has also support option /Y-. Having the same option in clang would be helpful. This commit is to add support for ignoring PCH options (-ignore-pch). The commit includes: 1. Implement -ignore-pch as a Driver option. 2. Add a Driver test and a PCH test. 3. Add a section of -ignore-pch to user manual. 4. Add a release note for the new option '-ignore-pch'. Code reviewed by: Matheus Izvekov <mizvekov@gmail.com>	2025-06-10 12:52:44 +01:00
Joseph Huber	f5e499a338	Revert "[HIP] use offload wrapper for non-device-only non-rdc (#132869 )" (#143432 ) This breaks a lot of new driver HIP compilation. We should probably revert this for now until we can make a fixed version. ```c++ static __global__ void print() { printf("%s\n", "foo"); } void b(); int main() { hipLaunchKernelGGL(print, dim3(1), dim3(1), 0, 0); auto y = hipDeviceSynchronize(); b(); } ``` ```c++ static __global__ void print() { printf("%s\n", "bar"); } void b() { hipLaunchKernelGGL(print, dim3(1), dim3(1), 0, 0); auto y = hipDeviceSynchronize(); } ``` ```console $ clang++ a.hip b.hip --offload-arch=gfx1030 --offload-new-driver $ ./a.out foo foo ``` ```console $ clang++ a.hip b.hip --offload-arch=gfx1030 --offload-new-driver -flto <crash> ``` This reverts commit d54c28b9c1396fa92d9347ac1135da7907121cb8.	2025-06-09 17:18:49 -05:00
Cameron McInally	a42bb8b57a	[Driver] Move CommonArgs to a location visible by the Frontend Drivers (#142800 ) This patch moves the CommonArgs utilities into a location visible by the Frontend Drivers, so that the Frontend Drivers may share option parsing code with the Compiler Driver. This is useful when the Frontend Drivers would like to verify that their incoming options are well-formed and also not reinvent the option parsing wheel. We already see code in the Clang/Flang Drivers that is parsing and verifying its incoming options. E.g. OPT_ffp_contract. This option is parsed in the Compiler Driver, Clang Driver, and Flang Driver, all with slightly different parsing code. It would be nice if the Frontend Drivers were not required to duplicate this Compiler Driver code. That way there is no/low maintenance burden on keeping all these parsing functions in sync. Along those lines, the Frontend Drivers will now have a useful mechanism to verify their incoming options are well-formed. Currently, the Frontend Drivers trust that the Compiler Driver is not passing back junk in some cases. The Language Drivers may even accept junk with no error at all. E.g.: `clang -cc1 -mprefer-vector-width=junk test.c' With this patch, we'll now be able to tighten up incomming options to the Frontend drivers in a lightweight way. --------- Co-authored-by: Cameron McInally <cmcinally@nvidia.com> Co-authored-by: Shafik Yaghmour <shafik.yaghmour@intel.com>	2025-06-06 17:59:24 -04:00
Peter Collingbourne	d1b0b4bb44	Add -funique-source-file-identifier option. This option complements -funique-source-file-names and allows the user to use a different unique identifier than the source file path. Reviewers: teresajohnson Reviewed By: teresajohnson Pull Request: https://github.com/llvm/llvm-project/pull/142901	2025-06-05 10:52:01 -07:00
Omar Ahmed	06ee672fc5	[clang] Move opt level in clang toolchain to clang::ConstructJob start (#141036 ) We currently transfer the opt level from the user clang call to CC1 args at the end of the `ConstructJob` function, this might lead to bugs as `ConstructJob` is a big function and we easily could add a change that would return early from it. That would cause the opt level to not be transferred to CC1 args and lead to wrong opt level compilation and would be hard to spot. This PR moves the opt level to the beginning of the function as opt level should be a direct transfer without any problems, it also removes the redundancy where it was added 2 times through the function.	2025-05-28 07:40:00 -06:00
Kazu Hirata	6c37341943	[Driver] Remove unused includes (NFC) (#141448 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-05-26 09:13:36 -07:00
Kazu Hirata	031cf05f01	[Driver] Use StringRef::consume_front (NFC) (#141412 )	2025-05-25 10:55:11 -07:00
Kazu Hirata	8323335496	[clang] Use llvm::any_of (NFC) (#141314 )	2025-05-23 23:59:38 -07:00
Sebastian Kreutzer	8d0a484983	[XRay] Fix argument parsing with offloading (#140748 ) (#141043 ) This PR addressed issue #140748 to support XRay instrumentation on the host side when using offloading. It makes the following changes: - Initializes `XRayArgs` using the processed toolchain arguments instead of the raw input. - Removes the current caching mechanism of `XRayArgs` in the `ToolChain` class, as this is error-prone and potential benefits are questionable. For reference, `SanitizierArgs`, which is constructed in a similar manner but is much more complex, does not use any caching. - Adds driver tests to verify that XRay flags are set correctly with offloading and `-Xarch_host`.	2025-05-22 09:06:24 -05:00
Rohit Aggarwal	54f2b45c98	[Clang][Driver][fveclib] Fix target parsing for -fveclib=AMDLIBM option (#140544 ) The behavior of -fveclib=AMDLIBM should be similar to -fveclib=libmvec. Example - Error message for unsupported target usage should be same. We are handling the missed cases for -fveclib=AMDLIBM and aligning it to -fveclib=libmvec usage. --------- Co-authored-by: Rohit Aggarwal <Rohit.Aggarwal@amd.com>	2025-05-20 13:35:17 +01:00
Orlando Cazalet-Hyams	8f03e1a9d5	[KeyInstr][Clang] Add Clang option -g[no-]key-instructions (#134627 ) This needs to be driver level to pass an -mllvm flag to LLVM, though this may change soon as the -mllvm flag will soon not be necessary. Keep the flag help-hidden as the feature is under development. This patch is part of a stack that teaches Clang to generate Key Instructions metadata for C and C++. RFC: https://discourse.llvm.org/t/rfc-improving-is-stmt-placement-for-better-interactive-debugging/82668 The feature is only functional in LLVM if LLVM is built with CMake flag LLVM_EXPERIMENTAL_KEY_INSTRUCTIONs. Eventually that flag will be removed.	2025-05-20 11:22:36 +01:00
Juan Manuel Martinez Caamaño	fbf08a68b8	[ObjectiveC] -rewrite-objc was treating inputs as preprocessed files (#137623 ) `-rewrite-objc` passes `-x objective-c++-cpp-output` as input type to the preprocessor job. This is not correct since we would be preprocessing a preprocessed file. The correct input type is `objective-c++`.	2025-05-12 09:50:15 +02:00
Daniel Paoliello	97a58b04c6	[aarch64][x86][win] Add compiler support for MSVC's /funcoverride flag (Windows kernel loader replaceable functions) (#125320 ) Adds support for MSVC's undocumented `/funcoverride` flag, which marks functions as being replaceable by the Windows kernel loader. This is used to allow functions to be upgraded depending on the capabilities of the current processor (e.g., the kernel can be built with the naive implementation of a function, but that function can be replaced at boot with one that uses SIMD instructions if the processor supports them). For each marked function we need to generate: * An undefined symbol named `<name>_$fo$`. * A defined symbol `<name>_$fo_default$` that points to the `.data` section (anywhere in the data section, it is assumed to be zero sized). * An `/ALTERNATENAME` linker directive that points from `<name>_$fo$` to `<name>_$fo_default$`. This is used by the MSVC linker to generate the appropriate metadata in the Dynamic Value Relocation Table. Marked function must never be inlined (otherwise those inline sites can't be replaced). Note that I've chosen to implement this in AsmPrinter as there was no way to create a `GlobalVariable` for `<name>_$fo$` that would result in a symbol being emitted (as nothing consumes it and it has no initializer). I tried to have `llvm.used` and `llvm.compiler.used` point to it, but this didn't help. Within LLVM I referred to this feature as "loader replaceable" as "function override" already has a different meaning to C++ developers... I also took the opportunity to extract the feature symbol generation code used by both AArch64 and X86 into a common function in AsmPrinter.	2025-05-09 14:56:38 -07:00
Daniel Paoliello	72c3ed6745	[win][x64] Unwind v2 3/n: Add support for emitting unwind v2 information (equivalent to MSVC /d2epilogunwind) (#129142 ) Adds support for emitting Windows x64 Unwind V2 information, includes support `/d2epilogunwind` in clang-cl. Unwind v2 adds information about the epilogs in functions such that the unwinder can unwind even in the middle of an epilog, without having to disassembly the function to see what has or has not been cleaned up. Unwind v2 requires that all epilogs are in "canonical" form: * If there was a stack allocation (fixed or dynamic) in the prolog, then the first instruction in the epilog must be a stack deallocation. * Next, for each `PUSH` in the prolog there must be a corresponding `POP` instruction in exact reverse order. * Finally, the epilog must end with the terminator. This change adds a pass to validate epilogs in modules that have Unwind v2 enabled and, if they pass, emits new pseudo instructions to MC that 1) note that the function is using unwind v2 and 2) mark the start of the epilog (this is either the first `POP` if there is one, otherwise the terminator instruction). If a function does not meet these requirements, it is downgraded to Unwind v1 (i.e., these new pseudo instructions are not emitted). Note that the unwind v2 table only marks the size of the epilog in the "header" unwind code, but it's possible for epilogs to use different terminator instructions thus they are not all the same size. As a work around for this, MC will assume that all terminator instructions are 1-byte long - this still works correctly with the Windows unwinder as it is only using the size to do a range check to see if a thread is in an epilog or not, and since the instruction pointer will never be in the middle of an instruction and the terminator is always at the end of an epilog the range check will function correctly. This does mean, however, that the "at end" optimization (where an epilog unwind code can be elided if the last epilog is at the end of the function) can only be used if the terminator is 1-byte long. One other complication with the implementation is that the unwind table for a function is emitted during streaming, however we can't calculate the distance between an epilog and the end of the function at that time as layout hasn't been completed yet (thus some instructions may be relaxed). To work around this, epilog unwind codes are emitted via a fixup. This also means that we can't pre-emptively downgrade a function to Unwind v1 if one of these offsets is too large, so instead we raise an error (but I've passed through the location information, so the user will know which of their functions is problematic).	2025-05-09 10:42:10 -07:00
Lei Wang	b836f96b8f	[Coverage] Support -fprofile-list for cold function coverage (#136333 ) Add a new instrumentation section type `[sample-coldcov]` to support`-fprofile-list` for sample pgo based cold function coverage. Note that the current cold function coverage is based on sampling PGO pipeline, which is incompatible with the existing [llvm] option(see [PGOOptions](https://github.com/llvm/llvm-project/blob/main/llvm/include/llvm/Support/PGOOptions.h#L27-L43)), so we can't reuse the IR-PGO(-fprofile-instrument=llvm) flag.	2025-05-08 10:51:38 -07:00
Paul Walker	cb9683fad1	[Clang][Flang][Driver] Fix target parsing for -fveclib=libmvec option. (#138288 ) There are various places where the -fveclib option is parsed to determine whether its value is correct for the target. Unfortunately these places assume case-insensitivity and subsequently use "LIBMVEC" where the driver mandates "libmvec", thus rendering the diagnosistic useless. This PR corrects the naming along with similar incorrect uses within the test files.	2025-05-06 11:57:04 +01:00
Shilei Tian	8ae9a204f0	[Clang][Driver] Only enable internalization for OpenMP target offloading with ThinLTO on AMDGPU (#138547 )	2025-05-05 13:10:46 -04:00
Shilei Tian	2b62a311d7	[Clang][Driver] Enable internalization by default for AMDGPU (#138365 )	2025-05-02 21:32:46 -04:00
Alan Zhao	4a6c81dc0e	[clang] Implement JSON formatted -ftime-report (#137737 ) This patch adds a new flag, -ftime-report-json, which outputs the same information as -ftime-report but as JSON instead of -ftime-report's pretty printed format.	2025-04-30 13:43:05 -07:00
Prabhu Rajasekaran	6274cdb9a6	[clang] Fix UEFI Target info (#127290 ) For X64 UEFI targets set appropriate integer type sizes, and relevant ABI information. --------- Co-authored-by: Petr Hosek <phosek@google.com>	2025-04-30 10:07:57 -07:00

1 2 3 4 5 ...

1644 Commits