llvm-project

Author	SHA1	Message	Date
Hans Wennborg	e11ede5e90	Revert "[MS][clang] Add support for vector deleting destructors (#126240 )" This caused link errors when building with sancov. See comment on the PR. > Whereas it is UB in terms of the standard to delete an array of objects > via pointer whose static type doesn't match its dynamic type, MSVC > supports an extension allowing to do it. > Aside from array deletion not working correctly in the mentioned case, > currently not having this extension implemented causes clang to generate > code that is not compatible with the code generated by MSVC, because > clang always puts scalar deleting destructor to the vftable. This PR > aims to resolve these problems. > > Fixes https://github.com/llvm/llvm-project/issues/19772 This reverts commit d6942d54f677000cf713d2b0eba57b641452beb4.	2025-03-12 16:26:00 +01:00
Aiden Grossman	de132b2a0c	[Clang][CodeGen] Fix demangler invariant comment assertion (#130522 ) This patch makes the assertion (that is currently in a comment) that validates that names mangled by clang can be demangled by LLVM actually compile/work. There were some minor issues that needed to be fixed (like starts_with not being available on std::string and needing to call getDecl() on GD), and a logic issue that should be fixed in this patch. This enables just uncommenting the assertion to enable it within the compiler (minus needing to add the header file).	2025-03-09 19:56:40 -07:00
erichkeane	67960e5c08	[OpenACC] Ensure decl OpenACC constructs don't crash I initially implemented codegen to be a 'no-op' for these declarations, which I thought was properly implemented. However, when they are a top-level decl, we have a separate switch. This patch makes sure they are properly emitted at top-level as a no-op, and adds a test for both top-level and not top-level.	2025-03-07 08:05:19 -08:00
Steven Perron	6d4f8b1dbf	[HLSL] Fix resource wrapper declaration (#129100 ) The resource wrapper should have internal linkage because it contains a handle to the global resource, and it not the actual global. Makeing this changed exposed that we were zeroinitializing the resouce, which is a problem. The handle cannot be zeroinitialized. This is changed to use poison instead. Fixes https://github.com/llvm/llvm-project/issues/122767. --------- Co-authored-by: Helena Kotas <hekotas@microsoft.com>	2025-03-05 14:02:39 -05:00
Mariya Podchishchaeva	d6942d54f6	[MS][clang] Add support for vector deleting destructors (#126240 ) Whereas it is UB in terms of the standard to delete an array of objects via pointer whose static type doesn't match its dynamic type, MSVC supports an extension allowing to do it. Aside from array deletion not working correctly in the mentioned case, currently not having this extension implemented causes clang to generate code that is not compatible with the code generated by MSVC, because clang always puts scalar deleting destructor to the vftable. This PR aims to resolve these problems. Fixes https://github.com/llvm/llvm-project/issues/19772	2025-03-04 09:17:50 +01:00
Yaxun (Sam) Liu	240f2269ff	Add clang atomic control options and attribute (#114841 ) Add option and statement attribute for controlling emitting of target-specific metadata to atomicrmw instructions in IR. The RFC for this attribute and option is https://discourse.llvm.org/t/rfc-add-clang-atomic-control-options-and-pragmas/80641, Originally a pragma was proposed, then it was changed to clang attribute. This attribute allows users to specify one, two, or all three options and must be applied to a compound statement. The attribute can also be nested, with inner attributes overriding the options specified by outer attributes or the target's default options. These options will then determine the target-specific metadata added to atomic instructions in the IR. In addition to the attribute, three new compiler options are introduced: `-f[no-]atomic-remote-memory`, `-f[no-]atomic-fine-grained-memory`, `-f[no-]atomic-ignore-denormal-mode`. These compiler options allow users to override the default options through the Clang driver and front end. `-m[no-]unsafe-fp-atomics` is aliased to `-f[no-]ignore-denormal-mode`. In terms of implementation, the atomic attribute is represented in the AST by the existing AttributedStmt, with minimal changes to AST and Sema. During code generation in Clang, the CodeGenModule maintains the current atomic options, which are used to emit the relevant metadata for atomic instructions. RAII is used to manage the saving and restoring of atomic options when entering and exiting nested AttributedStmt.	2025-02-27 10:41:04 -05:00
Helena Kotas	2db8386867	[HLSL] Implement default constant buffer $Globals (2nd attempt) (#128589 ) All variable declarations in the global scope that are not resources, static or empty are implicitly added to implicit constant buffer `$Globals`. They are created in `hlsl_constant` address space and collected in an implicit `HLSLBufferDecl` node that is added to the AST at the end of the translation unit. Codegen is the same as for explicit constant buffers. Fixes #123801 This is a second attempt to implement this feature. The first attempt had to be reverted because of memory leaks. The problem was adding a `SmallVector` member on `HLSLBufferDecl` node to represent a list of default buffer declarations. When this vector needed to grow, it allocated memory that was never released, because all memory used by AST nodes must be allocated by `ASTContext` allocator and is released all at once. Destructors on AST nodes are never called. It this change the list of default buffer declarations is collected in a `SmallVector` instance on `SemaHLSL`. The `HLSLBufDecl` representing `$Globals` is created at the end of the translation unit when the number of declarations is known, and the list is copied into an array allocated by the `ASTContext` allocator.	2025-02-25 16:57:07 -08:00
Helena Kotas	6e5f26bba8	Revert "[HLSL] Implement default constant buffer `$Globals`" (#128112 ) Reverts llvm/llvm-project#125807 Reverting this change because of failing tests.	2025-02-20 18:39:38 -08:00
Helena Kotas	776cddacb1	[HLSL] Implement default constant buffer `$Globals` (#125807 ) All variable declarations in the global scope that are not resources, static or empty are implicitly added to implicit constant buffer `$Globals`. They are created in `hlsl_constant` address space and collected in an implicit `HLSLBufferDecl` node that is added to the AST at the end of the translation unit. Codegen is the same as for explicit constant buffers. Fixes #123801	2025-02-20 17:27:53 -08:00
Nick Sarnie	f3cd223838	[OpenMP][OpenMPIRBuilder] Add initial changes for SPIR-V target frontend support (#125920 ) As Intel is working to add support for SPIR-V OpenMP device offloading in upstream clang/liboffload, we need to modify the OpenMP frontend to allow SPIR-V as well as generate valid IR for SPIR-V. For example, we need the frontend to generate code to define and interact with device globals used in the DeviceRTL. This is the beginning of what I expect will be (many) other changes, but let's get started with something simple. --------- Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>	2025-02-10 16:16:40 +00:00
Mats Jun Larsen	e0fee55a55	[CodeGen] Replace of PointerType::get(Type) with opaque version (NFC) (#124771 ) Follow-up to https://github.com/llvm/llvm-project/issues/123569	2025-02-08 15:13:02 +00:00
Scott Constable	e223485c9b	[X86] Extend kCFI with a 3-bit arity indicator (#121070 ) Kernel Control Flow Integrity (kCFI) is a feature that hardens indirect calls by comparing a 32-bit hash of the function pointer's type against a hash of the target function's type. If the hashes do not match, the kernel may panic (or log the hash check failure, depending on the kernel's configuration). These hashes are computed at compile time by applying the xxHash64 algorithm to each mangled canonical function (or function pointer) type, then truncating the result to 32 bits. This hash is written into each indirect-callable function header by encoding it as the 32-bit immediate operand to a `MOVri` instruction, e.g.: ``` __cfi_foo: nop nop nop nop nop nop nop nop nop nop nop movl $199571451, %eax # hash of foo's type = 0xBE537FB foo: ... ``` This PR extends x86-based kCFI with a 3-bit arity indicator encoded in the `MOVri` instruction's register (reg) field as follows: \| Arity Indicator \| Description \| Encoding in reg field \| \| --------------- \| --------------- \| --------------- \| \| 0 \| 0 parameters \| EAX \| \| 1 \| 1 parameter in RDI \| ECX \| \| 2 \| 2 parameters in RDI and RSI \| EDX \| \| 3 \| 3 parameters in RDI, RSI, and RDX \| EBX \| \| 4 \| 4 parameters in RDI, RSI, RDX, and RCX \| ESP \| \| 5 \| 5 parameters in RDI, RSI, RDX, RCX, and R8 \| EBP \| \| 6 \| 6 parameters in RDI, RSI, RDX, RCX, R8, and R9 \| ESI \| \| 7 \| At least one parameter may be passed on the stack \| EDI \| For example, if `foo` takes 3 register arguments and no stack arguments then the `MOVri` instruction in its kCFI header would instead be written as: ``` movl $199571451, %ebx # hash of foo's type = 0xBE537FB ``` This PR will benefit other CFI approaches that build on kCFI, such as FineIBT. For example, this proposed enhancement to FineIBT must be able to infer (at kernel init time) which registers are live at an indirect call target: https://lkml.org/lkml/2024/9/27/982. If the arity bits are available in the kCFI function header, then this information is trivial to infer. Note that there is another existing PR proposal that includes the 3-bit arity within the existing 32-bit immediate field, which introduces different security properties: https://github.com/llvm/llvm-project/pull/117121.	2025-02-06 10:54:22 +08:00
Chandler Carruth	cd269fee05	[StrTable] Switch Clang builtins to use string tables This both reapplies #118734, the initial attempt at this, and updates it significantly. First, it uses the newly added `StringTable` abstraction for string tables, and simplifies the construction to build the string table and info arrays separately. This should reduce any `constexpr` compile time memory or CPU cost of the original PR while significantly improving the APIs throughout. It also restructures the builtins to support sharding across several independent tables. This accomplishes two improvements from the original PR: 1) It improves the APIs used significantly. 2) When builtins are defined from different sources (like SVE vs MVE in AArch64), this allows each of them to build their own string table independently rather than having to merge the string tables and info structures. 3) It allows each shard to factor out a common prefix, often cutting the size of the strings needed for the builtins by a factor two. The second point is important both to allow different mechanisms of construction (for example a `.def` file and a tablegen'ed `.inc` file, or different tablegen'ed `.inc files), it also simply reduces the sizes of these tables which is valuable given how large they are in some cases. The third builds on that size reduction. Initially, we use this new sharding rather than merging tables in AArch64, LoongArch, RISCV, and X86. Mostly this helps ensure the system works, as without further changes these still push scaling limits. Subsequent commits will more deeply leverage the new structure, including using the prefix capabilities which cannot be easily factored out here and requires deep changes to the targets.	2025-02-04 18:04:57 +00:00
Owen Anderson	8f025f2a93	[clang] Do not emit template parameter objects as COMDATs when they have internal linkage. (#125448 ) Per the ELF spec, section groups may only contain local symbols if those symbols are only referenced from within the section group. [1] In the case of template parameter objects, they can be referenced from outside the group when the type of the object was declared in an anonymous namespace. In that case, we can't place the object in a COMDAT. This matches GCC's linkage behavior on the test input. [1]: https://www.sco.com/developers/gabi/latest/ch4.sheader.html#section_groups	2025-02-03 23:26:22 +13:00
Daniel Paoliello	845cc968e9	[clang][llvm][aarch64][win] Add a clang flag and module attribute for import call optimization, and remove LLVM flag (#122831 ) Switches import call optimization from being enabled by an LLVM flag to instead using a module attribute, and creates a new Clang flag that will set that attribute. This addresses the concern raised in the original PR: <https://github.com/llvm/llvm-project/pull/121516#discussion_r1911763991> This change also only creates the Called Global info if the module attribute is present, addressing this concern: <https://github.com/llvm/llvm-project/pull/122762#pullrequestreview-2547595934>	2025-01-30 09:51:43 -08:00
Jason Rice	abc8812df0	[Clang][P1061] Add stuctured binding packs (#121417 ) This is an implementation of P1061 Structure Bindings Introduce a Pack without the ability to use packs outside of templates. There is a couple of ways the AST could have been sliced so let me know what you think. The only part of this change that I am unsure of is the serialization/deserialization stuff. I followed the implementation of other Exprs, but I do not really know how it is tested. Thank you for your time considering this. --------- Co-authored-by: Yanzuo Liu <zwuis@outlook.com>	2025-01-29 21:43:52 +01:00
Hervé Poussineau	71d6287f5b	[Clang][MIPS] Create correct linker arguments for Windows toolchains (#121041 )	2025-01-20 15:11:26 +08:00
Alexandros Lamprineas	b93ffa8e4a	[FMV][AArch64] Changes in fmv-features metadata. (#122192 ) * We want the default version to have this attribute too otherwise it becomes indistinguishable from non-versioned functions. * We don't need the '+' unlike target-features which can negate. This will allow using the parsing API of target_version/clones for the metadata too.	2025-01-10 17:50:35 +00:00
Alexandros Lamprineas	8e65940161	[FMV][AArch64] Simplify version selection according to ACLE. (#121921 ) Currently, the more features a version has, the higher its priority is. We are changing ACLE https://github.com/ARM-software/acle/pull/370 as follows: "Among any two versions, the higher priority version is determined by identifying the highest priority feature that is specified in exactly one of the versions, and selecting that version."	2025-01-08 18:59:07 +00:00
Alexandros Lamprineas	93011fe2a5	[FMV][AArch64][clang] Emit fmv-features metadata in LLVM IR. (#118544 ) We need to be able to propagate information about FMV attribute strings from C/C++ source to LLVM IR. This is necessary so that we can distinguish which target-features are coming from the cmdline, which are coming from the target attribute, and which are coming from feature dependency expansion. We need this for static resolution of calls in LLVM. Here's a motivating example: Suppose you have target_version("i8mm+dotprod") and target_version("fcma"). The first version clearly has higher priority. Now suppose you specify -march=armv8-a+i8mm on the command line. Then the versions would have target-features "+i8mm,+dotprod" and "+i8mm,+fcma" respectively. If you are using those to deduce version priority, then you would incorrectly deduce that the second version was higher priority than the first.	2025-01-07 08:51:23 +00:00
Alexandros Lamprineas	6586c676b4	[FMV][AArch64] Emit mangled default version if explicitly specified. (#120022 ) Currently we need at least one more version other than the default to trigger FMV. However we would like a header file declaration __attribute__((target_version("default"))) void f(void); to guarantee that there will be f.default	2024-12-19 12:06:46 +00:00
Florian Hahn	c135f6ffe2	[TySan] Add initial Type Sanitizer support to Clang) (#76260 ) This patch introduces the Clang components of type sanitizer: a sanitizer for type-based aliasing violations. It is based on Hal Finkel's https://reviews.llvm.org/D32198. The Clang changes are mostly formulaic, the one specific change being that when the TBAA sanitizer is enabled, TBAA is always generated, even at -O0. It goes together with the corresponding LLVM changes (https://github.com/llvm/llvm-project/pull/76259) and compiler-rt changes (https://github.com/llvm/llvm-project/pull/76261) PR: https://github.com/llvm/llvm-project/pull/76260	2024-12-17 15:13:42 +00:00
Daniil Kovalev	f65a21a4ec	[PAC][ELF][AArch64] Support signed personality function pointer (#119361 ) Re-apply #113148 after revert in #119331 If function pointer signing is enabled, sign personality function pointer stored in `.DW.ref.__gxx_personality_v0` section with IA key, 0x7EAD = `ptrauth_string_discriminator("personality")` constant discriminator and address diversity enabled.	2024-12-16 10:24:09 +03:00
Daniil Kovalev	ef2e590e7b	Revert "[PAC][ELF][AArch64] Support signed personality function pointer" (#119331 ) Reverts llvm/llvm-project#113148 See buildbot failure https://lab.llvm.org/buildbot/#/builders/190/builds/11048	2024-12-10 09:12:25 +03:00
Daniil Kovalev	4fb1cda660	[PAC][ELF][AArch64] Support signed personality function pointer (#113148 ) If function pointer signing is enabled, sign personality function pointer stored in `.DW.ref.__gxx_personality_v0` section with IA key, 0x7EAD = `ptrauth_string_discriminator("personality")` constant discriminator and address diversity enabled.	2024-12-10 08:48:09 +03:00
Oleksandr T.	071da9261b	[Clang] ensure mangled names are valid identifiers before being suggested in ifunc/alias attributes notes (#118170 ) Fixes #112205 --- Commit that introduced this feature - `9306ef9750`	2024-12-02 18:16:47 +02:00
Alexandros Lamprineas	88c2af80fa	[NFC][clang][FMV][TargetInfo] Refactor API for FMV feature priority. (#116257 ) Currently we have code with target hooks in CodeGenModule shared between X86 and AArch64 for sorting MultiVersionResolverOptions. Those are used when generating IFunc resolvers for FMV. The RISCV target has different criteria for sorting, therefore it repeats sorting after calling CodeGenFunction::EmitMultiVersionResolver. I am moving the FMV priority logic in TargetInfo, so that it can be implemented by the TargetParser which then makes it possible to query it from llvm. Here is an example why this is handy: https://github.com/llvm/llvm-project/pull/87939	2024-11-28 09:22:05 +00:00
Alexandros Lamprineas	56eb559b1d	[clang][FMV] Fix crash with cpu_specific attribute. (#115762 ) When dealing with cpu_specific GlobalDecl, GetOrCreateMultiVersionResolver should immediately return the already created llvm function if it exists. Fixes https://github.com/llvm/llvm-project/issues/115299.	2024-11-26 07:45:15 +00:00
Viktoriia Bakalova	3de21477c4	[clang][codegen] Mention the invariant that LLVM demangler should be … (#117346 ) …able to handle mangled names generated by clang. https://discourse.llvm.org/t/rfc-clang-diagnostic-for-demangling-failures/82835/8 Since we're putting the work on the above RFC on hold, let's leave a comment in the source code pointing to prior efforts and the suggestion of further steps.	2024-11-25 17:23:10 +01:00
Nuno Lopes	b0afa6bab9	[clang] Change some placeholders from undef to poison [NFC]	2024-11-19 15:18:40 +00:00
Daniil Kovalev	3b162f73d8	[PAC][clang] Add signed GOT cc1 flag (#96160 ) Add `-fptrauth-elf-got` clang cc1 flag and set `ptrauth_elf_got` preprocessor feature and `PointerAuthELFGOT` LangOption correspondingly. No additional checks like ensuring OS binary format is ELF are performed: it should be done on clang driver level when a pauth-enabled environment implying signed GOT enabled is requested. If the cc1 flag is passed, "ptrauth-elf-got" IR module flag is set.	2024-11-19 10:20:15 +03:00
Kazu Hirata	e8a6624325	[CodeGen] Remove unused includes (NFC) (#116459 ) Identified with misc-include-cleaner.	2024-11-16 07:37:13 -08:00
Eli Friedman	6bd3f2e898	[clang codegen] Add CreateRuntimeFunction overload that takes a clang type. (#113506 ) Correctly computing the LLVM types/attributes is complicated in general, so add a variant which does that for you.	2024-11-14 14:35:40 -08:00
Chuanqi Xu	259eaa6878	[C++20] [Modules] Fix the duplicated static initializer problem (#114193 ) Reproducer: ``` //--- a.cppm export module a; int func(); static int a = func(); //--- a.cpp import a; ``` The `func()` should only execute once. However, before this patch we will somehow import `static int a` from a.cppm incorrectly and initialize that again. This is super bad and can introduce serious runtime behaviors. And also surprisingly, it looks like the root cause of the problem is simply some oversight choosing APIs.	2024-10-30 17:27:04 +08:00
Congcong Cai	bd6c430dcb	[clang codegen] avoid to crash when emit init func for global variable with flexible array init (#113336 ) Fixes: #113187 Avoid to create init function since clang does not support global variable with flexible array init. It will cause assertion failure later.	2024-10-23 09:21:27 +08:00
Boaz Brickner	09cc75e2cc	[clang] Deduplicate the logic that only warns once when stack is almost full (#112552 ) Zero diff in behavior.	2024-10-18 10:11:14 +02:00
Helena Kotas	7dbfa7b981	[HLSL] Add handle initialization for simple resource declarations (#111207 ) Adds `@_init_resource_bindings()` function to module initialization that includes `handle.fromBinding` intrinsic calls for simple resource declarations. Arrays of resources or resources inside user defined types are not supported yet. While this unblocks our progress on [Compile a runnable shader from clang](https://github.com/llvm/wg-hlsl/issues/7) milestone, this is probably not the way we would like to handle resource binding initialization going forward. Ideally, it should be done via the resource class constructors in order to support dynamic resource binding or unbounded arrays if resources. Depends on PRs #110327 and #111203. Part 1 of #105076	2024-10-17 17:59:08 -07:00
Steven Perron	2c8ecb3272	[HLSL][SPIRV] Use Spirv target codegen (#112573 ) When the arch in the triple in "spirv", the default target codegen is currently used. We should be using the spir-v target codegen. This will be used to have SPIR-V specific lowering of the HLSL types.	2024-10-16 12:46:45 -04:00
Boaz Brickner	c978f0f7ac	[clang] Fix segmentation fault caused by stack overflow on deeply nested expressions (#111701 ) Done by calling clang::runWithSufficientStackSpace(). Added CodeGenModule::runWithSufficientStackSpace() method similar to the one in Sema to provide a single warning when this triggers Fixes: #111699	2024-10-14 14:06:50 +02:00
Michał Górny	387b37af1a	[LLVM] [Clang] Support for Gentoo `t64` triples (64-bit time_t ABIs) (#111302 ) Gentoo is planning to introduce a `t64` suffix for triples that will be used by 32-bit platforms that use 64-bit `time_t`. Add support for parsing and accepting these triples, and while at it make clang automatically enable the necessary glibc feature macros when this suffix is used. An open question is whether we can backport this to LLVM 19.x. After all, adding new triplets to Triple sounds like an ABI change — though I suppose we can minimize the risk of breaking something if we move new enum values to the very end.	2024-10-14 11:18:04 +00:00
Rahul Joshi	fa789dffb1	[NFC] Rename `Intrinsic::getDeclaration` to `getOrInsertDeclaration` (#111752 ) Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).	2024-10-11 05:26:03 -07:00
Piyou Chen	f658c1bf4a	Recommit "[RISCV][FMV] Support target_version" (#111096 )" (#111333 ) Fix the buildbot failure caused by heap use-after-free error. Origin message: This patch enable `target_version` attribute for RISC-V target. The proposal of `target_version` syntax can be found at the https://github.com/riscv-non-isa/riscv-c-api-doc/pull/48 (which has landed), as modified by the proposed https://github.com/riscv-non-isa/riscv-c-api-doc/pull/85 (which adds the priority syntax). `target_version` attribute will trigger the function multi-versioning feature and act like `target_clones` attribute. See https://github.com/llvm/llvm-project/pull/85786 for the implementation of `target_clones`.	2024-10-08 16:26:55 +08:00
Piyou Chen	1e5e153485	Revert "[RISCV][FMV] Support target_version" (#111096 ) Reverts llvm/llvm-project#99040 due to https://lab.llvm.org/buildbot/#/builders/190/builds/7052	2024-10-04 12:02:39 +08:00
Piyou Chen	7ab488e92c	[RISCV][FMV] Support target_version (#99040 ) This patch enable `target_version` attribute for RISC-V target. The proposal of `target_version` syntax can be found at the https://github.com/riscv-non-isa/riscv-c-api-doc/pull/48 (which has landed), as modified by the proposed https://github.com/riscv-non-isa/riscv-c-api-doc/pull/85 (which adds the priority syntax). `target_version` attribute will trigger the function multi-versioning feature and act like `target_clones` attribute. See https://github.com/llvm/llvm-project/pull/85786 for the implementation of `target_clones`.	2024-10-04 11:02:45 +08:00
Alex Voicu	e203a67f4c	[cuda][HIP] `__constant__` should imply constant (#110182 ) Currently, `__constant__` variables do not get unconditionally marked as `constant` in IR, which seems a bit odd given their definition. This is generally inconsequential for NVPTX/AMDGPU, since said variables get emitted in the constant address space for those BEs. However, it is potentially significant for e.g. HIP-on-SPIR-V cases, as SPIR-V does not allow casts to/from the constant AS (`UniformConstant`), which forces `__constant__` variables to be emitted in the global AS, thus making IR constness meaningful.	2024-09-29 01:22:52 +01:00
Ming-Yi Lai	9f33eb861a	[clang][RISCV] Introduce command line options for RISC-V Zicfilp CFI This patch enables the following command line flags for RISC-V targets: + `-fcf-protection=branch` turns on forward-edge control-flow integrity conditioning + `-mcf-branch-label-scheme=unlabeled\|func-sig` selects the label scheme used in the forward-edge CFI conditioning	2024-09-26 18:30:43 +08:00
Congcong Cai	eca5949031	[codegen][NFC] add static mark for internal usage variable and function (#109431 ) Detect by clang-tidy misc-use-internal-linkage	2024-09-24 07:25:07 +08:00
Thurston Dang	b89bb7775d	Reapply "[HLSL] set alwaysinline on HLSL functions (#106588 )" This reverts commit 4a63f4d301c0e044073e1b1f8f110015ec1778a1. It was reverted because of a buildbot breakage, but the fix-forward has landed (https://github.com/llvm/llvm-project/pull/109023).	2024-09-17 22:54:52 +00:00
Thurston Dang	4a63f4d301	Revert "[HLSL] set alwaysinline on HLSL functions (#106588 )" This reverts commit a729e706de3fc6ebee49ede3c50afb47f2e29191. Reason:bBuildbot failure (https://lab.llvm.org/buildbot/#/builders/25/builds/2541): 'Clang :: CodeGenHLSL/builtins/StructuredBuffer-subscript.hlsl' failed	2024-09-17 21:06:36 +00:00
Greg Roth	a729e706de	[HLSL] set alwaysinline on HLSL functions (#106588 ) HLSL inlines all its functions by default. This uses the alwaysinline attribute to make the alwaysinliner pass inline any function not explicitly marked noinline by the user or autogeneration. The alwayslinline marking takes place in `SetLLVMFunctionAttributesForDefinitions` where all other inlining interactions are determined. The outermost entry function is marked noinline because there's no reason to inline it. Any user calls to an entry function will instead call the internal mangled version of the entry function. Adds tests for function and constructor inlining and augments some existing tests to verify correct inlining of implicitly created functions as well. Incidentally restore RUN line that I believe was mistakenly removed as part of #88918 Fixes #89282	2024-09-17 10:09:42 -07:00

1 2 3 4 5 ...

2172 Commits