llvm-project

Author	SHA1	Message	Date
Joseph Huber	9888f0c3c4	[Clang] Add builtins for masked vector loads / stores (#154464 ) Summary: Clang has support for boolean vectors, these builtins expose the LLVM instruction of the same name. This differs from a manual load and select by potentially suppressing traps from deactivated lanes. Fixes: https://github.com/llvm/llvm-project/issues/107753	2025-08-20 13:33:32 -05:00
Fraser Cormack	8b128388b5	[clang] Introduce elementwise ctlz/cttz builtins (#131995 ) These builtins are modeled on the clzg/ctzg builtins, which accept an optional second argument. This second argument is returned if the first argument is 0. These builtins unconditionally exhibit zero-is-undef behaviour, regardless of target preference for the other ctz/clz builtins. The builtins have constexpr support. Fixes #154113	2025-08-20 12:18:28 +01:00
Chaitanya Koparkar	c3bf73bc4a	[clang] Add elementwise fshl/fshr builtins (#153113 ) This patch implements `__builtin_elementwise_fshl` and `__builtin_elementwise_fshr` builtins. These map to the fshl/fshr intrinsics described here: - https://llvm.org/docs/LangRef.html#llvm-fshl-intrinsic - https://llvm.org/docs/LangRef.html#llvm-fshr-intrinsic Fixes https://github.com/llvm/llvm-project/issues/152555.	2025-08-12 20:57:55 +09:00
Nikita Popov	c23b4fbdbb	[IR] Remove size argument from lifetime intrinsics (#150248 ) Now that #149310 has restricted lifetime intrinsics to only work on allocas, we can also drop the explicit size argument. Instead, the size is implied by the alloca. This removes the ability to only mark a prefix of an alloca alive/dead. We never used that capability, so we should remove the need to handle that possibility everywhere (though many key places, including stack coloring, did not actually respect this).	2025-08-08 11:09:34 +02:00
Bill Wendling	49a24b3116	[CodeGen][counted_by] Support use of the comma operator (#151776 ) Writing something like this: __builtin_dynamic_object_size((0, p->array), 0) is equivalent to writing this: __builtin_dynamic_object_size(p->array, 0) though the former will give a warning about the first value being unused.	2025-08-01 17:28:08 -07:00
Bill Wendling	254b90fa95	[CodeGen][counted_by] See past parentheses and no-op casts (#151266 ) Parentheses and no-op casts don't change the value. Skip past them to get to a MemberExpr. Fixes #151236	2025-07-30 14:37:05 -07:00
Wenju He	e0dd22fab1	[Clang] Add elementwise maximumnum/minimumnum builtin functions (#149775 ) Addresses https://github.com/llvm/llvm-project/issues/112164. minimumnum and maximumnum intrinsics were added in 5bf81e53dbea. The new built-ins can be used for implementing OpenCL math function fmax and fmin in #128506.	2025-07-23 08:34:35 +08:00
jofrn	15d36aa4ce	[clang][CodeGen] Preserve addrspace of enqueue_kernel builtin. (#148062 ) __enqueue_kernel_varargs' last parameter is in addrspace(5), but CodeGen currently misses this qualifier. This commit fixes the code to preserve the qualifier by referencing Alloca, which has its casts removed, rather than TmpPtr.	2025-07-11 17:00:28 -04:00
Adam Glass	9a0a9764f3	[Clang][AArch64] _interlockedbittestand{set,reset}64_{acq,rel,nf} support for AArch64 (#145980 ) Adds _interlockedbittestand{set,reset}64_{acq,rel,nf} support for AArch64	2025-06-26 17:20:27 -07:00
Mariya Podchishchaeva	ad87d951c9	[clang] Fix __builtin_mul_overflow for big _BitInts (#145497 ) For long enough _BitInt types we use different types for memory, storing-loading and other operations. Makes sure it is correct for mixed sign __builtin_mul_overflow cases. Using pointer element type as a result type doesn't work, because it will be "in-memory" type which is usually bigger than "operations" type and that caused crashes because clang was trying to emit trunc to a bigger type. Fixes https://github.com/llvm/llvm-project/issues/144771	2025-06-25 10:57:48 +02:00
Kazu Hirata	7cbb141155	[clang] Migrate away from ArrayRef(std::nullopt) (NFC) (#144982 ) ArrayRef has a constructor that accepts std::nullopt. This constructor dates back to the days when we still had llvm::Optional. Since the use of std::nullopt outside the context of std::optional is kind of abuse and not intuitive to new comers, I would like to move away from the constructor and eventually remove it. This patch takes care of the clang side of the migration.	2025-06-19 23:29:50 -07:00
Kazu Hirata	c01532177f	[clang] Remove unused includes (NFC) (#144285 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-06-15 21:00:36 -07:00
Thurston Dang	428afa62b0	[ubsan] Add more -fsanitize-annotate-debug-info checks (#141997 ) This extends https://github.com/llvm/llvm-project/pull/138577 to more UBSan checks, by changing SanitizerDebugLocation (formerly SanitizerScope) to add annotations if enabled for the specified ordinals. Annotations will use the ordinal name if there is exactly one ordinal specified in the SanitizerDebugLocation; otherwise, it will use the handler name. Updates the tests from https://github.com/llvm/llvm-project/pull/141814. --------- Co-authored-by: Vitaly Buka <vitalybuka@google.com>	2025-06-06 14:59:32 -07:00
Oliver Hunt	93314bd946	[clang][PAC] Add __builtin_get_vtable_pointer (#139790 ) With pointer authentication it becomes non-trivial to correctly load the vtable pointer of a polymorphic object. __builtin_get_vtable_pointer is a function that performs the load and performs the appropriate authentication operations if necessary.	2025-06-04 00:21:20 -07:00
Victor Lomuller	c474f8f240	[clang][SPIRV] Add builtin for OpGenericCastToPtrExplicit and its SPIR-V friendly binding (#137805 ) The patch introduce __builtin_spirv_generic_cast_to_ptr_explicit which is lowered to the llvm.spv.generic.cast.to.ptr.explicit intrinsic. The SPIR-V builtins are now split into 3 differents file: BuiltinsSPIRVCore.td, BuiltinsSPIRVVK.td for Vulkan specific builtins, BuiltinsSPIRVCL.td for OpenCL specific builtins and BuiltinsSPIRVCommon.td for common ones. The patch also introduces a new header defining its SPIR-V friendly equivalent (__spirv_GenericCastToPtrExplicit_ToGlobal, __spirv_GenericCastToPtrExplicit_ToLocal and __spirv_GenericCastToPtrExplicit_ToPrivate). The functions are declared as aliases to the new builtin allowing C-like languages to have a definition to rely on as well as gaining proper front-end diagnostics. The motivation for the header is to provide a stable binding for applications or library (such as SYCL) and allows non SPIR-V targets to provide an implementation (via libclc or similar to how it is done for gpuintrin.h).	2025-05-29 15:19:40 +02:00
Orlando Cazalet-Hyams	351f15ba82	Reapply "[KeyIntsr][Clang] Builtins atoms (#134651 )" This reverts commit 894a0dd57f81211f9e431d9e84f2856d34f46993 with tests fixed. This patch is part of a stack that teaches Clang to generate Key Instructions metadata for C and C++. RFC: https://discourse.llvm.org/t/rfc-improving-is-stmt-placement-for-better-interactive-debugging/82668 The feature is only functional in LLVM if LLVM is built with CMake flag LLVM_EXPERIMENTAL_KEY_INSTRUCTIONs. Eventually that flag will be removed.	2025-05-29 10:40:34 +01:00
Orlando Cazalet-Hyams	894a0dd57f	Revert "[KeyIntsr][Clang] Builtins atoms (#134651 )" This reverts commit b14799e9e0ed2cae7cbce45c413233336b151fea. Breaks downstream bots.	2025-05-28 18:31:08 +01:00
Orlando Cazalet-Hyams	b14799e9e0	[KeyIntsr][Clang] Builtins atoms (#134651 ) This patch is part of a stack that teaches Clang to generate Key Instructions metadata for C and C++. RFC: https://discourse.llvm.org/t/rfc-improving-is-stmt-placement-for-better-interactive-debugging/82668 The feature is only functional in LLVM if LLVM is built with CMake flag LLVM_EXPERIMENTAL_KEY_INSTRUCTIONs. Eventually that flag will be removed.	2025-05-28 18:19:16 +01:00
Kazu Hirata	8075c15f54	[CodeGen] Remove unused includes (NFC) (#141418 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-05-25 10:55:28 -07:00
Peter Collingbourne	b20801646a	CodeGen: Fix implementation of __builtin_trivially_relocate. The builtin is documented to copy `count` elements, but the implementation copies `count` bytes. Fix that. Reviewers: cor3ntin, ojhunt Pull Request: https://github.com/llvm/llvm-project/pull/140312	2025-05-23 17:02:49 -07:00
Pierre van Houtryve	0c96c65169	[clang][CodeGen] Fix crash on non-natural type in CheckAtomicAlignment (#141053 ) In some specific scenarios, `Ptr.getElementType()` won't be a primitive type or a vector of primitive types, and thus `getScalarSizeInBits()` returns zero. Use the datalayout to get the proper size of the type instead of making an implicit assumption that the type is a simple primitive type. Solves SWDEV-534184	2025-05-22 16:45:46 +02:00
choikwa	77de8a0c0a	[AMDGPU][clang] provide device implementation for __builtin_logb and … (#129347 ) …__builtin_scalbn Clang generates library calls for __builtin_* functions which can be a problem for GPUs that cannot handle them. This patch generates call to device implementation for __builtin_logb and ldexp intrinsic for __builtin_scalbn.	2025-05-19 14:11:31 -04:00
Jessica Clarke	864f0ff4ef	[clang][IR] Overload @llvm.thread.pointer to support non-AS0 targets (#132489 ) Thread-local globals live, by default, in the default globals address space, which may not be 0, so we need to overload @llvm.thread.pointer to support other address spaces, and use the default globals address space in Clang.	2025-05-14 21:51:56 +01:00
Bill Wendling	9ae3bce175	[Clang][counted_by] Add support for 'counted_by' on struct pointers (#137250 ) The 'counted_by' attribute is now available for pointers in structs. It generates code for sanity checks as well as __builtin_dynamic_object_size() calculations. For example: struct annotated_ptr { int count; char buf __attribute__((counted_by(count))); }; If the pointer's type is 'void ', use the 'sized_by' attribute, which works similarly to 'counted_by', but can handle the 'void' base type: struct annotated_ptr { int count; void buf __attribute__((sized_by(count))); }; If the 'count' field member occurs after the pointer, use the '-fexperimental-late-parse-attributes' flag during compilation. Note that 'counted_by' cannot be applied to a pointer to an incomplete type, because the size isn't known. struct foo; struct annotated_ptr { int count; struct foo buf __attribute__((counted_by(count))); /* invalid */ }; Signed-off-by: Bill Wendling <morbo@google.com>	2025-05-13 16:01:36 -07:00
Matt Arsenault	5ae2aed218	clang: Remove dest LangAS argument from performAddrSpaceCast (#138866 ) It isn't used and is redundant with the result pointer type argument. A more reasonable API would only have LangAS parameters, or IR parameters, not both. Not all values have a meaningful value for this. I'm also not sure why we have this at all, it's not overridden by any targets and further simplification is possible.	2025-05-09 14:24:54 +02:00
cor3ntin	300d4026f7	[Clang] Implement the core language parts of P2786 - Trivial relocation (#127636 ) This adds - The parsing of `trivially_relocatable_if_eligible`, `replaceable_if_eligible` keywords - `__builtin_trivially_relocate`, implemented in terms of memmove. In the future this should - Add the appropriate start/end lifetime markers that llvm does not have (`start_lifetime_as`) - Add support for ptrauth when that's upstreamed - the `__builtin_is_cpp_trivially_relocatable` and `__builtin_is_replaceable` traits Fixes #127609	2025-05-06 14:13:32 +02:00
Kazu Hirata	f002f300c5	[clang] Remove unused local variables (NFC) (#138453 )	2025-05-04 10:51:40 -07:00
YunQiang Su	58b5df09dc	Clang: Add elementwise minnum/maxnum builtin functions (#129207 ) With https://github.com/llvm/llvm-project/pull/112852, we claimed that llvm.minnum and llvm.maxnum should treat +0.0>-0.0, while libc doesn't require fmin(3)/fmax(3) for it. To make llvm.minnum/llvm.maxnum easy to use, we define the builtin functions for them, include __builtin_elementwise_minnum __builtin_elementwise_maxnum All of them support _Float16, __bf16, float, double, long double.	2025-04-14 13:49:32 +08:00
Farzon Lotfi	16c84c4475	[DirectX] Add target builtins (#134439 ) - fixes #132303 - Moves dot2add from a language builtin to a target builtin. - Sets the scaffolding for Sema checks for DX builtins - Setup DirectX backend as able to have target builtins - Adds a DX TargetBuiltins emitter in `clang/lib/CodeGen/TargetBuiltins/DirectX.cpp`	2025-04-07 12:06:57 -04:00
Nikita Popov	b384d6d6cc	[CodeGen] Don't include CGDebugInfo.h in CodeGenFunction.h (NFC) (#134100 ) This is an expensive header, only include it where needed. Move some functions out of line to achieve that. This reduces time to build clang by ~0.5% in terms of instructions retired.	2025-04-03 08:04:19 +02:00
Jonathan Thackray	a1a74c9e80	[NFC][clang] Remove superfluous header files after refactor in #132252 (#132495 ) Remove superfluous header files after refactor in #132252	2025-03-26 14:45:00 +00:00
Jonathan Thackray	7f920e2e5f	[NFC][clang] Split clang/lib/CodeGen/CGBuiltin.cpp into target-specific files (#132252 ) clang/lib/CodeGen/CGBuiltin.cpp is over 1MB long (>23k LoC), and can take minutes to recompile (depending on compiler and host system) when modified, and 5 seconds for clangd to update for every edit. Splitting this file was discussed in this thread: https://discourse.llvm.org/t/splitting-clang-s-cgbuiltin-cpp-over-23k-lines-long-takes-1min-to-compile/ and the idea has received a number of +1 votes, hence this change.	2025-03-21 19:09:39 +00:00
Phoebe Wang	09feaa9261	Revert "[X86][AVX10.2] Support YMM rounding new instructions (#101825 )" (#132362 ) This reverts commit 0dba5381d8c8e4cadc32a067bf2fe5e3486ae53d. YMM rounding was removed from AVX10 whitepaper. Ref: https://cdrdv2.intel.com/v1/dl/getContent/784343 The MINMAX and SATURATING CONVERT instructions will be removed as a follow up.	2025-03-21 20:12:57 +08:00
Rahul Joshi	88a51d2392	[Clang][NFC] Code cleanup in CGBuiltin.cpp (#132060 ) - Use `Intrinsic::` directly instead of `llvm::Intrinsic::`. - Eliminate redundant `nullptr` for some `CreateIntrinsic` calls. - Eliminate redundant `ArrayRef` casts. - Use C++17 structured binding instead of `std::tie`.	2025-03-20 15:46:28 -07:00
Aaron Ballman	d781ac1cf0	[C23] Add __builtin_c23_va_start (#131166 ) This builtin is supported by GCC and is a way to improve diagnostic behavior for va_start in C23 mode. C23 no longer requires a second argument to the va_start macro in support of variadic functions with no leading parameters. However, we still want to diagnose passing more than two arguments, or diagnose when passing something other than the last parameter in the variadic function. This also updates the freestanding <stdarg.h> header to use the new builtin, same as how GCC works. Fixes #124031	2025-03-15 11:01:53 -04:00
Juan Manuel Martinez Caamaño	7decd04626	[Clang] Add __builtin_elementwise_exp10 in the same fashion as exp/exp2 (#130746 ) Clang has __builtin_elementwise_exp and __builtin_elementwise_exp2 intrinsics, but no __builtin_elementwise_exp10. There doesn't seem to be a good reason not to expose the exp10 flavour of this intrinsic too. This commit introduces this intrinsic following the same pattern as the exp and exp2 versions. Fixes: SWDEV-519541	2025-03-12 09:20:29 +01:00
Benjamin Maxwell	fb397ab1e5	Reland "[clang] Lower modf builtin using `llvm.modf` intrinsic" (#130761 ) Reverts `c40f0fe434` Original description: This updates the existing modf[f\|l] builtin to be lowered via the llvm.modf.* intrinsic (rather than directly to a library call). The Windows 32-bit x86 missing `modff` symbol issue should have been solved in: https://github.com/llvm/llvm-project/pull/130636.	2025-03-11 14:55:33 +00:00
Hans Wennborg	c40f0fe434	Revert "Reland "[clang] Lower modf builtin using `llvm.modf` intrinsic" (#129885 )" This broke modff calls on 32-bit x86 Windows. See comment on the PR. > This updates the existing modf[f\|l] builtin to be lowered via the > llvm.modf.* intrinsic (rather than directly to a library call). > > The legalization issues exposed by the original PR (#126750) should have > been fixed in #128055 and #129264. This reverts commit cd1d9a8fab05524a27ffdb251f6def37786b5cc1.	2025-03-10 16:35:03 +01:00
Chris B	e85e29c299	[HLSL] select scalar overloads for vector conditions (#129396 ) This PR adds scalar/vector overloads for vector conditions to the `select` builtin, and updates the sema checking and codegen to allow scalars to extend to vectors. Fixes #126570	2025-03-09 16:01:12 -05:00
Matt Arsenault	bfea84946d	clang: Hack around opencl enqueue_block using wrong ABI for aggregrate (#130011 ) EmitAggExprToLValue started wrapping the temporary alloca in an addrspacecast at some point. We take the direct type from this as the pointer argument for the runtime function type, but this isn't correct. Technically, we should be querying the target's ABI for what IR to produce for this sequence. The assumption seems to always have been that this will be indirectly passed with byval (or byref). I started working on a patch to go through the ABI handling, but it seems to require more time and/or clang expertise than I have at the moment.	2025-03-06 23:13:28 +07:00
Benjamin Maxwell	cd1d9a8fab	Reland "[clang] Lower modf builtin using `llvm.modf` intrinsic" (#129885 ) Reverts llvm/llvm-project#127987 Original description: This updates the existing modf[f\|l] builtin to be lowered via the llvm.modf.* intrinsic (rather than directly to a library call). The legalization issues exposed by the original PR (#126750) should have been fixed in #128055 and #129264.	2025-03-06 09:43:59 +00:00
Deric C.	b4ecebe745	[HLSL] [DXIL] Implement the AddUint64 HLSL function and the UAddc DXIL op (#127137 ) Fixes #99205. - Implements the HLSL intrinsic `AddUint64` used to perform unsigned 64-bit integer addition by using pairs of unsigned 32-bit integers instead of native 64-bit types - The LLVM intrinsic `uadd_with_overflow` is used in the implementation of `AddUint64` in `CGBuiltin.cpp` - The DXIL op `UAddc` was defined in `DXIL.td`, and a lowering of the LLVM intrinsic `uadd_with_overflow` to the `UAddc` DXIL op was implemented in `DXILOpLowering.cpp` Notes: - `__builtin_addc` was not able to be used to implement `AddUint64` in `hlsl_intrinsics.h` because its `CarryOut` argument is a pointer, and pointers are not supported in HLSL - A lowering of the LLVM intrinsic `uadd_with_overflow` to SPIR-V [already exists](https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/SPIRV/llvm-intrinsics/uadd.with.overflow.ll) - When lowering the LLVM intrinsic `uadd_with_overflow` to the `UAddc` DXIL op, the anonymous struct type `{ i32, i1 }` is replaced with a named struct type `%dx.types.i32c`. This aspect of the implementation may be changed when issue #113192 gets addressed - Fixes issues mentioned in the comments on the original PR #125319 --------- Co-authored-by: Finn Plummer <50529406+inbelic@users.noreply.github.com> Co-authored-by: Farzon Lotfi <farzonlotfi@microsoft.com> Co-authored-by: Chris B <beanz@abolishcrlf.org> Co-authored-by: Justin Bogner <mail@justinbogner.com>	2025-03-05 17:04:10 -08:00
YunQiang Su	4bd3427321	IRBuilder: Add FMFSource parameter to CreateMaxNum/CreateMinNum (#129173 ) In https://github.com/llvm/llvm-project/pull/112852, we claimed that llvm.minnum and llvm.maxnum should treat +0.0>-0.0, while libc doesn't require fmin(3)/fmax(3) for it. Let's add FMFSource parameter to CreateMaxNum and CreateMinNum, so that they can be used by CodeGenFunction::EmitBuiltinExpr of Clang.	2025-03-04 08:49:35 +08:00
metkarpoonam	23efe734fc	[HLSL] Add "or" intrinsic (#128979 ) Include HLSL or_intrinsic, add codegen in CGBuiltin, and the corresponding tests in or.hlsl. Additionally, incorporate logical-operator-errors to handle both 'and' and 'or' semantic diagnostics.	2025-02-28 13:54:13 -07:00
Virginia Cangelosi	2477f82db9	[clang] Update SVE load and store intrinsics to have FP8 variants (#126726 )	2025-02-28 14:20:59 +00:00
Heejin Ahn	40566fd674	[WebAssembly] Generate invokes with llvm.wasm.(re)throw (#128105 ) Even though `__builtin_wasm_throw`, which is lowered down to `llvm.wasm.throw`, throws, ```cpp try { __builtin_wasm_throw(0, obj); } catch (...) { } ``` does not generate `invoke`. This is because we have assumed the intrinsic cannot be invoked, which doesn't make much sense. (See #128104 for the historical context) #128104 made `llvm.wasm.throw` intrinsic invokable in the backend. This actually generates `invoke`s in Clang for `__builtin_wasm_throw`. While we're at it, this also generates `invoke`s for `__builtin_wasm_rethrow`, which is actually not used anywhere in C++ support. I haven't deleted it just in case in may have uses later. (For example, to support rethrow functionality that carries stack trace with exnref) Depends on #128104 for the CI to pass. Fixes #124710.	2025-02-25 14:12:35 -08:00
Deric Cheung	305d273894	Reland "[HLSL] Implement the reflect HLSL function" (#125599 ) This PR relands #122992. A reland was attempted before (#123853), but it [failed to pass the `sanitizer-aarch64-linux-bootstrap-hwasan` buildbot](https://github.com/llvm/llvm-project/pull/123853#issuecomment-2608389396) due to the test `llvm/test/CodeGen/SPIRV/opencl/reflect-error.ll` The issue has since been patched thanks to @vitalybuka, so the PR is safe to reland without any changes. See https://github.com/llvm/llvm-project/pull/125599#discussion_r1966650839 and https://github.com/llvm/llvm-project/pull/125599#discussion_r1966650839	2025-02-24 16:46:59 -08:00
Benjamin Maxwell	5a7ee431fd	[clang] Add `__builtin_sincospi` that lowers to `llvm.sincospi.` (#127065 ) This (only) adds the `__builtin` variant which lowers to the `llvm.sincospi.` intrinsic when `-fno-math-errno` is set.	2025-02-21 16:49:41 +00:00
Benjamin Maxwell	d595fc91ae	Revert "[clang] Lower modf builtin using `llvm.modf` intrinsic" (#127987 ) Reverts llvm/llvm-project#126750 Revering while I investigate: https://lab.llvm.org/buildbot/#/builders/72/builds/8406	2025-02-20 10:25:18 +00:00
Deric Cheung	1c762c288f	[HLSL] Implement the 'and' HLSL function (#127098 ) Addresses #125604 - Implements `and` as an HLSL builtin function - The `and` HLSL builtin function gets lowered to the the LLVM `and` instruction	2025-02-19 11:22:46 -08:00

1 2 3 4 5 ...

2168 Commits