llvm-project

Author	SHA1	Message	Date
Shilei Tian	ce01e4e2f6	[Clang][OpenCL][AMDGPU] Use `byref` for aggregate OpenCL kernel arguments (#134892 ) Due to a previous workaround allowing kernels to be called from other functions, Clang currently doesn't use the `byref` attribute for aggregate kernel arguments. The issue was recently resolved in https://github.com/llvm/llvm-project/pull/115821. With that fix, we can now enable the use of `byref` consistently across all languages. Co-authored-by: Matt Arsenault <Matthew.Arsenault@amd.com> Fixes SWDEV-247226. Co-authored-by: Matt Arsenault <Matthew.Arsenault@amd.com>	2025-04-13 10:17:55 -04:00
Yingwei Zheng	8b40a09bf5	[Clang][CodeGen][UBSan] Remove redundant `EmitCheckValue` calls. NFCI (#135141 ) `EmitCheckValue` is called inside `EmitCheck`: `b122956390/clang/lib/CodeGen/CGExpr.cpp (L3739)` The outside calls are redundant because `EmitCheckValue(EmitCheckValue(V))` always returns `EmitCheckValue(V)`. Required by https://github.com/llvm/llvm-project/pull/135135.	2025-04-12 15:35:45 +08:00
Shilei Tian	9e90e10e76	[AMDGPU][Clang] Add builtins for gfx12 ray tracing intrinsics (#135224 )	2025-04-11 09:33:32 -04:00
Yingwei Zheng	04c38981a9	[Clang][CodeGen] Do not set inbounds flag in `EmitMemberDataPointerAddress` when the base pointer is null (#130952 ) See also https://github.com/llvm/llvm-project/pull/130734 for the original motivation. This pattern (`container_of`) is also widely used by real-world programs. Examples: `1d89d7d5d7/llvm/include/llvm/IR/SymbolTableListTraits.h (L77-L87)` `a2a53cb728/src/util-inl.h (L134-L137)` https://github.com/search?q=%29nullptr-%3E&type=code	2025-04-11 10:51:08 +08:00
Yingwei Zheng	1711996805	[Clang][CodeGen] Do not set inbounds flag for struct GEP with null base pointers (#130734 ) In the LLVM middle-end we want to fold `gep inbounds null, idx -> null`: https://alive2.llvm.org/ce/z/5ZkPx- This pattern is common in real-world programs (https://github.com/dtcxzyw/llvm-opt-benchmark/pull/55#issuecomment-1870963906). Generally, it exists in some (actually) unreachable blocks, which is introduced by JumpThreading. However, some old-style offsetof macros are still widely used in real-world C/C++ code (e.g., hwloc/slurm/luajit). To avoid breaking existing code and inconvenience to downstream users, this patch removes the inbounds flag from the struct gep if the base pointer is null.	2025-04-11 09:04:23 +08:00
Oliver Hunt	1cd59264aa	[RFC] Initial implementation of P2719 (#113510 ) This is a basic implementation of P2719: "Type-aware allocation and deallocation functions" described at http://wg21.link/P2719 The proposal includes some more details but the basic change in functionality is the addition of support for an additional implicit parameter in operators `new` and `delete` to act as a type tag. Tag is of type `std::type_identity<T>` where T is the concrete type being allocated. So for example, a custom type specific allocator for `int` say can be provided by the declaration of void operator new(std::type_identity<int>, size_t, std::align_val_t); void operator delete(std::type_identity<int>, void, size_t, std::align_val_t); However this becomes more powerful by specifying templated declarations, for example template <typename T> void operator new(std::type_identity<T>, size_t, std::align_val_t); template <typename T> void operator delete(std::type_identity<T>, void, size_t, std::align_val_t);); Where the operators being resolved will be the concrete type being operated over (NB. A completely unconstrained global definition as above is not recommended as it triggers many problems similar to a general override of the global operators). These type aware operators can be declared as either free functions or in class, and can be specified with or without the other implicit parameters, with overload resolution performed according to the existing standard parameter prioritisation, only with type parameterised operators having higher precedence than non-type aware operators. The only exception is destroying_delete which for reasons discussed in the paper we do not support type-aware variants by default.	2025-04-10 17:13:10 -07:00
Deric C.	727f3921e7	[DirectX] Implement Shader Flags Analysis for ResMayNotAlias (#131070 ) Fixes #112270 Completed ACs: - `-res-may-alias` clang-dxc command-line option added - It inserts and sets a module metadata flag `dx.resmayalias` to 1 - Shader flag set appropriately: - The flag IS NOT set if DXIL Version <= 1.6 OR the command-line option `-res-may-alias` is specified - Otherwise the flag IS set when: - DXIL Version > 1.7 AND function uses UAVs, OR - DXIL Version <= 1.7 AND UAVs present globally - Add tests - Tests for Shader Models 6.6, 6.7, and 6.8 corresponding to DXIL Versions 1.6, 1.7, and 1.8 - Tests (`res-may-alias-0.ll`/`res-may-alias-1.ll`) for when the module metadata flag `dx.resmayalias` is set to 0 or 1 respectively - A frontend test (`res-may-alias.hlsl`) for testing that that the command-line option `-res-may-alias` inserts `dx.resmayalias` module metadata correctly	2025-04-10 16:06:48 -07:00
Farzon Lotfi	589e1c73d0	[HLSL] Add support for modulo of floating point scalar and vectors (#135125 ) fixes #135122 SemaExpr.cpp - Make all doubles fail. Add sema support for float scalars and vectors when language mode is HLSL. CGExprScalar.cpp - Allow emit frem when language mode is HLSL.	2025-04-10 14:27:49 -04:00
Aaron Ballman	5c8ba28c75	[C11] Implement WG14 N1285 (temporary lifetimes) (#133472 ) This feature largely models the same behavior as in C++11. It is technically a breaking change between C99 and C11, so the paper is not being backported to older language modes. One difference between C++ and C is that things which are rvalues in C are often lvalues in C++ (such as the result of a ternary operator or a comma operator). Fixes #96486	2025-04-10 08:12:14 -04:00
Nathan Gauër	a625bc60e2	[HLSL][SPIR-V] Add hlsl_private address space for SPIR-V (#133464 ) This is an alternative to https://github.com/llvm/llvm-project/pull/122103 In SPIR-V, private global variables have the Private storage class. This PR adds a new address space which allows frontend to emit variable with this storage class when targeting this backend. This is covered in this proposal: llvm/wg-hlsl@4c9e11a This PR will cause addrspacecast to show up in several cases, like class member functions or assignment. Those will have to be handled in the backend later on, particularly to fixup pointer storage classes in some functions. Before this change, global variable were emitted with the 'Function' storage class, which was wrong.	2025-04-10 10:55:10 +02:00
Yingwei Zheng	2257f51431	Revert "[Clang][CodeGen][UBSan] Add more precise attributes to recoverable ubsan handlers" (#135130 ) Reverts llvm/llvm-project#130990 Breaks buildbot https://lab.llvm.org/buildbot/#/builders/186/builds/8072	2025-04-10 13:15:55 +08:00
Yingwei Zheng	0283bb3afc	[Clang][CodeGen][UBSan] Add more precise attributes to recoverable ubsan handlers (#130990 ) This patch adds `memory(argmem: read, inaccessiblemem: readwrite) mustprogress` to recoverable ubsan handlers in order to unblock some memory/loop optimizations. It provides an average of 3% performance improvement on llvm-test-suite (except for 49 test failures due to ubsan diagnostics). Closes https://github.com/llvm/llvm-project/issues/130093.	2025-04-10 11:09:45 +08:00
Deric C.	747d4a952b	[DirectX] Implement UseNativeLowPrecision shader flag analysis (#134288 ) Fixes #112267 Implement the shader flag analysis to set the UseNativeLowPrecision DXIL module flag. The flag is only able to be set when the command-line flag `-enable-16bit-types` is passed to clang-dxc, or equivalently `-fnative-half-type` is passed to clang. When the command-line flag is passed, a module metadata flag called "dx.nativelowprec" is set to 1. The DXILShaderFlags shader flags analysis checks that the module metadata flag "dx.nativelowprec" is set to 1 and the DXIL Version is 1.2 or greater before setting the UseNativeLowPrecision DXIL module flag.	2025-04-09 18:14:23 -07:00
Yingwei Zheng	c0480738cb	[Clang][CodeGen] Respect -fwrapv-pointer when emitting struct GEPs (#134269 ) This patch turns off inbounds/nuw flags for member accesses when `-fwrapv-pointer` is set. Closes https://github.com/llvm/llvm-project/issues/132449. It is required by https://github.com/llvm/llvm-project/pull/130734.	2025-04-10 00:48:18 +08:00
Yaxun (Sam) Liu	d54c28b9c1	[HIP] use offload wrapper for non-device-only non-rdc (#132869 ) Currently HIP still uses offload bundler for non-rdc mode for the new offload driver. This patch switches to use offload wrapper for non-device-only non-rdc mode when new offload driver is enabled. This makes the rdc and non-rdc compilation more consistent and speeds up compilation since the offload wrapper supports parallel compilation for different GPU arch's. It is implemented by adding a linker wrapper action for each assemble action of input file. Linker wrapper action differentiates this special type of work vs normal linker wrapper work by the fle type. This type of work results in object instead of image. The linker wrapper adds "-r" for it and only includes the object file as input, not the host libraries. For device-only non-RDC mode, the new driver keeps the original behavior.	2025-04-09 09:13:21 -04:00
Alex MacLean	a6853cd9af	[NVPTX] Auto-Upgrade llvm.nvvm.atomic.load.{inc,dec}.32 (#134111 ) These intrinsics can be upgrade to an atomicrmw instruction.	2025-04-08 13:44:11 -07:00
Jakub Ficek	a5509d62a7	[clang] fp options fix for __builtin_convertvector (#134102 ) Add missing CGFPOptionsRAII for fptoi and itofp cases	2025-04-08 11:36:48 +02:00
Pedro Lobo	bb5006169f	[CodeGen] Change placeholder from `undef` to `poison` (#134731 ) Fill default values of a map with `poison` instead of `undef`. There should be no functional difference as the default values are overridden later.	2025-04-08 09:50:48 +01:00
Orlando Cazalet-Hyams	308654608c	[Clang][NFC] Move some static functions into CodeGenFunction (#134634 ) Patches in the Key Instructions (KeyInstr) stack need to access CGF in these functions. 2 CGF fields are passed to these functions already; at this point it felt natural to promote them to CGF methods.	2025-04-08 08:44:10 +01:00
Aniket Lal	642481a428	[Clang][OpenCL][AMDGPU] Allow a kernel to call another kernel (#115821 ) This feature is currently not supported in the compiler. To facilitate this we emit a stub version of each kernel function body with different name mangling scheme, and replaces the respective kernel call-sites appropriately. Fixes https://github.com/llvm/llvm-project/issues/60313 D120566 was an earlier attempt made to upstream a solution for this issue. --------- Co-authored-by: anikelal <anikelal@amd.com>	2025-04-08 10:29:30 +05:30
Sarah Spall	01bc672b8a	[HLSL] Desugar ConstantArrayType when calculating cbuffer field layout (#134683 ) When calculating the layout for a cbuffer field, if that field is a ConstantArrayType, desguar it before casting it to a ConstantArrayType. Closes #134668 --------- Co-authored-by: Eli Friedman <efriedma@quicinc.com>	2025-04-07 15:25:47 -07:00
Jan Leyonberg	fbc8335311	[MLIR][OpenMP] Add codegen for teams reductions (#133310 ) This patch adds the lowering of teams reductions from the omp dialect to LLVM-IR. Some minor cleanup was done in clang to remove an unused parameter.	2025-04-07 12:47:16 -04:00
Farzon Lotfi	16c84c4475	[DirectX] Add target builtins (#134439 ) - fixes #132303 - Moves dot2add from a language builtin to a target builtin. - Sets the scaffolding for Sema checks for DX builtins - Setup DirectX backend as able to have target builtins - Adds a DX TargetBuiltins emitter in `clang/lib/CodeGen/TargetBuiltins/DirectX.cpp`	2025-04-07 12:06:57 -04:00
Farzon Lotfi	82103dfae9	Revert "Reland [Clang][Cmake] fix libtool duplicate member name warnings" (#134656 ) Reverts llvm/llvm-project#133850	2025-04-07 10:00:53 -04:00
Farzon Lotfi	0d71d9ab28	Reland [Clang][Cmake] fix libtool duplicate member name warnings (#133850 ) fixes https://github.com/llvm/llvm-project/issues/133199 As of the third commit the fix to the linker missing references in `Targets/DirectX.cpp` found in https://github.com/llvm/llvm-project/pull/133776 was fixed by moving `HLSLBufferLayoutBuilder.cpp` to `clang/lib/CodeGen/Targets/`. It fixes the circular reference issue found in https://github.com/llvm/llvm-project/pull/133619 for all `-DBUILD_SHARED_LIBS=ON` builds by removing `target_link_libraries` from the sub directory cmake files. testing for amdgpu offload was done via `cmake -B ../llvm_amdgpu -S llvm -GNinja -C offload/cmake/caches/Offload.cmake -DCMAKE_BUILD_TYPE=Release` PR https://github.com/llvm/llvm-project/pull/132252 Created a second file that shared <TargetName>.cpp in clang/lib/CodeGen/CMakeLists.txt For example There were two AMDGPU.cpp's one in TargetBuiltins and the other in Targets. Even though these were in different directories libtool warns that it might not distinguish them because they share the same base name. There are two potential fixes. The easy fix is to rename one of them and keep one cmake file. That solution though doesn't future proof this problem in the event of a third <TargetName>.cpp and it seems teams want to just use the target name https://github.com/llvm/llvm-project/pull/132252#issuecomment-2758178483. The alternative fix that this PR went with is to seperate the cmake files into their own sub directories as static libs.	2025-04-07 09:53:07 -04:00
Jay Foad	e2fe78797f	[Clang] Use "syncscope" instead of "synchscope". NFC. (#134616 ) This matches the spelling of the keyword in LLVM IR.	2025-04-07 13:32:36 +01:00
Nikita Popov	87a4215ed1	[Clang] Always verify LLVM IR inputs (#134396 ) We get a lot of issues that basically boil down to "I passed malformed LLVM IR to clang and it crashed". Clang does not perform IR verification by default in (non-assertion-enabled) release builds, and that's sensible for IR that Clang itself produces, which is expected to always be valid. However, if people pass in their own handwritten IR, we should report if it is malformed, instead of crashing. We should also report it in a way that does not produce a crash trace and ask for a bug report, as currently happens in assertions-enabled builds. This aligns the behavior with how opt/llc work.	2025-04-07 09:18:47 +02:00
Mats Jun Larsen	a641910531	[clang][CGObjC] Remove unused ExternalProtocolPtrTy (NFC) (#133870 ) This function was previously used to get a type to the protocol that was used to bitcast the initializer of GenerateProtocol. This bitcast has later been removed (thanks to opaque pointers), but the member was left behind. History: - 020de3254acc3 used ExternalProtocolPtrTy - 34ee69b4ce662 removes the bitcast Also technically part of #123569	2025-04-05 09:01:36 +00:00
Florian Mayer	30f2e92c69	[clang] [sanitizer] predict trap checks succeed (#134310 ) Trap checks fail at most once (when the program crashes).	2025-04-04 10:58:08 -07:00
Mariya Podchishchaeva	22130ca486	[MS][clang] Fix crash on deletion of array of pointers (#134088 ) Sometimes a non-array delete is treated as delete[] when input pointer is pointer to array. With vector deleting destructors support we now generate a virtual destructor call instead of simple loop over the elements. This patch adjusts the codepath that generates virtual call to expect the case of pointer to array.	2025-04-04 09:37:28 +02:00
Mats Jun Larsen	d579622b1e	[clang][CGObjC] Prefer PointerType::get with LLVMContext over Type (NFC) (#133871 ) Part of #123569	2025-04-04 07:18:01 +00:00
Phoebe Wang	897f9a51b9	[X86][AVX10.2] Replace nepbh with bf16 to match with others, NFCI (#134240 )	2025-04-04 11:27:39 +08:00
NAKAMURA Takumi	4088c70f4e	CGHLSLBuiltins.cpp: Suppress a warning in #131237 [-Wunused-variable]	2025-04-04 11:05:46 +09:00
Sumit Agarwal	996cf5dc67	[HLSL] Implement dot2add intrinsic (#131237 ) Resolves #99221 Key points: For SPIRV backend, it decompose into a `dot` followed a `add`. - [x] Implement dot2add clang builtin, - [x] Link dot2add clang builtin with hlsl_intrinsics.h - [x] Add sema checks for dot2add to CheckHLSLBuiltinFunctionCall in SemaHLSL.cpp - [x] Add codegen for dot2add to EmitHLSLBuiltinExpr in CGBuiltin.cpp - [x] Add codegen tests to clang/test/CodeGenHLSL/builtins/dot2add.hlsl - [x] Add sema tests to clang/test/SemaHLSL/BuiltIns/dot2add-errors.hlsl - [x] Create the int_dx_dot2add intrinsic in IntrinsicsDirectX.td - [x] Create the DXILOpMapping of int_dx_dot2add to 162 in DXIL.td - [x] Create the dot2add.ll and dot2add_errors.ll tests in llvm/test/CodeGen/DirectX/	2025-04-03 16:23:09 -06:00
Andy Kaylor	13aac46332	[clang][NFC] Refactor CodeGen's hasBooleanRepresentation (#134159 ) The ClangIR upstreaming project needs the same logic for hasBooleanRepresentation() that is currently implemented in the standard clang codegen. In order to share this code, this change moves the implementation of this function into the AST Type class. No functional change is intended by this change. The ClangIR use of this function will be added separately in a later change.	2025-04-03 14:03:25 -07:00
gbMattN	59074a3760	[ASan] Add metadata to renamed instructions so ASan doesn't use the i… (#119387 ) …ncorrect name Clang needs variables to be represented with unique names. This means that if a variable shadows another, its given a different name internally to ensure it has a unique name. If ASan tries to use this name when printing an error, it will print the modified unique name, rather than the variable's source code name Fixes #47326	2025-04-03 15:27:14 +01:00
Yingwei Zheng	61907ebd76	[Clang][CodeGen] Do not use the GEP result to infer offset and result type (#134221 ) If `CreateConstInBoundsGEP2_32` returns a constant null/gep, the cast to GetElementPtrInst will fail. This patch uses two static helpers `GEPOperator::accumulateConstantOffset/GetElementPtrInst::getIndexedType` to infer offset and result type instead of depending on the GEP result. This patch is extracted from https://github.com/llvm/llvm-project/pull/130734.	2025-04-03 18:03:42 +08:00
Nikita Popov	b384d6d6cc	[CodeGen] Don't include CGDebugInfo.h in CodeGenFunction.h (NFC) (#134100 ) This is an expensive header, only include it where needed. Move some functions out of line to achieve that. This reduces time to build clang by ~0.5% in terms of instructions retired.	2025-04-03 08:04:19 +02:00
Sami Tolvanen	acc6bcdc50	Support alternative sections for patchable function entries (#131230 ) With -fpatchable-function-entry (or the patchable_function_entry function attribute), we emit records of patchable entry locations to the __patchable_function_entries section. Add an additional parameter to the command line option that allows one to specify a different default section name for the records, and an identical parameter to the function attribute that allows one to override the section used. The main use case for this change is the Linux kernel using prefix NOPs for ftrace, and thus depending on__patchable_function_entries to locate traceable functions. Functions that are not traceable currently disable entry NOPs using the function attribute, but this creates a compatibility issue with -fsanitize=kcfi, which expects all indirectly callable functions to have a type hash prefix at the same offset from the function entry. Adding a section parameter would allow the kernel to distinguish between traceable and non-traceable functions by adding entry records to separate sections while maintaining a stable function prefix layout for all functions. LKML discussion: https://lore.kernel.org/lkml/Y1QEzk%2FA41PKLEPe@hirez.programming.kicks-ass.net/	2025-04-02 21:53:55 +00:00
Sarah Spall	60efed3f20	[HLSL] Update __builtin_hlsl_dot builtin Sema Checking to fix error when passed an array literal 1u.xxxx (#133941 ) update dot builtin sema checking and codegen new test fix tests Closes #133659	2025-04-02 12:27:01 -07:00
Mariya Podchishchaeva	8a691cc615	[MS][clang] Make sure vector deleting dtor calls correct operator delete (#133950 ) During additional testing I spotted that vector deleting dtor calls operator delete, not operator delete[] when performing array deletion. This patch fixes that.	2025-04-02 09:25:43 +02:00
Steven Perron	16603d838c	[HLSL] Add SPIR-V target type for RWStructuredBuffers (#133468 ) This PR adds the target type for main storage for HLSL raw buffer types. It does not handle the counter variables that are associated with those buffers. This is implementing part of https://github.com/llvm/wg-hlsl/blob/main/proposals/0018-spirv-resource-representation.md. We do not handle other HLSL raw buffer types.	2025-04-01 16:59:46 -04:00
Zahira Ammarguellat	aa73124e51	Fix complex long double division with -mno-x87. (#133152 ) The combination of `-fcomplex-arithmetic=promoted` and `mno-x87` for `double` complex division is leading to a crash. See https://godbolt.org/z/189G957oY This patch fixes that.	2025-04-01 11:10:51 -04:00
Nathan Gauër	da5fb4213f	[Clang][SPIR-V] Fix convergence tokens for dtor (#133469 ) Destructor calls were emitted without convergence intrinsics when building for SPIR-V, which means invalid IR since we mixed controlled and non-controlled convergence.	2025-04-01 11:03:30 +02:00
Lukacma	6c3adaafe3	[AARCH64][Neon] switch to using bitcasts in arm_neon.h where appropriate (#127043 ) Currently arm_neon.h emits C-style casts to do vector type casts. This relies on implicit conversion between vector types to be enabled, which is currently deprecated behaviour and soon will disappear. To ensure NEON code will keep working afterwards, this patch changes all this vector type casts into bitcasts. Co-authored-by: Momchil Velikov <momchil.velikov@arm.com>	2025-04-01 09:45:16 +01:00
Farzon Lotfi	bdae91b08b	Revert "[Clang][Cmake] fix libtool duplicate member name warnings" (#133795 ) Reverts llvm/llvm-project#133619	2025-03-31 17:00:38 -04:00
Farzon Lotfi	cc2b432614	[Clang][Cmake] fix libtool duplicate member name warnings (#133619 ) fixes #133199 PR #132252 Created a second file that shared `<TargetName>.cpp` in `clang/lib/CodeGen/CMakeLists.txt` For example There were two `AMDGPU.cpp`'s one in `TargetBuiltins` and the other in `Targets`. Even though these were in different directories `libtool` warns that it might not distinguish them because they share the same base name. There are two potential fixes. The easy fix is to rename one of them and keep one cmake file. That solution though doesn't future proof this problem in the event of a third `<TargetName>.cpp` and it seems teams want to just use the target name https://github.com/llvm/llvm-project/pull/132252#issuecomment-2758178483. The alternative fix is to seperate the cmake files into their own sub directories. I chose to create static libraries. It might of been possible to build an OBJECT, but I only saw examples of this in compiler-rt and test directories so assumed there was a reason it wasn't used.	2025-03-31 14:21:22 -04:00
Helena Kotas	dcc2faecd8	[HLSL] Fix codegen to support classes in `cbuffer` (#132828 ) Fixes #132309	2025-03-31 10:05:59 -07:00
Alan Zhao	c5b3fe2094	[clang] Automatically add the `returns_twice` attribute to certain functions even if `-fno-builtin` is set (#133511 ) Certain functions require the `returns_twice` attribute in order to produce correct codegen. However, `-fno-builtin` removes all knowledge of functions that require this attribute, so this PR modifies Clang to add the `returns_twice` attribute even if `-fno-builtin` is set. This behavior is also consistent with what GCC does. It's not (easily) possible to get the builtin information from `Builtins.td` because `-fno-builtin` causes Clang to never initialize any builtins, so functions never get tokenized as functions/builtins that require `returns_twice`. Therefore, the most straightforward solution is to explicitly hard code the function names that require `returns_twice`. Fixes #122840	2025-03-31 09:42:34 -07:00
Rahul Joshi	74b7abf154	[IRBuilder] Add new overload for CreateIntrinsic (#131942 ) Add a new `CreateIntrinsic` overload with no `Types`, useful for creating calls to non-overloaded intrinsics that don't need additional mangling.	2025-03-31 08:10:34 -07:00

... 6 7 8 9 10 ...

18194 Commits