llvm-project

Author	SHA1	Message	Date
Weining Lu	e7a8a7d497	[Clang][LoongArch] Fix ABI handling of empty structs in C++ to match GCC behaviour GCC doesn't ignore non-zero-length array of empty structures in C++ while clang does. What this patch did is to match GCC's behaviour although this rule is not documented in psABI. Similar to D142327 for RISCV. Reviewed By: xry111, xen0n Differential Revision: https://reviews.llvm.org/D156116	2023-08-08 09:07:11 +08:00
Alex Bradbury	e3c57fdd84	[clang][RISCV] Fix bug in ABI handling of empty structs with hard FP calling conventions in C++ As reported in <https://github.com/llvm/llvm-project/issues/58929>, Clang's handling of empty structs in the case of small structs that may be eligible to be passed using the hard FP calling convention doesn't match g++. In general, C++ record fields are never empty unless [[no_unique_address]] is used, but the RISC-V FP ABI overrides this. After this patch, fields of structs that contain empty records will be ignored, even in C++, when considering eligibility for the FP calling convention ('flattening'). It isn't explicitly noted in the RISC-V psABI, but arrays of empty records will disqualify a struct for consideration of using the FP calling convention in g++. This patch matches that behaviour. The psABI issue <https://github.com/riscv-non-isa/riscv-elf-psabi-doc/issues/358> seeks to clarify this. This patch was previously committed but reverted after a bug was found. This recommit adds additional logic to prevent that bug (adding an extra check for when a candidate from detectFPCCEligibleStructHelper may not be valid). Differential Revision: https://reviews.llvm.org/D142327	2023-08-07 10:45:22 +01:00
wangpc	edb5056300	[RISCV] Use correct LMUL!=1 types for __attribute__((riscv_rvv_vector_bits(N))) We used to convert them to M1 types in arguments and return value, which causes failures in CodeGen since it is not legal to insert subvectors with LMUL>1 to M1 vectors. Fixes 64266 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D156779	2023-08-02 01:21:14 +08:00
Yaxun (Sam) Liu	ac72531043	[Driver] Add `-f[no-]offload-uniform-block` By default, clang assumes HIP kernels are launched with uniform block size, which is the case for kernels launched through triple chevron or hipLaunchKernelGGL. Clang adds uniform-work-group-size function attribute to HIP kernels to allow the backend to do optimizations on that. However, in some rare cases, HIP kernels can be launched through hipExtModuleLaunchKernel where global work size is specified, which may result in non-uniform block size. To be able to support non-uniform block size for HIP kernels, an option `-f[no-]offload-uniform-block is added. This option is generic for offloading languages. Its default value is on for CUDA/HIP and off otherwise. Make -cl-uniform-work-group-size an alias to -foffload-uniform-block. Reviewed by: Siu Chi Chan, Matt Arsenault, Fangrui Song, Johannes Doerfert Differential Revision: https://reviews.llvm.org/D155213 Fixes: SWDEV-406592	2023-07-27 16:36:02 -04:00
Amy Huang	27dab4d305	Reland "Try to implement lambdas with inalloca parameters by forwarding without use of inallocas."t This reverts commit 8ed7aa59f489715d39d32e72a787b8e75cfda151. Differential Revision: https://reviews.llvm.org/D154007	2023-07-26 16:13:36 -07:00
Jirui Wu	c9de04ea64	[ARM] Fixing ABI mismatch for packed structs passed as function arguments Previously when a packed struct, containing vector data types such as uint16x8_t, is passed as a function argument, the alignment of the struct used by the function caller and the alignment used by the callee to load the argument from stack does not match. This patch implements section 6.8.2, stage C.4 of the Procedure Call Standard for the Arm 64-bit Architecture (AAPCS64): "If the argument is an HFA, an HVA, a Quad-precision Floating-point or short vector type then the NSAA is rounded up to the next multiple of 8 if its natural alignment is ≤ 8 or the next multiple of 16 if its natural alignment is ≥ 16." This ensures the alignments of the packed structs used as function arguments are the same as described in the AAPCS64 for both the caller and callee. Reference: AAPCS64 (https://github.com/ARM-software/abi-aa/blob/latest-release/aapcs64/aapcs64.rst) Reviewed By: olista01, rjmccall, tmatheson Differential Revision: https://reviews.llvm.org/D146242	2023-07-26 17:33:06 +01:00
Johannes Doerfert	08a220764b	Reapply "[OpenMP] Add the `ompx_attribute` clause for target directives" This reverts commit 0d12683046ca75fb08e285f4622f2af5c82609dc and reapplies ef9ec4bbcca2fa4f64df47bc426f1d1c59ea47e2 with an extension to fix the Flang build. Differential Revision: https://reviews.llvm.org/D156184	2023-07-25 10:40:35 -07:00
Aaron Ballman	0d12683046	Revert "[OpenMP] Add the `ompx_attribute` clause for target directives" This reverts commit ef9ec4bbcca2fa4f64df47bc426f1d1c59ea47e2. The changes broke several bots: https://lab.llvm.org/buildbot/#/builders/176/builds/3408 https://lab.llvm.org/buildbot/#/builders/198/builds/4028 https://lab.llvm.org/buildbot/#/builders/197/builds/8491 https://lab.llvm.org/buildbot/#/builders/197/builds/8491	2023-07-25 07:57:36 -04:00
Johannes Doerfert	ef9ec4bbcc	[OpenMP] Add the `ompx_attribute` clause for target directives CUDA and HIP have kernel attributes to tune the code generation (in the backend). To reuse this functionality for OpenMP target regions we introduce the `ompx_attribute` clause that takes these kernel attributes and emits code as if they had been attached to the kernel fuction (which is implicitly generated). To limit the impact, we only support three kernel attributes: `amdgpu_waves_per_eu`, for AMDGPU `amdgpu_flat_work_group_size`, for AMDGPU `launch_bounds`, for NVPTX The existing implementations of those attributes are used for error checking and code generation. `ompx_attribute` can be attached to any executable target region and it can hold more than one kernel attribute. Differential Revision: https://reviews.llvm.org/D156184	2023-07-24 22:04:45 -07:00
Weining Lu	983f248832	Revert "[Clang][LoongArch] Fix ABI handling of empty structs in C++ to match GCC behaviour" This reverts commit c299efbf284fae92bb8d7d29110f76912f089daf. Reason to revert: D142327 has been reverted which is depended by this change.	2023-07-25 09:18:06 +08:00
Weining Lu	c299efbf28	[Clang][LoongArch] Fix ABI handling of empty structs in C++ to match GCC behaviour GCC doesn't ignore non-zero-length array of empty structures in C++ while clang does. What this patch did is to match GCC's behaviour although this rule is not documented in psABI. Similar to D142327 for RISCV. Reviewed By: xry111, xen0n Differential Revision: https://reviews.llvm.org/D156116	2023-07-25 08:59:40 +08:00
Alex Bradbury	0fa004e072	Revert "[clang][RISCV] Fix ABI handling of empty structs with hard FP calling conventions in C++" This reverts commit 17a58b3ca7ec18585e9ea8ed8b39d72fe36fb6cb and the minor documentation fix 569e99a471f618b7fdf045d5e96f21d3e3a7f898. An issue was reported in https://reviews.llvm.org/D142327#inline-1510301 so reverting until it can be investigated and fixed.	2023-07-24 16:58:48 +01:00
Alex Bradbury	17a58b3ca7	[clang][RISCV] Fix ABI handling of empty structs with hard FP calling conventions in C++ As reported in <https://github.com/llvm/llvm-project/issues/58929>, Clang's handling of empty structs in the case of small structs that may be eligible to be passed using the hard FP calling convention doesn't match g++. In general, C++ record fields are never empty unless [[no_unique_address]] is used, but the RISC-V FP ABI overrides this. After this patch, fields of structs that contain empty records will be ignored, even in C++, when considering eligibility for the FP calling convention ('flattening'). See also the relevant psABI issue <https://github.com/riscv-non-isa/riscv-elf-psabi-doc/issues/358> which seeks to clarify the documentation. Fixes https://github.com/llvm/llvm-project/issues/58929 Differential Revision: https://reviews.llvm.org/D142327	2023-07-24 10:24:34 +01:00
Sergio Afonso	63ca93c7d1	[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags This patch renames the `OpenMPIRBuilderConfig` flags to reduce confusion over their meaning. `IsTargetCodegen` becomes `IsGPU`, whereas `IsEmbedded` becomes `IsTargetDevice`. The `-fopenmp-is-device` compiler option is also renamed to `-fopenmp-is-target-device` and the `omp.is_device` MLIR attribute is renamed to `omp.is_target_device`. Getters and setters of all these renamed properties are also updated accordingly. Many unit tests have been updated to use the new names, but an alias for the `-fopenmp-is-device` option is created so that external programs do not stop working after the name change. `IsGPU` is set when the target triple is AMDGCN or NVIDIA PTX, and it is only valid if `IsTargetDevice` is specified as well. `IsTargetDevice` is set by the `-fopenmp-is-target-device` compiler frontend option, which is only added to the OpenMP device invocation for offloading-enabled programs. Differential Revision: https://reviews.llvm.org/D154591	2023-07-10 14:14:16 +01:00
Youngsuk Kim	6f986bffc5	[clang] Remove CGBuilderTy::CreateElementBitCast `CGBuilderTy::CreateElementBitCast()` no longer does what its name suggests. Remove remaining in-tree uses by one of the following methods. * drop the call entirely * fold it to an `Address` construction * replace it with `Address::withElementType()` This is a NFC cleanup effort. Reviewed By: barannikov88, nikic, jrtc27 Differential Revision: https://reviews.llvm.org/D154285	2023-07-02 10:40:16 -04:00
Youngsuk Kim	5f32baf17d	[clang] Replace uses of CreateElementBitCast (NFC) Partial progress towards replacing uses of CreateElementBitCast, as it no longer does what its name suggests. Reviewed By: barannikov88 Differential Revision: https://reviews.llvm.org/D154229	2023-06-30 17:35:36 -04:00
Youngsuk Kim	474ec69419	[clang] Replace uses of CGBuilderTy::CreateElementBitCast (NFC) Partial progress towards replacing `CreateElementBitCast`, as it no longer does what its name suggests. Either replace its uses with `Address::withElementType()`, or remove them if no longer needed. Reviewed By: barannikov88, nikic Differential Revision: https://reviews.llvm.org/D153314	2023-06-27 10:38:54 -04:00
Amy Huang	8ed7aa59f4	Revert "Try to implement lambdas with inalloca parameters by forwarding without use of inallocas." Causes a clang crash (see crbug.com/1457256). This reverts commit 015049338d7e8e0e81f2ad2f94e5a43e2e3f5220.	2023-06-22 11:42:33 -07:00
Amy Huang	015049338d	Try to implement lambdas with inalloca parameters by forwarding without use of inallocas. Differential Revision: https://reviews.llvm.org/D137872	2023-06-20 17:30:20 -07:00
Youngsuk Kim	44e63ffe2b	[clang] Replace uses of CGBuilderTy::CreateElementBitCast (NFC) * Add `Address::withElementType()` as a replacement for `CGBuilderTy::CreateElementBitCast`. * Partial progress towards replacing `CreateElementBitCast`, as it no longer does what its name suggests. Either replace its uses with `Address::withElementType()`, or remove them if no longer needed. * Remove unused parameter 'Name' of `CreateElementBitCast` Reviewed By: barannikov88, nikic Differential Revision: https://reviews.llvm.org/D153196	2023-06-18 04:13:15 +03:00
Sergei Barannikov	992cb98462	[clang][CodeGen] Break up TargetInfo.cpp [8/8] This commit breaks up CodeGen/TargetInfo.cpp into a set of .cpp files, one file per target. There are no functional changes, mostly just code moving. Non-code-moving changes are: A virtual destructor has been added to DefaultABIInfo to pin the vtable to a cpp file. * A few methods of ABIInfo and DefaultABIInfo were split into declaration + definition in order to reduce the number of transitive includes. * Several functions that used to be static have been placed in clang::CodeGen namespace so that they can be accessed from other cpp files. RFC: https://discourse.llvm.org/t/rfc-splitting-clangs-targetinfo-cpp/69883 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D148094	2023-06-17 07:14:50 +03:00

1 2 3 4 5

221 Commits