llvm-project

Author	SHA1	Message	Date
Jon Roelofs	b3f3c0c633	[clang][AArch64] Put soft-float ABI checks under isSoftFloat(). NFC	2024-09-11 13:37:45 -07:00
Helena Kotas	becb03f3c6	[DirectX] Add DirectXTargetCodeGenInfo (#104856 ) Adds target codegen info class for DirectX. For now it always translates `__hlsl_resource_t` handle to `target("dx.TypedBuffer", i32, 1, 0, 1)` (`RWBuffer<int>`). More work is needed to determine the actual target exp type and parameters based on the resource handle attributes. Part 1/2 of #95952	2024-09-10 12:41:08 -07:00
Lei Huang	ea9204505c	Fix codegen for transparent_union function params (#104816 ) Update codegen for func param with transparent_union attr to be that of the first union member. This is a followup to #101738 to fix non-ppc codegen and closes #76773.	2024-09-09 11:01:22 -04:00
Alex Voicu	ad435bcc14	[clang][CodeGen][SPIR-V][AMDGPU] Tweak AMDGCNSPIRV ABI to allow for the correct handling of aggregates passed to kernels / functions. (#102776 ) The AMDGPU kernel ABI is not directly representable in SPIR-V, since it relies on passing aggregates `byref`, and SPIR-V only encodes `byval` (which the AMDGPU BE disallows for kernel arguments). As a temporary solution to this mismatch, we add special handling for AMDGCN flavoured SPIR-V, whereby aggregates are passed as direct, both to kernels and to normal functions. This is not ideal (there are pathological cases where performance is heavily impacted), but empirically robust and guaranteed to work as the AMDGPU BE retains handling of `direct` passing for legacy reasons. We will revisit this in the future, but as it stands it is enough to pass a wide array of integration tests and generates correct SPIR-V and correct reverse translation into LLVM IR. The amdgpu-kernel-arg-pointer-type test is updated via the automated script, and thus becomes quite noisy.	2024-08-21 13:16:59 +01:00
Lei Huang	f95026dbf6	[PowerPC] Fix codegen for transparent_union function params (#101738 ) Update codegen for func param with transparent_union attr to be that of the first union member. PPC fix for: https://github.com/llvm/llvm-project/issues/76773	2024-08-19 12:17:44 -04:00
Jon Roelofs	019ef52275	[clang][AArch64] Point the nofp ABI check diagnostics at the callee (#103392 ) ... whereever we have the Decl for it, and even when we don't keep the SourceLocation of it aimed at the call site. Fixes: #102983	2024-08-14 07:38:14 -07:00
Longsheng Mou	a27f40e5d9	[X86_64] Fix empty field error in vaarg of C++. (#101639 ) Such struct types: ``` struct { struct{} a; long long b; }; stuct { struct{} a; double b; }; ``` For such structures, Lo is NoClass and Hi is Integer/SSE. And when this structure argument is passed, the high part is passed at offset 8 in memory. So we should do special handling for these types in EmitVAArg.Fix https://github.com/llvm/llvm-project/issues/79790 and fix https://github.com/llvm/llvm-project/issues/86371.	2024-08-13 11:35:23 +08:00
Vladislav Belov	635d20e9e7	[RISCV] full support for riscv_rvv_vector_bits attribute (#100110 ) Add support for using attribute((rvv_vector_bits(N))), when N < 8. It allows using all fixed length vector mask types regardless VLEN value.	2024-08-08 12:45:20 +03:00
Longsheng Mou	4461b69022	[X86_32][C++] fix 0 sized struct case in vaarg. (#86388 ) struct SuperEmpty { struct{ int a[0];} b;}; Such 0 sized structs in c++ mode can not be ignored in i386 for that c++ fields are never empty.But when EmitVAArg, its size is 0, so that va_list not increase.Maybe we can just Ignore this kind of arguments, like X86_64 did. Fix #86385.	2024-08-02 09:20:49 +08:00
Sander de Smalen	389679d5f9	Reland: "[Clang] Demote always_inline error to warning for mismatching SME attrs" (#100991 ) (#100996 ) Test `aarch64-sme-inline-streaming-attrs.c` caused some buildbot failures, because the test was missing a `REQUIRES: aarch64-registered target`. This was because we've demoted the error to a warning, which then resulted in a different error message, because Clang can't actually CodeGen the IR.	2024-07-29 11:23:25 +01:00
Sander de Smalen	e3a3397209	Revert "[Clang] Demote always_inline error to warning for mismatching SME attrs" (#100991 ) Reverts llvm/llvm-project#100740	2024-07-29 10:19:28 +01:00
Sander de Smalen	5430f73b50	[Clang] Demote always_inline error to warning for mismatching SME attrs (#100740 ) PR #77936 introduced a diagnostic to avoid calls being inlined into functions with a different streaming mode, because inlining those functions may result in different runtime behaviour. This was necessary because LLVM doesn't check whether inlining is possible and thus blindly inlines the function without checking feasibility. In practice however, this introduces an artificial restriction that the user may not be able to work around. Calling an `always_inline` function from some header file that is out of the control of the user would result in an error that the user cannot remedy. Therefore, this patch demotes the error into a warning (for calls from streaming[-compatible] -> non-streaming), but the proper fix would be to fix the AlwaysInliner in LLVM to avoid inlining when it has analyzed the callee and has determined that inlining is not possible. Calling an always_inline function for calls from non-streaming -> streaming will remain an error, because there is little pre-existing code for SME, so it is expected that the header file can be modified by the user (e.g. by using `__arm_streaming_compatible` if the code is claimed to be compatible).	2024-07-29 09:29:30 +01:00
Matt Arsenault	e108853ac8	clang: Allow targets to set custom metadata on atomics (#96906 ) Use this to replace the emission of the amdgpu-unsafe-fp-atomics attribute in favor of per-instruction metadata. In the future new fine grained controls should be introduced that also cover the integer cases. Add a wrapper around CreateAtomicRMW that appends the metadata, and update a few use contexts to use it.	2024-07-26 09:57:28 +04:00
James Y Knight	e59a619acf	Clang: don't unnecessarily convert inline-asm operands to x86mmx in IR. (#98273 ) The SelectionDAG asm-lowering code can already handle conversion of other vector types to MMX if needed.	2024-07-23 13:22:24 -04:00
Ulrich Weigand	9af3628ce7	[SystemZ] Fix transparent_union calling convention The SystemZ ABI code was missing code to handle the transparent_union extension. Arguments of such types are specified to be passed like the first member of the union, instead of according to the usual ABI calling convention for aggregates. This did not make much difference in practice as the SystemZ ABI already specifies that 1-, 2-, 4- or 8-byte aggregates are passed in registers. However, there is a difference if the first member of the transparent union is a scalar integer type smaller than word size - if passed as a scalar, it needs to be zero- or sign-extended to word size, while if passed as aggregate, it is not. Fixed by adding code to handle transparent_union similar to what is done on other targets.	2024-07-18 17:43:28 +02:00
Joseph Huber	486d00eca6	[NVPTX] Implement variadic functions using IR lowering (#96015 ) Summary: This patch implements support for variadic functions for NVPTX targets. The implementation here mainly follows what was done to implement it for AMDGPU in https://github.com/llvm/llvm-project/pull/93362. We change the NVPTX codegen to lower all variadic arguments to functions by-value. This creates a flattened set of arguments that the IR lowering pass converts into a struct with the proper alignment. The behavior of this function was determined by iteratively checking what the NVCC copmiler generates for its output. See examples like https://godbolt.org/z/KavfTGY93. I have noted the main methods that NVIDIA uses to lower variadic functions. 1. All arguments are passed in a pointer to aggregate. 2. The minimum alignment for a plain argument is 4 bytes. 3. Alignment is dictated by the underlying type 4. Structs are flattened and do not have their alignment changed. 5. NVPTX never passes any arguments indirectly, even very large ones. This patch passes the tests in the `libc` project currently, including support for `sprintf`.	2024-07-12 17:09:48 -05:00
Daniel Kiss	f34a1654d6	[NFC][Clang] Move set functions out BranchProtectionInfo. (#98451 ) To reduce build times move them to TargetCodeGenInfo. Refactor of #98329	2024-07-12 10:58:34 +02:00
Daniel Kiss	e03f66516d	[Clang][ARM] Call constructor on BranchTargetInfo. (#98307 ) Otherwise members will be uninitialised.	2024-07-10 17:26:55 +02:00
Daniel Kiss	1782810b84	[Clang][ARM][AArch64] Alway emit protection attributes for functions. (#82819 ) So far branch protection, sign return address, guarded control stack attributes are only emitted as module flags to indicate the functions need to be generated with those features. The problem is in case of an LTO build the module flags are merged with the `min` rule which means if one of the module is not build with sign return address then the features will be turned off for all functions. Due to the functions take the branch-protection and sign-return-address features from the module flags. The sign-return-address is function level option therefore it is expected functions from files that is compiled with -mbranch-protection=pac-ret to be protected. The inliner might inline functions with different set of flags as it doesn't consider the module flags. This patch adds the attributes to all functions and drops the checking of the module flags for the code generation. Module flag is still used for generating the ELF markers. Also drops the "true"/"false" values from the branch-protection-enforcement, branch-protection-pauth-lr, guarded-control-stack attributes as presence of the attribute means it is on absence means off and no other option. Releand with test fixes.	2024-07-10 11:32:41 +02:00
Daniel Kiss	4b2daeccc7	Revert "[Clang][ARM][AArch64] Alway emit protection attributes for functions." (#98284 ) Reverts llvm/llvm-project#82819	2024-07-10 10:22:38 +02:00
Daniel Kiss	e15d67cfc2	[Clang][ARM][AArch64] Alway emit protection attributes for functions. (#82819 ) So far branch protection, sign return address, guarded control stack attributes are only emitted as module flags to indicate the functions need to be generated with those features. The problem is in case of an LTO build the module flags are merged with the `min` rule which means if one of the module is not build with sign return address then the features will be turned off for all functions. Due to the functions take the branch-protection and sign-return-address features from the module flags. The sign-return-address is function level option therefore it is expected functions from files that is compiled with -mbranch-protection=pac-ret to be protected. The inliner might inline functions with different set of flags as it doesn't consider the module flags. This patch adds the attributes to all functions and drops the checking of the module flags for the code generation. Module flag is still used for generating the ELF markers. Also drops the "true"/"false" values from the branch-protection-enforcement, branch-protection-pauth-lr, guarded-control-stack attributes as presence of the attribute means it is on absence means off and no other option.	2024-07-10 10:06:14 +02:00
Sudharsan Veeravalli	d65f423202	[RISCV] Handle empty structs/unions passing in C++ (#97315 ) According to RISC-V integer calling convention empty structs or union arguments or return values are ignored by C compilers which support them as a non-standard extension. This is not the case for C++, which requires them to be sized types. Fixes #97285	2024-07-08 18:17:51 -07:00
Phoebe Wang	7e01e64714	[X86][vectorcall] Do not consume register for indirect return value (#97939 ) This is how MSVC handles it. https://godbolt.org/z/Eav3vx7cd	2024-07-08 21:23:08 +08:00
Tomas Matheson	fa6d38d61a	[AArch64][TargetParser] Split FMV and extensions (#92882 ) FMV extensions are really just mappings from FMV feature names to lists of backend features for codegen. Split them out into their own separate file.	2024-06-20 15:33:21 +01:00
Mariya Podchishchaeva	6d973b4548	[clang][CodeGen] Return RValue from `EmitVAArg` (#94635 ) This should simplify handling of resulting value by the callers.	2024-06-17 13:29:20 +02:00
Jon Chesterfield	8516f54e6a	[AMDGPU] Implement variadic functions by IR lowering (#93362 ) This is a mostly-target-independent variadic function optimisation and lowering pass. It is only enabled for AMDGPU in this initial commit. The purpose is to make C style variadic functions a zero cost abstraction. They are lowered to equivalent IR which is then amenable to other optimisations. This is inherently slightly target specific but much less so than one might expect - the C varargs interface heavily constrains the ABI design divergence. The pass is primarily tested from webassembly. This is because wasm has a straightforward variadic lowering strategy which coincides exactly with what this pass transforms code into and a struct passing convention with few cases to check. Adding further targets conventions is straightforward and elided from this patch primarily to simplify the review. Implemented in other branches are Linux X86, AMD64, AArch64 and NVPTX. Testing for targets that have existing lowering for va_arg from clang is most efficiently done by checking that clang \| opt completely elides the variadic syntax from test cases. The lowering produces a struct for each call site which can be inspected to check the various alignment and indirections are correct. AMDGPU presently has no variadic support other than some ad hoc printf handling. Combined with the pass being inactive on all other targets landing this represents strict increase in capability with zero risk. Testing and refining will continue post commit. In addition to the compiler tests included here, a self contained x64 clang/musl toolchain was constructed using the "lowering" instead of the systemv ABI and used to build various C programs like lua and libxml2.	2024-06-06 10:44:53 +01:00
Jon Chesterfield	794457f6f9	[amdgpu] Pass variadic arguments without splitting (#94083 ) Pass variadic arguments without changing their type, unlike the fixed ones. Fixed arguments are modified to better fit into registers. This patch leaves those unchanged. Splitting struct types into individual fields and packing small structs into integers works well for passing via registers. Variadic arguments are currently unimplemented in the backend. They're likely to be implemented as a pointer to stack memory in which case register-themed optimisations are inapplicable. Splitting the struct into fields makes it difficult to implement va_arg robustly. The rules around padding and alignment to inverse the struct splitting could be constructed, but at high complexity and no particular advantage. Passing types as-is means there is a 1:1 correspondence with the type information va_arg has to work with and the parameter type at the call site. This is an ABI change, but as the only functions affected are variadic ones which are presently a compilation error, not a functional break. Factored out of the larger #93362 and can land independently.	2024-06-04 13:10:10 +01:00
Jon Chesterfield	b2d7d72ff2	[AArch64] Use ptrmask for vaarg stack alignment (#92836 )	2024-05-21 03:22:20 +01:00
Ahmed Bougacha	3575d23ca8	[clang][CodeGen] Remove unused LValue::getAddress CGF arg. (#92465 ) This is in effect a revert of f139ae3d93797, as we have since gained a more sophisticated way of doing extra IRGen with the addition of RawAddress in #86923.	2024-05-20 10:23:04 -07:00
Florian Hahn	8a4cbeada9	[Clang] Unbreak build take 2 using uint64_t() explicitly.	2024-05-15 15:49:03 +01:00
Florian Hahn	da116bd82c	[Clang] Use ULL for std::max constant argument to fix build failure. getKnownMinValue returns uint64_t, use ULL to make sure the second arg is also 64 bit.	2024-05-15 15:37:52 +01:00
Koakuma	c2fba6df94	[clang][SPARC] Treat empty structs as if it's a one-bit type in the CC (#90338 ) Make sure that empty structs are treated as if it has a size of one bit in function parameters and return types so that it occupies a full argument and/or return register slot. This fixes crashes and miscompilations when passing and/or returning empty structs. Reviewed by: @s-barannikov	2024-05-15 20:49:28 +07:00
Lukacma	421862f8e4	[Clang] Fix incorrect passing of _BitInt args (#90741 ) This patch removes incorrect `byval` attribute from pointer argument passed with >128 bit long _BitInt types.	2024-05-15 10:51:32 +01:00
Phoebe Wang	5bde8017a1	[X86][vectorcall] Pass built types byval when xmm0~6 exhausted (#91846 ) This is how MSVC handles it. https://godbolt.org/z/fG386bjnf	2024-05-13 08:31:49 +08:00
Sander de Smalen	6a6fcbffbb	[Clang][AArch64] NFC: Add IsArmStreamingFunction. Simple refactoring to make a single interface that checks if a FunctionDecl is a __arm[_locally]_streaming function.	2024-05-07 15:33:24 +00:00
ostannard	1fd196c8df	[AArch64] Diagnose more functions when FP not enabled (#90832 ) When using a hard-float ABI for a target without FP registers, it's not possible to correctly generate code for functions with arguments which must be passed in floating-point registers. This is diagnosed in CodeGen instead of Sema, to more closely match GCC's behaviour around inline functions, which is relied on by the Linux kernel. Previously, this only checked function signatures as they were code-generated, but this missed some cases: * Calls to functions not defined in this translation unit. * Calls through function pointers. * Calls to variadic functions, where the variadic arguments have a floating-point type. This adds checks to function calls, as well as definitions, so that these cases are correctly diagnosed.	2024-05-07 09:17:05 +01:00
Timm Baeder	3d56ea05b6	[clang][NFC] Fix FieldDecl::isUnnamedBitfield() capitalization (#89048 ) We always capitalize bitfield as "BitField".	2024-04-18 07:39:29 +02:00
Longsheng Mou	000f2b5163	[X86_64] fix arg pass error in struct. (#86902 ) ``` typedef long long t67 __attribute__((aligned (4))); struct s67 { int a; t67 b; }; void f67(struct s67 x) { } ``` When classify: a: Lo = Integer, Hi = NoClass b: Lo = Integer, Hi = NoClass struct S: Lo = Integer, Hi = NoClass ``` define dso_local void @f67(i64 %x.coerce) { ``` In this case, only one i64 register is used when the structure parameter is transferred, which is obviously incorrect.So we need to treat the split case specially. fix https://github.com/llvm/llvm-project/issues/85387.	2024-04-09 19:57:35 -07:00
Longsheng Mou	956b47b486	[X86_32] Teach X86_32 va_arg to ignore empty structs. (#86075 ) Empty structs are ignored for parameter passing purposes, but va_arg was incrementing the pointer anyway for that the size of empty struct in c++ is 1 byte, which could lead to va_list getting out of sync. Fix #86057.	2024-04-03 19:12:12 +08:00
Akira Hatanaka	84780af4b0	[CodeGen][arm64e] Add methods and data members to Address, which are needed to authenticate signed pointers (#86923 ) To authenticate pointers, CodeGen needs access to the key and discriminators that were used to sign the pointer. That information is sometimes known from the context, but not always, which is why `Address` needs to hold that information. This patch adds methods and data members to `Address`, which will be needed in subsequent patches to authenticate signed pointers, and uses the newly added methods throughout CodeGen. Although this patch isn't strictly NFC as it causes CodeGen to use different code paths in some cases (e.g., `mergeAddressesInConditionalExpr`), it doesn't cause any changes in functionality as it doesn't add any information needed for authentication. In addition to the changes mentioned above, this patch introduces class `RawAddress`, which contains a pointer that we know is unsigned, and adds several new functions for creating `Address` and `LValue` objects. This reapplies d9a685a9dd589486e882b722e513ee7b8c84870c, which was reverted because it broke ubsan bots. There seems to be a bug in coroutine code-gen, which is causing EmitTypeCheck to use the wrong alignment. For now, pass alignment zero to EmitTypeCheck so that it can compute the correct alignment based on the passed type (see function EmitCXXMemberOrOperatorMemberCallExpr).	2024-03-28 06:54:36 -07:00
smanna12	1095f71bdf	[NFC][Clang] Fix potential dereferencing of nullptr (#86759 ) This patch replaces dyn_cast<> with cast<> to resolve potential static analyzer bugs for 1. Dereferencing a pointer issue with nullptr GVar when calling addAttribute() in AIXTargetCodeGenInfo::setTargetAttributes(clang::Decl const , llvm::GlobalValue , clang::CodeGen::CodeGenModule &). 2. Dereferencing a pointer issue with nullptr GG when calling getCorrespondingConstructor() in DeclareImplicitDeductionGuidesForTypeAlias(clang::Sema &, clang::TypeAliasTemplateDecl *, clang::SourceLocation). 3. Dereferencing a pointer issue with nullptr CurrentBT when calling getKind() in ComplexExprEmitter::GetHigherPrecisionFPType(clang::QualType).	2024-03-27 20:20:22 -05:00
Akira Hatanaka	f75eebab88	Revert "[CodeGen][arm64e] Add methods and data members to Address, which are needed to authenticate signed pointers (#86721 )" (#86898 ) This reverts commit d9a685a9dd589486e882b722e513ee7b8c84870c. The commit broke ubsan bots.	2024-03-27 18:14:04 -07:00
Akira Hatanaka	d9a685a9dd	[CodeGen][arm64e] Add methods and data members to Address, which are needed to authenticate signed pointers (#86721 ) To authenticate pointers, CodeGen needs access to the key and discriminators that were used to sign the pointer. That information is sometimes known from the context, but not always, which is why `Address` needs to hold that information. This patch adds methods and data members to `Address`, which will be needed in subsequent patches to authenticate signed pointers, and uses the newly added methods throughout CodeGen. Although this patch isn't strictly NFC as it causes CodeGen to use different code paths in some cases (e.g., `mergeAddressesInConditionalExpr`), it doesn't cause any changes in functionality as it doesn't add any information needed for authentication. In addition to the changes mentioned above, this patch introduces class `RawAddress`, which contains a pointer that we know is unsigned, and adds several new functions for creating `Address` and `LValue` objects. This reapplies 8bd1f9116aab879183f34707e6d21c7051d083b6. The commit broke msan bots because LValue::IsKnownNonNull was uninitialized.	2024-03-27 12:24:49 -07:00
Chris B	28ddbd4a86	[NFC] Refactor ConstantArrayType size storage (#85716 ) In PR #79382, I need to add a new type that derives from ConstantArrayType. This means that ConstantArrayType can no longer use `llvm::TrailingObjects` to store the trailing optional Expr*. This change refactors ConstantArrayType to store a 60-bit integer and 4-bits for the integer size in bytes. This replaces the APInt field previously in the type but preserves enough information to recreate it where needed. To reduce the number of places where the APInt is re-constructed I've also added some helper methods to the ConstantArrayType to allow some common use cases that operate on either the stored small integer or the APInt as appropriate. Resolves #85124.	2024-03-26 14:15:56 -05:00
Akira Hatanaka	b311756450	Revert "[CodeGen][arm64e] Add methods and data members to Address, which are needed to authenticate signed pointers (#67454 )" (#86674 ) This reverts commit 8bd1f9116aab879183f34707e6d21c7051d083b6. It appears that the commit broke msan bots.	2024-03-26 07:37:57 -07:00
Longsheng Mou	9c8dd5e6f6	[X86_64] fix SSE type error in vaarg. (#86377 ) tweak the position of the ++neededSSE when Lo is NoClass and Hi is SSE. Fix #86371.	2024-03-26 09:19:42 +08:00
Akira Hatanaka	8bd1f9116a	[CodeGen][arm64e] Add methods and data members to Address, which are needed to authenticate signed pointers (#67454 ) To authenticate pointers, CodeGen needs access to the key and discriminators that were used to sign the pointer. That information is sometimes known from the context, but not always, which is why `Address` needs to hold that information. This patch adds methods and data members to `Address`, which will be needed in subsequent patches to authenticate signed pointers, and uses the newly added methods throughout CodeGen. Although this patch isn't strictly NFC as it causes CodeGen to use different code paths in some cases (e.g., `mergeAddressesInConditionalExpr`), it doesn't cause any changes in functionality as it doesn't add any information needed for authentication. In addition to the changes mentioned above, this patch introduces class `RawAddress`, which contains a pointer that we know is unsigned, and adds several new functions for creating `Address` and `LValue` objects.	2024-03-25 18:05:42 -07:00
hstk30-hw	631248dcd2	[X86_64] fix empty structure vaarg in c++ (#77907 ) SizeInBytes of empty structure is 0 in C, while 1 in C++. And empty structure argument of the function is ignored in X86_64 backend.As a result, the value of variable arguments in C++ is incorrect. fix #77036 Co-authored-by: Longsheng Mou <moulongsheng@huawei.com>	2024-03-21 09:25:24 +08:00
ostannard	ef395a492a	[AArch64] Add soft-float ABI (#84146 ) This is re-working of #74460, which adds a soft-float ABI for AArch64. That was reverted because it causes errors when building the linux and fuchsia kernels. The problem is that GCC's implementation of the ABI compatibility checks when using the hard-float ABI on a target without FP registers does it's checks after optimisation. The previous version of this patch reported errors for all uses of floating-point types, which is stricter than what GCC does in practice. This changes two things compared to the first version: * Only check the types of function arguments and returns, not the types of other values. This is more relaxed than GCC, while still guaranteeing ABI compatibility. * Move the check from Sema to CodeGen, so that inline functions are only checked if they are actually used. There are some cases in the linux kernel which depend on this behaviour of GCC.	2024-03-19 13:58:51 +00:00
Kuba (Brecka) Mracek	b84ce99799	[clang] Define SwiftInfo for RISCVTargetCodeGenInfo (#82152 ) For Embedded Swift, let's unblock building for RISC-V boards (e.g. ESP32-C6). This isn't trying to add full RISC-V support to Swift / Embedded Swift, it's just fixing the immediate blocker (not having SwiftInfo defined blocks all compilations).	2024-03-13 20:04:30 -07:00

1 2 3 4 5

221 Commits