llvm-project

Author	SHA1	Message	Date
CarolineConcatto	aaba8406c5	[NFC][Clang][AArch64]Refactor implementation of Neon vectors MFloat8… (#114804 ) …x8 and MFloat8x16 This patch adds MFloat8 as a TypeFlag and Kind on Neon to generate the typedefs using emitNeonTypeDefs. It does not need any change in Clang, because SEMA and CodeGen use the Builtins defined in AArch64SVEACLETypes.def	2024-11-21 10:29:28 +00:00
Rahul Joshi	63aa8cf6be	[NFC][Clang][TableGen] Fix file header comments (#116491 )	2024-11-17 07:54:10 -08:00
CarolineConcatto	91aad9bfb2	[Clang][AArch64]Fix Name and Mangle name for scalar fp8 (#114983 ) The scalar __mfp8 type has the wrong name and mangle name in AArch64SVEACLETypes.def According to the ACLE[1] the name should be __mfp8 This patch fixes this problem by replacing the Name __MFloat8_t by __mfp8 and the Mangle Name __MFloat8_t by u6__mfp8 And we revert the incorrect typedef in NeonEmitter. [1]https://github.com/ARM-software/acle	2024-11-15 09:19:39 +00:00
Kazu Hirata	173529104d	[TableGen] Use heterogenous lookups with std::map (NFC) (#115682 ) Heterogenous lookups allow us to call find with StringRef, avoiding a temporary heap allocation of std::string.	2024-11-11 07:34:42 -08:00
Kazu Hirata	a44ee8ec1c	[TableGen] Use heterogenous lookups with std::map (NFC) (#115633 ) Heterogenous lookups allow us to call find with StringRef, avoiding a temporary heap allocation of std::string.	2024-11-10 07:24:27 -08:00
Momchil Velikov	1df5c94343	[AArch64] Implement FP8 floating-point mode helper intrinsics (#100608 ) Implement FP8 mode helper intrinsics (as inline functions) as specified in ACLE 2024Q3 "14.2 Helper intrinsics" https://github.com/ARM-software/acle/releases/download/r2024Q3/acle-2024Q3.pdf	2024-10-28 11:22:38 +00:00
CarolineConcatto	49940514e2	[CLANG][AArch64] Add the modal 8 bit floating-point scalar type (#97277 ) ARM ACLE PR#323[1] adds new modal types for 8-bit floating point intrinsic. From the PR#323: ``` ACLE defines the `__mfp8` type, which can be used for the E5M2 and E4M3 8-bit floating-point formats. It is a storage and interchange only type with no arithmetic operations other than intrinsic calls. ```` The type should be an opaque type and its format in undefined in Clang. Only defined in the backend by a status/format register, for AArch64 the FPMR. This patch is an attempt to the add the mfloat8_t scalar type. It has a parser and codegen for the new scalar type. The patch it is lowering to and 8bit unsigned as it has no format. But maybe we should add another opaque type. [1] https://github.com/ARM-software/acle/pull/323	2024-10-25 13:59:46 +01:00
Jay Foad	4dd55c567a	[clang] Use {} instead of std::nullopt to initialize empty ArrayRef (#109399 ) Follow up to #109133.	2024-10-24 10:23:40 +01:00
CarolineConcatto	6dad29aebc	[CLANG][AArch64]Add Neon vectors for mfloat8_t (#99865 ) This patch adds these new vector sizes for neon: mfloat8x16_t and mfloat8x8_t According to the ARM ACLE PR#323[1]. [1] ARM-software/acle#323	2024-10-23 13:23:18 +01:00
Rahul Joshi	9b422d14f3	[Clang][TableGen] Use const pointers for various Init objects in NeonEmitter (#112317 ) Use const pointers for various Init objects in NeonEmitter. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-10-15 15:48:42 -07:00
Rahul Joshi	e9dbdb20f2	[Clang][TableGen] Change NeonEmitter to use const Record * (#110597 ) This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-10-01 10:47:09 -07:00
Rahul Joshi	0e948bfd31	[NFC][clang][TableGen] Remove redundant llvm:: namespace qualifier (#108627 ) Remove llvm:: from .cpp files, and add "using namespace llvm" if needed.	2024-09-16 06:35:34 -07:00
Kazu Hirata	bae275f65e	[TableGen] Avoid repeated map lookups (NFC) (#108675 )	2024-09-14 07:39:00 -07:00
Rahul Joshi	a4b1617368	[clang][TableGen] Change NeonEmitter to use const RecordKeeper (#108501 ) Change NeonEmitter to use const RecordKeeper. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-09-13 07:52:37 -07:00
Rahul Joshi	1651014960	[TableGen] Change SetTheory set/vec to use const Record * (#107692 ) Change SetTheory::RecSet/RecVec to use const Record pointers.	2024-09-09 08:47:42 -07:00
SpencerAbson	1f70fcefa9	[Clang][AArch64] Add customisable immediate range checking to NEON (#100278 ) This patch moves NEON immediate argument specification and checking to the system currently shared by both SVE and SME. In its current form, the TableGen definition of a NEON intrinsic cannot control how its immediate arguments are range-checked, this information must be inferred from the name of the intrinsic by NeonEmitter, which also assumes that any NEON instruction will only ever receive a single immediate argument. For SVE/SME instrinsics, this information is more conveniently supplied in the TableGen definition. As a result, for each immediate argument, NEON instructions must define - The index of the immediate argument to be checked - The type of immediate range check to be performed, (e.g., ImmCheckShiftRight) - The index of the argument whose type defines the context of this immediate check (base type, vector size). - Difference from SVE/SME If this definition generates a polymorphic NEON builtin, the base type defined by this argument is overwritten by that of the type code supplied to the overloaded builtin call. This third argument is omitted in some cases due to this. Here is an example for [`vfma_laneq`](https://developer.arm.com/architectures/instruction-sets/intrinsics/#f:@navigationhierarchiessimdisa=[Neon]&q=vfma_laneq) - The immediate is supplied in argument 3 - The immediate is used as an index into the lanes of argument 2 - So we must perform an immediate check on argument 3, based on the type information of argument 2. - `ImmCheck<3, ImmCheckLaneIndex, 2>` During this work, we discovered that the existing immediate range-checking system was largely untested, which made it difficult to make reliable progress. Missing tests have been added to verify this implementation against all intrinsics which take constrained immediate arguments. All test immediate range checking tests for NEON intrinsics are moved to a dedicated directory `clang/test/Sema/aarch64-neon-immediate-ranges/`.	2024-09-06 13:12:37 +01:00
Rahul Joshi	d7da79f2cd	[NFC][SetTheory] Refactor to use const pointers and range loops (#105544 ) - Refactor SetTheory code to use const pointers when possible. - Use auto for variables initialized using dyn_cast<>. - Use range based for loops and early continue.	2024-08-22 05:47:31 -07:00
Lukacma	0284b4b4b6	[Clang][NEON] Add neon target guard to intrinsics (#99870 ) This patch improves reported error when NEON intrinsics are used without neon target feature.	2024-07-22 14:21:31 +01:00
Lukacma	c1622cae10	Revert "[Clang][NEON] Add neon target guard to intrinsics" (#99864 ) Reverts llvm/llvm-project#98624	2024-07-22 12:32:47 +01:00
Lukacma	dc82c774a7	[Clang][NEON] Add neon target guard to intrinsics (#98624 ) This patch improves reported error when NEON intrinsics are used without neon target feature.	2024-07-22 12:03:59 +01:00
Lukacma	8a46bbbc22	[Clang] Remove preprocessor guards and global feature checks for NEON (#95224 ) To enable function multi-versioning (FMV), current checks which rely on cmd line options or global macros to see if target feature is present need to be removed. This patch removes those for NEON and also implements changes to NEON header file as proposed in [ACLE](https://github.com/ARM-software/acle/pull/321).	2024-06-25 17:19:42 +02:00
Eli Friedman	8c9f45e2de	[ARM64EC] Fix arm_neon.h on ARM64EC. (#88572 ) Since 97fe519d, in ARM64EC mode, we don't define `__aarch64__`. Fix various preprocessor guards to account for this.	2024-04-16 17:08:02 -07:00
Kazu Hirata	e6bafbe726	[TableGen] Use StringRef::consume_{front,back} (NFC)	2024-01-25 18:17:24 -08:00
Sam Tebbs	945c645acb	[AArch64][SME] Warn when using a streaming builtin from a non-streaming function (#75487 ) This PR adds a warning that's emitted when a non-streaming or non-streaming-compatible builtin is called in an unsuitable function. Uses work by Kerry McLaughlin. This is a re-upload of #74064 and fixes a compile time increase.	2023-12-18 09:32:34 +00:00
Sam Tebbs	342384ca05	Revert "[AArch64][SME] Warn when using a streaming builtin from a non-streaming function" (#75449 ) Reverts llvm/llvm-project#74064	2023-12-14 09:31:55 +00:00
Sam Tebbs	2e45326b08	[AArch64][SME] Warn when using a streaming builtin from a non-streaming function (#74064 ) This PR adds a warning that's emitted when a non-streaming or non-streaming-compatible builtin is called in an unsuitable function. Uses work by Kerry McLaughlin.	2023-12-14 00:11:04 +00:00
CarolineConcatto	ed2d497291	[Clang][AArch64] Add fix vector types to header into SVE (#73258 ) This patch is needed for the reduction instructions in sve2.1 It add a new header to sve with all the fixed vector types. The new types are only added if neon is not declared.	2023-12-13 08:59:41 +00:00
Simon Pilgrim	141122ece3	[TableGen] Use StringRef::starts_with/ends_with instead of startswith/endswith. NFC. startswith/endswith wrap starts_with/ends_with and will eventually go away (to more closely match string_view)	2023-11-03 17:53:56 +00:00
Kazu Hirata	dd27036ff7	[TableGen] Modernize OverloadInfo (NFC)	2023-09-04 13:35:26 -07:00
Lucas Prates	2b7ac62606	[AArch64][RCPC3] Add Neon intrinsics for LDAP1 and STL1 This adds new intrisics to support the LDAP1 and STL1 Advanced SIMD (Neon) instructions introduced as part of FEAT_LRCPC3. The new intrinsics `vldap1(q)_lane`/`vstl1(q)_lane` generate IR code similar to the existing `vld1(q)_lane/st1(q)_lane` ones, but capturing the difference in the atomic release/acquire memory model. The LLVM code generation changes to ensure that this instruction pair is lowered to the correct LDAP1/STL1 instructions will be covered in a separate commit. Based on a patch by Sam Elliott. Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D153128	2023-07-07 12:31:55 +01:00
Dimitry Andric	db49231639	[clang][BFloat] Avoid redefining bfloat16_t in arm_neon.h As of https://reviews.llvm.org/D79708, clang-tblgen generates `arm_neon.h`, `arm_sve.h` and `arm_bf16.h`, and all those generated files will contain a typedef of `bfloat16_t`. However, `arm_neon.h` and `arm_sve.h` include `arm_bf16.h` immediately before their own typedef: #include <arm_bf16.h> typedef __bf16 bfloat16_t; With a recent version of clang (I used 16.0.1) this results in warnings: /usr/lib/clang/16/include/arm_neon.h:38:16: error: redefinition of typedef 'bfloat16_t' is a C11 feature [-Werror,-Wtypedef-redefinition] Since `arm_bf16.h` is very likely supposed to be the one true place where `bfloat16_t` is defined, I propose to delete the duplicate typedefs from the generated `arm_neon.h` and `arm_sve.h`. Reviewed By: sdesmalen, simonbutcher Differential Revision: https://reviews.llvm.org/D148822	2023-05-03 17:54:58 +02:00
Manna, Soumi	38ecb9767c	[NFC][clang] Fix Coverity bugs with AUTO_CAUSES_COPY Reported by Coverity: AUTO_CAUSES_COPY Unnecessary object copies can affect performance. 1. Inside "ExtractAPIVisitor.h" file, in clang::extractapi::impl::ExtractAPIVisitorBase<<unnamed>::BatchExtractAPIVisitor>::VisitFunctionDecl(clang::FunctionDecl const ): Using the auto keyword without an & causes the copy of an object of type DynTypedNode. 2. Inside "NeonEmitter.cpp" file, in <unnamed>::Intrinsic::Intrinsic(llvm::Record , llvm::StringRef, llvm::StringRef, <unnamed>::TypeSpec, <unnamed>::TypeSpec, <unnamed>::ClassKind, llvm::ListInit , <unnamed>::NeonEmitter &, llvm::StringRef, llvm::StringRef, bool, bool): Using the auto keyword without an & causes the copy of an object of type Type. 3. Inside "MicrosoftCXXABI.cpp" file, in <unnamed>::MSRTTIBuilder::getClassHierarchyDescriptor(): Using the auto keyword without an & causes the copy of an object of type MSRTTIClass. 4. Inside "CGGPUBuiltin.cpp" file, in clang::CodeGen::CodeGenFunction::EmitAMDGPUDevicePrintfCallExpr(clang::CallExpr const ): Using the auto keyword without an & causes the copy of an object of type CallArg. 5. Inside "SemaDeclAttr.cpp" file, in threadSafetyCheckIsSmartPointer(clang::Sema &, clang::RecordType const ): Using the auto keyword without an & causes the copy of an object of type CXXBaseSpecifier. 6. Inside "ComputeDependence.cpp" file, in clang::computeDependence(clang::DesignatedInitExpr ): Using the auto keyword without an & causes the copy of an object of type Designator. 7. Inside "Format.cpp" file, In clang::format::affectsRange(llvm::ArrayRef<clang::tooling::Range>, unsigned int, unsigned int): Using the auto keyword without an & causes the copy of an object of type Range. Reviewed By: tahonermann Differential Revision: https://reviews.llvm.org/D149074	2023-04-24 14:52:55 -07:00
Kazu Hirata	9cf4419e24	[clang] Use std::optional instead of llvm::Optional (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-02 15:54:57 -08:00
Kazu Hirata	f7dffc28b3	Don't include None.h (NFC) I've converted all known uses of None to std::nullopt, so we no longer need to include None.h. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 11:24:26 -08:00
Kazu Hirata	5891420e68	[clang] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 11:54:46 -08:00
David Green	b879f99f0e	[AArch64][ARM] Alter most of arm_neon.h to be target-based, not preprocessor based. Similar to D131064, this alters most of the intrinsics in arm_neon.h to be target based, not preprocessor based. The intrinsics that are changed are the ones with obvious target features (fp16, fp16fml, cryptos, i8mm and bf16). The ones that are not yet altered are the ones without target features like rdma (8.1) and complex (8.3). Those will be switched in a followup patch that allows targeting architecture versions. The existing ArchGuard in arm_neon.td is split into ArchGuard that still adds ifdef defines (for example for intrinsics that require __aarch64__), and TargetGuards for intrinsics dependant on target features. From there the TargetGuards are used in two ways: - For intrinsics emitted as functions, __attribute__((target(TargetGuard))) is added to the definition of the function. Along with the existing always_inline intrinsic, this will give a compile time error if the function is used in a context where the target feature is not available. - For intrinsics emitted as macros, the __builtins are emitted into arm_neon.inc using TARGET_BUILTIN as opposed to BUILTIN, which includes the target feature and gives an error if the builtin is found in a function without the required features, similar to arm_sve.h. The second method requires that the intrinsics be separable from the existing _v intrinsics used in other types. For example __builtin_neon_splat_lane_bf16 is used as opposed to __builtin_neon_splat_lane_v. There are some adjustments to the CGBuiltin to account for intrinsics that can be treated similarly, except for their target features. Differential Revision: https://reviews.llvm.org/D132034	2022-10-11 09:09:16 +01:00
David Green	4987ae8462	[ARM][AArch64] Dont use macros for half instrinsics in NeonEmitter We don't require arm_neon.h fp16 intrinsics to be treated as macros any more. Differential Revision: https://reviews.llvm.org/D131504	2022-10-03 15:27:23 +01:00
Fangrui Song	3f18f7c007	[clang] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D131346	2022-08-08 09:12:46 -07:00
Gabriel Ravier	5674a3c880	Fixed a number of typos I went over the output of the following mess of a command: (ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less) and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Differential Revision: https://reviews.llvm.org/D130827	2022-08-01 13:13:18 -04:00
Nick Desaulniers	5a2e56b70e	[Clang][NeonEmitter] emit ret decl first for -Wdeclaration-after-statement The generated arm_neon.h header isn't -Wdeclaration-after-statement compliant when targeting -mbig-endian. Update the generator to declare the return value, if any, first before any other arguments that might need to be "reversed" from little endian to big. Another approach would have been to try to ignore this warning in system headers, though that might not be precise for tokens involved in macro expansion. See also: https://reviews.llvm.org/D116833#3236209. Link: https://github.com/ClangBuiltLinux/linux/issues/1603 Fixes: https://github.com/llvm/llvm-project/issues/54062 Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D122189	2022-03-23 09:40:43 -07:00
Kazu Hirata	17d4bd3d78	[clang] Fix bugprone argument comments (NFC) Identified with bugprone-argument-comment.	2022-01-09 00:19:49 -08:00
Kazu Hirata	f4ffcab178	Remove redundant string initialization (NFC) Identified by readability-redundant-string-init.	2022-01-01 12:34:11 -08:00
Kazu Hirata	0542d15211	Remove redundant string initialization (NFC) Identified with readability-redundant-string-init.	2021-12-26 09:39:26 -08:00
Kazu Hirata	16ceb44e62	[clang] Use llvm::{count,count_if,find_if,all_of,none_of} (NFC)	2021-10-25 09:14:45 -07:00
Kazu Hirata	4bd46501c3	Use llvm::any_of and llvm::none_of (NFC)	2021-10-24 17:35:33 -07:00
Kazu Hirata	dccfaddc6b	[clang] Use StringRef::contains (NFC)	2021-10-21 08:58:19 -07:00
Ryan Santhiraraja	2c25efcbd3	[AArch64] Adding SHA3 Intrinsics support This patch adds the following SHA3 Intrinsics: vsha512hq_u64, vsha512h2q_u64, vsha512su0q_u64, vsha512su1q_u64 veor3q_u8 veor3q_u16 veor3q_u32 veor3q_u64 veor3q_s8 veor3q_s16 veor3q_s32 veor3q_s64 vrax1q_u64 vxarq_u64 vbcaxq_u8 vbcaxq_u16 vbcaxq_u32 vbcaxq_u64 vbcaxq_s8 vbcaxq_s16 vbcaxq_s32 vbcaxq_s64 Note need to include +sha3 and +crypto when building from the front-end Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D96381	2021-02-22 12:09:20 +00:00
Florian Hahn	51d5991f04	[Clang] Add AArch64 VCMLA LANE variants. This patch adds the LANE variants for VCMLA on AArch64 as defined in "Arm Neon Intrinsics Reference for ACLE Q3 2020" [1] This patch also updates `dup_typed` to accept constant type strings directly. Based on a patch by Tim Northover. [1] https://developer.arm.com/documentation/ihi0073/latest Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D93014	2021-01-05 16:14:00 +00:00
Fangrui Song	c70f36865e	Use basic_string::find(char) instead of basic_string::find(const char *s, size_type pos=0) Many (StringRef) cannot be detected by clang-tidy performance-faster-string-find.	2020-12-16 23:28:32 -08:00
Fangrui Song	a2f922140f	[TableGen] Delete 11 unused declarations	2020-12-06 13:21:07 -08:00

1 2 3 4

193 Commits