llvm-project

Author	SHA1	Message	Date
Alexandros Lamprineas	d01576bb60	[TargetParser][AArch64] Add alias for FEAT_RDM. (#80540 ) This patch allows using the name "rdma" as an alias for "rdm". The name makes its way to target attributes as well as the command line via the -march and -mcpu options. The motivation was originally to support this in Function Multi Versioning but it also makes sense to align with GCC on the command line.	2024-03-01 21:06:08 +00:00
Brandon Wu	8fd011ecc6	[RISCV] Add getFeaturesForCPU function support (#83269 ) This function parse the cpu and return it's supported features placed in EnabledFeatures. It is same as the one in X86TargetParser and also is used in IREE.	2024-03-01 13:12:09 +08:00
Freddy Ye	1fe6be8794	[X86] Support APXF to enable __builtin_cpu_supports. (#80636 ) For referring, APX's spec: https://cdrdv2.intel.com/v1/dl/getContent/784266 APX's index in libgcc: https://github.com/gcc-mirror/gcc/blob/master/gcc/common/config/i386/i386-cpuinfo.h#L267	2024-02-23 15:18:42 +08:00
Jonas Devlieghere	1f57800fe6	[llvm] Handle BridgeOS in Triple::parseOS This fixes the TripleTest unit test.	2024-02-19 13:51:35 -08:00
Jonas Devlieghere	1b26c25f7e	[llvm] Upstream the BridgeOS triple enum value (NFC) This upstreams the BridgeOS target triple enum value.	2024-02-19 12:20:23 -08:00
Tomas Matheson	d022f32c73	Revert "[ARM] __ARM_ARCH macro definition fix (#81493 )" This reverts commit 89c1bf1230e011f2f0e43554c278205fa1819de5. This has been unimplemenented for a while, and GCC does not implement it, therefore we need to consider whether we should just deprecate it in the ACLE instead.	2024-02-19 12:19:16 +00:00
Lang Hames	80c25daca8	[TargetParser] Expose Triple::getArchPointerBitWidth. (#82086 ) The getArchPointerBitWidth method provides direct access to the pointer size for arithmetic or switch statements, instead of requiring clients to use the isArch16Bit / isArch32Bit / isArch64Bit predicates.	2024-02-17 13:21:38 -08:00
Pierre van Houtryve	43c7eb5d7b	[AMDGPU] Replace '.' with '-' in generic target names (#81718 ) The dot is too confusing for tools. Output temporaries would have '10.3-generic' so tools could parse it as an extension, device libs & the associated clang driver logic are also confused by the dot. After discussions, we decided it's better to just remove the '.' from the target name than fix each issue one by one.	2024-02-14 15:19:04 +01:00
James Westwood	89c1bf1230	[ARM] __ARM_ARCH macro definition fix (#81493 ) This patch changes how the macro __ARM_ARCH is defined to match its defintion in the ACLE. In ACLE 5.4.1, __ARM_ARCH is defined as equal to the major architecture version for ISAs up to and including v8. From v8.1 onwards, its definition is changed to include minor versions, such that for an architecture vX.Y, __ARM_ARCH = X*100 + Y. Before this patch, LLVM defined __ARM_ARCH using only the major architecture version for all architecture versions. This patch adds functionality to define __ARM_ARCH correctly for architectures greater than or equal to v8.1.	2024-02-13 15:12:35 +00:00
Pierre van Houtryve	f93aa5157a	[AMDGPU] Introduce GFX9/10.1/10.3/11 Generic Targets (#76955 ) These generic targets include multiple GPUs and will, in the future, provide a way to build once and run on multiple GPU, at the cost of less optimization opportunities. Note that this is just doing the compiler side of things, device libs an runtimes/loader/etc. don't know about these targets yet, so none of them actually work in practice right now. This is just the initial commit to make LLVM aware of them. This contains the documentation changes for both this change and #76954 as well.	2024-02-12 10:18:20 +01:00
Philipp Tomsich	fbba818a78	[AArch64] Add the Ampere1B core (#81297 ) The Ampere1B is Ampere's third-generation core implementing a superscalar, out-of-order microarchitecture with nested virtualization, speculative side-channel mitigation and architectural support for defense against ROP/JOP style software attacks. Ampere1B is an ARMv8.7+ implementation, adding support for the FEAT WFxT, FEAT CSSC, FEAT PAN3 and FEAT AFP extensions. It also includes all features of the second-generation Ampere1A, such as the Memory Tagging Extension and SM3/SM4 cryptography instructions.	2024-02-09 15:22:09 -08:00
Natalie Chouinard	3b57b647a9	[HLSL][SPIR-V] Add create.handle intrinsic (#81038 ) Add a SPIR-V target-specific intrinsic for creating handles, which is used for lowering HLSL resources types like RWBuffer. `llvm/lib/TargetParser/Triple.cpp`: SPIR-V intrinsics use "spv" as the target prefix, not "spirv". As far as I can tell, this is the first one that is used via the `CGBuiltin` codepath, which relies on `getArchTypePrefix`, so I've corrected it here. `clang/lib/Basic/Targets/SPIR.h`: When records are laid out in the lowering from AST to IR, they were incorrectly offset because these Pointer attributes were defaulting to 32. Related to #81036	2024-02-08 14:35:44 -05:00
ZijunZhaoCCK	34c4a0fa2b	[Driver] Report invalid target triple versions for all environment types. (#78655 ) Followup for https://github.com/llvm/llvm-project/pull/75373 1. Make this feature not just available for android, but everyone. 2. Correct some target triples. 3. Add opencl to the environment type list.	2024-02-04 16:42:15 -08:00
Freddy Ye	72105605d1	[X86] Support more ISAs to enable __builtin_cpu_supports (#79086 ) This patch will also expand supports for attribute/target, while the priority of newly supported ISAs will be set to zero.	2024-02-04 21:21:12 +08:00
Alexandros Lamprineas	260fe032c2	[llvm][AArch64][TargetParser][NFC] Use parseArchExtension in parseModifier. (#80427 ) This allows making changes in parseArchExtension to make their way in the command line as well, not only in target attributes.	2024-02-03 12:29:50 +00:00
Mariusz Sikora	cfddb59be2	[AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (#78414 ) …bf8 instructions Add VOP1, VOP1_DPP8, VOP1_DPP16, VOP3, VOP3_DPP8, VOP3_DPP16 instructions that were supported on GFX940 (MI300): - V_CVT_F32_FP8 - V_CVT_F32_BF8 - V_CVT_PK_F32_FP8 - V_CVT_PK_F32_BF8 - V_CVT_PK_FP8_F32 - V_CVT_PK_BF8_F32 - V_CVT_SR_FP8_F32 - V_CVT_SR_BF8_F32 --------- Co-authored-by: Mateja Marjanovic <mateja.marjanovic@amd.com> Co-authored-by: Mirko Brkušanin <Mirko.Brkusanin@amd.com>	2024-01-24 12:21:15 +01:00
Lucas Duarte Prates	6bbaad1ed4	[ARM] Introduce the v9.5-A architecture version to Arm targets (#78994 ) This introduces the Armv9.5-A architecture version to the Arm backend, following on from the existing implementation for AArch64 targets. Mode details about the Armv9.5-A architecture version can be found at: * https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/arm-a-profile-architecture-developments-2023 * https://developer.arm.com/documentation/ddi0602/2023-09/	2024-01-23 14:39:15 +00:00
Jay Foad	e21b0b083e	[AMDGPU] Remove gws feature from GFX12 (#78711 ) This was already done for LLVM. This patch just updates the Clang builtin handling to match.	2024-01-19 15:45:53 +00:00
Jay Foad	ed12388082	[AMDGPU] Do not emit `V_DOT2C_F32_F16_e32` on GFX12 (#78709 ) That instruction is not supported on GFX12. Added a testcase which previously crashed without this change. Co-authored-by: pvanhout <pierre.vanhoutryve@amd.com>	2024-01-19 14:36:27 +00:00
Natalie Chouinard	c21f48e5ad	[HLSL][SPIR-V] Add Vulkan to target triple (#76749 ) Add support for specifying the logical SPIR-V target environment in the triple as Vulkan. When compiling HLSL, this replaces the DirectX Shader Model with a Vulkan environment instead. Currently, the only supported combinations of SPIR-V version and Vulkan environment are: - Vulkan 1.2 and SPIR-V 1.5 - Vulkan 1.3 and SPIR-V 1.6 Fixes #70051	2024-01-18 12:52:00 -05:00
Mariusz Sikora	3e6589f21c	[AMDGPU][GFX12] Add 16 bit atomic fadd instructions (#75917 ) - image_atomic_pk_add_f16 - image_atomic_pk_add_bf16 - ds_pk_add_bf16 - ds_pk_add_f16 - ds_pk_add_rtn_bf16 - ds_pk_add_rtn_f16 - flat_atomic_pk_add_f16 - flat_atomic_pk_add_bf16 - global_atomic_pk_add_f16 - global_atomic_pk_add_bf16 - buffer_atomic_pk_add_f16 - buffer_atomic_pk_add_bf16	2024-01-18 14:01:09 +01:00
Mariusz Sikora	264fd9e13e	[AMDGPU][NFC] Rename feature FP8Insts to FP8ConversionInsts (#78439 )	2024-01-18 08:46:53 +01:00
Freddy Ye	f3a4de395c	[X86] Support "f16c" and "avx512fp16" for __builtin_cpu_supports (#78384 ) This resolves issue #65320. This also supports clarify sapphirerapids and cooperlake for cpu_specific/dispatch.	2024-01-18 09:22:04 +08:00
Jie Fu	b647a34c9f	[AArch64] Fix -Wreturn-type in AArch64TargetParser.cpp (NFC) llvm-project/llvm/lib/TargetParser/AArch64TargetParser.cpp:157:1: error: non-void function does not return a value in all control paths [-Werror,-Wreturn-type] } ^ 1 error generated.	2024-01-18 07:54:55 +08:00
Cyndy Ishida	735adbf1a8	[llvm] Teach MachO about XROS (#78373 ) Add support for XROS to encode in Mach-O file formats.	2024-01-17 10:35:20 -08:00
ostannard	13e977d173	[AArch64][Driver] Better handling of target feature dependencies (#78270 ) Currently there are several bits of code in the AArch64 driver which attempt to enforce dependencies between optional features in the -march= and -mcpu= options. However, these are based on the list of feature names being enabled/disabled, so they have a lot of logic to consider the order in which features were turned on and off, which doesn't scale well as dependency chains get longer. This patch moves the code handling these dependencies to TargetParser, and changes them to use a Bitset of enabled features. This makes it easy to check which features are enabled, and is converted back to a list of LLVM feature names once all of the command-line options are parsed. The motivating example for this was the -mcpu=cortex-r82+nofp option. Previously, the code handling the dependency between the fp16 and fp16fml extensions did not consider the nofp modifier, so it added +fullfp16 to the feature list. This should have been disabled by the +nofp modifier, and also the backend did follow the dependency between fullfp16 and fp, resulting in fp being turned back on in the backend. Most of the dependencies added to AArch64TargetParser.h weren't known about by clang before, I built that list by checking what the backend thinks the dependencies between SubtargetFeatures are.	2024-01-17 16:20:34 +00:00
Anatoly Trosinenko	7f7bbb9b44	[AArch64] Use parseArchExtension function. NFC (#78158 )	2024-01-17 12:57:36 +03:00
Cyndy Ishida	7d3466b5c3	[llvm] Introduce XROS platform (#77707 ) Accepts but otherwise ignores visionOS/XROS target triples as Darwin based platform. https://developer.apple.com/documentation/visionos	2024-01-16 16:32:10 -08:00
Alexandros Lamprineas	a690e86753	[AArch64] Add native CPU detection for Microsoft Azure Cobalt 100. (#77793 ) This patch extends the -mcpu/mtune=native support to handle the Microsoft Azure Cobalt 100 CPU as a Neoverse N2. We expect users to use -mcpu=neoverse-n2 when targeting this CPU and all the architecture and codegen decisions to be identical. The only difference is that the Microsoft Azure Cobalt 100 has a different Implementer ID in the /proc/cpuinfo entry that needs to be detected in getHostCPUNameForARM appropriately.	2024-01-16 09:40:13 +00:00
Jie Fu	d553934770	[AArch64] Add missing field 'GuardedControlStack' initializer (NFC) llvm-project/llvm/lib/TargetParser/ARMTargetParserCommon.cpp:143:39: error: missing field 'GuardedControlStack' initializer [-Werror,-Wmissing-field-initializers] PBP = {"none", "a_key", false, false}; ^ 1 error generated.	2024-01-11 21:35:04 +08:00
John Brawn	40d5c2bcd4	[clang][AArch64] Add a -mbranch-protection option to enable GCS (#75486 ) -mbranch-protection=gcs (enabled by -mbranch-protection=standard) causes generated objects to be marked with the gcs feature. This is done via the guarded-control-stack module flag, in a similar way to branch-target-enforcement and sign-return-address. Enabling GCS causes the GNU_PROPERTY_AARCH64_FEATURE_1_GCS bit to be set on generated objects. No code generation changes are required, as GCS just requires that functions are called using BL and returned from using RET (or other similar variant instructions), which is already the case.	2024-01-11 12:53:23 +00:00
ZijunZhaoCCK	f6dbd4cc5f	Make clang report invalid target versions. (#75373 ) Clang always silently ignores garbage target versions and this makes debug harder. So clang will report when target versions are invalid.	2024-01-08 20:46:05 -08:00
Weining Lu	7e186d366d	[LoongArch] Fix the procossor series mask Refer PRID_SERIES_MASK definition in linux kernel: arch/loongarch/include/asm/cpu.h.	2024-01-03 14:01:05 +08:00
Aiden Grossman	3ddf368524	[X86] Fix warning in cpu detection due to unsigned comparison a15532d7647a8a4b7fd2889bd97f6f72f273c4bf landed a patch that added support for detecting more AMD znver2 CPUs and cleaned up some of the surrounding code, including the znver3 detection. Since one model group is 00h-0fh, I adjusted the check to include checking if the value is greater than zero. Since the value is unsigned, this is always true and gcc warns on it. This patch removes the comparison with zero to get rid of the compiler warning.	2023-12-29 17:58:53 -08:00
Freddy Ye	19e784604c	[X86] Remove RAO-INT from Grandridge (#76420 ) According to latest spec: https://cdrdv2.intel.com/v1/dl/getContent/671368	2023-12-28 10:06:54 +08:00
Kazu Hirata	f5f2c313ae	[llvm] Use StringRef::consume_front (NFC)	2023-12-25 12:33:00 -08:00
Aiden Grossman	a15532d764	[X86] Add CPU detection for more znver2 CPUs (#74955 ) This patch adds proper detection support for more znver2 CPUs. Specifically, this adds in support for CPUs codenamed Renoir, Lucienne, and Mendocino. This was originally proposedfor Renoir in https://reviews.llvm.org/D96220 and got approved, but slipped through the cracks. However, there is still a demand for this feature. In addition to adding support for more znver2 CPUs, this patch also includes some additional refactoring and comments related to cpu model information for zen CPUs. Fixes https://github.com/llvm/llvm-project/issues/74934.	2023-12-21 23:39:28 -08:00
Tomas Matheson	7bd17212ef	Re-land "[AArch64] Codegen support for FEAT_PAuthLR" (#75947 ) This reverts commit 9f0f5587426a4ff24b240018cf8bf3acc3c566ae. Fix expensive checks failure by properly marking register def for ADR.	2023-12-21 18:32:55 +00:00
Tomas Matheson	9f0f558742	Revert "[AArch64] Codegen support for FEAT_PAuthLR" This reverts commit 5992ce90b8c0fac06436c3c86621fbf6d5398ee5. Builtbot failures with expensive checks enabled.	2023-12-21 16:25:55 +00:00
Tomas Matheson	5992ce90b8	[AArch64] Codegen support for FEAT_PAuthLR - Adds a new +pc option to -mbranch-protection that will enable the use of PC as a diversifier in PAC branch protection code. - When +pauth-lr is enabled (-march=armv9.5a+pauth-lr) in combination with -mbranch-protection=pac-ret+pc, the new 9.5-a instructions (pacibsppc, retaasppc, etc) are used. Documentation for the relevant instructions can be found here: https://developer.arm.com/documentation/ddi0602/2023-09/Base-Instructions/ Co-authored-by: Lucas Prates <lucas.prates@arm.com>	2023-12-21 14:18:33 +00:00
Matt Arsenault	629412938b	Triple: Fix handling of macos with unexpected target arches (#75469 ) Some tools with a specified target arch, but no full triple default to the host triple. On macos hosts, this would then force using macho on targets that didn't expect it, resulting in assertions. We should also probably emit explicit errors if the object format is specified on targets which don't handle it.	2023-12-18 21:28:33 +07:00
Kazu Hirata	038871ae68	[llvm] Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-16 14:47:16 -08:00
Kazu Hirata	586ecdf205	[llvm] Use StringRef::{starts,ends}_with (NFC) (#74956 ) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-11 21:01:36 -08:00
Jonathan Thackray	8758e648da	[ARM][AArch32] Add support for AArch32 Cortex-M52 CPU (#74822 ) Cortex-M52 is an Armv8.1 AArch32 CPU. Technical specifications available at: https://developer.arm.com/processors/cortex-m52	2023-12-08 15:04:08 +00:00
Shengchen Kan	6d6baef5c9	[X86] Support CFE flags for APX features (#74199 ) Positive options: -mapx-features=<comma-separated-features> Negative options: -mno-apx-features=<comma-separated-features> -m[no-]apx-features is designed to be able to control separate APX features. Besides, we also support the flag -m[no-]apxf, which can be used like an alias of -m[no-]apx-features=< all APX features covered by CPUID APX_F> Behaviour when positive and negative options are used together: For boolean flags, the last one wins -mapxf -mno-apxf -> -mno-apxf -mno-apxf -mapxf -> -mapxf For flags that take a set as arguments, it sets the mask by order of the flags -mapx-features=egpr,ndd -mno-apx-features=egpr -> -egpr,+ndd -mapx-features=egpr -mno-apx-features=egpr,ndd -> -egpr,-ndd -mno-apx-features=egpr -mapx-features=egpr,ndd -> +egpr,+ndd -mno-apx-features=egpr,ndd -mapx-features=egpr -> -ndd,+egpr The design is aligned with gcc https://gcc.gnu.org/pipermail/gcc-patches/2023-August/628905.html	2023-12-04 19:22:56 +08:00
Jay Foad	cf1e0c0b07	[AMDGPU] Define new targets gfx1200 and gfx1201 (#73133 ) Define target names and ELF numbers for new GFX12 targets gfx1200 and gfx1201. For now they behave identically to GFX11.	2023-11-23 16:44:05 +00:00
Jay Foad	9b374a800d	[AMDGPU] Add some clang-format off/on markers This keeps clang-format happy on future patches.	2023-11-23 09:50:55 +00:00
Jay Foad	e0d93d5aaa	[AMDGPU] Reindent some tables This keeps clang-format happy on future patches.	2023-11-23 09:49:03 +00:00
Jonathan Thackray	066c4524bc	[AArch64] Add support for Cortex-A520, Cortex-A720 and Cortex-X4 CPUs (#72395 ) Cortex-A520, Cortex-A720 and Cortex-X4 are Armv9.2 AArch64 CPUs. Technical Reference Manual for Cortex-A520: https://developer.arm.com/documentation/102517/latest/ Technical Reference Manual for Cortex-A720: https://developer.arm.com/documentation/102530/latest/ Technical Reference Manual for Cortex-X4: https://developer.arm.com/documentation/102484/latest/ Patch co-authored by: Sivan Shani <sivan.shani@arm.com>	2023-11-16 22:08:58 +00:00
Lucas Duarte Prates	59b2301508	[AArch64] Introduce the Armv9.5-A architecture version (#72392 ) This introduces the Armv9.5-A architecture version, including the relevant command-line option for -march. Mode details about the Armv9.5-A architecture version can be found at: * https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/arm-a-profile-architecture-developments-2023 * https://developer.arm.com/documentation/ddi0602/2023-09/ Patch by Oliver Stannard.	2023-11-16 15:38:32 +00:00

1 2 3 4

159 Commits