llvm-project

Author	SHA1	Message	Date
Mariusz Sikora	cfddb59be2	[AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (#78414 ) …bf8 instructions Add VOP1, VOP1_DPP8, VOP1_DPP16, VOP3, VOP3_DPP8, VOP3_DPP16 instructions that were supported on GFX940 (MI300): - V_CVT_F32_FP8 - V_CVT_F32_BF8 - V_CVT_PK_F32_FP8 - V_CVT_PK_F32_BF8 - V_CVT_PK_FP8_F32 - V_CVT_PK_BF8_F32 - V_CVT_SR_FP8_F32 - V_CVT_SR_BF8_F32 --------- Co-authored-by: Mateja Marjanovic <mateja.marjanovic@amd.com> Co-authored-by: Mirko Brkušanin <Mirko.Brkusanin@amd.com>	2024-01-24 12:21:15 +01:00
Lucas Duarte Prates	6bbaad1ed4	[ARM] Introduce the v9.5-A architecture version to Arm targets (#78994 ) This introduces the Armv9.5-A architecture version to the Arm backend, following on from the existing implementation for AArch64 targets. Mode details about the Armv9.5-A architecture version can be found at: * https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/arm-a-profile-architecture-developments-2023 * https://developer.arm.com/documentation/ddi0602/2023-09/	2024-01-23 14:39:15 +00:00
Jay Foad	e21b0b083e	[AMDGPU] Remove gws feature from GFX12 (#78711 ) This was already done for LLVM. This patch just updates the Clang builtin handling to match.	2024-01-19 15:45:53 +00:00
Jay Foad	ed12388082	[AMDGPU] Do not emit `V_DOT2C_F32_F16_e32` on GFX12 (#78709 ) That instruction is not supported on GFX12. Added a testcase which previously crashed without this change. Co-authored-by: pvanhout <pierre.vanhoutryve@amd.com>	2024-01-19 14:36:27 +00:00
Natalie Chouinard	c21f48e5ad	[HLSL][SPIR-V] Add Vulkan to target triple (#76749 ) Add support for specifying the logical SPIR-V target environment in the triple as Vulkan. When compiling HLSL, this replaces the DirectX Shader Model with a Vulkan environment instead. Currently, the only supported combinations of SPIR-V version and Vulkan environment are: - Vulkan 1.2 and SPIR-V 1.5 - Vulkan 1.3 and SPIR-V 1.6 Fixes #70051	2024-01-18 12:52:00 -05:00
Mariusz Sikora	3e6589f21c	[AMDGPU][GFX12] Add 16 bit atomic fadd instructions (#75917 ) - image_atomic_pk_add_f16 - image_atomic_pk_add_bf16 - ds_pk_add_bf16 - ds_pk_add_f16 - ds_pk_add_rtn_bf16 - ds_pk_add_rtn_f16 - flat_atomic_pk_add_f16 - flat_atomic_pk_add_bf16 - global_atomic_pk_add_f16 - global_atomic_pk_add_bf16 - buffer_atomic_pk_add_f16 - buffer_atomic_pk_add_bf16	2024-01-18 14:01:09 +01:00
Mariusz Sikora	264fd9e13e	[AMDGPU][NFC] Rename feature FP8Insts to FP8ConversionInsts (#78439 )	2024-01-18 08:46:53 +01:00
Freddy Ye	f3a4de395c	[X86] Support "f16c" and "avx512fp16" for __builtin_cpu_supports (#78384 ) This resolves issue #65320. This also supports clarify sapphirerapids and cooperlake for cpu_specific/dispatch.	2024-01-18 09:22:04 +08:00
Jie Fu	b647a34c9f	[AArch64] Fix -Wreturn-type in AArch64TargetParser.cpp (NFC) llvm-project/llvm/lib/TargetParser/AArch64TargetParser.cpp:157:1: error: non-void function does not return a value in all control paths [-Werror,-Wreturn-type] } ^ 1 error generated.	2024-01-18 07:54:55 +08:00
Cyndy Ishida	735adbf1a8	[llvm] Teach MachO about XROS (#78373 ) Add support for XROS to encode in Mach-O file formats.	2024-01-17 10:35:20 -08:00
ostannard	13e977d173	[AArch64][Driver] Better handling of target feature dependencies (#78270 ) Currently there are several bits of code in the AArch64 driver which attempt to enforce dependencies between optional features in the -march= and -mcpu= options. However, these are based on the list of feature names being enabled/disabled, so they have a lot of logic to consider the order in which features were turned on and off, which doesn't scale well as dependency chains get longer. This patch moves the code handling these dependencies to TargetParser, and changes them to use a Bitset of enabled features. This makes it easy to check which features are enabled, and is converted back to a list of LLVM feature names once all of the command-line options are parsed. The motivating example for this was the -mcpu=cortex-r82+nofp option. Previously, the code handling the dependency between the fp16 and fp16fml extensions did not consider the nofp modifier, so it added +fullfp16 to the feature list. This should have been disabled by the +nofp modifier, and also the backend did follow the dependency between fullfp16 and fp, resulting in fp being turned back on in the backend. Most of the dependencies added to AArch64TargetParser.h weren't known about by clang before, I built that list by checking what the backend thinks the dependencies between SubtargetFeatures are.	2024-01-17 16:20:34 +00:00
Anatoly Trosinenko	7f7bbb9b44	[AArch64] Use parseArchExtension function. NFC (#78158 )	2024-01-17 12:57:36 +03:00
Cyndy Ishida	7d3466b5c3	[llvm] Introduce XROS platform (#77707 ) Accepts but otherwise ignores visionOS/XROS target triples as Darwin based platform. https://developer.apple.com/documentation/visionos	2024-01-16 16:32:10 -08:00
Alexandros Lamprineas	a690e86753	[AArch64] Add native CPU detection for Microsoft Azure Cobalt 100. (#77793 ) This patch extends the -mcpu/mtune=native support to handle the Microsoft Azure Cobalt 100 CPU as a Neoverse N2. We expect users to use -mcpu=neoverse-n2 when targeting this CPU and all the architecture and codegen decisions to be identical. The only difference is that the Microsoft Azure Cobalt 100 has a different Implementer ID in the /proc/cpuinfo entry that needs to be detected in getHostCPUNameForARM appropriately.	2024-01-16 09:40:13 +00:00
Jie Fu	d553934770	[AArch64] Add missing field 'GuardedControlStack' initializer (NFC) llvm-project/llvm/lib/TargetParser/ARMTargetParserCommon.cpp:143:39: error: missing field 'GuardedControlStack' initializer [-Werror,-Wmissing-field-initializers] PBP = {"none", "a_key", false, false}; ^ 1 error generated.	2024-01-11 21:35:04 +08:00
John Brawn	40d5c2bcd4	[clang][AArch64] Add a -mbranch-protection option to enable GCS (#75486 ) -mbranch-protection=gcs (enabled by -mbranch-protection=standard) causes generated objects to be marked with the gcs feature. This is done via the guarded-control-stack module flag, in a similar way to branch-target-enforcement and sign-return-address. Enabling GCS causes the GNU_PROPERTY_AARCH64_FEATURE_1_GCS bit to be set on generated objects. No code generation changes are required, as GCS just requires that functions are called using BL and returned from using RET (or other similar variant instructions), which is already the case.	2024-01-11 12:53:23 +00:00
ZijunZhaoCCK	f6dbd4cc5f	Make clang report invalid target versions. (#75373 ) Clang always silently ignores garbage target versions and this makes debug harder. So clang will report when target versions are invalid.	2024-01-08 20:46:05 -08:00
Weining Lu	7e186d366d	[LoongArch] Fix the procossor series mask Refer PRID_SERIES_MASK definition in linux kernel: arch/loongarch/include/asm/cpu.h.	2024-01-03 14:01:05 +08:00
Aiden Grossman	3ddf368524	[X86] Fix warning in cpu detection due to unsigned comparison a15532d7647a8a4b7fd2889bd97f6f72f273c4bf landed a patch that added support for detecting more AMD znver2 CPUs and cleaned up some of the surrounding code, including the znver3 detection. Since one model group is 00h-0fh, I adjusted the check to include checking if the value is greater than zero. Since the value is unsigned, this is always true and gcc warns on it. This patch removes the comparison with zero to get rid of the compiler warning.	2023-12-29 17:58:53 -08:00
Freddy Ye	19e784604c	[X86] Remove RAO-INT from Grandridge (#76420 ) According to latest spec: https://cdrdv2.intel.com/v1/dl/getContent/671368	2023-12-28 10:06:54 +08:00
Kazu Hirata	f5f2c313ae	[llvm] Use StringRef::consume_front (NFC)	2023-12-25 12:33:00 -08:00
Aiden Grossman	a15532d764	[X86] Add CPU detection for more znver2 CPUs (#74955 ) This patch adds proper detection support for more znver2 CPUs. Specifically, this adds in support for CPUs codenamed Renoir, Lucienne, and Mendocino. This was originally proposedfor Renoir in https://reviews.llvm.org/D96220 and got approved, but slipped through the cracks. However, there is still a demand for this feature. In addition to adding support for more znver2 CPUs, this patch also includes some additional refactoring and comments related to cpu model information for zen CPUs. Fixes https://github.com/llvm/llvm-project/issues/74934.	2023-12-21 23:39:28 -08:00
Tomas Matheson	7bd17212ef	Re-land "[AArch64] Codegen support for FEAT_PAuthLR" (#75947 ) This reverts commit 9f0f5587426a4ff24b240018cf8bf3acc3c566ae. Fix expensive checks failure by properly marking register def for ADR.	2023-12-21 18:32:55 +00:00
Tomas Matheson	9f0f558742	Revert "[AArch64] Codegen support for FEAT_PAuthLR" This reverts commit 5992ce90b8c0fac06436c3c86621fbf6d5398ee5. Builtbot failures with expensive checks enabled.	2023-12-21 16:25:55 +00:00
Tomas Matheson	5992ce90b8	[AArch64] Codegen support for FEAT_PAuthLR - Adds a new +pc option to -mbranch-protection that will enable the use of PC as a diversifier in PAC branch protection code. - When +pauth-lr is enabled (-march=armv9.5a+pauth-lr) in combination with -mbranch-protection=pac-ret+pc, the new 9.5-a instructions (pacibsppc, retaasppc, etc) are used. Documentation for the relevant instructions can be found here: https://developer.arm.com/documentation/ddi0602/2023-09/Base-Instructions/ Co-authored-by: Lucas Prates <lucas.prates@arm.com>	2023-12-21 14:18:33 +00:00
Matt Arsenault	629412938b	Triple: Fix handling of macos with unexpected target arches (#75469 ) Some tools with a specified target arch, but no full triple default to the host triple. On macos hosts, this would then force using macho on targets that didn't expect it, resulting in assertions. We should also probably emit explicit errors if the object format is specified on targets which don't handle it.	2023-12-18 21:28:33 +07:00
Kazu Hirata	038871ae68	[llvm] Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-16 14:47:16 -08:00
Kazu Hirata	586ecdf205	[llvm] Use StringRef::{starts,ends}_with (NFC) (#74956 ) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-11 21:01:36 -08:00
Jonathan Thackray	8758e648da	[ARM][AArch32] Add support for AArch32 Cortex-M52 CPU (#74822 ) Cortex-M52 is an Armv8.1 AArch32 CPU. Technical specifications available at: https://developer.arm.com/processors/cortex-m52	2023-12-08 15:04:08 +00:00
Shengchen Kan	6d6baef5c9	[X86] Support CFE flags for APX features (#74199 ) Positive options: -mapx-features=<comma-separated-features> Negative options: -mno-apx-features=<comma-separated-features> -m[no-]apx-features is designed to be able to control separate APX features. Besides, we also support the flag -m[no-]apxf, which can be used like an alias of -m[no-]apx-features=< all APX features covered by CPUID APX_F> Behaviour when positive and negative options are used together: For boolean flags, the last one wins -mapxf -mno-apxf -> -mno-apxf -mno-apxf -mapxf -> -mapxf For flags that take a set as arguments, it sets the mask by order of the flags -mapx-features=egpr,ndd -mno-apx-features=egpr -> -egpr,+ndd -mapx-features=egpr -mno-apx-features=egpr,ndd -> -egpr,-ndd -mno-apx-features=egpr -mapx-features=egpr,ndd -> +egpr,+ndd -mno-apx-features=egpr,ndd -mapx-features=egpr -> -ndd,+egpr The design is aligned with gcc https://gcc.gnu.org/pipermail/gcc-patches/2023-August/628905.html	2023-12-04 19:22:56 +08:00
Jay Foad	cf1e0c0b07	[AMDGPU] Define new targets gfx1200 and gfx1201 (#73133 ) Define target names and ELF numbers for new GFX12 targets gfx1200 and gfx1201. For now they behave identically to GFX11.	2023-11-23 16:44:05 +00:00
Jay Foad	9b374a800d	[AMDGPU] Add some clang-format off/on markers This keeps clang-format happy on future patches.	2023-11-23 09:50:55 +00:00
Jay Foad	e0d93d5aaa	[AMDGPU] Reindent some tables This keeps clang-format happy on future patches.	2023-11-23 09:49:03 +00:00
Jonathan Thackray	066c4524bc	[AArch64] Add support for Cortex-A520, Cortex-A720 and Cortex-X4 CPUs (#72395 ) Cortex-A520, Cortex-A720 and Cortex-X4 are Armv9.2 AArch64 CPUs. Technical Reference Manual for Cortex-A520: https://developer.arm.com/documentation/102517/latest/ Technical Reference Manual for Cortex-A720: https://developer.arm.com/documentation/102530/latest/ Technical Reference Manual for Cortex-X4: https://developer.arm.com/documentation/102484/latest/ Patch co-authored by: Sivan Shani <sivan.shani@arm.com>	2023-11-16 22:08:58 +00:00
Lucas Duarte Prates	59b2301508	[AArch64] Introduce the Armv9.5-A architecture version (#72392 ) This introduces the Armv9.5-A architecture version, including the relevant command-line option for -march. Mode details about the Armv9.5-A architecture version can be found at: * https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/arm-a-profile-architecture-developments-2023 * https://developer.arm.com/documentation/ddi0602/2023-09/ Patch by Oliver Stannard.	2023-11-16 15:38:32 +00:00
Yeting Kuo	75d6795e42	[RISCV][Clang][TargetParser] Support getting feature unaligned-scalar-mem from mcpu. (#71513 ) This patch reference ac1ffd3caca12c254e0b8c847aa8ce8e51b6cfbf to suppot a soft coding way to identify whether a cpu has a feature `unaligned-scalar-mem` by `RISCVProcessors.td`. This patch does not provide test case since there is no risc-v cpu support `unaligned-scalar-mem` in llvm upstream now.	2023-11-08 00:05:16 +08:00
Andrew Kaster	7ef6b21c3e	[llvm] Add triple for SerenityOS Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D154395	2023-11-02 05:28:12 -04:00
Alpha Abdoulaye	88d00a6897	Reland [dsymutil] Add support for mergeable libraries (#70256 ) Reland https://reviews.llvm.org/D158124 Fixed `-fpermissive` error reported by gcc only.	2023-10-26 10:45:08 -07:00
Philip Reames	717946f9eb	Revert "[dsymutil] Add support for mergeable libraries" This reverts commit 122c89b271af30b86536cad7bac64ea9c56615ed. Change does not build, with errors such as: In file included from ../llvm-project/llvm/tools/dsymutil/DebugMap.h:24, from ../llvm-project/llvm/tools/dsymutil/DwarfLinkerForBinary.h:13, from ../llvm-project/llvm/tools/dsymutil/DwarfLinkerForBinary.cpp:9: ../llvm-project/llvm/tools/dsymutil/RelocationMap.h:60:17: error: declaration of ‘llvm::dsymutil::SymbolMapping llvm::dsymutil::ValidReloc::SymbolMapping’ changes meaning of ‘SymbolMapping’ [-fpermissive] 60 \| SymbolMapping SymbolMapping; \| ^~~~~~~~~~~~~ ../llvm-project/llvm/tools/dsymutil/RelocationMap.h:36:8: note: ‘SymbolMapping’ declared here as ‘struct llvm::dsymutil::SymbolMapping’ 36 \| struct SymbolMapping { \| ^~~~~~~~~~~~~ In file included from ../llvm-project/llvm/tools/dsymutil/DwarfLinkerForBinary.h:13, from ../llvm-project/llvm/tools/dsymutil/DwarfLinkerForBinary.cpp:9: ../llvm-project/llvm/tools/dsymutil/DebugMap.h:198:32: error: declaration of ‘std::optional<llvm::dsymutil::RelocationMap> llvm::dsymutil::DebugMapObject::RelocationMap’ changes meaning of ‘RelocationMap’ [-fpermissive] 198 \| std::optional<RelocationMap> RelocationMap; \| ^~~~~~~~~~~~~ In file included from ../llvm-project/llvm/tools/dsymutil/DebugMap.h:24, from ../llvm-project/llvm/tools/dsymutil/DwarfLinkerForBinary.h:13, from ../llvm-project/llvm/tools/dsymutil/DwarfLinkerForBinary.cpp:9: ../llvm-project/llvm/tools/dsymutil/RelocationMap.h:76:7: note: ‘RelocationMap’ declared here as ‘class llvm::dsymutil::RelocationMap’ 76 \| class RelocationMap { \| ^~~~~~~~~~~~~	2023-10-24 11:54:18 -07:00
Alpha Abdoulaye	122c89b271	[dsymutil] Add support for mergeable libraries This adds support in dsymutil for mergeable libraries [1]. dsymutil reads a new stab emitted by ld, allowing it to operate on dynamic libraries instead of object files. It also now loads the DWARF files associated to the libraries, and build the debug map for each binary from the list of symbols exported by the library. For each Debug Map Object, there is a new associated Relocation Map which is serialized from the information retrieved in the original debug_info (or debug_addr) section of the .o file. The final DWARF file has multiple compile units, so the offsets information of the relocations are adjusted relatively to the compile unit they will end up belonging to, inside the final linked DWARF file. [1] https://developer.apple.com/documentation/xcode/configuring-your-project-to-use-mergeable-libraries Differential revision: https://reviews.llvm.org/D158124	2023-10-24 10:39:08 -07:00
Brad Smith	a63dc79d11	[Clang][OHOS] Keep ARM ABI selection logic in sync between Clang and LLVM (#68656 )	2023-10-22 08:48:41 +03:00
Dominik Wójt	b3c4f64943	[ARM] fix "+fp.dp" in multilib selection (#67412 ) When the FPU was selected with "+(no)fp(.dp)" extensions in "-march" or "-mcpu" options, the FPU used for multilib selection was still the default one for given architecture or CPU.	2023-10-19 11:27:33 +01:00
Freddy Ye	278e533ee9	[X86] Support -march=pantherlake,clearwaterforest (#69277 )	2023-10-19 15:11:15 +08:00
Phoebe Wang	cfbf0a500f	[X86][RFC] Support AVX10 options (#67278 ) AVX10 Architecture Specification: https://cdrdv2.intel.com/v1/dl/getContent/784267 AVX10 Technical Paper: https://cdrdv2.intel.com/v1/dl/getContent/784343 RFC: https://discourse.llvm.org/t/rfc-design-for-avx10-options-support/73672	2023-10-19 07:52:50 +08:00
Ivan Kosarev	096eba148d	[TargetParser][AMDGPU] Fix getArchEntry(). (#69222 ) It's supposed to return null when an unknown target id is passed.	2023-10-17 14:54:29 +01:00
Freddy Ye	819ac45d1c	[X86] Add USER_MSR instructions. (#68944 ) For more details about this instruction, please refer to the latest ISE document: https://www.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html	2023-10-16 10:12:53 +08:00
Brad Smith	7cfe32d4d8	[Driver] Hook up Haiku ARM support (#67222 )	2023-10-09 00:49:53 -04:00
prabhukr	720e3bacbd	[Basic] Support 64-bit x86 target for UEFI Adding support for X86_64 UEFI target to begin with. Reviewed By: phosek, MaskRay Differential Revision: https://reviews.llvm.org/D152206	2023-09-28 11:36:22 -07:00
prabhukr	d5ccbaff98	Revert "[Basic] Support 64-bit x86 target for UEFI" This reverts commit 315a407086b0ab302d0293b720d7f9b3e8f6ffa9. The new test added fails to link the unit tests correctly and breaks certain buildbots.	2023-09-28 09:01:30 -07:00
prabhukr	315a407086	[Basic] Support 64-bit x86 target for UEFI Adding support for X86_64 UEFI target to begin with. Reviewed By: phosek, MaskRay Differential Revision: https://reviews.llvm.org/D152206	2023-09-27 20:23:11 -07:00

1 2 3

144 Commits