llvm-project

Author	SHA1	Message	Date
Rainer Orth	06e0a32178	[CMake] Fix BUILD_SHARED_LIBS build on Solaris LLVM currently doesn't build with `-DBUILD_SHARED_LIBS=ON` on Solaris: `libLLVMTargetParser.so` uses `libkstat` functions without linking it. Tested on `amd64-pc-solaris2.11` and `sparcv9-sun-solaris2.11`. Differential Revision: <https://reviews.llvm.org/D158846	2023-09-15 10:00:49 +02:00
David Spickett	99594ba30a	[clang][ARM] Enable --print-supported-extensions for ARM (#66083 ) ``` $ ./bin/clang --target=arm-linux-gnueabihf --print-supported-extensions <...> All available -march extensions for ARM crc crypto sha2 aes dotprod <...> ``` This follows the format set by RISC-V and AArch64. As for AArch64, ARM doesn't have versioned extensions like RISC-V does. So there is only 1 column, which contains the name. Any extension without a "feature" is hidden as these cannot be used with -march.	2023-09-13 10:10:57 +01:00
hassnaaHamdi	491a1cd09e	[AArch64]: Refactor target parser to use Bitset. (#65423 ) Use Bitset instead of BitMasking for the Architecture Extensions, as the number of extensions will exceed the bitmask bits eventually.	2023-09-12 14:54:33 +01:00
Fangrui Song	2bdf5aa5df	[Driver] Properly report error for unsupported powerpc darwin/macos triples The removal started at https://reviews.llvm.org/D50989 and https://reviews.llvm.org/D75494 removed the Triple support. Without recognizing Darwin triples as Mach-O, we will get assertion error in ToolChains/Darwin.h due to the universal binary mechanism. Fix #47698 --- This requires fixing many misuses of llc -march= and llvm-mc -arch= ( commits 806761a7629df268c8aed49657aeccffa6bca449 and 252c42354eca54274ed7b10c32c73c6937478e8b).	2023-09-11 18:53:51 -07:00
Nico Weber	cc2013061e	Revert "[Driver] Properly report error for unsupported powerpc darwin/macos triples" This reverts commit 9f77facfce3ca23213c1de2e3e4c969b5187e29d. The change unintentionally changed lots of codegen, see https://github.com/llvm/llvm-project/issues/47698#issuecomment-1714548640 Also revert a follow-up: This reverts commit b40a5bead2cb95c90ecd8c0fa566722e6133e01c.	2023-09-11 14:08:59 -07:00
Nathan Gauër	53b6a169e4	[SPIR-V] Add SPIR-V logical triple. Clang implements SPIR-V with both Physical32 and Physical64 addressing models. This commit adds a new triple value for the Logical addressing model. Differential Revision: https://reviews.llvm.org/D155978	2023-09-11 10:15:24 +02:00
David Spickett	90db4193f8	[clang][AArch64] Add --print-supported-extensions support (#65466 ) This follows the RISC-V work done in 4b40ced4e5ba10b841516b3970e7699ba8ded572. This uses AArch64's target parser instead. We just list the names, without the "+" on them, which matches RISC-V's format. ``` $ ./bin/clang -target aarch64-linux-gnu --print-supported-extensions clang version 18.0.0 (https://github.com/llvm/llvm-project.git 154da8aec20719c82235a6957aa6e461f5a5e030) Target: aarch64-unknown-linux-gnu Thread model: posix InstalledDir: <...> All available -march extensions for AArch64 aes b16b16 bf16 brbe crc crypto cssc <...> ``` Since our extensions don't have versions in the same way there's just one column with the name in. Any extension without a feature name (including the special "none") is not listed as those cannot be passed to -march, they're just for the backend. For example the MTE extension can be added with "+memtag" but MTE2 and MTE3 do not have feature names so they cannot be added to -march. This does not attempt to tackle the fact that clang allows invalid combinations of AArch64 extensions, it simply lists the possible options. It's still up to the user to ask for something sensible. Equally, this has no context of what CPU is being selected. Neither does the RISC-V option, the user has to be aware of that. I've added a target parser test, and a high level clang test that checks RISC-V and AArch64 work and that Intel, that doesn't support this, shows the correct error.	2023-09-11 08:25:02 +01:00
Fangrui Song	9f77facfce	[Driver] Properly report error for unsupported powerpc darwin/macos triples The removal started at https://reviews.llvm.org/D50989 and https://reviews.llvm.org/D75494 removed the Triple support. Without recognizing Darwin triples as Mach-O, we will get assertion error in ToolChains/Darwin.h due to the universal binary mechanism. Fix #47698	2023-09-10 13:06:27 -07:00
Phoebe Wang	24194090e1	[X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask instructions for AVX512 features This is an alternative of D157485 and a pre-feature to support AVX10. AVX10 Architecture Specification: https://cdrdv2.intel.com/v1/dl/getContent/784267 AVX10 Technical Paper: https://cdrdv2.intel.com/v1/dl/getContent/784343 RFC: https://discourse.llvm.org/t/rfc-design-for-avx10-feature-support/72661 Based on the feedbacks from LLVM and GCC community, we have agreed to start from supporting `-m[no-]evex512` on existing AVX512 features. The option `-mno-evex512` can be used with `-mavx512xxx` to build binaries that can run on both legacy AVX512 targets and AVX10-256. There're still arguments about what's the expected behavior when this option as well as `-mavx512xxx` used together with `-mavx10.1-256`. We decided to defer the support of `-mavx10.1` after we made consensus. Or furthermore, we start from supporting AVX10.2 and not providing any AVX10.1 options. Reviewed By: RKSimon, skan Differential Revision: https://reviews.llvm.org/D159250	2023-09-08 22:47:22 +08:00
Phoebe Wang	0856efbf88	Revert "[X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask instructions for AVX512 features" This reverts commit 7dd48cc24de2d54d40527432cbee8a9d97a8a4f7. Causing buildbot failure.	2023-09-07 21:59:01 +08:00
Phoebe Wang	7dd48cc24d	[X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask instructions for AVX512 features This is an alternative of D157485 and a pre-feature to support AVX10. AVX10 Architecture Specification: https://cdrdv2.intel.com/v1/dl/getContent/784267 AVX10 Technical Paper: https://cdrdv2.intel.com/v1/dl/getContent/784343 RFC: https://discourse.llvm.org/t/rfc-design-for-avx10-feature-support/72661 Based on the feedbacks from LLVM and GCC community, we have agreed to start from supporting `-m[no-]evex512` on existing AVX512 features. The option `-mno-evex512` can be used with `-mavx512xxx` to build binaries that can run on both legacy AVX512 targets and AVX10-256. There're still arguments about what's the expected behavior when this option as well as `-mavx512xxx` used together with `-mavx10.1-256`. We decided to defer the support of `-mavx10.1` after we made consensus. Or furthermore, we start from supporting AVX10.2 and not providing any AVX10.1 options. Reviewed By: RKSimon, skan Differential Revision: https://reviews.llvm.org/D159250	2023-09-07 21:38:35 +08:00
Brad Smith	5165593a97	Delete CloudABI support After this D108637 and with FreeBSD -current and now 14 dropping support for CloudABI I think it is time to consider deleting the CloudABI support. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D158920	2023-08-29 22:57:30 -04:00
Craig Topper	5487b67caf	[X86] Merge FeatureInfos_WithPLUS and FeatureInfos. NFC Store the string with the '+' in FeatureInfos. Drop the '+' at runtime for the users that don't want it. Reviewed By: RKSimon, FreddyYe Differential Revision: https://reviews.llvm.org/D158814	2023-08-27 22:39:44 -07:00
Brad Smith	2a105105a6	Delete Ananas support After looking at this further I think the Ananas support should be removed. They stopped using Clang. There have never been any releases either; as in source only, and the backend is not maintained. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D158946	2023-08-27 18:43:23 -04:00
Fangrui Song	27da15381c	[X86] __builtin_cpu_supports: support x86-64{,-v2,-v3,-v4} GCC 12 (https://gcc.gnu.org/PR101696) allows __builtin_cpu_supports("x86-64") (and -v2 -v3 -v4). This patch ports the feature. * Add `FEATURE_X86_64_{BASELINE,V2,V3,V4}` to enum ProcessorFeatures, but keep CPU_FEATURE_MAX unchanged to make FeatureInfos/FeatureInfos_WithPLUS happy. * Change validateCpuSupports to allow `x86-64{,-v2,-v3,-v4}` * Change getCpuSupportsMask to return `std::array<uint32_t, 4>` where `x86-64{,-v2,-v3,-v4}` set bits `FEATURE_X86_64_{BASELINE,V2,V3,V4}`. * `target("x86-64")` and `cpu_dispatch(x86_64)` are invalid. Tested by commit 9de3b35ac9159d5bae6e6796cb91e4f877a07189 Close https://github.com/llvm/llvm-project/issues/59961 Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D158811	2023-08-25 20:56:25 -07:00
Brad Smith	24eaf7858b	Cleanup remaining bits for Minix, Contiki and Myriad Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D158806	2023-08-25 16:37:55 -04:00
Yaxun (Sam) Liu	b8a9c50f22	[AMDGPU] Add target feature gws to clang Reviewed by: Matt Arsenault Differential Revision: https://reviews.llvm.org/D158367	2023-08-25 11:50:47 -04:00
Fangrui Song	a208b68401	[X86TargetParser] Simplify X86_FEATURE_COMPAT assert. NFC	2023-08-24 21:31:39 -07:00
Fangrui Song	7a41af8604	[X86] Support arch=x86-64{,-v2,-v3,-v4} for target_clones attribute GCC 12 (https://gcc.gnu.org/PR101696) allows `arch=x86-64` `arch=x86-64-v2` `arch=x86-64-v3` `arch=x86-64-v4` in the target_clones function attribute. This patch ports the feature. * Set KeyFeature to `x86-64{,-v2,-v3,-v4}` in `Processors[]`, to be used by X86TargetInfo::multiVersionSortPriority * builtins: change `__cpu_features2` to an array like libgcc. Define `FEATURE_X86_64_{BASELINE,V2,V3,V4}` and depended ISA feature bits. * CGBuiltin.cpp: update EmitX86CpuSupports to handle `arch=x86-64*`. Close https://github.com/llvm/llvm-project/issues/55830 Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D158329	2023-08-23 22:08:55 -07:00
Craig Topper	4f12c0b7d6	[X86] Remove FeatureBitset from X86TargetParser.cpp. NFC Use the templated Bitset added in D158576. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D158682	2023-08-23 20:49:55 -07:00
Fangrui Song	eff105b1bd	Bitset: remove some GCC<6.2 workarounds related to bitwise operators GCC<6.2 has been unsupported since April 2022 (commit 4c72deb613d9d8838785b431facb3eb480fb2f51). X86TargetParser.cpp has another workaround that the other 2 nearly identical places don't have. Remove them as well. Reviewed By: arsenm, craig.topper Differential Revision: https://reviews.llvm.org/D158687	2023-08-23 18:43:58 -07:00
Freddy Ye	6acff5390d	[X86] Support -march=gracemont gracemont has some different tuning features from alderlake. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D158046	2023-08-21 08:49:01 +08:00
Jon Roelofs	6affdbad93	Remove a reference to rdar://5972456 The details in the radar don't seem all that relevant.	2023-08-09 18:11:28 -07:00
Weining Lu	f62c9252fc	[LoongArch] Support -march=native and -mtune= As described in [1][2], `-mtune=` is used to select the type of target microarchitecture, defaults to the value of `-march`. The set of possible values should be a superset of `-march` values. Currently possible values of `-march=` and `-mtune=` are `native`, `loongarch64` and `la464`. D136146 has supported `-march={loongarch64,la464}` and this patch adds support for `-march=native` and `-mtune=`. A new ProcessorModel called `loongarch64` is defined in LoongArch.td to support `-mtune=loongarch64`. `llvm::sys::getHostCPUName()` returns `generic` on unknown or future LoongArch CPUs, e.g. the not yet added `la664`, leading to `llvm::LoongArch::isValidArchName()` failing to parse the arch name. In this case, use `loongarch64` as the default arch name for 64-bit CPUs. Two preprocessor macros are defined based on user-provided `-march=` and `-mtune=` options and the defaults. - __loongarch_arch - __loongarch_tune Note that, to work with `-fno-integrated-cc1` we leverage cc1 options `-target-cpu` and `-tune-cpu` to pass driver options `-march=` and `-mtune=` respectively because cc1 needs these information to define macros in `LoongArchTargetInfo::getTargetDefines`. [1]: https://github.com/loongson/LoongArch-Documentation/blob/2023.04.20/docs/LoongArch-toolchain-conventions-EN.adoc [2]: https://github.com/loongson/la-softdev-convention/blob/v0.1/la-softdev-convention.adoc Reviewed By: xen0n, wangleiat, steven_wu, MaskRay Differential Revision: https://reviews.llvm.org/D155824	2023-08-09 10:29:50 +08:00
Freddy Ye	dc7c0181ef	[X86] Promote VAES, SHA512, SM4 implied feature to AVX2 Reviewed By: skan Differential Revision: https://reviews.llvm.org/D155662	2023-08-04 10:43:34 +08:00
Craig Topper	2a5e3f4c6c	[X86] Workaround possible CPUID bug in Sandy Bridge. Don't access leaf 7 subleaf 1 unless subleaf 0 says it is supported via EAX. Intel documentation says invalid subleaves return 0. We had been relying on that behavior instead of checking the max sublef number. It appears that some Sandy Bridge CPUs return at least the subleaf 0 EDX value for subleaf 1. Best guess is that this is a bug in a microcode patch since all of the bits we're seeing set in EDX were introduced after Sandy Bridge was originally released. This is causing avxvnniint16 to be incorrectly enabled with -march=native on these CPUs. Reviewed By: pengfei, anna Differential Revision: https://reviews.llvm.org/D156963	2023-08-03 08:12:01 -07:00
Steven Wu	42c9354a92	Revert "Reland "[LoongArch] Support -march=native and -mtune="" This reverts commit c56514f21b2cf08eaa7ac3a57ba4ce403a9c8956. This commit adds global state that is shared between clang driver and clang cc1, which is not correct when clang is used with `-fno-integrated-cc1` option (no integrated cc1). The -march and -mtune option needs to be properly passed through cc1 command-line and stored in TargetInfo.	2023-07-31 16:57:06 -07:00
Freddy Ye	c9d92e6638	[X86] Support -march=arrowlake,arrowlake-s,lunarlake Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D156239	2023-07-28 15:05:54 +08:00
Freddy Ye	cafbcfa086	[X86] Update Model value for Raptor Lake. Reviewed By: pengfei, skan Differential Revision: https://reviews.llvm.org/D156285	2023-07-26 15:33:15 +08:00
Weining Lu	c56514f21b	Reland "[LoongArch] Support -march=native and -mtune=" As described in [1][2], `-mtune=` is used to select the type of target microarchitecture, defaults to the value of `-march`. The set of possible values should be a superset of `-march` values. Currently possible values of `-march=` and `-mtune=` are `native`, `loongarch64` and `la464`. D136146 has supported `-march={loongarch64,la464}` and this patch adds support for `-march=native` and `-mtune=`. A new ProcessorModel called `loongarch64` is defined in LoongArch.td to support `-mtune=loongarch64`. `llvm::sys::getHostCPUName()` returns `generic` on unknown or future LoongArch CPUs, e.g. the not yet added `la664`, leading to `llvm::LoongArch::isValidArchName()` failing to parse the arch name. In this case, use `loongarch64` as the default arch name for 64-bit CPUs. And these two preprocessor macros are defined: - __loongarch_arch - __loongarch_tune [1]: https://github.com/loongson/LoongArch-Documentation/blob/2023.04.20/docs/LoongArch-toolchain-conventions-EN.adoc [2]: https://github.com/loongson/la-softdev-convention/blob/v0.1/la-softdev-convention.adoc Reviewed By: xen0n, wangleiat Differential Revision: https://reviews.llvm.org/D155824	2023-07-26 10:26:38 +08:00
Weining Lu	212d6aa0da	Revert "[LoongArch] Support -march=native and -mtune=" This reverts commit 92c06114b2ea9900a3364fb395988dfb065758f7.	2023-07-25 23:32:15 +08:00
Weining Lu	92c06114b2	[LoongArch] Support -march=native and -mtune= As described in [1][2], `-mtune=` is used to select the type of target microarchitecture, defaults to the value of `-march`. The set of possible values should be a superset of `-march` values. Currently possible values of `-march=` and `-mtune=` are `native`, `loongarch64` and `la464`. D136146 has supported `-march={loongarch64,la464}` and this patch adds support for `-march=native` and `-mtune=`. A new ProcessorModel called `loongarch64` is defined in LoongArch.td to support `-mtune=loongarch64`. `llvm::sys::getHostCPUName()` returns `generic` on unknown or future LoongArch CPUs, e.g. the not yet added `la664`, leading to `llvm::LoongArch::isValidArchName()` failing to parse the arch name. In this case, use `loongarch64` as the default arch name for 64-bit CPUs. And these two preprocessor macros are defined: - __loongarch_arch - __loongarch_tune [1]: https://github.com/loongson/LoongArch-Documentation/blob/2023.04.20/docs/LoongArch-toolchain-conventions-EN.adoc [2]: https://github.com/loongson/la-softdev-convention/blob/v0.1/la-softdev-convention.adoc Differential Revision: https://reviews.llvm.org/D155824	2023-07-25 21:01:51 +08:00
Freddy Ye	6d23a3faa4	[X86] Support -march=graniterapids-d and update -march=graniterapids Reviewed By: pengfei, RKSimon, skan Differential Revision: https://reviews.llvm.org/D155798	2023-07-25 13:48:31 +08:00
Freddy Ye	5cc4b1059b	[X86] Update features for sierraforest, grandridge Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D155784	2023-07-25 11:00:41 +08:00
Freddy Ye	1c154bd755	[X86] Add AVX-VNNI-INT16 instructions. For more details about these instructions, please refer to the latest ISE document: https://www.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html Reviewed By: pengfei, skan Differential Revision: https://reviews.llvm.org/D155145	2023-07-20 14:31:16 +08:00
Freddy Ye	049d6a3f42	[X86] Add SM4 instructions. For more details about these instructions, please refer to the latest ISE document: https://www.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html Reviewed By: pengfei, skan Differential Revision: https://reviews.llvm.org/D155148	2023-07-20 13:35:15 +08:00
Freddy Ye	c6f66de21a	[X86] Add SM3 instructions. For more details about these instructions, please refer to the latest ISE document: https://www.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D155147	2023-07-20 10:24:16 +08:00
Freddy Ye	fc3b7874b6	[X86] Add SHA512 instructions. For more details about this instruction, please refer to the latest ISE document: https://www.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html Reviewed By: RKSimon, skan Differential Revision: https://reviews.llvm.org/D155146	2023-07-20 09:44:44 +08:00
Weining Lu	c4eb880d43	Revert "[LoongArch] Change 'using namespace llvm;' to 'namespace llvm {' in LoongArchTargetParser.cpp. NFC" This reverts commit 586321467a0d48573ae643e837a6c4eaf6bc75db. Reason to revert: These changes are inconsistent with the [[https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions\|LLVM stype guide]].	2023-07-19 13:08:47 +08:00
Weining Lu	586321467a	[LoongArch] Change 'using namespace llvm;' to 'namespace llvm {' in LoongArchTargetParser.cpp. NFC And change 'using namespace llvm::LoongArch' to 'namespace LoongArch {' to simplify the code a little bit.	2023-07-18 16:51:26 +08:00
Weining Lu	ef9421dcf1	[LoongArch] Remove useless 'invalid' and 'none' feature and arch names. NFC	2023-07-18 16:51:23 +08:00
Jay Foad	92542f2a40	[AMDGPU] Add targets gfx1150 and gfx1151 This is the target definition only. Currently they are treated the same as GFX 11.0.x. Differential Revision: https://reviews.llvm.org/D155429	2023-07-17 13:06:12 +01:00
Jon Roelofs	dc078e6eaa	TargetParser: fix getProcessTriple in universal builds The bug happens when you build e.g. an x64_64;arm64 JIT with LLVM_HOST_TRIPLE=x86_64-apple-macos, and then run it on an apple-m1 not under Rosetta. In that case, sys::getProcessTriple() will return an x86_64 triple, not an arm64 one. Differential revision: https://reviews.llvm.org/D138449	2023-07-14 13:44:43 -07:00
Freddy Ye	a10dccf271	[X86] Support some Intel CPUs for cpu_specific/dispatch feature Reviewed By: RKSimon, skan Differential Revision: https://reviews.llvm.org/D154493	2023-07-07 13:47:33 +08:00
Freddy Ye	7717c0071d	[X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling This refactor patch means to remove CPU_SPECIFIC* MACROs in X86TargetParser.def and move those information into ProcInfo of X86TargetParser.cpp. Since these two files both maintain a table with redundant info such as cpuname and its features supported. CPU_SPECIFIC* MACROs define some different information. This patch dealt with them in these ways when moving: 1.mangling This is now moved to Mangling in ProcInfo and directly initialized at array of Processors. CPUs don't support cpu_dispatch/specific are assigned '\0' as mangling. 2.CPU alias The alias cpu will also be initialized in array of Processors, its attributes will be same as its alias target cpu. Same feature list, same mangling. 3.TUNE_NAME Before my change, some cpu names support cpu_dispatch/specific are not supported in X86.td, which means optimizer/backend doesn't recognize them. So they use a different TUNE_NAME to generate in IR. In this patch, I added these missing cpu support at X86.td by utilizing existing Features and XXXTunings, so that each cpu name can directly use its own name as TUNE_NAME to be supported by optimizer/backend. 4.Feature list The feature list of one CPU maintained in X86TargetParser.def is not same as the one in X86TargetParser.cpp. It only maintains part of features of one CPU (features defined by X86_FEATURE_COMPAT). While X86TargetParser.cpp maintains a complete one. This patch abandons the feature list maintained by CPU_SPECIFIC* MACROs because assigning a CPU with a complete one doesn't affect the functionality of cpu_dispatch/specific. Except these four info, since some of CPUs supported by cpu_dispatch/specific doesn's support clang options like -march, -mtune before, this patch also kept this behavior still by adding another member OnlyForCPUDispatchSpecific in ProcInfo. Reviewed By: pengfei, RKSimon Differential Revision: https://reviews.llvm.org/D151696	2023-07-05 17:32:00 +08:00
Freddy Ye	71249fd71b	[NFC][X86] Add missing CPUID related changes for AMX-COMPLEX.	2023-06-30 15:15:37 +08:00
Freddy Ye	a9256a2e04	[x86] Add missing FeatureCMOV in frontend targets. The missing info is gathered from X86.td. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D154181	2023-06-30 13:19:15 +08:00
Job Noorman	8de9f2b558	Move SubtargetFeature.h from MC to TargetParser SubtargetFeature.h is currently part of MC while it doesn't depend on anything in MC. Since some LLVM components might have the need to work with target features without necessarily needing MC, it might be worthwhile to move SubtargetFeature.h to a different location. This will reduce the dependencies of said components. Note that I choose TargetParser as the destination because that's where Triple lives and SubtargetFeatures feels related to that. This issues came up during a JITLink review (D149522). JITLink would like to avoid a dependency on MC while still needing to store target features. Reviewed By: MaskRay, arsenm Differential Revision: https://reviews.llvm.org/D150549	2023-06-26 11:20:08 +02:00
Yaxun (Sam) Liu	c0f0d50653	[HIP] emit macro `__HIP_NO_IMAGE_SUPPORT` HIP texture/image support is optional as some devices do not have image instructions. A macro __HIP_NO_IMAGE_SUPPORT is defined for device not supporting images (`d0448aa4c4/docs/reference/kernel_language.md (L426)` ) Currently the macro is defined by HIP header based on predefined macros for GPU, e.g __gfx*__ , which is error prone. This patch let clang emit the predefined macro. Reviewed by: Matt Arsenault, Artem Belevich Differential Revision: https://reviews.llvm.org/D151349	2023-06-14 22:53:41 -04:00
Kazu Hirata	143e131aa7	Do not unnecessarily include StringSwitch.h	2023-06-11 13:19:22 -07:00

1 2

91 Commits