llvm-project

Author	SHA1	Message	Date
Pawan Nirpal	a5ba6067d6	[Clang][NFC] Use Hex Encoding for Intel CPU CPUID family (#153004 ) Use Hex Encoding for CPUID family to match number format with Intel ISE rev.58: https://cdrdv2.intel.com/v1/dl/getContent/671368	2025-08-14 18:36:34 +02:00
Daniel Paoliello	7694856fdd	Fix TargetParserTests for big-endian hosts (#152407 ) The new `sys::detail::getHostCPUNameForARM` for Windows (#151596) was implemented using a C++ bit-field, which caused the associated unit tests to fail on big-endian machines as it assumed a little-endian layout. This change switches from the C++ bit-field to LLVM's `BitField` type instead.	2025-08-06 16:50:28 -07:00
Daniel Paoliello	a418fa7cdc	[win][aarch64] Add support for detecting the Host CPU on Arm64 Windows (#151596 ) Uses the `CP 4000` registry keys under `HKLM\HARDWARE\DESCRIPTION\System\CentralProcessor\*` to get the Implementer and Part, which is then provided to a modified form of `getHostCPUNameForARM` to map to a CPU. On my local Surface Pro 11 `llc --version` reports: ``` > .\build\bin\llc.exe --version LLVM (http://llvm.org/): LLVM version 22.0.0git Optimized build with assertions. Default target: aarch64-pc-windows-msvc Host CPU: oryon-1 ```	2025-08-06 11:39:41 -07:00
Daniel Paoliello	4adce336f4	[win][arm64ec] Fixes to unblock building LLVM and Clang as Arm64EC (#150068 ) These changes allow LLVM and Clang to be built with Clang targeting Arm64EC using the MSVC linker. Built with these options: ``` -DLLVM_ENABLE_PROJECTS="clang" -DLLVM_HOST_TRIPLE=arm64ec-pc-windows-msvc -DCMAKE_C_COMPILER=clang-cl.exe -DCMAKE_C_COMPILER_TARGET=arm64ec-pc-windows-msvc -DCMAKE_CXX_COMPILER=clang-cl.exe -DCMAKE_CXX_COMPILER_TARGET=arm64ec-pc-windows-msvc -DCMAKE_LINKER_TYPE=MSVC ```	2025-07-31 09:30:05 -07:00
Kazu Hirata	96bde11e30	[TargetParser] Remove const from a return type (NFC) (#149255 ) getHostCPUFeatures constructs and returns a temporary instance of StringMap<bool>. We don't need const on the return type.	2025-07-17 07:22:51 -07:00
Elvina Yakubova	69835d8f6d	[clang][AArch64] Parse more features in getHostCPUFeatures (#146323 ) Add parsing of some crypto features to display them properly when -mcpu=native is used	2025-07-09 11:43:08 +01:00
Ricardo Jesus	84e54515bc	[AArch64] Add support for -mcpu=gb10. (#146515 ) This patch adds support for -mcpu=gb10 (NVIDIA GB10). This is a big.LITTLE cluster of Cortex-X925 and Cortex-A725 cores. The appropriate MIDR numbers are added to detect them in -mcpu=native. We did not add an -mcpu=cortex-x925.cortex-a725 option because GB10 does include the crypto instructions which we want on by default, and the current convention is to not enable such extensions for Arm Cortex cores in -mcpu where they are optional in the IP. Relevant GCC patch: https://gcc.gnu.org/pipermail/gcc-patches/2025-June/687005.html	2025-07-07 11:14:26 +01:00
Kazu Hirata	29b2b2263f	[TargetParser] Use StringRef::consume_front (NFC) (#147202 ) While we are at it, this patch switches to a range-based for loop.	2025-07-06 19:05:45 -07:00
Pengcheng Wang	ce621041c2	[RISCV] Get host CPU name via hwprobe (#142745 ) We can get the `mvendorid/marchid/mimpid` via hwprobe and then we can compare these IDs with those defined in processors to find the CPU name. With this change, `-mcpu/-mtune=native` can set the proper name.	2025-06-12 16:39:57 +08:00
Ties Stuij	269f5fe91e	[AARCH64] Add support for Cortex-A320 (#139055 ) This patch adds initial support for the recently announced Armv9 Cortex-A320 processor. For more information, including the Technical Reference Manual, see: https://developer.arm.com/Processors/Cortex-A320 --------- Co-authored-by: Oliver Stannard <oliver.stannard@arm.com>	2025-05-09 16:24:48 +01:00
Ulrich Weigand	80267f8148	Support z17 processor name and scheduler description (#135254 ) The recently announced IBM z17 processor implements the architecture already supported as "arch15" in LLVM. This patch adds support for "z17" as an alternate architecture name for arch15. This patch also add the scheduler description for the z17 processor, provided by Jonas Paulsson.	2025-04-11 00:20:58 +02:00
Ricardo Jesus	847e46ca01	[AArch64] Add initial support for -mcpu=olympus. (#132368 ) This patch adds support for the NVIDIA Olympus core. This does not add any special tuning decisions, and those may come later.	2025-03-25 08:09:04 +00:00
Craig Topper	5dc815503f	[RISCV] Add ESWIN EIC770X (SiFive P550) to getHostCPUNameForRISCV. (#125277 ) This enables -mcpu=native for the HiFive Premier P550 board.	2025-01-31 17:12:34 -08:00
tangaac	19834b4623	[LoongArch] Support sc.q instruction for 128bit cmpxchg operation (#116771 ) Two options for clang -mno-scq: Disable sc.q instruction. -mscq: Enable sc.q instruction. The default is -mno-scq.	2025-01-23 12:11:07 +08:00
Ulrich Weigand	8424bf207e	[SystemZ] Add support for new cpu architecture - arch15 This patch adds support for the next-generation arch15 CPU architecture to the SystemZ backend. This includes: - Basic support for the new processor and its features. - Detection of arch15 as host processor. - Assembler/disassembler support for new instructions. - Exploitation of new instructions for code generation. - New vector (signed\|unsigned\|bool) __int128 data types. - New LLVM intrinsics for certain new instructions. - Support for low-level builtins mapped to new LLVM intrinsics. - New high-level intrinsics in vecintrin.h. - Indicate support by defining __VEC__ == 10305. Note: No currently available Z system supports the arch15 architecture. Once new systems become available, the official system name will be added as supported -march name.	2025-01-20 19:30:21 +01:00
Mads Marquart	a082cc145f	Add Apple M4 host detection (#117530 ) Add Apple M4 host detection, which fixes https://github.com/rust-lang/rust/issues/133414. Also add support for older ARM families (this is likely never going to get used, since only macOS is officially supported as host OS, but nice to have for completeness sake). Error handling (checking `CPUFAMILY_UNKNOWN`) is also included here. Finally, add links to extra documentation to make it easier for others to update this in the future. NOTE: These values are taken from `mach/machine.h` the Xcode 16.2 SDK, and has been confirmed on an M4 Max in https://github.com/rust-lang/rust/issues/133414#issuecomment-2499123337.	2025-01-16 08:15:12 -08:00
Craig Topper	fd38a95586	[TargetParser] Use StringRef::split that takes a char separator instead of StringRef separator. NFC	2025-01-04 12:31:46 -08:00
pcc	38eaea73ca	TargetParser: AArch64: Add part numbers for Apple CPUs. Part numbers taken from: https://github.com/AsahiLinux/m1n1/blob/main/src/chickens.c Reviewers: ahmedbougacha, jroelofs Reviewed By: jroelofs Pull Request: https://github.com/llvm/llvm-project/pull/119777	2024-12-12 16:52:58 -08:00
Kinoshita Kotaro	a1197a2ca8	[AArch64] Add initial support for FUJITSU-MONAKA (#118432 ) This patch adds initial support for FUJITSU-MONAKA CPU (-mcpu=fujitsu-monaka). The scheduling model will be corrected in the future.	2024-12-09 09:56:02 +09:00
Phoebe Wang	a63931292b	[X86] Fix typo of gracemont (#118486 )	2024-12-03 20:56:52 +08:00
Phoebe Wang	3348b4688f	[X86][compiler-rt] Split CPU names even they have the same subtype (#118237 ) Fixes: #118205	2024-12-02 18:51:19 +08:00
tangaac	427be07675	[LoongArch] Support amcas[_db].{b/h/w/d} instructions. (#114189 ) Two options for clang: -mlamcas & -mno-lamcas. Enable or disable amcas[_db].{b/h} instructions. The default is -mno-lamcas. Only works on LoongArch64.	2024-11-27 17:36:13 +08:00
tangaac	f4379db496	[LoongArch] Support LA V1.1 feature that div.w[u] and mod.w[u] instructions with inputs not signed-extended. (#116764 ) Two options for clang -mdiv32: Use div.w[u] and mod.w[u] instructions with input not sign-extended. -mno-div32: Do not use div.w[u] and mod.w[u] instructions with input not sign-extended. The default is -mno-div32.	2024-11-26 21:57:29 +08:00
Weining Lu	e70f9e2096	[LoongArch] Remove the added in #116762	2024-11-25 09:33:55 +08:00
tangaac	1d4602070f	[LoongArch] Support LA V1.1 feature ld-seq-sa that don't generate dbar 0x700. (#116762 ) Two options for clang -mld-seq-sa: Do not generate load-load barrier instructions (dbar 0x700) -mno-ld-seq-sa: Generate load-load barrier instructions (dbar 0x700) The default is -mno-ld-seq-sa	2024-11-22 17:34:15 +08:00
Freddy Ye	97836bed63	Reland "[X86] Support -march=diamondrapids (#113881 )" (#116564 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-11-18 10:40:32 +08:00
Freddy Ye	90e92239bd	Revert "[X86] Support -march=diamondrapids (#113881 )" (#116563 ) This reverts commit 826b845c9e97448395431be3e4e5da585bd98c5e.	2024-11-18 08:45:28 +08:00
Freddy Ye	826b845c9e	[X86] Support -march=diamondrapids (#113881 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-11-18 08:31:17 +08:00
tangaac	2283d50447	[LoongArch] add la v1.1 features for sys::getHostCPUFeatures (#115832 ) Two features (i.e. `frecipe` and `lam-bh`) are added to `sys.getHostCPUFeatures`. More features will be added in future. In addition, this patch adds the features returned by `sys.getHostCPUFeature` when `-march=native`.	2024-11-14 11:25:32 +08:00
Elvina Yakubova	133f8fa233	Reland [clang][AArch64] Add getHostCPUFeatures to query for enabled f… (#115467 ) …eatures in cpu info Relands #97749. Fixed test by adding additional checks for system linux and target == host.	2024-11-13 09:10:56 +00:00
Malay Sanghi	f77101ea79	[X86][AMX] Support AMX-MOVRS (#115151 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-11-12 15:05:43 +08:00
Feng Zou	eddb79d56d	[X86][AMX] Support AMX-TF32 (#115625 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-11-11 15:24:18 +08:00
Phoebe Wang	8f4401374c	Reland "[X86][AMX] Support AMX-AVX512" (#115581 ) Resolve compile fail without SSE2.	2024-11-09 13:26:10 +08:00
Alan Zhao	ff22515430	Revert "[X86][AMX] Support AMX-AVX512" (#115570 ) Reverts llvm/llvm-project#114070 Reason: Causes `immintrin.h` to fail to compile if `-msse` and `-mno-sse2` are passed to clang: https://github.com/llvm/llvm-project/pull/114070#issuecomment-2465926700	2024-11-08 16:15:02 -08:00
Phoebe Wang	58a17e1bbc	[X86][AMX] Support AMX-AVX512 (#114070 )	2024-11-08 16:25:16 +08:00
Phoebe Wang	c72a751dab	[X86][AMX] Support AMX-TRANSPOSE (#113532 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-11-01 16:45:03 +08:00
Feng Zou	8127162427	[X86][AMX] Support AMX-FP8 (#113850 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-10-31 10:14:25 +08:00
Elvina Yakubova	80a09735ac	Revert "[clang][AArch64] Add getHostCPUFeatures to query for enabled … (#114066 ) …features in cpu info (#97749)" This reverts commit d732c0b13c55259177f2936516b6087d634078e0. This is breaking buildbots https://lab.llvm.org/buildbot/#/builders/190/builds/8413, https://lab.llvm.org/buildbot/#/builders/56/builds/10880 and a few others.	2024-10-29 14:43:01 +00:00
neildhickey	d732c0b13c	[clang][AArch64] Add getHostCPUFeatures to query for enabled features in cpu info (#97749 ) Add getHostCPUFeatures into the AArch64 Target Parser to query the cpuinfo for the device in the case where we are compiling with -mcpu=native. Add LLVM_CPUINFO environment variable to test mock /proc/cpuinfo files for -mcpu=native Co-authored-by: Elvina Yakubova <eyakubova@nvidia.com>	2024-10-29 13:34:43 +00:00
Freddy Ye	c4248fa3ed	[X86] Support MOVRS and AVX10.2 instructions. (#113274 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-10-25 09:00:19 +08:00
Freddy Ye	9e3d4653af	[X86] Update Model value for Arrow Lake. (#113273 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-10-23 09:44:26 +08:00
Albert Huang	aa2c0f35a1	[ARM] [AArch32] Add support for Arm China STAR-MC1 CPU (#110085 ) STAR-MC1 is an Armv8m CPU. Technical specifications available at: https://www.armchina.com/download/Documents/Application-Notes/Technical-Reference-Manual?infoId=160	2024-10-14 15:48:12 +01:00
RipleyTom	c5f7a32356	[X86] Add AMD Llano family detection (#111312 ) Very simple one liner, adds the missing detection for the Llano family which is essentially a refreshed K10: Documentation of the family id: https://en.wikichip.org/wiki/amd/cpuid#Family_18_.2812h.29 Documentation that it fits into amdfam10: https://en.wikipedia.org/wiki/AMD_10h#12h	2024-10-07 08:33:26 -07:00
Yingwei Zheng	bf895c714e	[RISCV] Bump hwprobe support to Linux 6.11 (#108578 ) This patch is the follow-up of https://github.com/llvm/llvm-project/pull/94352 with some updates: 1. Add support for more extensions for `zve`, `zimop`, `zc`, `zcmop` and `zawrs`. 2. Use `RISCV_HWPROBE_KEY_MISALIGNED_SCALAR_PERF` to check whether the processor supports fast misaligned scalar memory access. https://github.com/llvm/llvm-project/pull/108551 reminds me that the patch https://lore.kernel.org/all/20240809214444.3257596-1-evan@rivosinc.com/T/ has been merged. Address comment https://github.com/llvm/llvm-project/pull/94352#discussion_r1626056015. References: 1. constants: https://github.com/torvalds/linux/blame/v6.11-rc7/arch/riscv/include/uapi/asm/hwprobe.h 2. https://docs.kernel.org/arch/riscv/hwprobe.html 3. Related commits: 1. `zve` support: `de8f8282a9` 2. `zimop` support: `36f8960de8` 3. `zc` support: `0ad70db5eb` 4. `zcmop` support: `fc078ea317` 5. `zawrs` support: `244c18fbf6` 6. scalar misaligned perf: `c42e2f0767` and `1f5288874d`	2024-10-05 11:00:09 +08:00
Jubilee	b8028f6b87	[TargetParser][AArch64] Believe runtime feature detection (#95694 ) In https://github.com/llvm/llvm-project/issues/90365 it was reported that TargetParser arrives at the wrong conclusion regarding what features are enabled when attempting to detect "native" features on the Raspberry Pi 4, because it (correctly) detects it as a Cortex-A72, but LLVM (incorrectly) believes all Cortex-A72s have crypto enabled. Attempt to help ourselves by allowing runtime information derived from the host to contradict whatever we believe is "true" about the architecture.	2024-10-02 08:28:57 +01:00
Alex Bradbury	614aeda93b	[RISCV] Mark Zacas as non-experimental (#109651 ) The extension has been ratified for some time, but we kept it experimental (see #99898) due to <https://github.com/riscv-non-isa/riscv-elf-psabi-doc/issues/444>. The ABI issue has been resolved by #101023 so I believe there's no known barrier to moving Zacas to non-experimental.	2024-09-25 06:14:43 +01:00
Ganesh	02e4186d0b	[X86] AMD Zen 5 Initial enablement (#107964 ) This patch enables the basic skeleton enablement of AMD next gen zen5 CPUs.	2024-09-13 17:45:33 +01:00
Phoebe Wang	259ca9ee9c	Reland "[X86][AVX10.2] Support AVX10.2 option and VMPSADBW/VADDP[D,H,S] new instructions (#101452 )" (#101616 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-03 09:26:07 +08:00
Phoebe Wang	2e0588d5e1	Revert "[X86][AVX10.2] Support AVX10.2 option and VMPSADBW/VADDP[D,H,S] new instructions" (#101612 ) Reverts llvm/llvm-project#101452 There are several buildbot failed. Revert first.	2024-08-02 13:04:10 +08:00
Phoebe Wang	10bad2c8d7	[X86][AVX10.2] Support AVX10.2 option and VMPSADBW/VADDP[D,H,S] new instructions (#101452 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-02 12:10:50 +08:00

1 2 3

108 Commits