llvm-project

Author	SHA1	Message	Date
David Tenty	63195d3d7a	[NFC][CMake] quote ${CMAKE_SYSTEM_NAME} consistently (#154537 ) A CMake change included in CMake 4.0 makes `AIX` into a variable (similar to `APPLE`, etc.) `ff03db6657` However, `${CMAKE_SYSTEM_NAME}` unfortunately also expands exactly to `AIX` and `if` auto-expands variable names in CMake. That means you get a double expansion if you write: `if (${CMAKE_SYSTEM_NAME} MATCHES "AIX")` which becomes: `if (AIX MATCHES "AIX")` which is as if you wrote: `if (ON MATCHES "AIX")` You can prevent this by quoting the expansion of "${CMAKE_SYSTEM_NAME}", due to policy [CMP0054](https://cmake.org/cmake/help/latest/policy/CMP0054.html#policy:CMP0054) which is on by default in 4.0+. Most of the LLVM CMake already does this, but this PR fixes the remaining cases where we do not.	2025-08-20 12:45:41 -04:00
Phoebe Wang	99a1d5f7fa	[X86][APX] Remove CF feature from APXF and Diamond Rapids (#153751 ) Due to it results in more losses than gains.	2025-08-20 03:07:56 +00:00
Owen Anderson	4f683b10b5	[RISCV] When resolving extension implications, handle the default I/E case after implications are resolved. (#154353 ) This is needed to handle the scenario of an extension that implies FeatureStdExtE, as is the case for the downstream FeatureVendorXCheriot used for Cheriot support.	2025-08-20 09:59:23 +08:00
Pawan Nirpal	a5ba6067d6	[Clang][NFC] Use Hex Encoding for Intel CPU CPUID family (#153004 ) Use Hex Encoding for CPUID family to match number format with Intel ISE rev.58: https://cdrdv2.intel.com/v1/dl/getContent/671368	2025-08-14 18:36:34 +02:00
Andrei Safronov	48da8489f2	[Xtensa] Add esp32/esp8266 cpus implementation. (#152409 ) Add Xtensa esp32 and esp8266 cpus. Implement target parser to recognise Xtensa hardware features.	2025-08-12 15:17:36 +03:00
Daniel Paoliello	7694856fdd	Fix TargetParserTests for big-endian hosts (#152407 ) The new `sys::detail::getHostCPUNameForARM` for Windows (#151596) was implemented using a C++ bit-field, which caused the associated unit tests to fail on big-endian machines as it assumed a little-endian layout. This change switches from the C++ bit-field to LLVM's `BitField` type instead.	2025-08-06 16:50:28 -07:00
Daniel Paoliello	a418fa7cdc	[win][aarch64] Add support for detecting the Host CPU on Arm64 Windows (#151596 ) Uses the `CP 4000` registry keys under `HKLM\HARDWARE\DESCRIPTION\System\CentralProcessor\*` to get the Implementer and Part, which is then provided to a modified form of `getHostCPUNameForARM` to map to a CPU. On my local Surface Pro 11 `llc --version` reports: ``` > .\build\bin\llc.exe --version LLVM (http://llvm.org/): LLVM version 22.0.0git Optimized build with assertions. Default target: aarch64-pc-windows-msvc Host CPU: oryon-1 ```	2025-08-06 11:39:41 -07:00
zGoldthorpe	d7074b63ed	[Clang][AMDGPU] Add builtins for some buffer resource atomics (#149216 ) This patch exposes builtins for atomic `add`, `max`, and `min` operations that operate over buffer resource pointers.	2025-08-05 11:04:15 -06:00
Alex Voicu	06458fff87	[AMDGCNSPIRV][NFC] Add missing target features to AMDGCNSPIRV (#152057 ) `gfx1250` bring-up omitted updating the `amdgcnspirv` feature list, this fixes that oversight.	2025-08-05 15:29:48 +01:00
Matt Arsenault	e848959cb5	ARM: Remove CPU from computeTargetABI (#151983 ) The target CPU is a subtarget / function level concept, which should not influence the module level ABI decisions. No tests fail so it appears nothing is relying on this.	2025-08-05 10:21:50 +09:00
Stanislav Mekhanoshin	0988510ad4	[AMDGPU] gfx1250 v_perm_pk16_* instructions (#151773 )	2025-08-01 20:12:35 -07:00
Daniel Paoliello	4adce336f4	[win][arm64ec] Fixes to unblock building LLVM and Clang as Arm64EC (#150068 ) These changes allow LLVM and Clang to be built with Clang targeting Arm64EC using the MSVC linker. Built with these options: ``` -DLLVM_ENABLE_PROJECTS="clang" -DLLVM_HOST_TRIPLE=arm64ec-pc-windows-msvc -DCMAKE_C_COMPILER=clang-cl.exe -DCMAKE_C_COMPILER_TARGET=arm64ec-pc-windows-msvc -DCMAKE_CXX_COMPILER=clang-cl.exe -DCMAKE_CXX_COMPILER_TARGET=arm64ec-pc-windows-msvc -DCMAKE_LINKER_TYPE=MSVC ```	2025-07-31 09:30:05 -07:00
Stanislav Mekhanoshin	62187a60e6	[AMDGPU] Add gfx1250 v_cvt_sr_pk_bf16_f32 instruction (#151385 )	2025-07-30 14:02:03 -07:00
satyanarayana reddy janga	c03b0dd9f4	Add MTIA and META to triple (#150236 ) Ref: https://ai.meta.com/blog/next-generation-meta-training-inference-accelerator-AI-MTIA/ This PR contains 1. MTIA: Meta Training and Inference Accelerator as Environment. 2. Meta as the vendor. ### Testing Added a unittest for the relevant changes ### Reviewers @clayborg , @jeffreytan81 , @Jlalond	2025-07-28 10:03:20 -07:00
Kazu Hirata	83cb8b7bef	[llvm] Use a range-based for loop instead of {std,llvm}::for_each (NFC) (#150841 ) LLVM Coding Standards discourages {std,llvm}::for_each unless we already have a callable.	2025-07-27 10:43:52 -07:00
Stanislav Mekhanoshin	9deb7f6062	[AMDGPU] gfx1250 vmem prefetch target intrinsics and builtins (#150466 )	2025-07-24 12:13:59 -07:00
Kazu Hirata	3e53d4d386	[llvm] Remove unused includes (NFC) (#150265 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-07-23 15:18:46 -07:00
Alexandros Lamprineas	3ab64c5b29	[NFC][Clang][FMV] Make FMV priority data type future proof. (#150079 ) FMV priority is the returned value of a polymorphic function. On RISC-V and X86 targets a 32-bit value is enough. On AArch64 we currently need 64 bits and we will soon exceed that. APInt seems to be a suitable replacement for uint64_t, presumably with minimal compile time overhead. It allows bit manipulation, comparison and variable bit width.	2025-07-23 10:37:29 +01:00
Phoebe Wang	4fbe88fc46	[NFC] Add parentheses around arithmetic operand (#149489 )	2025-07-18 18:23:46 +08:00
Shilei Tian	7e105fbdbe	[AMDGPU] Add support for `v_tanh_f32` on gfx1250 (#149360 ) Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>	2025-07-17 15:42:35 -04:00
Kazu Hirata	96bde11e30	[TargetParser] Remove const from a return type (NFC) (#149255 ) getHostCPUFeatures constructs and returns a temporary instance of StringMap<bool>. We don't need const on the return type.	2025-07-17 07:22:51 -07:00
Djordje Todorovic	742147ba1b	[llvm-objcopy][libObject] Add RISC-V big-endian support (#146913 ) Add support for big-endian RISC-V ELF files: - Add riscv32be/riscv64be target architectures to Triple - Support elf32-bigriscv and elf64-bigriscv output targets in llvm-objcopy - Update ELFObjectFile to handle BE RISC-V format strings and architecture detection - Add BE RISC-V support to RelocationResolver - Add tests for new functionality This is a subset of a bigger RISC-V big-endian support patch, containing only the llvm-objcopy and libObject changes. Other changes will be added later.	2025-07-17 10:36:31 +02:00
Brad Smith	0d2e11f3e8	Remove Native Client support (#133661 ) Remove the Native Client support now that it has finally reached end of life.	2025-07-15 13:22:33 -04:00
Shilei Tian	d7ec80c897	[AMDGPU] Add support for `v_tanh_bf16` on gfx1250 (#147425 ) Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>	2025-07-14 16:30:18 -04:00
Phoebe Wang	d11fd66959	[X86] Remove WIDEKL feature from Pantherlake and Clearwaterforest (#148184 ) According to Intel [SDM088](https://cdrdv2.intel.com/v1/dl/getContent/671200) Volume 1, Chapter 2.4, Key Locker will be removed from 2025 onwards products.	2025-07-11 21:27:42 +08:00
Elvina Yakubova	69835d8f6d	[clang][AArch64] Parse more features in getHostCPUFeatures (#146323 ) Add parsing of some crypto features to display them properly when -mcpu=native is used	2025-07-09 11:43:08 +01:00
Shilei Tian	d258457d42	[AMDGPU] Add support for `v_cvt_f32_fp8` on gfx1250 (#147579 ) Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>	2025-07-08 16:21:24 -04:00
Tony Varghese	b4b150f8c9	[PowerPC][clang] Fix triple constructor ambiguity causing "unknown" target triple on AIX (#147488 ) PR #145685 introduced constructor overload ambiguity in the Triple class, causing `updateTripleOSVersion()` to construct Triple objects with `unknown` instead of the configured target triple (e.g., `powerpc-ibm-aix7.3.0.0`). This results in Clang driver errors like `error: unknown target triple 'unknown'`. Used `Twine` constructor with braced initialization to bypass ambiguity. --------- Co-authored-by: Tony Varghese <tony.varghese@ibm.com> Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2025-07-08 22:45:56 +05:30
Elvina Yakubova	bd6e9047dd	[LLVM][AArch64] Relax SVE codegen predicates for sm4 instructions (#147524 ) Adds sve-sm4 to reference FEAT_SVE_SM4 without specifically enabling SVE2.	2025-07-08 17:04:21 +01:00
Jim Lin	899a11ae32	[Triple][M68k] Add missing handling for target m68k in getDefaultExceptionHandling. (#147492 ) I encountered the assertion failure `Assertion TmpAsmInfo->getExceptionHandlingType() == getTargetTriple().getDefaultExceptionHandling() && "MCAsmInfo and Triple disagree on default exception handling type"' failed`.	2025-07-08 17:29:19 +08:00
Matt Arsenault	1121034dd1	Triple: Record default exception handling type Currently the default exception handling type is scattered across the backends in MCAsmInfo constructors. Allow this to be computed from the triple so the IR can centrally determine the set of ABI calls. Manually submitting, closes #147225	2025-07-08 00:13:19 +09:00
Ricardo Jesus	84e54515bc	[AArch64] Add support for -mcpu=gb10. (#146515 ) This patch adds support for -mcpu=gb10 (NVIDIA GB10). This is a big.LITTLE cluster of Cortex-X925 and Cortex-A725 cores. The appropriate MIDR numbers are added to detect them in -mcpu=native. We did not add an -mcpu=cortex-x925.cortex-a725 option because GB10 does include the crypto instructions which we want on by default, and the current convention is to not enable such extensions for Arm Cortex cores in -mcpu where they are optional in the IP. Relevant GCC patch: https://gcc.gnu.org/pipermail/gcc-patches/2025-June/687005.html	2025-07-07 11:14:26 +01:00
Kazu Hirata	29b2b2263f	[TargetParser] Use StringRef::consume_front (NFC) (#147202 ) While we are at it, this patch switches to a range-based for loop.	2025-07-06 19:05:45 -07:00
Kazu Hirata	f90025ebd9	[llvm] Compare std::optional<T> to values directly (NFC) (#146222 ) This patch transforms: X && *X == Y to: X == Y where X is of std::optional<T>, and Y is of T or similar.	2025-06-28 13:04:16 -07:00
Changpeng Fang	4729242878	AMDGPU: Add MC layer support for load transpose instructions for gfx1250 (#146024 ) Co-authored with @jayfoad	2025-06-26 22:30:31 -07:00
Paul Walker	635acfbfca	[LLVM][AArch64] Relax SVE/SME codegen predicates for crypto and bitperm instructions. (#145696 ) Adds sve-sha3 to reference FEAT_SVE_SHA3 without specifically enabling SVE2. The SVE2 requirement for AES, SHA3 and Bitperm is replaced with SVE for non-streaming function.	2025-06-26 13:01:07 +01:00
Matt Arsenault	fff720d641	Triple: Forward declare Twine and remove include (#145685 )	2025-06-26 15:26:04 +09:00
Matt Arsenault	fb6882719a	Triple: Remove workaround for gcc 4.0.3 (#145660 ) Use the Twine version instead of manually building a string	2025-06-25 23:57:56 +09:00
Matt Arsenault	7ff0d28f2e	Triple: Remove redundant member initializers (#145661 ) These are already initialized in the field definitions.	2025-06-25 21:04:07 +09:00
Matt Arsenault	38920964ba	ARM: Move ABI enum from TargetMachine to TargetParser (#144725 ) Consolidate ABI parsing logic in TargetParser where computeDefaultTargetABI is defined, instead of splitting it into the backend. We need the full ABI information computable in RuntimeLibcallsInfo	2025-06-23 15:22:59 +09:00
Stanislav Mekhanoshin	69974658f0	[AMDGPU] Initial support for gfx1250 target. (#144965 ) This is just a stub for now.	2025-06-19 22:52:51 -07:00
Phoebe Wang	5cbed34404	[X86] Remove CLDEMOTE from Arrowlake and later hybrid processors (#144833 ) Decouple Arrowlake from Sierraforest because the later has CLDEMOTE feature.	2025-06-20 08:57:37 +08:00
Karlo Basioli	3de01d07c3	Fix bazel build after #144594 , mark variable as potentially unused (#144910 )	2025-06-19 16:16:03 +01:00
zhijian lin	bf79d4819e	[Reland] [PowerPC] frontend get target feature from backend with cpu name (#144594 ) 1. The PR proceeds with a backend target hook to allow front-ends to determine what target features are available in a compilation based on the CPU name. 2. Fix a backend target feature bug that supports HTM for Power8/9/10/11. However, HTM is only supported on Power8/9 according to the ISA. 3. All target features that are hardcoded in PPC.cpp can be retrieved from the backend target feature. I have double-checked that the hardcoded logic for inferring target features from the CPU in the frontend(PPC.cpp) is the same as in PPC.td. The reland patch addressed the comment https://github.com/llvm/llvm-project/pull/137670#discussion_r2143541120	2025-06-19 09:22:16 -04:00
Phoebe Wang	5875fafdc5	[X86] Remove CLDEMOTE from Alderlake and later hybrid processors (#144662 ) SDM doesn't list any hybrid processors in this feature. Besides, physical machine also reports not supported.	2025-06-19 10:30:47 +08:00
Matthias Braun	b0378e7ca9	[AArch64TargetParser]Fix reconstructFromParsedFeatures ignoring negative features (#142236 ) The `targetFeatureToExtension` function used by reconstructFromParsedFeatures only found positive `+FEATURE` strings, but not negative `-FEATURE` strings. Extend the function to handle both to fix `reconstructFromParsedFeatures`.	2025-06-16 12:55:12 -07:00
Kazu Hirata	84ff1bda29	[RISCV] Use StringRef in a range-based for loop (NFC) (#144243 ) When we iterate over std::vector<std::string>, we can directly assign each element to StringRef. We do not need to go through a separate statement.	2025-06-14 23:23:49 -07:00
Reid Kleckner	cbf27bf711	Revert " [PowerPC] frontend get target feature from backend with cpu name (#137670 )" This reverts commit 9208b343e962b9f1140ee345c0050a3920bdcbf2. TargetParser shouldn't re-run the PPC subtarget tablegen target, it should define its own `-gen-ppc-target-def` rule like all the other targets do in llvm/include/llvm/TargetParser/CMakeLists.txt . One user reported that there are incorrect CMake dependencies after this change, so I will roll this back in the meantime.	2025-06-12 19:56:41 +00:00
zhijian lin	9208b343e9	[PowerPC] frontend get target feature from backend with cpu name (#137670 ) 1. The PR proceeds with a backend target hook to allow front-ends to determine what target features are available in a compilation based on the CPU name. 2. Fix a backend target feature bug that supports HTM for Power8/9/10/11. However, HTM is only supported on Power8/9 according to the ISA. 3. All target features that are hardcoded in PPC.cpp can be retrieved from the backend target feature. I have double-checked that the hardcoded logic for inferring target features from the CPU in the frontend(PPC.cpp) is the same as in PPC.td.	2025-06-12 13:38:13 -04:00
Pengcheng Wang	ce621041c2	[RISCV] Get host CPU name via hwprobe (#142745 ) We can get the `mvendorid/marchid/mimpid` via hwprobe and then we can compare these IDs with those defined in processors to find the CPU name. With this change, `-mcpu/-mtune=native` can set the proper name.	2025-06-12 16:39:57 +08:00

1 2 3 4 5 ...

479 Commits