llvm-project

Author	SHA1	Message	Date
Ami-zhang	06f779b69d	Reland "[Clang][LoongArch] Support target attribute for function" (#142546 ) This relands #140700. I have updated the test case('targetattr.c') to resolve the test failure. Original PR resulted in test fail: https://lab.llvm.org/buildbot/#/builders/11/builds/16173 https://lab.llvm.org/buildbot/#/builders/202/builds/1531 Original description: Followup to #140700.	2025-06-03 15:57:50 +08:00
Paul Kirth	d93788fcbf	Revert "[Clang][LoongArch] Support target attribute for function" (#141998 ) Reverts llvm/llvm-project#140700 This breaks bots both in buildbot and downstream CI: - https://lab.llvm.org/buildbot/#/builders/11/builds/16173 - https://lab.llvm.org/buildbot/#/builders/202/builds/1531 - https://ci.chromium.org/ui/p/fuchsia/builders/toolchain.ci/clang-host-linux-x64/b8713537585914796017/overview	2025-05-29 11:26:44 -07:00
Ami-zhang	b359422eeb	[Clang][LoongArch] Support target attribute for function (#140700 ) This adds support under LoongArch for the target("..") attributes. The supported formats are: - "arch=<arch>" strings, that specify the architecture features for a function as per the -march=arch option. - "tune=<cpu>" strings, that specify the tune-cpu cpu for a function as per -mtune. - "<feature>", "no-<feature>" enabled/disables the specific feature.	2025-05-29 19:54:48 +08:00
Craig Topper	a0b6cfd975	[RISCV] Add MC layer support for XSfmm*. (#133031 ) This adds assembler/disassembler support for XSfmmbase 0.6 and related SiFive matrix multiplication extensions based on the spec here https://www.sifive.com/document-file/xsfmm-matrix-extensions-specification Functionality-wise, this is the same as the Zvma extension proposal that SiFive shared with the Attached Matrix Extension Task Group. The extension names and instruction mnemonics have been changed to use vendor prefixes. Note this is a non-conforming extension as the opcodes used here are in the standard opcode space in OP-V or OP-VE. --------- Co-authored-by: Brandon Wu <brandon.wu@sifive.com>	2025-05-21 08:26:35 -07:00
Iris Shi	bdf03fcff3	Revert "[llvm][NFC] Use `llvm::sort()`" (#140668 )	2025-05-20 11:27:03 +08:00
Iris Shi	061a7699f3	[llvm][NFC] Use `llvm::sort()` (#140335 )	2025-05-17 14:49:46 +08:00
Iris Shi	1e503d08e1	[RISCV][MC] Add support for Q extension (#139369 ) Closes #130217. https://github.com/riscv/riscv-isa-manual/blob/main/src/q-st-ext.adoc	2025-05-15 10:51:06 +08:00
Kazu Hirata	7ca4079504	[TargetParser] Use StringRef::consume_back (NFC) (#139416 )	2025-05-11 07:11:53 -07:00
Ties Stuij	269f5fe91e	[AARCH64] Add support for Cortex-A320 (#139055 ) This patch adds initial support for the recently announced Armv9 Cortex-A320 processor. For more information, including the Technical Reference Manual, see: https://developer.arm.com/Processors/Cortex-A320 --------- Co-authored-by: Oliver Stannard <oliver.stannard@arm.com>	2025-05-09 16:24:48 +01:00
no92	0505e3761b	[llvm] Add triples for managarm (#87845 ) This PR aims to add a target for [managarm](https://github.com/managarm/managarm). The targets `{x86_64,aarch64,riscv64}-pc-managarm-{kernel,mlibc}` will be supported. Discourse RFC: [discourse.llvm.org/t/rfc-new-proposed-managarm-support-for-llvm-and-clang-87845/85884](https://discourse.llvm.org/t/rfc-new-proposed-managarm-support-for-llvm-and-clang-87845/85884)	2025-05-06 23:21:22 -07:00
Shoaib Meenai	705ceff7c1	[TargetParser] Fix flaky installs of generated headers (#137853 ) The `llvm-headers` target wasn't depending on the generated TargetParser headers, so they'd be flakily installed or not installed depending on which order the build steps ran in. Add an explicit dependency to fix this, and switch to a single `target_parser_gen` target to mirror the pattern used by `intrinsics_gen` (which also fixes a few other missing dependencies). Switch `llvm-headers` to use `add_dependencies` instead of `DEPENDS` for the tablegen dependencies as well, since `DEPENDS` is only intended for creating a file-level dependency on the output of an `add_custom_command` in the same CMakeLists.txt (see `DEPENDS` under https://cmake.org/cmake/help/latest/command/add_custom_target.html).	2025-04-29 12:13:38 -07:00
Phoebe Wang	a87d8e9442	[X86][AVX512FP16] Decouple AVX512VL and AVX512DQ from AVX512FP16 (#137450 ) Fixes: #136209	2025-04-27 14:01:37 +08:00
Pengcheng Wang	6c33735343	[RISCV] Allow `Zicsr`/`Zifencei` to duplicate with `g` (#136842 ) This matches GCC and we supported it in LLVM 17/18. Fixes #136803	2025-04-27 11:12:47 +08:00
jeremyd2019	6900e90265	[LLVM][TargetParser] Handle -msys targets the same as -cygwin. (#136817 ) MSYS2 uses i686-pc-msys and x86_64-pc-msys as target, and is a fork of Cygwin. There's an effort underway to try to switch as much as possible to use -pc-cygwin targets, but the -msys target will be hanging around for the forseeable future.	2025-04-24 13:28:27 +03:00
Craig Topper	68d89e9316	[RISCV] Remove stale comment. NFC	2025-04-22 21:25:41 -07:00
Craig Topper	4c0ea476c4	[RISCV] Report error if Zilsd is used on RV64. (#136577 ) Fixes #136564.	2025-04-21 11:27:24 -07:00
Kazu Hirata	c4e9901b5b	[llvm] Use llvm::append_range (NFC) (#135931 )	2025-04-16 12:28:47 -07:00
Jack Styles	06da00ae2d	[ARM][Clang] Make `+nosimd` functional for AArch32 Targets (#130623 ) `+simd` and `+nosimd` are used to enable or disable NEON Instructions when compiling for ARM Targets. However, up until now, using these has not been possible. To enable this, these options are mapped to the relevant LLVM backend option (`+neon` and `-neon`) so it can be both enabled and disabled successfully by the user. Tests have been added to ensure this behaviour is maintained in the future, along with updates to existing tests as behaviour has now changed relating to the use of `+simd` and `+nosimd`. As `simd` has been mapped within the ARMTargetParser.def, support for this extension is also added for the `--print-support-extensions` command when the target is AArch32. This will print the `simd` option, along with the description that relates to the Neon feature. This previously was not possible as `simd` did not have a related Feature or Negative Feature. To make this functional as intended, MVE and MVE.FP now rely on their own Enum identifier, rather than `AEK_SIMD`. While SIMD does refer to both Neon and Helium technologies, in terms of command line options, SIMD relates to Neon. Helium relates to MVE and MVE.FP. The Enum now reflects this too.	2025-04-15 09:00:14 +01:00
Pengcheng Wang	e8e98683d7	[RISCV][NFC] Use bitmasks generated by TableGen So that we don't need to sync-up the table manually. Reviewers: BeMg, preames, lenary Reviewed By: BeMg Pull Request: https://github.com/llvm/llvm-project/pull/135600	2025-04-14 19:32:13 +08:00
Phoebe Wang	ebba554a32	[X86][AVX10] Remove VAES and VPCLMULQDQ feature from AVX10.1 (#135489 ) According to SDM, they require both VAES/VPCLMULQDQ and AVX10.1 CPUID bits. Fixes: #135394	2025-04-14 08:54:10 +08:00
Juan Manuel Martinez Caamaño	d6c1ef576f	[AMDGPU] vmem-to-lds-load-insts incoherence between TargetParser and AMDGPU.td (#135376 ) The vmem-to-lds-loads-insts feature is only available on gfx9/10. While target-parser was also enabling it for gfx6,7,8.	2025-04-11 16:31:04 +02:00
Ulrich Weigand	80267f8148	Support z17 processor name and scheduler description (#135254 ) The recently announced IBM z17 processor implements the architecture already supported as "arch15" in LLVM. This patch adds support for "z17" as an alternate architecture name for arch15. This patch also add the scheduler description for the z17 processor, provided by Jonas Paulsson.	2025-04-11 00:20:58 +02:00
Juan Manuel Martinez Caamaño	beae0e9f1a	[AMDGPU] Use a target feature to enable __builtin_amdgcn_global_load_lds on gfx9/10 (#133055 ) This patch introduces the `vmem-to-lds-load-insts` target feature, which can be used to enable builtins `__builtin_amdgcn_global_load_lds` and `__builtin_amdgcn_raw_ptr_buffer_load_lds` on platforms which have this feature. This feature is only available on gfx9/10. A limitation of using a common target feature for both builtins is that we could have made `__builtin_amdgcn_raw_ptr_buffer_load_lds` available on gfx6,7,8.	2025-04-02 20:00:09 +02:00
quic_hchandel	edef028029	[RISCV] Add Qualcomm uC Xqciio (External Input Output) extension (#132721 ) This extension adds two external input output instructions for non-memory-mapped device. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-28 19:47:29 -07:00
Tsukasa OI	a6de034f85	[RISCV] Combine 3 bit-manip extensions into `B` (#132858 ) Like cryptography extensions like `Zk`, `B` (a combination of `Zba`, `Zbb` and `Zbs` extensions) can be useful if we handle this extension as a combination. If all `Zba`, `Zbb` and `Zbs` extensions are enabled, it also enables the `B` extension.	2025-03-25 22:32:16 +08:00
Ricardo Jesus	847e46ca01	[AArch64] Add initial support for -mcpu=olympus. (#132368 ) This patch adds support for the NVIDIA Olympus core. This does not add any special tuning decisions, and those may come later.	2025-03-25 08:09:04 +00:00
Jesse Huang	20b5728b7b	[RISCV] Implement the implications of C extension (#132259 ) Implement the following implications according to the [Zc spec](https://github.com/riscvarchive/riscv-code-size-reduction/blob/main/Zc-specification/Zc.adoc#13-c) > As C defines the same instructions as Zca, Zcf and Zcd, the rule is that: > * C always implies Zca > * C+F implies Zcf (RV32 only) > * C+D implies Zcd	2025-03-22 14:48:52 +08:00
Sudharsan Veeravalli	e7107973b8	Recommit "[RISCV] Add Qualcomm uC Xqcisync (Sync Delay) extension (#132184 )" (#132520 ) With a minor fix for the build failures. Original message: This extension adds nine instructions, eight for non-memory-mapped devices synchronization and delay instruction. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli quic_svs@quicinc.com	2025-03-22 11:07:48 +05:30
Kazu Hirata	fe7776eab8	Revert "[RISCV] Add Qualcomm uC Xqcisync (Sync Delay) extension (#132184 )" This reverts commit 3840f787a21a66686f5d8bf61877d41f3a65f205. Multiple builtbot failures have been reported: https://github.com/llvm/llvm-project/pull/132184	2025-03-21 20:28:11 -07:00
quic_hchandel	3840f787a2	[RISCV] Add Qualcomm uC Xqcisync (Sync Delay) extension (#132184 ) This extension adds nine instructions, eight for non-memory-mapped devices synchronization and delay instruction. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-22 07:57:07 +05:30
quic_hchandel	0744d4926a	[RISCV] Add Qualcomm uC Xqcilb (Long Branch) extension (#131996 ) This extension adds two long branch instructions. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-20 11:14:53 +05:30
dong-miao	480202f0d1	[RISCV] Add Zilsd and Zclsd Extensions (#131094 ) This commit adds the Load/Store pair instructions (Zilsd) and Compressed Load/Store pair instructions (Zclsd). [Specification link](https://github.com/riscv/riscv-isa-manual/blob/main/src/zilsd.adoc).	2025-03-19 08:53:41 -07:00
Sudharsan Veeravalli	467e5a1d41	[RISCV] Add Qualcomm uC Xqcisim (Simulation Hint) extension (#128833 ) This extension adds 10 instructions that provide hints to the interface simulation environment. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/ This patch adds assembler only support.	2025-03-18 09:05:22 -07:00
quic_hchandel	036c6cb37c	[RISCV] Add Qualcomm uC Xqcibi (Branch Immediate) extension (#130779 ) This extension adds twelve conditional branch instructions that use an immediate operand for the source. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-18 15:18:43 +05:30
Kazu Hirata	e71686ed15	[TargetParser] Avoid repeated hash lookups (NFC) (#131555 )	2025-03-17 07:42:50 -07:00
u4f3	e61859f14d	[RISCV] Add Qualcomm uC Xqcili (load large immediates) extension (#130012 ) The Xqcili extension includes a two instructions that load large immediates than is available with the base RISC-V ISA. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support.	2025-03-13 11:13:02 -07:00
Craig Topper	90a8322399	[RISCV] Add an error that Xqccmp, Xqciac, and Xqcicm are not compatible with C+D or Zcd. (#130816 ) I was reviewing encodings to put the disassembling of vendor instructions after after standard instructions and found that these overlap with c.fldsp and c.fsdsp.	2025-03-12 08:24:34 -07:00
quic_hchandel	6e7e46cafe	[RISCV] Add Qualcomm uC Xqcibm (Bit Manipulation) extension (#129504 ) This extension adds thirty eight bit manipulation instructions. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.6 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-06 12:01:53 +05:30
Sam Elliott	5066d7b601	[RISCV] Add Xqccmp 0.1 Assembly Support (#128731 ) Xqccmp is a new spec by Qualcomm that makes a vendor-specific effort to solve the push/pop + frame pointers issue. Broadly, it takes the Zcmp instructions and reverse the order they push/pop registers in, which ends up matching the frame pointer convention. This extension adds a new instruction not present in Zcmp, `qc.cm.pushfp`, which will set `fp` to the incoming `sp` value after it has pushed the registers. This change duplicates the Zcmp implementation, with minor changes to mnemonics (for the `qc.` prefix), predicates, and the addition of `qc.cm.pushfp`. There is also new logic to prevent combining Xqccmp and Zcmp. Xqccmp is kept separate to Xqci for decoding/encoding etc, as the specs are separate today. Specification: https://github.com/quic/riscv-unified-db/releases/tag/Xqccmp_extension-0.1.0	2025-02-26 20:03:02 -08:00
quic_hchandel	538b898a83	[RISCV] Add Qualcomm uC Xqcilia (Large Immediate Arithmetic) extension (#124706 ) This extension adds eight 48 bit large arithmetic instructions. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/latest This patch adds assembler only support.	2025-02-24 08:04:29 -08:00
Fabian Ritter	8615f9aaff	[AMDGPU] Replace gfx940 and gfx941 with gfx942 in llvm (#126763 ) gfx940 and gfx941 are no longer supported. This is one of a series of PRs to remove them from the code base. This PR removes all non-documentation occurrences of gfx940/gfx941 from the llvm directory, and the remaining occurrences in clang. Documentation changes will follow. For SWDEV-512631	2025-02-19 10:20:48 +01:00
Craig Topper	0cc532b79e	[RISCV] Move the RISCVII namespaced enums into RISCVVType namespace in RISCVTargetParser.h. NFC (#127585 ) The VLMUL and policy enums originally lived in RISCVBaseInfo.h in the backend which is where everything else in the RISCVII namespace is defined. RISCVTargetParser.h is used by much more of the compiler and it doesn't really make sense to have 2 different namespaces exposed. These enums are both associated with VTYPE so using the RISCVVType namespace seems like a good home for them.	2025-02-18 08:27:25 -08:00
Jack Styles	d9af03ba80	[ARM] Ensure FPU Selection can select mode correctly (#124935 ) Previously, when selecting a Single Precision FPU, LLVM would ensure all elements of the Candidate FPU matched the InputFPU that was given. However, for cases such as Cortex-R52, there are FPU options where not all fields match exactly, for example NEON Support or Restrictions on the Registers available. This change ensures that LLVM can select the FPU correctly, removing the requirement for Neon Support and Restrictions for the Candidate FPU to be the same as the InputFPU.	2025-02-04 10:42:26 +00:00
Craig Topper	5dc815503f	[RISCV] Add ESWIN EIC770X (SiFive P550) to getHostCPUNameForRISCV. (#125277 ) This enables -mcpu=native for the HiFive Premier P550 board.	2025-01-31 17:12:34 -08:00
ssijaric-nv	16e9601e19	[Flang] Adjust the trampoline size for AArch64 and PPC (#118678 ) Set the trampoline size to match that in compiler-rt/lib/builtins/trampoline_setup.c and AArch64 and PPC lowering.	2025-01-27 08:02:18 -08:00
quic_hchandel	163935a48d	[RISCV] Add Qualcomm uC Xqcilo (Large Offset Load Store) extension (#123881 ) This extension adds eight 48 bit load store instructions. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/latest This patch adds assembler only support. --------- Co-authored-by: Harsh Chandel <hchandel@qti.qualcomm.com>	2025-01-23 10:14:25 +05:30
tangaac	19834b4623	[LoongArch] Support sc.q instruction for 128bit cmpxchg operation (#116771 ) Two options for clang -mno-scq: Disable sc.q instruction. -mscq: Enable sc.q instruction. The default is -mno-scq.	2025-01-23 12:11:07 +08:00
Ulrich Weigand	8424bf207e	[SystemZ] Add support for new cpu architecture - arch15 This patch adds support for the next-generation arch15 CPU architecture to the SystemZ backend. This includes: - Basic support for the new processor and its features. - Detection of arch15 as host processor. - Assembler/disassembler support for new instructions. - Exploitation of new instructions for code generation. - New vector (signed\|unsigned\|bool) __int128 data types. - New LLVM intrinsics for certain new instructions. - Support for low-level builtins mapped to new LLVM intrinsics. - New high-level intrinsics in vecintrin.h. - Indicate support by defining __VEC__ == 10305. Note: No currently available Z system supports the arch15 architecture. Once new systems become available, the official system name will be added as supported -march name.	2025-01-20 19:30:21 +01:00
Alex Voicu	b08b56381c	[NFC][AMDGPU] Clean-up feature parsing for AMDGCNSPIRV. (#123519 ) When we did the initial AMDGCNSPIRV commits we left the initialisation of the feature map in a relatively disorderly state. This change corrects that oversight: - We make sure that AMDGCNSPIRV actually advertises the union of all AMDGCN features, as some were not included; - We keep feature initialisation in sorted order to make it easy to pick an insertion point when features are added in the future.	2025-01-20 02:30:29 +00:00
Alexandros Lamprineas	831527a5ef	[FMV][GlobalOpt] Statically resolve calls to versioned functions. (#87939 ) To deduce whether the optimization is legal we need to compare the target features between caller and callee versions. The criteria for bypassing the resolver are the following: * If the callee's feature set is a subset of the caller's feature set, then the callee is a candidate for direct call. * Among such candidates the one of highest priority is the best match and it shall be picked, unless there is a version of the callee with higher priority than the best match which cannot be picked from a higher priority caller (directly or through the resolver). * For every higher priority callee version than the best match, there is a higher priority caller version whose feature set availability is implied by the callee's feature set. Example: Callers and Callees are ordered in decreasing priority. The arrows indicate successful call redirections. Caller Callee Explanation ========================================================================= mops+sve2 --+--> mops all the callee versions are subsets of the \| caller but mops has the highest priority \| mops --+ sve2 between mops and default callees, mops wins sve sve between sve and default callees, sve wins but sve2 does not have a high priority caller default -----> default sve (callee) implies sve (caller), sve2(callee) implies sve (caller), mops(callee) implies mops(caller)	2025-01-17 10:49:43 +00:00

1 2 3 4 5 ...

427 Commits