llvm-project

Author	SHA1	Message	Date
林克	6842cc5562	[RISCV] Add SpacemiT XSMTVDot (SpacemiT Vector Dot Product) extension. (#151706 ) The full spec can be found at spacemit-x60 processor support scope: Section 2.1.2.2 (Features): https://developer.spacemit.com/documentation?token=BWbGwbx7liGW21kq9lucSA6Vnpb#2.1 This patch only supports assembler.	2025-08-18 18:03:17 +08:00
Craig Topper	e67ec12640	[RISCV] Remove experimental from Smctr and Ssctr. (#153903 ) These extensions were ratified in November 2024.	2025-08-15 17:18:09 -07:00
Daniel Henrique Barboza	8e57689c34	[RISCV] add load/store misched/PostRA subtarget features (#149409 ) Some processors benefit more from store clustering than load clustering, and vice-versa, depending on factors that are exclusive to each one (e.g. macrofusions implemented). Likewise, certain optimizations benefits more from misched clustering than postRA clustering. Macrofusions are again an example: in a processor with store pair macrofusions, like the veyron-v1, it is observed that misched clustering increases the amount of macrofusions more than postRA clustering. This of course isn't necessarily true for other processors, but it shows that processors can benefit from a more fine grained control of clustering mutations, and each one is able to do it differently. Add 4 new subtarget features that deprecates the existing riscv-misched-load-store-clustering and riscv-postmisched-load-store-clustering options: - disable-misched-load-clustering and disable-misched-store-clustering: disable load/store clustering during misched; - disable-postmisched-load-clustering and disable-postmisched-store-clustering: disable load/store clustering during PostRA. Note that the new subtarget features disables specific stages of the default clustering settings. The default per se (load and store clustering for both misched and PostRA) is left untouched. Disable all clustering but misched-store-clustering for the veyron-v1 processor using the new features.	2025-08-06 09:08:25 -07:00
Craig Topper	0a2b979310	[RISCV] Add missing Zvl dependencies for XSfvqmaccdod/XSfvqmaccqoq/XSfvfwmaccqqq. (#150346 ) These have an LMUL=1 operand that must have a multiple of 16 or 32 elements in it. This places a lower bound on the VLEN.	2025-07-23 22:06:46 -07:00
quic_hchandel	0be51cff91	[RISCV] Add ISel patterns for Qualcomm uC Xqcicli extension (#148121 ) Add CodeGen patterns for conditional load immediate instructions	2025-07-15 12:13:57 +05:30
Jim Lin	22707fd4a5	[RISCV] Add Andes XAndesBFHCvt (Andes Scalar BFLOAT16) extension (#148563 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. The extension includes only two instructions: one for converting from f32 to f16, and another for converting from f16 to f32. This patch only implements MC support for XAndesBFHCvt.	2025-07-15 08:59:00 +08:00
Craig Topper	cc9b5c3480	[RISCV] Remove unused Predicates. NFC	2025-07-11 22:51:42 -07:00
Craig Topper	390fbe664c	[RISCV] Use Predicates instead of Added Complexity to prefer QC_SELECTEQI over QC_MVEQI. NFC (#148312 ) IMHO AddedComplexity should be used as a last resort. We should use other mechanism like Predicates and PatFrag predicates to give priority.	2025-07-11 22:34:54 -07:00
Luke Lau	7c812ea01a	[RISCV] Avoid vl toggles when lowering vector_splice/experimental_vp_splice and add +vl-dependent-latency tuning feature (#146746 ) When vectorizing a loop with a fixed-order recurrence we use a splice, which gets lowered to a vslidedown and vslideup pair. However with the way we lower it today we end up with extra vl toggles in the loop, especially with EVL tail folding, e.g: .LBB0_5: # %vector.body # =>This Inner Loop Header: Depth=1 sub a5, a2, a3 sh2add a6, a3, a1 zext.w a7, a4 vsetvli a4, a5, e8, mf2, ta, ma vle32.v v10, (a6) addi a7, a7, -1 vsetivli zero, 1, e32, m2, ta, ma vslidedown.vx v8, v8, a7 sh2add a6, a3, a0 vsetvli zero, a5, e32, m2, ta, ma vslideup.vi v8, v10, 1 vadd.vv v8, v10, v8 add a3, a3, a4 vse32.v v8, (a6) vmv2r.v v8, v10 bne a3, a2, .LBB0_5 Because the vslideup overwrites all but UpOffset elements from the vslidedown, we currently set the vslidedown's AVL to said offset. But in the vslideup we use either VLMAX or the EVL which causes a toggle. This increases the AVL of the vslidedown so it matches vslideup, even if the extra elements are overridden, to avoid the toggle. A new tuning feature +vl-dependent-latency has been added which keeps the old behaviour for microarchitectures that dynamically dispatch uops based on vl, e.g. sifive-x280. +vl-dependent-latency can be reused for the recently proposed Ovlt optimization directive if/when it's ratified: https://lists.riscv.org/g/tech-privileged/message/2487 If we wanted to aggressively optimise for vl at the expense of introducing more toggles we could probably look at doing this in RISCVVLOptimizer.	2025-07-09 11:09:13 +08:00
Jim Lin	3f33e7ba5b	[RISCV] Add Andes XAndesVSIntLoad (Andes Vector INT4 Load) extension (#147005 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only implements MC support for XAndesVSIntLoad. --------- Co-authored-by: Lino Hsing-Yu Peng <linopeng@andestech.com>	2025-07-07 13:01:22 +08:00
Jim Lin	61529d9e36	[RISCV] Remove implied extension Zvfhmin for XAndesVPackFPH (#146861 ) XAndesVPackFPH can actually be used independently without requiring Zvfhmin. Therefore, we remove the implicitly required Zvfhmin extension from XAndesVPackFPH and imply that the f extension is sufficient.	2025-07-04 10:16:20 +08:00
Jim Lin	f36ad98b27	[RISCV] Remove unneeded blank line between the features for XAndesVBFHCvt and XAndesVPackFPH. NFC.	2025-07-03 19:27:28 +08:00
UmeshKalappa	032966ff56	[RISCV] Added the MIPS prefetch extensions for MIPS RV64 P8700. (#145647 ) the extension enabled with xmipscbop. Please refer "MIPS RV64 P8700/P8700-F Multiprocessing System Programmer’s Guide" for more info on the extension at https://mips.com/wp-content/uploads/2025/06/P8700_Programmers_Reference_Manual_Rev1.84_5-31-2025.pdf	2025-07-03 10:59:10 +02:00
Craig Topper	3d51490622	[RISCV] Fix typo in the description of xsfvfwmaccqqq. (#145771 )	2025-06-25 15:46:45 -07:00
Craig Topper	c3b160bcaa	[RISCV] Remove -mattr=+no-rvc-hints (#145138 ) As far as I know binutils does not have a similar option and I don't know of a reason we shouldn't accept the RVC hint instructions. The wording in the spec in the past suggested that maybe these weren't valid instruction names, but that's been modified recently.	2025-06-25 08:24:24 -07:00
Sam Elliott	0fcced7d79	[RISCV][NFC] Zce always implies/requires Zca (#145442 )	2025-06-23 20:09:23 -07:00
Sam Elliott	a6eb5eee38	[RISCV][NFC] Remove hasStdExtCOrZca (#145139 ) As of 20b5728b7b1ccc4509a316efb270d46cc9526d69, C always enables Zca, so the check `C \|\| Zca` is equivalent to just checking for `Zca`. This replaces any uses of `HasStdExtCOrZca` with a new `HasStdExtZca` (with the same assembler description, to avoid changes in error messages), and simplifies everywhere where C++ needed to check for either C or Zca. The Subtarget function is just deprecated for the moment.	2025-06-23 10:49:47 -07:00
Sam Elliott	cb4f329004	[RISCV] Fix HasStdExtCOrZcfOrZce Syntax (#145141 )	2025-06-20 22:44:55 -07:00
Jim Lin	8ddada41df	[RISCV] Add Andes XAndesVBFHCvt (Andes Vector BFLOAT16 Conversion) extension (#144320 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. The instructions are similar to `Zvfbfmin` and the only difference with `Zvfbfmin` is that `XAndesVBFHCvt` doesn't have mask variant.	2025-06-18 09:17:46 +08:00
Sam Elliott	98c6c371d6	[RISCV] Xqccmp v0.3 (#137854 ) All the changes for v0.2 and v0.3 are either already implemented, or irrelevant to the compiler implementation.	2025-06-16 22:13:45 -07:00
Sam Elliott	c0ac95181e	[RISCV] Update Xqci to v0.13.0 (#144398 )	2025-06-16 22:12:12 -07:00
Jim Lin	483d19619c	[RISCV] Add tune features for Andes 45 series cpus (#143899 ) Add tune features TuneNoDefaultUnroll, TuneShortForwardBranchOpt and TunePostRAScheduler for Andes 45 series cpus.	2025-06-13 14:26:50 +08:00
Sam Elliott	6f6dc9c8ba	[RISCV] Implement Feature Bits for B, E, H (#143436 ) As defined in riscv-non-isa/riscv-c-api-doc#109.	2025-06-09 15:01:18 -07:00
Ying Chen	5483190216	[RISCV] Add shlcofideleg extension (#141572 ) This is for `shlcofideleg` extension, that supports delegating LCOFI interrupts to VS-mode. Spec: https://github.com/riscv/riscv-isa-manual/blob/main/src/hypervisor.adoc	2025-05-30 16:52:08 +08:00
Jerry Zhang Jian	3ef1b07a6c	[RISCV] add Double Trap extension requires Zicsr (#141016 ) - The double trap extension requires `mtval2' register, so add Zicsr as required extension Signed-off-by: Jerry Zhang Jian <jerry.zhangjian@sifive.com>	2025-05-23 00:27:04 +08:00
Craig Topper	a0b6cfd975	[RISCV] Add MC layer support for XSfmm*. (#133031 ) This adds assembler/disassembler support for XSfmmbase 0.6 and related SiFive matrix multiplication extensions based on the spec here https://www.sifive.com/document-file/xsfmm-matrix-extensions-specification Functionality-wise, this is the same as the Zvma extension proposal that SiFive shared with the Attached Matrix Extension Task Group. The extension names and instruction mnemonics have been changed to use vendor prefixes. Note this is a non-conforming extension as the opcodes used here are in the standard opcode space in OP-V or OP-VE. --------- Co-authored-by: Brandon Wu <brandon.wu@sifive.com>	2025-05-21 08:26:35 -07:00
Iris Shi	1e503d08e1	[RISCV][MC] Add support for Q extension (#139369 ) Closes #130217. https://github.com/riscv/riscv-isa-manual/blob/main/src/q-st-ext.adoc	2025-05-15 10:51:06 +08:00
Jim Lin	2a8960e48b	[RISCV] Add Andes XAndesVDot (Andes Vector Dot Product) extension. (#139849 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. Intrinsics support will be added in a later patch.	2025-05-15 10:14:58 +08:00
Craig Topper	e1b3af6dc4	[RISCV] Add isel patterns to use Zilsd for f64 load/store for Zdinx on RV32. (#139935 )	2025-05-14 12:19:38 -07:00
Sam Elliott	7038d50d62	[RISCV] Xqci Extensions v0.11.0 (#137881 ) This updates all the extensions to their version in the v0.11.0 spec. All changes from this version are already implemented or are not relevant to LLVM. This change also alphabetises the lists of Xqci extensions, to make future checks easier, and removes irrelevant info from the usage docs.	2025-05-13 10:48:38 -07:00
Jim Lin	9e27db0a50	[RISCV] Add Andes XAndesVPackFPH (Andes Vector Packed FP16) extension. (#138827 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. Intrinsics support will be added in a later patch.	2025-05-12 16:58:39 +08:00
Jim Lin	6ba1a62a6c	[RISCV] Add Andes XAndesperf (Andes Performance) extension. (#135110 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. Relocation and fixup for the branch and gp-implied instructions will be added in a later patch.	2025-04-28 17:23:51 +08:00
Sam Elliott	cfc5baf6e6	[RISCV] SiFive CLIC Support (#132481 ) This Change adds support for two SiFive vendor attributes in clang: - "SiFive-CLIC-preemptible" - "SiFive-CLIC-stack-swap" These can be given together, and can be combined with "machine", but cannot be combined with any other interrupt attribute values. These are handled primarily in RISCVFrameLowering: - "SiFive-CLIC-stack-swap" entails swapping `sp` with `sf.mscratchcsw` at function entry and exit, which holds the trap stack pointer. - "SiFive-CLIC-preemptible" entails saving `mcause` and `mepc` before re-enabling interrupts using `mstatus`. To save these, `s0` and `s1` are first spilled to the stack, and then the values are read into these registers. If these registers are used in the function, their values will be spilled a second time onto the stack with the generic callee-saved-register handling. At the end of the function interrupts are disabled again before `mepc` and `mcause` are restored. This Change also adds support for the following two experimental extensions, which only contain CSRs: - XSfsclic - for SiFive's CLIC Supervisor-Mode CSRs - XSfmclic - for SiFive's CLIC Machine-Mode CSRs The latter is needed for interrupt support. The CFI information for this implementation is not correct, but I'd prefer to correct this in a follow-up. While it's unlikely anyone wants to unwind through a handler, the CFI information is also used by debuggers so it would be good to get it right. Co-authored-by: Ana Pazos <apazos@quicinc.com>	2025-04-25 17:12:27 -07:00
Sam Elliott	683c3b8b7e	[RISCV] Allocate Feature Bits for Zilsd/Zclsd/Zcmp (#135197 ) As proposed in https://github.com/riscv-non-isa/riscv-c-api-doc/pull/104 No real compiler-rt implementation, as these are not exposed by Linux.	2025-04-25 11:06:04 -07:00
T-Tie	9c2190eb5d	[RISCV] Add support for Ziccamoc (#136694 ) Support for Ziccamoc is added in this pr. Specification link: https://drive.google.com/file/d/12QKRm92cLcEk8-5J9NI91m0fAQOxqNAq/view --------- Co-authored-by: Tie <Tie@llvm.com>	2025-04-25 13:49:48 +08:00
Liao Chunyu	9e26c797ae	[RISCV] Add smcntrpmf extension (#136556 ) spec: https://github.com/riscvarchive/riscv-smcntrpmf	2025-04-22 12:34:39 +08:00
Djordje Todorovic	d30a5b41fe	[RISCV] Fix xmipscmov extension name (#135647 ) The right name was used in riscv-toolchain-conventions docs.	2025-04-15 23:17:03 +02:00
Pengcheng Wang	90c01f4bad	[RISCV] Add missing bitmask for some extensions (#135599 ) According to: https://github.com/riscv-non-isa/riscv-c-api-doc/blob/main/src/c-api.adoc#extension-bitmask-definitions And we sort the bitmask by group id and then bit position.	2025-04-14 19:18:38 +08:00
quic_hchandel	edef028029	[RISCV] Add Qualcomm uC Xqciio (External Input Output) extension (#132721 ) This extension adds two external input output instructions for non-memory-mapped device. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-28 19:47:29 -07:00
Sam Elliott	6a371c7744	[RISCV] Support .option {no}exact (#122483 ) This implements [the `.option exact` and `.option noexact` proposal](https://github.com/riscv-non-isa/riscv-asm-manual/pull/122) for RISC-V. `.option exact` turns off: - Compression - Branch Relaxation - Linker Relaxation `.option noexact` turns these back on, and is also the default, matching the current behaviour.	2025-03-26 11:14:16 -07:00
Craig Topper	2b82555ef4	[RISCV] Remove experimental from Sdext and Sdtrig which are ratified. (#132529 ) They were ratified in February 2025.	2025-03-24 09:46:51 -07:00
Craig Topper	fb44c54d65	[RISCV] Add missing space to optimized-nf*-segment-load-store description. NFC (#132531 )	2025-03-22 10:18:31 -07:00
Jesse Huang	20b5728b7b	[RISCV] Implement the implications of C extension (#132259 ) Implement the following implications according to the [Zc spec](https://github.com/riscvarchive/riscv-code-size-reduction/blob/main/Zc-specification/Zc.adoc#13-c) > As C defines the same instructions as Zca, Zcf and Zcd, the rule is that: > * C always implies Zca > * C+F implies Zcf (RV32 only) > * C+D implies Zcd	2025-03-22 14:48:52 +08:00
Sudharsan Veeravalli	e7107973b8	Recommit "[RISCV] Add Qualcomm uC Xqcisync (Sync Delay) extension (#132184 )" (#132520 ) With a minor fix for the build failures. Original message: This extension adds nine instructions, eight for non-memory-mapped devices synchronization and delay instruction. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli quic_svs@quicinc.com	2025-03-22 11:07:48 +05:30
Kazu Hirata	fe7776eab8	Revert "[RISCV] Add Qualcomm uC Xqcisync (Sync Delay) extension (#132184 )" This reverts commit 3840f787a21a66686f5d8bf61877d41f3a65f205. Multiple builtbot failures have been reported: https://github.com/llvm/llvm-project/pull/132184	2025-03-21 20:28:11 -07:00
quic_hchandel	3840f787a2	[RISCV] Add Qualcomm uC Xqcisync (Sync Delay) extension (#132184 ) This extension adds nine instructions, eight for non-memory-mapped devices synchronization and delay instruction. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-22 07:57:07 +05:30
Craig Topper	4a2ab3afdb	[RISCV] Remove trailing whitespace. NFC	2025-03-21 09:04:14 -07:00
Kito Cheng	7f8451c868	[RISCV] Use vsetvli instead of vlenb in Prologue/Epilogue (#113756 ) Currently, we use `csrr` with `vlenb` to obtain the `VLEN`, but this is not the only option. We can also use `vsetvli` with `e8`/`m1` to get `VLENMAX`, which is equal to the VLEN. This method is preferable on some microarchitectures and makes it easier to obtain values like `VLEN * 2`, `VLEN * 4`, or `VLEN * 8`, reducing the number of instructions needed to calculate VLEN multiples. However, this approach is NOT always interchangeable, as it changes the state of `VTYPE` and `VL`, which can alter the behavior of vector instructions, potentially causing incorrect code generation if applied after a vsetvli insertion. Therefore, we limit its use to the prologue/epilogue for now, as there are no vector operations within the prologue/epilogue sequence. With further analysis, we may extend this approach beyond the prologue/epilogue in the future, but starting here should be a good first step. This feature is gurded by the `+prefer-vsetvli-over-read-vlenb` feature, which is disabled by default for now.	2025-03-21 17:22:32 +08:00
Craig Topper	eb77061a42	[RISCV] Add assembler support for Zvqdotq. (#132118 ) Based on the 0.0.1 spec here https://github.com/riscv/riscv-dot-product/releases/tag/v0.0.1	2025-03-20 08:38:15 -07:00
quic_hchandel	0744d4926a	[RISCV] Add Qualcomm uC Xqcilb (Long Branch) extension (#131996 ) This extension adds two long branch instructions. The current spec can be found at: https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.7.0 This patch adds assembler only support. Co-authored-by: Sudharsan Veeravalli <quic_svs@quicinc.com>	2025-03-20 11:14:53 +05:30

1 2 3 4 5 ...

281 Commits