llvm-project

Author	SHA1	Message	Date
Piyou Chen	20a3484b15	[RISCV] Add statistic support for VSETVL insertion pass (#78543 ) This patch make vsetvl insertion pass could track the number of inserted/removed vsetvl instruction from `-stats` option.	2024-01-19 08:22:25 +08:00
Philip Reames	de423cfe3d	[RISCV] Prefer vsetivli for VLMAX when VLEN is exactly known (#75509 ) If VLEN is exactly known, we may be able to use the vsetivli encoding instead of the vsetvli a0, zero, <vtype> encoding. This slightly reduces register pressure. This builds on 632f1c5, but reverses course a bit. It turns out to be quite complicated to canonicalize from VLMAX to immediate early because the sentinel value is widely used in tablegen patterns without knowledge of LMUL. Instead, we canonicalize towards the VLMAX representation, and then pick the immediate form during insertion since we have the LMUL information there. Within InsertVSETVLI, this could reasonable fit in a couple places. If reviewers want me to e.g. move it to emission, let me know. Doing so may require a bit of extra code to e.g. handle comparisons of the two forms, but shouldn't be too complicated.	2024-01-17 12:40:00 -08:00
Luke Lau	e8790027b1	[RISCV] Allow vsetvlis with same register AVL in doLocalPostpass (#76801 )	2024-01-11 12:12:46 +07:00
Luke Lau	274f8332b9	[RISCV] Don't attempt PRE if available info is SEW/LMUL ratio only (#77063 )	2024-01-07 14:23:01 +07:00
Philip Reames	4fa9697b47	[RISCV][InsertVSETVLI] Factor out isNonZeroLoadImmediate helper [nfc] Just reducing a bit of code duplication.	2023-12-15 11:20:01 -08:00
Philip Reames	46d1f30882	[RISCV][InsertSETVTLI] Handle large immediates in backwards walk (#75409 ) When doing our backwards walk, we were not handling the case where the AVL was defined by a register whose definition was an ADDI xN, x0, <imm>. Doing so (as we already do in the forward pass) allows us to prune a few more transitions.	2023-12-14 07:36:07 -08:00
Yingwei Zheng	3564c85b0e	[RISCV] Eliminate dead li after emitting VSETVLIs (#65934 ) This patch tracks li instructions that set AVL operands and does DCE after emitting VSETVLIs.	2023-12-13 23:18:48 +08:00
Luke Lau	30e200b81e	[RISCV] Remove forward declaration and unused argument. NFC	2023-12-12 17:51:35 +09:00
Luke Lau	6707b33b80	[RISCV] Don't set AVL if only zeroness is demanded (#74049 ) This refactors the logic in transferBefore so that we're moving in the direction of "keep the existing Info, only change what is needed". For the sake of review there are two commits in this PR: The former is needed to make the latter an NFC commit. Neither introduce any test diffs but the former is not technically NFC, hence why I did not precommit it. - [RISCV] Preserve AVL when previous info is ratio only in transferBefore - [RISCV] Don't change AVL if only zeroness is demanded. NFC	2023-12-12 17:30:18 +09:00
Luke Lau	39445046dc	[RISCV] Remove unecessary early exit in transferBefore (#74040 ) Previously we bailed if we encountered a pseudo without a VL op, i.e. vmv.x.s, which prevented us from preserving VL and VTYPE. It looks like this was copied over from a time whenever this code was operating on the MachineInstrs in place, see https://reviews.llvm.org/D127870 However because we no longer mutate the MIs, we can just get rid of this early exit which allows us to preserve VL and VTYPE when dealing with vmv.x.s.	2023-12-12 17:25:19 +09:00
Craig Topper	4162a9bca4	[RISCV] Cleanup pass initialization. Remove redundant initializations from pass constructors that were already being initialized by LLVMInitializeRISCVTarget().	2023-12-07 18:21:38 -08:00
Luke Lau	cf1a979ccf	[RISCV] Minimally modify incoming state in transferBefore (#72352 ) transferBefore currently takes an incoming state and an instruction, computes the new state needed for the instruction, and then modifies that new state to be more similar to the incoming state. This patch reverses the approach by instead taking the incoming state and modifying only the bits that are demanded by the instruction.	2023-12-01 13:51:18 +08:00
Luke Lau	36239f9418	[RISCV] Move AVL coalescing logic upwards into computeInfoForInstr. NFC (#73909 ) There is an optimisation in transferBefore where if a VSETVLIInfo uses the AVL of a defining vsetvli, it uses that vsetvli's AVL provided VLMAX is the same. This patch moves it out of transferBefore and up into computeInfoForInstr to show how it isn't affected by the other optimisations in transferBefore, and to simplify the control flow by removing an early return. This should make #72352 easier to reason about.	2023-12-01 13:11:45 +08:00
Luke Lau	c0b9269398	[RISCV] Add helper to copy the AVL of another VSETVLIInfo. NFC	2023-11-30 15:19:46 +08:00
Luke Lau	933dd03386	[RISCV] Remove checks that MI's info is valid. NFC It's always guaranteed to be valid since we compute it ourselves from MI. This should simplify an upcoming change in #72352	2023-11-20 13:17:15 +08:00
Luke Lau	69f64dedb0	[RISCV] Use DemandedFields instead of checking for vmv.s.x/vmv.x.s. NFC The property we're explicitly looking for is whether or not MI only cares about VL zeroness and not VL itself, so we can just use DemandedFields for this. This should simplify an upcoming change in #72352	2023-11-20 13:16:54 +08:00
Philip Reames	7ac8486e54	[RISCVInsertVSETVLI] Allow PRE with non-immediate AVLs (#71728 ) Extend our PRE logic to cover non-immediate AVL values. This covers large constant AVLs (which must be materialized in registers), and may help some code written explicitly with intrinsics. Looking at the existing code, I can't entirely figure out why I thought we needed VL == AVL to perform the PRE. My best guess is that I was worried about the VLMAX < VL < 2 * VLMAX case, but the spec explicitly says that vsetvli must be determinist on any particular AVL value. That case was, possibly by accident, covering another legality precondition. Specifically, by only returning true for immediate and VLMAX AVL values, we didn't encounter the case where the AVL was a register and that register wasn't available in the predecessor (e.g. if AVL is a load in the MBB block itself). --------- Co-authored-by: Luke Lau <luke_lau@icloud.com>	2023-11-09 08:03:13 -08:00
Wang Pengcheng	a316f14fdd	[RISCV][NFC] Move getRVVMCOpcode to RISCVInstrInfo (#70637 ) To simplify more code.	2023-10-30 19:03:04 +08:00
Luke Lau	c8e1fbc3cc	[RISCV] Keep same SEW/LMUL ratio if possible in forward transfer (#69788 ) For instructions like vmv.s.x and friends where we don't care about LMUL or the SEW/LMUL ratio, we can change the LMUL in its state so that it has the same SEW/LMUL ratio as the previous state. This allows us to avoid more VL toggles later down the line (i.e. use vsetvli zero, zero, which requires that the SEW/LMUL ratio must be the same) This is an alternative approach to the idea in #69259, but note that they don't catch exactly the same test cases.	2023-10-27 12:16:28 +01:00
Philip Reames	ca8d02d78a	[RISCV] Use a switch instead of a series of if-clauses [nfc] (try 2) This way the compiler can tell us about missing cases if we add a new value to this enum. Amusingly, the first time I landed this, I had indeed forgotten a switch case, and the build bots were quite happy to remind me of such.	2023-10-25 13:44:46 -07:00
Philip Reames	29181bd97a	Revert "[RISCV] Use a switch instead of a series of if-clauses [nfc]" This reverts commit 3c2203ae03ca8a8cf56691d6f03050ccc2420ff6. The buildbots were quick to remind me that I had, in fact, missed a switch case. Oops.	2023-10-25 13:34:10 -07:00
Philip Reames	3c2203ae03	[RISCV] Use a switch instead of a series of if-clauses [nfc] This way the compiler can tell us about missing cases if we add a new value to this enum.	2023-10-25 13:15:01 -07:00
Philip Reames	2732860ddb	[RISCV][InsertVSETVLI] Add Subtarget variable to class [nfc] A bit debateable since we could extract it from the MachineFunction (and thus the MachineInstr), but we have the same pattern for MachineFunction associated structure already for TII and MRI.	2023-10-24 11:58:25 -07:00
Philip Reames	fa7c50d00f	[RISCV] Rename hasFixedResult to willVLBeAVL [nfc]	2023-10-24 11:27:09 -07:00
Philip Reames	17b2935270	Revert "[RISCV][InsertVSETVLI] Make VL preserving vsetvli emission more explicit [nfc]" This reverts commit 20fc8e8df20e165d1c632bc80a0cebce2dc158f7. As pointed out in review of the mentioned follow up patch, this gets the predicate wrong. We need not simply VL being unchanged, but VLMAX being unchanged. Given that the code structure I'd introduced here is simply confusing.	2023-10-23 11:43:14 -07:00
Philip Reames	20fc8e8df2	[RISCV][InsertVSETVLI] Make VL preserving vsetvli emission more explicit [nfc] This just reorganizes the code to make it clear what the existing cases were doing in common. An upcoming change will extend the logic.	2023-10-20 12:04:07 -07:00
Yingwei Zheng	93fde2ea1b	[RISCV] Add a pass to rewrite rd to x0 for non-computational instrs whose return values are unused When AMOs are used to implement parallel reduction operations, typically the return value would be discarded. This patch adds a peephole pass `RISCVDeadRegisterDefinitions`. It rewrites `rd` to `x0` when `rd` is marked as dead. It may improve the register allocation and reduce pipeline hazards on CPUs without register renaming and OOO. Comparison with GCC: https://godbolt.org/z/bKaxnEcec Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D158759	2023-09-20 01:02:19 +08:00
Kito Cheng	af9b25f9db	[RISCV] Optimize floating point scalar move and splat In D158086, we limit all floating point scalar move and splat can't fuse vsetvli with different SEW, and this patch try to relax the constraint as possible by introducing new SEW demand type: SEWGreaterThanOrEqualAndLessThan64, that allow SEW fused with larger SEW, but constraint it can't fused with SEW=64. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D158177	2023-09-06 16:39:30 +08:00
Craig Topper	27d996e9e8	[RISCV] Remove Change field from BlockData in RISCVInsertVSETVLI. NFC In practice, this field is only used a return value from computeVLVTYPEChanges. Add a reference parameter to computeVLVTYPEChanges to return its info. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D158902	2023-08-29 13:47:15 -07:00
Philip Reames	dd0d36d09f	[RISCVInsertVSETVLI] Handle vl-preserve case in backwards rewrite This updates the backwards mutation code to handle the case where the previous vset was in vl-preserving (x0, x0) form, but that VL was never used before the next vset which changes the VL. Since this requires writing both VL operands, eliminate the restriction on removing GPR producing vsetv as well. (The register will now be written by the earlier vsetv.) Differential Revision: https://reviews.llvm.org/D158019	2023-08-21 12:28:28 -07:00
Kito Cheng	0816b3efbf	[RISCV] Check floating point vector instruction with SEW=64 is valid when vsetvl insertion Scalar move and splat instruction are only demand the SEW is greater than its own needs, but floating point vector with SEW=64 is not alwaws valid even SEW=64 is valid, because we have a special configuration: zve64f. So we need to check floating point vector instruction with SEW=64 is valid when compute demand of floating point scalar move and splat instruction. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D158086	2023-08-18 10:31:01 +08:00
Philip Reames	3c2a66973e	[RISCVInsertVSETVLI] Generalize scalar extract (vmv.x.s, and vmx.f.s) hamdling vmv.x.s and vmv.f.s are unconditional. They read the low element of a vector register (not vector group), and function even when VL=0 or VSTART>0. As such, they are don't care with respect to both VL and LMUL. We'd previously had handling in the forward pass only via the NoRegister mechanusm. (The only instructions with SEW but without VL are these extracts.) This patch moves that handling into getDemanded so that the backwards pass benefits as well. Differential Revision: https://reviews.llvm.org/D157991	2023-08-16 07:50:59 -07:00
Philip Reames	b06e52c32f	[RISCVInsertVSETVLI] Default to VL=1 for scalar extracts We were defaulting to VL=0 when we didn't otherwise have a vsetv nearby. Instead, let's use VL=1. VL=0 is very much a cornercase in hardware, and let's avoid if we can. Differential Revision: https://reviews.llvm.org/D158015	2023-08-16 07:35:00 -07:00
Philip Reames	a63bd7e99b	[RISCV] Use NoReg in place of IMPLICIT_DEF for undefined passthru operands In a recent series of refactorings (described here: https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295), I greatly increased the number of IMPLICIT_DEF operands to our vector instructions. This has turned out to have an unexpected negative impact because MachineCSE does not CSE IMPLICIT_DEFs, and thus does not CSE any instruction with an IMPLICIT_DEF operand. SelectionDAG does CSE the same case, but that only covers the same block case, not the cross block case. This lead to the performance regression reported in https://github.com/llvm/llvm-project/issues/64282. This change is a slightly ugly hack to side step the issue. Instead of fixing the root cause (lack of CSE for IMPLICIT_DEF) or undoing the operand changes, we leave the extra operand in place, and use NoReg in place of IMPLICIT_DEF. I then convert back to IMPLICIT_DEF just before register allocation so that ProcessImplicitDefs and TwoAddressInstructions can do the normal transforms to Undef tied registers. We may end up backporting this into the 17.x release branch. Given how late in the release cycle this is landing, that's much less likely now, but still a possibility. Differential Revision: https://reviews.llvm.org/D156909	2023-08-14 12:57:38 -07:00
Philip Reames	403261eafd	[RISCV] Remove legacy TA/TU pseudo distinction for load instructions This change continues with the line of work discussed in https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295. This change targets all the pseudos used in loads (unit, strided, segmented, fault first, and their combinations). As with previous changes in the series, we replace the existing TA and TU forms with a single unified pseudo with a passthru (which may be implicit_def) and a policy operand. One quirk is that I went ahead and treated the unmasked mask load instruction (vlm) the same way. We need the pass thru operand to model tail undefined, but since the instruction is unconditionally agnostic and the instruction has no mask, the policy operand is arguably unneeded. I kept it mostly for consistency sake. Another quirk worth highlighting is that segment loads require a bit of dedicated handling. Surprisingly, we don't have IMPLICIT_DEF nodes of the right types, and attempting to use them results in some odd looking codegen and a few crashes. Instead, I left the REG_SEQUENCE form, and extended InsertVSETVLI to recognize the complex undefs. Arguably, we should probably revisit the handling of undef reg_sequence nodes here, but I'm hoping to side step that in this patch. As before, we see codegen changes (some improvements and some regressions) due to scheduling differences caused by the extra implicit_def instructions. I did have to delete one register allocation regression test as I couldn't figure out how to meaningfully update it. I spent a significant amount of time trying, and finally gave up. Differential Revision: https://reviews.llvm.org/D154141	2023-07-05 13:11:58 -07:00
Craig Topper	354530fe19	[RISCV] Prevent vsetvli insertion from deleting some vsetvli instructions If the result register is used, it is not safe to delete. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D153076	2023-06-15 15:18:47 -07:00
Philip Reames	fc9b26440d	[RISCV][InsertVSETVLI] Treat vmv.v.i as-if it were vmv.s.x when VL=1, and inactive lanes are undefined A vmv.v.i/x splats the immediate to all active lanes. For the active lanes, this is the same as vmv.s.x which inserts one scalar into the low lane. If we can ignore all the inactive lanes (because they are known undefined), then the two are semantically equivalent. We already reason about compatible VL/VTYPE combinations for vmv.s.x, apply the same logic to vmv.v.i. Unlike a vmv.s.x, we do need to be careful not to increase LMUL. A splat instruction is probably linear in LMUL, so restrict this to LMUL1. Differential Revision: https://reviews.llvm.org/D152845	2023-06-15 14:10:04 -07:00
Philip Reames	807adcf4b9	[RISCV][InsertVSETVLI] Rework code structure to make reasoning about undefined lanes explicit [NFC] We already have several places in this code which reason about whether the inactive lanes are defined, and are about to add one more in D151653. Let's go ahead and common the code so that we don't have the same concept repeating in multiply places. Differential Revision: https://reviews.llvm.org/D152844	2023-06-14 09:48:31 -07:00
David Green	2802739dfd	[NFC] Replace ;; with ;	2023-06-11 10:25:24 +01:00
Luke Lau	f3b39ceaf5	[RISCV][InsertVSETVLI] Relax tail policy more often for vmv.s.x If a vm.s.x pseudo has an undef passthru operand, then we're free to use whatever tail policy we want for VL > 1. We previously relaxed the tail policy for this but only when we could also expand the SEW. This patch changes it to relax the tail policy even if the SEW can't be expanded and removes a few more toggles, as well as fully moving the vmv.s.x logic into getDemanded.	2023-05-31 18:18:44 +01:00
Luke Lau	badf11de4a	[RISCV][InsertVSETVLI] Avoid vmv.s.x SEW toggle if at start of block vmv.s.x/vfmv.s.f instructions that only write to the first destination element can use any SEW greater than or equal to its original SEW, provided that it's writing to an implicit_def operand where we can clobber the other lanes. We were already handling this in needVSETVLI, which meant that when scanning the instructions from top to bottom we could detect this and avoid the toggle: vsetivli zero, 4, e64, mf2, ta, ma li a0, 11 vsetivli zero, 1, e8, mf8, ta, ma vmv.s.x v0, a0 -> vsetivli zero, 4, e64, mf2, ta, ma li a0, 11 vmv.s.x v0, a0 The issue that this patch aims to solve is arises when the vmv.s.x is the first vector instruction in the block and doesn't have any prior predecessor info: entry_bb: li a0, 11 ; No previous state here: forced to set VL/VTYPE vsetivli zero, 1, e8, mf8, ta, ma vmv.s.x v0, a0 vsetivli zero, 4, e16, mf2, ta, ma vmerge.vvm v8, v9, v8, v0 doLocalPostpass can work backwards from bottom to top and work out if an earlier vsetvli can be mutated to avoid a toggle. It uses DemandedFields and getDemanded for this, which previously didn't take into account the possibility of going to a larger SEW. A previous patch consolidated the vmv.s.x logic from needVSETVLI logic into getDemanded, and this patch removes the gate around it so that doLocalPostpass can now delete vsetvlis like in the scenario below: entry_bb: li a0, 11 ; Previous vsetivli mutated: second one deleted vsetivli zero, 4, e16, mf2, ta, ma vmv.s.x v0, a0 vmerge.vvm v8, v9, v8, v0 Differential Revision: https://reviews.llvm.org/D151561	2023-05-31 18:18:44 +01:00
Luke Lau	257cc049f9	[RISCV][InsertVSETVLI] Move vmv.s.x SEW check into getDemandedBits. NFC This patch restructures the logic that checks if vmv.s.x's SEW can be expanded into getDemandedBits, so that it can be shared by both the top-to-bottom and bottom-to-top passes. It adds a third option for SEW in DemandedFields, that's weaker than demanded but stronger than not demanded, that states that it the new SEW must be greater than or equal to the current SEW. Note that we now need to take care of the order of operands in areCompatibleVTYPEs as the relation is no longer commutative. A later patch will remove the gating on the bottom-to-top pass (dolocalPostpass) and another one will relax the demands on the tail policy further.	2023-05-31 18:18:44 +01:00
Luke Lau	319adf5de7	Revert "[RISCV][InsertVSETVLI] Avoid vmv.s.x SEW toggle if at start of block" This reverts commit 0ba41dd3806e658e67acb63353fd5540f2bf333c.	2023-05-31 18:14:55 +01:00
Luke Lau	0ba41dd380	[RISCV][InsertVSETVLI] Avoid vmv.s.x SEW toggle if at start of block vmv.s.x and friends that only write to the first destination element can use any SEW greater than or equal to its original SEW, provided that it's writing to an implicit_def operand where we can clobber the other lanes. We were already handling this in needVSETVLI, which meant that when scanning the instructions from top to bottom we could detect this and avoid the toggle: ``` vsetivli zero, 4, e64, mf2, ta, ma li a0, 11 vsetivli zero, 1, e8, mf8, ta, ma vmv.s.x v0, a0 -> vsetivli zero, 4, e64, mf2, ta, ma li a0, 11 vmv.s.x v0, a0 ``` The issue that this patch aims to solve is whenever vmv.s.x arises when the first vector instruction in the block and doesn't have any prior predecessor info: ``` entry_bb: li a0, 11 ; No previous state here: forced to set VL/VTYPE vsetivli zero, 1, e8, mf8, ta, ma vmv.s.x v0, a0 vsetivli zero, 4, e16, mf2, ta, ma vmerge.vvm v8, v9, v8, v0 ``` doLocalPostpass can work backwards from bottom to top and work out if an earlier vsetvli can be mutated to avoid a toggle. It uses DemandedFields and getDemanded for this, which previously didn't take into account the possibility of going to a larger SEW. This patch adds a third option for SEW in DemandedFields, that's weaker than demanded but stronger than not demanded, that states that it the new SEW must be greater than or equal to the current SEW. We can then use this option to move that vmv.s.x specific logic from needVSETVLI into getDemanded, making it available for both phase 2 and 3, i.e. we can now mutate the earlier vsetivli going from bottom to top: ``` entry_bb: li a0, 11 ; Previous vsetivli mutated: second one deleted vsetivli zero, 4, e16, mf2, ta, ma vmv.s.x v0, a0 vmerge.vvm v8, v9, v8, v0 ``` Reviewed By: reames Differential Revision: https://reviews.llvm.org/D151561	2023-05-31 18:14:21 +01:00
Philip Reames	7639a39dd2	[RISCV][InsertVSETVLI] Support constant VLs larger than immediate encoding The immediate field on the vsetivli is fairly limited. For larger vectors, we end up having to materialize a constant in a register. We hadn't plumbed the infrastructure to treat such materialized constants as constants for purpose of vsetvli elimination. I only bothered to handle LI. We could extend this to LUI sequences, but well, 2048 elements is probably enough for all practical fixed length vector codegen. :) The test delta does point out a related problem. At LMUL8, we see increased register allocation pressure, and we should probably either a) address register allocation remat, or b) be less aggressive about eliminating vsetvlis at high lmul. Note that high LMUL code is not generated much by default. Differential Revision: https://reviews.llvm.org/D151212	2023-05-24 10:37:59 -07:00
Philip Reames	020812b64f	Reapply "[RISCV][InsertVSETVLI] Avoid VL toggles for extractelement patterns" The original change had a bug where it allowed SEW mutation. This is wrong in multiple ways, but an easy example is that the slide amount is in units of SEW, and thus that changing SEW changes the slide offset. I'd reverted this in 33314693 intending to more majorly rework the patch because in addition to the bug, I'd noticed a potential oppurtunity to increase scope. After implementing that variant, and realizing it triggered nowhere, I decided to go back to the prior patch with the minimal fix. Note there's no separate test case for the fix. This is because we already had multiple, and I just didn't realize the impact of the original test diff. Adding one more test would have been unlikely to catch that human error. Original commit message.. Noticed this while looking at some SLP output. If we have an extractelement, we're probably using a slidedown into an destination with no contents. Given this, we can allow the slideup to use a larger VL and clobber tail elements of the destination vector. Doing this allows us to avoid vsetvli toggles in many fixed length vector examples. Differential Revision: https://reviews.llvm.org/D148834	2023-05-10 11:51:51 -07:00
Philip Reames	33314693f5	Revert "[RISCV][InsertVSETVLI] Avoid VL toggles for extractelement patterns" This reverts commit 657d20dc75252f0c8415ada5214affccc3c98efe. A correctness problem was reported against the review and the fix warrants re-review.	2023-05-10 10:58:46 -07:00
Philip Reames	657d20dc75	[RISCV][InsertVSETVLI] Avoid VL toggles for extractelement patterns Noticed this while looking at some SLP output. If we have an extractelement, we're probably using a slidedown into an destination with no contents. Given this, we can allow the slideup to use a larger VL and clobber tail elements of the destination vector. Doing this allows us to avoid vsetvli toggles in many fixed length vector examples. Differential Revision: https://reviews.llvm.org/D148834	2023-05-01 18:46:54 -07:00
Craig Topper	5894eec874	[RISCV][WIP] Use vsetvli x0, x0 in more cases. If the AVL is a virtual register defined by a vsetvli with the same vlmax we need and the previous vsetvli we saw in the data flow also has that vlmax, we can use the x0, x0 form when we insert a vsetvli. Not only does this avoid an update of the VL physical register, but it may allow doLocalPostpass to completely remove the inserted vsetvli by rewriting the vtype of the previous vsetvli. Differential Revision: https://reviews.llvm.org/D148735	2023-04-20 13:58:28 -07:00
Craig Topper	0f4c9c016c	[RISCV] Replace RISCV->RISC-V in strings. To be consistent with RISC-V branding guidelines https://riscv.org/about/risc-v-branding-guidelines/ Think we should be using RISC-V where possible. D146449 already updated comments. Strings may have more user impact. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D146451	2023-03-27 09:50:17 -07:00

1 2 3 4

184 Commits