llvm-project

Author	SHA1	Message	Date
Alex Bradbury	b717365216	[MachineScheduler][NFCI] Add Offset and OffsetIsScalable args to shouldClusterMemOps (#73778 ) These are picked up from getMemOperandsWithOffsetWidth but weren't then being passed through to shouldClusterMemOps, which forces backends to collect the information again if they want to use the kind of heuristics typically used for the similar shouldScheduleLoadsNear function (e.g. checking the offset is within 1 cache line). This patch just adds the parameters, but doesn't attempt to use them. There is potential to use them in the current PPC and AArch64 shouldClusterMemOps implementation, and I intend to use the offset in the heuristic for RISC-V. I've left these for future patches in the interest of being as incremental as possible. As noted in the review and in an inline FIXME, an ElementCount-style abstraction may later be used to condense these two parameters to one argument. ElementCount isn't quite suitable as it doesn't support negative offsets.	2023-12-06 15:30:48 +00:00
Alex Bradbury	d6fbd96e5e	[RISCV] Support FrameIndex operands in getMemOperandsWithOffsetWidth / getMemOperandWithOffsetWidth (#73802 ) I noted AArch64 happily accepts a FrameIndex operand as well as a register. This doesn't cause any changes outside of my C++ unit test for the current state of in-tree, but this will cause additional test changes if #73789 is rebased on top of it. Note that the returned Offset doesn't seem at all as meaningful if you have a FrameIndex base, though the approach taken here follows AArch64 (see D54847). This change won't harm the approach taken in shouldClusterMemOps because memOpsHaveSameBasePtr will only return true if the FrameIndex operand is the same for both operations.	2023-12-05 21:26:56 +00:00
Alex Bradbury	85c9c16895	[RISCV] Support load clustering in the MachineScheduler (off by default) (#73754 ) This adds minimal support for load clustering, but disables it by default. The intent is to iterate on the precise heuristic and the question of turning this on by default in a separate PR. Although previous discussion indicates hope that the MachineScheduler would replace most uses of the SelectionDAG scheduler, it does seem most targets aren't using MachineScheduler load clustering right now: PPC+AArch64 seem to just use it to help with paired load/store formation and although AMDGPU uses it for general clustering it also implements ShouldScheduleLoadsNear for the SelectionDAG scheduler's clustering.	2023-11-29 10:01:55 +00:00
Alex Bradbury	9c5003cc0c	[RISCV] Implement RISCVInstrInfo::getMemOperandsWithOffsetWidth (#73681 ) This hook is called by the default implementation of getMemOperandWithOffset and by the load/store clustering code in the MachineScheduler though this isn't enabled by default and is not yet enabled for RISC-V. Only return true for queries on scalar loads/stores for now (this is a conservative starting point, and vector load/store can be handled in a follow-on patch).	2023-11-29 04:48:43 +00:00
Craig Topper	a845061935	[AArch64] Use the same fast math preservation for MachineCombiner reassociation as X86/PowerPC/RISCV. (#72820 ) Don't blindly copy the original flags from the pre-reassociated instrutions. This copied the integer poison flags which are not safe to preserve after reassociation. For the FP flags, I think we should only keep the intersection of the flags. Override setSpecialOperandAttr to do this. Fixes #72777.	2023-11-22 14:17:45 -08:00
Craig Topper	9ae04a77d1	[RISCV] Don't set nsw/nuw/exact flag after MachineCombiner reassociation. This matches what PowerPC and X86 do.	2023-11-19 11:08:28 -08:00
Alex Bradbury	7f28e8ced7	[RISCV] Implement RISCVInstrInfo::isAddImmediate (#72356 ) This hook is called by the target-independent implementation of TargetInstrInfo::describeLoadedValue. I've opted to test it via a C++ unit test, which although fiddly to set up seems the right way to test a function with such clear intended semantics (rather than testing the impact indirectly). isAddImmediate will never recognise ADDIW as an add immediate which I _think_ is conservatively correct, as the caller may not understand its semantics vs ADDI. Note that although the doc comment for isAddImmediate specifies its behaviour solely in terms of physical registers, none of the current in-tree implementations (including this one) bail out on virtual registers (see #72357).	2023-11-16 14:43:31 +00:00
Alex Bradbury	ac378ac493	[RISCV][NFC] Rewrite doc comment for RISCVInstrInfo::getMemOperandWithOffsetWidth Attempt to clarify the expected behaviour.	2023-11-15 14:50:37 +00:00
Craig Topper	e0e0891d74	[RISCV][GISel] Select G_BRCOND and G_ICMP together when possible. This allows us to fold the G_ICMP operands into the conditional branch. This reuses the helper function we have for folding a G_ICMP into G_SELECT.	2023-11-12 15:53:23 -08:00
Wang Pengcheng	e179b125fb	[RISCV][NFC] Pass MCSubtargetInfo instead of FeatureBitset in RISCVMatInt (#71770 ) The use of `hasFeature` is more descriptive and the callers of `RISCVMatInt` have no need to call `getFeatureBits()` any more.	2023-11-09 15:15:23 +08:00
Min-Yih Hsu	1e39575a98	[RISCV] CSE by swapping conditional branches (#71111 ) DAGCombiner, as well as InstCombine, tend to canonicalize GE/LE into GT/LT, namely: ``` X >= C --> X > (C - 1) ``` Which sometime generates off-by-one constants that could have been CSE'd with surrounding constants. Instead of changing such canonicalization, this patch tries to swap those branch conditions post-isel, in the hope of resurfacing more constant CSE opportunities. More specifically, it performs the following optimization: For two constants C0 and C1 from ``` li Y, C0 li Z, C1 ``` To remove redundnat `li Y, C0`, 1. if C1 = C0 + 1 we can turn: (a) blt Y, X -> bge X, Z (b) bge Y, X -> blt X, Z 2. if C1 = C0 - 1 we can turn: (a) blt X, Y -> bge Z, X (b) bge X, Y -> blt Z, X This optimization will be done by PeepholeOptimizer through RISCVInstrInfo::optimizeCondBranch.	2023-11-03 09:03:52 -07:00
Craig Topper	284d136c4a	[RISCV] Teach copyPhysReg to allow copies between GPR<->FPR32/FPR64 (#70525 ) This is needed because GISel emits copies instead of bitcasts like SelectionDAG.	2023-10-30 09:58:51 -07:00
Wang Pengcheng	a316f14fdd	[RISCV][NFC] Move getRVVMCOpcode to RISCVInstrInfo (#70637 ) To simplify more code.	2023-10-30 19:03:04 +08:00
Craig Topper	8363996894	[RISCV] Reduce the number of parameters to copyPhysRegVector. NFC (#70502 ) The Lmul and SubRegIdx can be derived from the opcode. Make NF default to 1.	2023-10-27 13:36:44 -07:00
Craig Topper	b679ec86e3	[RISCV] Use RISCVInstrInfo::movImm to implement most of RISCVPostRAExpandPseudo::expandMovImm (#70389 )	2023-10-27 13:35:56 -07:00
Craig Topper	035c154f4f	[RISCV] Refactor RISCVPostRAExpandPseudo::expandMovImm and RISCVInstrInfo::movImm to prepare for merging. Fix small bug where RISCVPostRAExpandPseudo::expandMovImm set the kill flag on X0.	2023-10-27 13:35:56 -07:00
Craig Topper	c18e78cfe3	[RISCV] Add copyPhysRegVector to extract common vector code out of copyPhysRegVector. (#70497 ) Call this method directly from each vector case with the correct arguments. This allows us to treat each type of copy as its own special case and not pass variables to a common merge point. This is similar to how AArch64 is structured. I think I can reduce the number of operands to this new method, but I'll do that as a follow up.	2023-10-27 12:43:56 -07:00
Craig Topper	124613c8ed	[RISCV] Separate FPR and VR copyPhysReg implementation. (#70492 ) This duplicates the BuildMI from the 3 FP copy types, but separates them from the VR code and gets rid of the IsScalableVector flag. I need to add FPR<->GPR copies for GISel which needs another variation of BuildMI. So it seems cleaner to start enumerating each case as its own if.	2023-10-27 12:25:42 -07:00
Craig Topper	116eb323b1	[RISCV] Correct copyPhysReg for GPRPF64. (#70419 ) GPRF64 represents a pair of registers. We were only copying the even part. We need to copy the odd part too.	2023-10-26 23:54:46 -07:00
Jim Lin	c249e27786	[RISCV] Add missing break in switch-case in convertToThreeAddress function. NFC.	2023-10-25 17:51:17 +08:00
Wang Pengcheng	69ade08b4a	[RISCV][NFC] Fix comments in foldMemoryOperandImpl (#70033 ) I think the TODO is stale now.	2023-10-25 11:13:42 +08:00
Sacha Coppey	776889bc1c	[RISCV] Add Stackmap/Statepoint/Patchpoint support without targets This patch adds stackmap support for RISC-V without targets (i.e. the nop patchable forms). Reviewed By: reames Differential Revision: https://reviews.llvm.org/D123496	2023-10-11 09:18:55 +05:30
Craig Topper	45636ecf2c	[RISCV] Add sink-and-fold support for RISC-V. (#67602 ) This uses the recently introduced sink-and-fold support in MachineSink. https://reviews.llvm.org/D152828 This enables folding ADDI into load/store addresses. Enabling by default will be a separate PR.	2023-10-07 10:38:35 -07:00
Luke Lau	e577e7025d	[RISCV] Move vector pseudo hasAllNBitUsers switch into RISCVInstrInfo. NFC (#67593 ) The handling for vector pseudos in hasAllNBitUsers is duplicated across RISCVISelDAGToDAG and RISCVOptWInstrs. This deduplicates it between the two, with the common denominator between the two call sites being the opcode and SEW: We need to handle extracting these separately since one operates at the SelectionDAG level and the other at the MachineInstr level.	2023-10-03 12:24:11 +01:00
Craig Topper	62f5636838	[RISCV] Don't set KILL flag on X0 in RISCVInstrInfo::movImm. Extracted from #67159.	2023-09-25 13:40:08 -07:00
Craig Topper	bbe3ee061f	[RISCV] Add more instructions for the short forward branch optimization. (#66789 ) This adds the shifts and the immediate forms of the instructions that were already supported. There are still more instructions that can be predicated, but this is the rest of what we had in our downstream.	2023-09-19 10:21:39 -07:00
liqin.weng	700042cd88	[RISCV] Remove debug location to spill/reload instructions Spill/reload instructions are artificially generated by the compiler and have no relation to the original source code. So the best thing to do is not attach any debug location to them (instead of just taking the next debug location we find on following instructions). Refered to https://reviews.llvm.org/rG3e081703c349dd00b8ef6991c2d15964915dd8f4 Reviewed By: asb, kito-cheng, benshi001 Differential Revision: https://reviews.llvm.org/D129173	2023-09-09 16:39:28 +08:00
Craig Topper	e4b2f2d4a6	[RISCV][GISel] Legalize G_PHI and G_BRCOND. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D157818	2023-08-14 10:21:58 -07:00
Craig Topper	da56750f82	[RISCV] Change naming of vector pseudos with scalar FP operand. We need a pseudo for each scalar FP register class. Previously we distinquished the pseudos by naming them with F16, F32, F64, or BF16 in place of the F in the normal instruction name. Because these strings can appear in other parts of the name we had to do things like matching "_VBF16" to "_VF". This patch replaces the F16, F32, F64 strings with FPR16, FPR32, and FPR64. We also use FPR16 for BF16 since that is the scalar register class for bf16. Since the FPR16/32/64 string does not anywhere else in the pseudo names, we can use this to simplify the string replacements. This also allows us to simplify some BF16 related code. Reviewed By: wangpc Differential Revision: https://reviews.llvm.org/D157749	2023-08-12 11:20:47 -07:00
Alex Bradbury	667602793b	[RISCV] Implement support for bf16 select when zfbfmin is enabled These test cases previously caused an error. RISCVInstrInfo::copyPhysReg also needed a tweak in order to account for copying bf16 values in FPR16 registers. Differential Revision: https://reviews.llvm.org/D156883	2023-08-02 20:04:30 +01:00
Craig Topper	f3b4c266e8	[RISCV] Adjust the Zfhmin handling in RISCVInstrInfo::copyPhysReg. Instead of checking '!Zfh && Zhfmin' first, handle Zfh. Then assert that the other case is F+Zfhmin. The F+Zfhmin check will need to be relaxed for bfloat16 support. As it was written before there would be now error to catch that. Instead it would just silently create fsgnj.h instructions.	2023-07-16 20:20:59 -07:00
eopXD	00093667b1	[2/8][RISCV] Add rounding mode control variant for vfwadd, vfwsub Depends on D154628 For the cover letter of the patch-set, please checkout D154628. This is the 2nd patch of the patch-set. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D154629	2023-07-13 00:42:00 -07:00
Craig Topper	1aecb0e000	[RISCV] Clear kill flags when forming FMA instructions in MachineCombiner. If the operands to the mul have other uses we may be extending their live range past a kill flag. Reviewed By: asb, asi-sc Differential Revision: https://reviews.llvm.org/D155046	2023-07-12 08:03:45 -07:00
Philip Reames	5cd41dc62d	[RISCV] Remove legacy TA/TU pseudo distinction for binary instructions This change continues with the line of work discussed in https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295. This change handles most of the binary pseudos. I excluded pseudos which _TIED variants, and those that produce mask results. Both a bit different in functionality, and deserve their own change and review. As with previous changes in the series, we replace the existing TA and TU forms with a single unified pseudo with a passthru (which may be implicit_def) and a policy operand. As before, we see codegen changes (some improvements and some regressions) due to scheduling differences caused by the extra implicit_def instructions. Differential Revision: https://reviews.llvm.org/D154245	2023-07-11 10:21:42 -07:00
Philip Reames	92b5a3405d	[RISCV] Remove legacy TA/TU pseudo distinction for unary instructions This change continues with the line of work discussed in https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295. In D153155, we started removing the legacy distinction between unsuffixed (TA) and _TU pseudos. This patch continues that effort for the unary instruction families. The change consists of a few interacting pieces: * Adding a vector policy operand to VPseudoUnaryNoMaskTU. * Then using VPseudoUnaryNoMaskTU for all cases where VPseudoUnaryNoMask was previously used and deleting the unsuffixed form. * Then renaming VPseudoUnaryNoMaskTU to VPseudoUnaryNoMask, and adjusting the RISCVMaskedPseudo table to use the combined pseudo. * Fixing up two places in C++ code which manually construct VMV_V_* instructions. Normally, I'd try to factor this into a couple of changes, but in this case, the table structure is tied to naming and thus we can't really separate the otherwise NFC bits. As before, we see codegen changes (some improvements and some regressions) due to scheduling differences caused by the extra implicit_def instructions. Differential Revision: https://reviews.llvm.org/D153899	2023-06-29 07:34:14 -07:00
Philip Reames	c6b56cec8b	[RISCV] Check that SEW and policy operands are immediates in verifier This converts a crash (due an assertion inside getImm) into a verifier failure. Much easier to debug when you have malformed instructions.	2023-06-26 11:45:17 -07:00
Craig Topper	b105b3266f	[RISCV] Properly handle partial writes in isConvertibleToVMV_V_V. We were only checking for the previous insructions to write exactly the register or a super register. We ignored writes to a subregister and continued searching for the producing instruction. We need to abort instead. There's another check inside the if body to abort if the registers don't match exactly. So we just need to check for overlap so we enter the if body. Reviewed By: fakepaper56 Differential Revision: https://reviews.llvm.org/D153490	2023-06-25 23:08:47 -07:00
Sami Tolvanen	83835e22c7	[RISCV] Implement KCFI operand bundle lowering With `-fsanitize=kcfi` (Kernel Control-Flow Integrity), Clang emits "kcfi" operand bundles to indirect call instructions. Similarly to the target-specific lowering added in D119296, implement KCFI operand bundle lowering for RISC-V. This patch disables the generic KCFI pass for RISC-V in Clang, and adds the KCFI machine function pass in `RISCVPassConfig::addPreSched` to emit target-specific `KCFI_CHECK` pseudo instructions before calls that have KCFI operand bundles. The machine function pass also bundles the instructions to ensure we emit the checks immediately before the calls, which is not possible with the generic pass. `KCFI_CHECK` instructions are lowered in `RISCVAsmPrinter` to a contiguous code sequence that traps if the expected hash in the operand bundle doesn't match the hash before the target function address. This patch emits an `ebreak` instruction for error handling to match the Linux kernel's `BUG()` implementation. Just like for X86, we also emit trap locations to a `.kcfi_traps` section to support error handling, as we cannot embed additional information to the trap instruction itself. Relands commit 62fa708ceb027713b386c7e0efda994f8bdc27e2 with fixed tests. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D148385	2023-06-23 22:57:56 +00:00
Sami Tolvanen	e809ebeb6c	Revert "[RISCV] Implement KCFI operand bundle lowering" This reverts commit 62fa708ceb027713b386c7e0efda994f8bdc27e2. Reverting to investigate -verify-machineinstrs errors in MIR tests.	2023-06-23 21:42:57 +00:00
Sami Tolvanen	62fa708ceb	[RISCV] Implement KCFI operand bundle lowering With `-fsanitize=kcfi` (Kernel Control-Flow Integrity), Clang emits "kcfi" operand bundles to indirect call instructions. Similarly to the target-specific lowering added in D119296, implement KCFI operand bundle lowering for RISC-V. This patch disables the generic KCFI pass for RISC-V in Clang, and adds the KCFI machine function pass in `RISCVPassConfig::addPreSched` to emit target-specific `KCFI_CHECK` pseudo instructions before calls that have KCFI operand bundles. The machine function pass also bundles the instructions to ensure we emit the checks immediately before the calls, which is not possible with the generic pass. `KCFI_CHECK` instructions are lowered in `RISCVAsmPrinter` to a contiguous code sequence that traps if the expected hash in the operand bundle doesn't match the hash before the target function address. This patch emits an `ebreak` instruction for error handling to match the Linux kernel's `BUG()` implementation. Just like for X86, we also emit trap locations to a `.kcfi_traps` section to support error handling, as we cannot embed additional information to the trap instruction itself. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D148385	2023-06-23 18:25:24 +00:00
Craig Topper	017a24eb1b	[RISCV] Minor refactoring of some code in copyPhysReg. NFC Move some of the vmv.v.i handling into the vmv.v.v if. This reduces the scope of one variable.	2023-06-21 18:02:58 -07:00
Craig Topper	894d047056	[RISCV] Remove RISCVII::hasMergeOp. NFC We can mostly get this from the operand info in MCInstrDesc. The exception is the _TIED pseudos so I've added a new flag for those. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D152313	2023-06-13 10:54:00 -07:00
Craig Topper	0ce8163f18	[RISCV] Use const reference when looping over RISCVMatInt::InstSeq. NFC	2023-06-06 14:27:28 -07:00
Dávid Bolvanský	09515f2c20	[SDAG] Preserve unpredictable metadata, teach X86CmovConversion to respect this metadata Sometimes an developer would like to have more control over cmov vs branch. We have unpredictable metadata in LLVM IR, but currently it is ignored by X86 backend. Propagate this metadata and avoid cmov->branch conversion in X86CmovConversion for cmov with this metadata. Example: ``` int MaxIndex(int n, int a) { int t = 0; for (int i = 1; i < n; i++) { // cmov is converted to branch by X86CmovConversion if (a[i] > a[t]) t = i; } return t; } int MaxIndex2(int n, int a) { int t = 0; for (int i = 1; i < n; i++) { // cmov is preserved if (__builtin_unpredictable(a[i] > a[t])) t = i; } return t; } ``` Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D118118	2023-06-01 20:56:44 +02:00
Shao-Ce SUN	8b90f8e04b	[RISCV][CodeGen] Support Zdinx on RV32 codegen This patch was split from D122918 . Co-Author: @StephenFan @liaolucy @realqhc Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D149743	2023-05-25 14:13:37 +08:00
Qihan Cai	773b0aaa49	[RISCV][CodeGen] Support Zhinx and Zhinxmin This patch was split from D122918. Co-Author: @liaolucy @sunshaoce Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D149811	2023-05-12 18:31:35 +10:00
Jie Fu	814e8d7c5f	[RISCV] Omit the template parameters in getSerializableMachineMemOperandTargetFlags() Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D149927	2023-05-05 14:12:16 +08:00
Craig Topper	b2420c67e4	[RISCV] Restrict valid indices for cm.jalt to be in [32,255]. Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D149901	2023-05-04 18:02:06 -07:00
Shao-Ce SUN	2dc0fa050e	[RISCV][CodeGen] Support Zdinx on RV64 codegen This patch was split from D122918 . Co-Author: @liaolucy @realqhc Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D149665	2023-05-04 09:00:40 +08:00
Shao-Ce SUN	fe558efe71	[RISCV][CodeGen] Support Zfinx codegen This patch was split from D122918 . Co-Author: @liaolucy @realqhc Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D148874	2023-05-03 00:13:38 +08:00

1 2 3 4 5 ...

252 Commits