llvm-project

Author	SHA1	Message	Date
Jack Styles	28233408a2	[CodeGen] [ARM] Make RISC-V Init Undef Pass Target Independent and add support for the ARM Architecture. (#77770 ) When using Greedy Register Allocation, there are times where early-clobber values are ignored, and assigned the same register. This is illeagal behaviour for these intructions. To get around this, using Pseudo instructions for early-clobber registers gives them a definition and allows Greedy to assign them to a different register. This then meets the ARM Architecture Reference Manual and matches the defined behaviour. This patch takes the existing RISC-V patch and makes it target independent, then adds support for the ARM Architecture. Doing this will ensure early-clobber restraints are followed when using the ARM Architecture. Making the pass target independent will also open up possibility that support other architectures can be added in the future.	2024-02-26 12:12:31 +00:00
Philip Reames	3ff7caea33	[TTI] Use Register in isLoadFromStackSlot and isStoreToStackSlot [nfc] (#80339 )	2024-02-01 17:52:35 -08:00
Alex Bradbury	bc90b91885	Revert "[RISCV] Implement RISCVInsrInfo::getConstValDefinedInReg" This reverts commit 4b7d997aaed7a2399d5e73fc3adfaaa6a3d35d1f. A miscompile was reported <https://github.com/llvm/llvm-project/pull/77610#issuecomment-1896193835>. Reverting so it can be investigated.	2024-01-17 19:27:36 +00:00
Alex Bradbury	57d517c257	[RISCV] Implement RISCVInsrInfo::getConstValDefinedInReg (#77610 ) This helper function handles common cases where we can determine a constant value is being defined in a register. Although it looks like codegen changes are possible due to this being called in PeepholeOptimizer, my main motivation is to use this in describeLoadedValue.	2024-01-16 07:14:41 +00:00
Alex Bradbury	b717365216	[MachineScheduler][NFCI] Add Offset and OffsetIsScalable args to shouldClusterMemOps (#73778 ) These are picked up from getMemOperandsWithOffsetWidth but weren't then being passed through to shouldClusterMemOps, which forces backends to collect the information again if they want to use the kind of heuristics typically used for the similar shouldScheduleLoadsNear function (e.g. checking the offset is within 1 cache line). This patch just adds the parameters, but doesn't attempt to use them. There is potential to use them in the current PPC and AArch64 shouldClusterMemOps implementation, and I intend to use the offset in the heuristic for RISC-V. I've left these for future patches in the interest of being as incremental as possible. As noted in the review and in an inline FIXME, an ElementCount-style abstraction may later be used to condense these two parameters to one argument. ElementCount isn't quite suitable as it doesn't support negative offsets.	2023-12-06 15:30:48 +00:00
Alex Bradbury	85c9c16895	[RISCV] Support load clustering in the MachineScheduler (off by default) (#73754 ) This adds minimal support for load clustering, but disables it by default. The intent is to iterate on the precise heuristic and the question of turning this on by default in a separate PR. Although previous discussion indicates hope that the MachineScheduler would replace most uses of the SelectionDAG scheduler, it does seem most targets aren't using MachineScheduler load clustering right now: PPC+AArch64 seem to just use it to help with paired load/store formation and although AMDGPU uses it for general clustering it also implements ShouldScheduleLoadsNear for the SelectionDAG scheduler's clustering.	2023-11-29 10:01:55 +00:00
Alex Bradbury	9c5003cc0c	[RISCV] Implement RISCVInstrInfo::getMemOperandsWithOffsetWidth (#73681 ) This hook is called by the default implementation of getMemOperandWithOffset and by the load/store clustering code in the MachineScheduler though this isn't enabled by default and is not yet enabled for RISC-V. Only return true for queries on scalar loads/stores for now (this is a conservative starting point, and vector load/store can be handled in a follow-on patch).	2023-11-29 04:48:43 +00:00
Craig Topper	a845061935	[AArch64] Use the same fast math preservation for MachineCombiner reassociation as X86/PowerPC/RISCV. (#72820 ) Don't blindly copy the original flags from the pre-reassociated instrutions. This copied the integer poison flags which are not safe to preserve after reassociation. For the FP flags, I think we should only keep the intersection of the flags. Override setSpecialOperandAttr to do this. Fixes #72777.	2023-11-22 14:17:45 -08:00
Alex Bradbury	7f28e8ced7	[RISCV] Implement RISCVInstrInfo::isAddImmediate (#72356 ) This hook is called by the target-independent implementation of TargetInstrInfo::describeLoadedValue. I've opted to test it via a C++ unit test, which although fiddly to set up seems the right way to test a function with such clear intended semantics (rather than testing the impact indirectly). isAddImmediate will never recognise ADDIW as an add immediate which I _think_ is conservatively correct, as the caller may not understand its semantics vs ADDI. Note that although the doc comment for isAddImmediate specifies its behaviour solely in terms of physical registers, none of the current in-tree implementations (including this one) bail out on virtual registers (see #72357).	2023-11-16 14:43:31 +00:00
Craig Topper	e0e0891d74	[RISCV][GISel] Select G_BRCOND and G_ICMP together when possible. This allows us to fold the G_ICMP operands into the conditional branch. This reuses the helper function we have for folding a G_ICMP into G_SELECT.	2023-11-12 15:53:23 -08:00
Min-Yih Hsu	1e39575a98	[RISCV] CSE by swapping conditional branches (#71111 ) DAGCombiner, as well as InstCombine, tend to canonicalize GE/LE into GT/LT, namely: ``` X >= C --> X > (C - 1) ``` Which sometime generates off-by-one constants that could have been CSE'd with surrounding constants. Instead of changing such canonicalization, this patch tries to swap those branch conditions post-isel, in the hope of resurfacing more constant CSE opportunities. More specifically, it performs the following optimization: For two constants C0 and C1 from ``` li Y, C0 li Z, C1 ``` To remove redundnat `li Y, C0`, 1. if C1 = C0 + 1 we can turn: (a) blt Y, X -> bge X, Z (b) bge Y, X -> blt X, Z 2. if C1 = C0 - 1 we can turn: (a) blt X, Y -> bge Z, X (b) bge X, Y -> blt Z, X This optimization will be done by PeepholeOptimizer through RISCVInstrInfo::optimizeCondBranch.	2023-11-03 09:03:52 -07:00
Wang Pengcheng	a316f14fdd	[RISCV][NFC] Move getRVVMCOpcode to RISCVInstrInfo (#70637 ) To simplify more code.	2023-10-30 19:03:04 +08:00
Craig Topper	8363996894	[RISCV] Reduce the number of parameters to copyPhysRegVector. NFC (#70502 ) The Lmul and SubRegIdx can be derived from the opcode. Make NF default to 1.	2023-10-27 13:36:44 -07:00
Craig Topper	b679ec86e3	[RISCV] Use RISCVInstrInfo::movImm to implement most of RISCVPostRAExpandPseudo::expandMovImm (#70389 )	2023-10-27 13:35:56 -07:00
Craig Topper	c18e78cfe3	[RISCV] Add copyPhysRegVector to extract common vector code out of copyPhysRegVector. (#70497 ) Call this method directly from each vector case with the correct arguments. This allows us to treat each type of copy as its own special case and not pass variables to a common merge point. This is similar to how AArch64 is structured. I think I can reduce the number of operands to this new method, but I'll do that as a follow up.	2023-10-27 12:43:56 -07:00
Craig Topper	45636ecf2c	[RISCV] Add sink-and-fold support for RISC-V. (#67602 ) This uses the recently introduced sink-and-fold support in MachineSink. https://reviews.llvm.org/D152828 This enables folding ADDI into load/store addresses. Enabling by default will be a separate PR.	2023-10-07 10:38:35 -07:00
Luke Lau	e577e7025d	[RISCV] Move vector pseudo hasAllNBitUsers switch into RISCVInstrInfo. NFC (#67593 ) The handling for vector pseudos in hasAllNBitUsers is duplicated across RISCVISelDAGToDAG and RISCVOptWInstrs. This deduplicates it between the two, with the common denominator between the two call sites being the opcode and SEW: We need to handle extracting these separately since one operates at the SelectionDAG level and the other at the MachineInstr level.	2023-10-03 12:24:11 +01:00
Sami Tolvanen	83835e22c7	[RISCV] Implement KCFI operand bundle lowering With `-fsanitize=kcfi` (Kernel Control-Flow Integrity), Clang emits "kcfi" operand bundles to indirect call instructions. Similarly to the target-specific lowering added in D119296, implement KCFI operand bundle lowering for RISC-V. This patch disables the generic KCFI pass for RISC-V in Clang, and adds the KCFI machine function pass in `RISCVPassConfig::addPreSched` to emit target-specific `KCFI_CHECK` pseudo instructions before calls that have KCFI operand bundles. The machine function pass also bundles the instructions to ensure we emit the checks immediately before the calls, which is not possible with the generic pass. `KCFI_CHECK` instructions are lowered in `RISCVAsmPrinter` to a contiguous code sequence that traps if the expected hash in the operand bundle doesn't match the hash before the target function address. This patch emits an `ebreak` instruction for error handling to match the Linux kernel's `BUG()` implementation. Just like for X86, we also emit trap locations to a `.kcfi_traps` section to support error handling, as we cannot embed additional information to the trap instruction itself. Relands commit 62fa708ceb027713b386c7e0efda994f8bdc27e2 with fixed tests. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D148385	2023-06-23 22:57:56 +00:00
Sami Tolvanen	e809ebeb6c	Revert "[RISCV] Implement KCFI operand bundle lowering" This reverts commit 62fa708ceb027713b386c7e0efda994f8bdc27e2. Reverting to investigate -verify-machineinstrs errors in MIR tests.	2023-06-23 21:42:57 +00:00
Sami Tolvanen	62fa708ceb	[RISCV] Implement KCFI operand bundle lowering With `-fsanitize=kcfi` (Kernel Control-Flow Integrity), Clang emits "kcfi" operand bundles to indirect call instructions. Similarly to the target-specific lowering added in D119296, implement KCFI operand bundle lowering for RISC-V. This patch disables the generic KCFI pass for RISC-V in Clang, and adds the KCFI machine function pass in `RISCVPassConfig::addPreSched` to emit target-specific `KCFI_CHECK` pseudo instructions before calls that have KCFI operand bundles. The machine function pass also bundles the instructions to ensure we emit the checks immediately before the calls, which is not possible with the generic pass. `KCFI_CHECK` instructions are lowered in `RISCVAsmPrinter` to a contiguous code sequence that traps if the expected hash in the operand bundle doesn't match the hash before the target function address. This patch emits an `ebreak` instruction for error handling to match the Linux kernel's `BUG()` implementation. Just like for X86, we also emit trap locations to a `.kcfi_traps` section to support error handling, as we cannot embed additional information to the trap instruction itself. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D148385	2023-06-23 18:25:24 +00:00
LiaoChunyu	5b869f3e2a	[RISCV] Custom lowering of llvm.is.fpclass This patch supports FCLASS.S, FCLASS.H and FCLASS.D. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D149063	2023-04-26 10:17:18 +08:00
Piyou Chen	8a3950510f	[RISCV] Support scalar/fix-length vector NTLH intrinsic with different domain This commit implements the two NTLH intrinsic functions. ``` type __riscv_ntl_load (type ptr, int domain); void __riscv_ntl_store (type ptr, type val, int domain); ``` ``` enum { __RISCV_NTLH_INNERMOST_PRIVATE = 2, __RISCV_NTLH_ALL_PRIVATE, __RISCV_NTLH_INNERMOST_SHARED, __RISCV_NTLH_ALL }; ``` We encode the non-temporal domain into MachineMemOperand flags. 1. Create the RISC-V built-in function with custom semantic checking. 2. Assume the domain argument is a compile time constant, and make it as LLVM IR metadata (nontemp_node). 3. Encode domain value as two bits MachineMemOperand TargetMMOflag. 4. According to MachineMemOperand TargetMMOflag, select corrsponding ntlh instruction. Currently, it supports scalar type and fixed-length vector type. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D143364	2023-04-24 20:15:14 -07:00
Craig Topper	3d7fa6dc7c	[RISCV] Move allWUsers from RISCVInstrInfo to RISCVOptWInstrs. It was only in RISCVInstrInfo because it was used by 2 passes, but those passes have been merged in D147173. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D147174	2023-03-29 15:13:09 -07:00
Craig Topper	29463612d2	[RISCV] Replace RISCV -> RISC-V in comments. NFC To be consistent with RISC-V branding guidelines https://riscv.org/about/risc-v-branding-guidelines/ Think we should be using RISC-V where possible. More patches will follow. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D146449	2023-03-27 09:50:17 -07:00
Amara Emerson	41e9c4b88c	[NFC][Outliner] Delete default ctors for Candidate & OutlinedFunction. I think it's good practice to avoid having default ctors unless they're really valid/useful. For OutlinedFunction the default ctor was used to represent a bail-out value for getOutliningCandidateInfo(), so I changed the API to return an optional<getOutliningCandidateInfo> instead which seems a tad cleaner. Differential Revision: https://reviews.llvm.org/D146375	2023-03-20 11:17:10 -07:00
Craig Topper	40582493f0	[RISCV] Add overrides of isLoadFromStackSlot/isStoreFromStackSlot signatures that don't have MemBytes. D145471 added overrides of the other signature to return MemBytes, but shouldn't have removed these overrides. These signatures will now call the MemBytes signature and ignore the MemBytes. This matches X86.	2023-03-12 09:52:49 -07:00
Jim Lin	7e9293572d	[RISCV] Set how many bytes load from or store to stack slot Refer from: https://reviews.llvm.org/D44782 After https://reviews.llvm.org/D130302, LW+SEXT.B can be folded into LB as partially reload stack slot. This gains incorrect optimization result from `StackSlotColoring` without given the number of bytes exactly load from stack. LB+SW are mis-interpreted as fully reload/restore from stack slot without the sign-extension. SW would be considered as a redundant store. The testcase is copied from llvm/test/CodeGen/X86/pr30821.mir. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D145471	2023-03-10 10:14:59 +08:00
Anton Sidorenko	2693efa8a5	[MachineCombiner] Support local strategy for traces For in-order cores MachineCombiner makes better decisions when the critical path is calculated only for the current basic block and does not take into account other blocks from the trace. This patch adds a virtual method to TargetInstrInfo to allow each target decide which strategy to use. Depends on D140541 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D140542	2023-02-17 13:17:22 +03:00
duk	d61d591411	[MachineOutliner] Make getOutliningType partially target-independent The motivation behind this patch is to unify some of the outliner logic across architectures. This looks nicer in general and makes fixing [issues like this](https://reviews.llvm.org/D124707#3483805) easier. There are some notable changes here: 1. `isMetaInstruction()` is used directly instead of checking for specific meta-instructions like `IMPLICIT_DEF` or `KILL`. This was already done in the RISC-V implementation, but other architectures still did hardcoded checks. - As an exception to this, CFI instructions are explicitly delegated to the target because RISC-V has different handling for those. 2. `isTargetIndex()` checks are replaced with an assert; none of the architectures supported actually use `MO_TargetIndex` at this point in time. 3. `isCFIIndex()` and `isFI()` checks are also replaced with asserts, since these operands should not exist in [any context](https://reviews.llvm.org/D122635#3447214) at this stage in the pipeline. Reviewed by: paquette Differential Revision: https://reviews.llvm.org/D125072	2023-02-09 14:35:00 -05:00
Craig Topper	ac51cf1960	[RISCV] Refactor RISCV::hasAllWUsers to hasAllNBitUsers similar to RISCVISelDAGToDAG's version. NFC Move to RISCVInstrInfo since we need RISCVSubtarget now. Instead of asking if only the lower 32 bits are used we can now ask if the lower N bits are used. This will be needed by a future patch.	2022-12-28 12:49:23 -08:00
Craig Topper	1184ede46f	[RISCV] Add const qualifiers to some function arguments. NFC	2022-12-28 11:20:17 -08:00
Nitin John Raj	d64d3c5a8f	[RISCV] Add pass to remove W suffix from ADDIW and SLLIW to improve compressibility SLLI and ADD are more compressible than SLLIW and ADDW. SLLI/ADD both have a 5-bit register encoding. SLLIW/ADDW have a 3-bit register encoding. They both require the dest to also be one of the sources. We aggressively form ADDW/SLLIW as it helps hasAllWBitUsers in RISCVISelDAGToDAG to not require recursion. So we need a pass to remove excessive -w suffixes. Differential Revision: https://reviews.llvm.org/D139948	2022-12-22 14:19:26 -08:00
Craig Topper	da7415acda	[RISCV] Add support for predicating AND/OR/XOR/ADD/SUB with short-forward-branch-opt. sifive-7-series can predicate ALU instructions in the shadow of a branch not just move instructions. This patch implements analyzeSelect/optimizeSelect to predicate these operations. This is based on ARM's implementation which can predicate using flags and condition codes. I've restricted it to just the instructions we have test cases for, but it can be extended in the future. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D140053	2022-12-16 22:58:43 -08:00
Christudasan Devadasan	b5efec4b27	[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot With D134950, targets get notified when a virtual register is created and/or cloned. Targets can do the needful with the delegate callback. AMDGPU propagates the virtual register flags maintained in the target file itself. They are useful to identify a certain type of machine operands while inserting spill stores and reloads. Since RegAllocFast spills the physical register itself, there is no way its virtual register can be mapped back to retrieve the flags. It can be solved by passing the virtual register as an additional argument. This argument has no use when the spill interfaces are called during the greedy allocator or even the PrologEpilogInserter and can pass a null register in such cases. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D138656	2022-12-17 11:55:34 +05:30
Anton Sidorenko	1cdffa359a	[MachineCombiner][RISCV] Support inverse instructions reassociation This patch adds reassociation of FADD/FSUB instruction pairs. Differential Revision: https://reviews.llvm.org/D138660	2022-12-15 16:48:30 +03:00
Anton Sidorenko	f8ed709345	[MachineCombiner] Extend reassociation logic to handle inverse instructions Machine combiner supports generic reassociation only of associative and commutative instructions, for example (A + X) + Y => (X + Y) + A. However, we can extend this generic support to handle patterns like (X + A) - Y => (X - Y) + A), where `-` is the inverse of `+`. This patch adds interface functions to process reassociation patterns of associative/commutative instructions and their inverse variants with minimal changes in backends. Differential Revision: https://reviews.llvm.org/D136754	2022-12-07 13:50:28 +03:00
Fangrui Song	b0df70403d	[Target] llvm::Optional => std::optional The updated functions are mostly internal with a few exceptions (virtual functions in TargetInstrInfo.h, TargetRegisterInfo.h). To minimize changes to LLVMCodeGen, GlobalISel files are skipped. https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 22:43:14 +00:00
Anton Sidorenko	a21bbc24d3	[MachineCombiner][RISCV] Make hasReassociableSibling virtual and override it for RISCV To check reassociation correctness for RISCV, we must ensure that the root and it's sibling have equal rounding modes (for floating point instructions). `hasReassociableSibling` is a good place to make additional target-dependend checks. This patch allows us to enable default machine combiner mechanism to gather reassociation candidates on RISCV. Differential Revision: https://reviews.llvm.org/D138302	2022-12-01 16:30:51 +03:00
Anton Sidorenko	b6c790736e	[MachineCombiner][RISCV] Add fmadd/fmsub/fnmsub instructions patterns This patch adds tranformation of fmul+fadd/fsub chains to fused multiply instructions: * fmul+fadd->fmadd * fmul+fsub->fmsub/fnmsub We also will try to combine these instructions if the fmul has more than one use and cannot be deleted. However, removing the dependence between fmul and fadd can still be profitable, and we rely on machine combiner approximations of scheduling. Differential Revision: https://reviews.llvm.org/D136764	2022-11-17 13:24:04 +03:00
Anton Sidorenko	1978b4d968	[MachineCombiner][RISCV] Enable MachineCombiner for RISCV Initial implementation to match basic FP reassociation patterns. Differential Revision: https://reviews.llvm.org/D135264	2022-10-18 18:56:32 +03:00
Anton Afanasyev	e175f99c49	Revert "[MachineCombiner][RISCV] Enable MachineCombiner for RISCV" This reverts commit 3112cf3b00fe45a0911ec0c2e6706ef1f8a9b972. Test breakage: https://lab.llvm.org/buildbot/#/builders/16/builds/36631	2022-10-18 15:57:11 +03:00
Anton Sidorenko	3112cf3b00	[MachineCombiner][RISCV] Enable MachineCombiner for RISCV Initial implementation to match basic FP reassociation patterns. Differential Revision: https://reviews.llvm.org/D135264	2022-10-18 15:31:03 +03:00
Craig Topper	31bca38ad1	[RISCV] Pass the destination register to getVLENFactoredAmount instead of returning it. NFC This is a refactor for another patch. For now we move the vreg creation to the caller. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D135008	2022-10-03 10:59:35 -07:00
Kazu Hirata	3f3930a451	Remove redundaunt virtual specifiers (NFC) Identified with tidy-modernize-use-override.	2022-07-25 23:00:59 -07:00
Kazu Hirata	b5188591a0	[llvm] Remove redundaunt virtual specifiers (NFC) Identified with modernize-use-override.	2022-07-24 21:50:35 -07:00
Craig Topper	ab2348a6fa	[RISCV] Add sext.b/h and zext.b/h/w to RISCVInstrInfo::foldMemoryOperandImpl. We can always fold zext.b since it is just andi. The others require Zba/Zbb. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D130302	2022-07-21 14:54:58 -07:00
Craig Topper	d7f2a63371	[RISCV] Fold stack reload into sext.w by using lw instead of ld. We can use lw to load 4 bytes from the stack and sign extend them instead of loading all 8 bytes. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D129948	2022-07-18 09:09:17 -07:00
luxufan	0f45eaf0da	[RISCV] Add a scavenge spill slot when use ADDI to compute scalable stack offset Computing scalable offset needs up to two scrach registers. We add scavenge spill slots according to the result of `RISCV::isRVVSpill` and `RVVStackSize`. Since ADDI is not included in `RISCV::isRVVSpill`, PEI doesn't add scavenge spill slots for scrach registers when using ADDI to get scalable stack offsets. The ADDI instruction has a destination register which can be used as a scrach register. So one scavenge spil slot is sufficient for computing scalable stack offsets. Differential Revision: https://reviews.llvm.org/D128188	2022-07-03 20:18:13 +08:00
Craig Topper	d63b66840f	[RISCV] Move some methods out of RISCVInstrInfo and into RISCV namespace. These methods don't access any state from RISCVInstrInfo. Make them free functions in the RISCV namespace. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D127583	2022-06-12 10:47:21 -07:00
Shao-Ce SUN	117e10304b	[RISCV] move `isFaultFirstLoad` into `RISCVInstrInfo` Fix build errors in D126794 ``` ld.lld: error: undefined symbol: llvm::MachineInstr::getNumExplicitDefs() const >>> referenced by RISCVBaseInfo.cpp >>> RISCVBaseInfo.cpp.o:(llvm::isFaultFirstLoad(llvm::MachineInstr const&)) in archive lib/libLLVMRISCVDesc.a ld.lld: error: undefined symbol: llvm::MachineInstr::findRegisterDefOperandIdx(llvm::Register, bool, bool, llvm::TargetRegisterInfo const*) const >>> referenced by RISCVBaseInfo.cpp >>> RISCVBaseInfo.cpp.o:(llvm::isFaultFirstLoad(llvm::MachineInstr const&)) in archive lib/libLLVMRISCVDesc.a clang-15: error: linker command failed with exit code 1 (use -v to see invocation) ``` Reviewed By: fakepaper56, craig.topper Differential Revision: https://reviews.llvm.org/D127477	2022-06-11 00:27:53 +08:00

1 2

95 Commits