llvm-project

Author	SHA1	Message	Date
Wang Pengcheng	f24d9490e5	[RISCV] Match prefetch address with offset (#66072 ) A new ComplexPattern `AddrRegImmLsb00000` is added, which is like `AddrRegImm` except that if the least significant 5 bits isn't all zeros, we will fail back to offset 0.	2023-10-20 14:22:48 +08:00
Shao-Ce SUN	f48dab5237	Add RV64 constraint to SRLIW (#69416 ) Fixes #69408	2023-10-18 15:01:17 +08:00
Luke Lau	e577e7025d	[RISCV] Move vector pseudo hasAllNBitUsers switch into RISCVInstrInfo. NFC (#67593 ) The handling for vector pseudos in hasAllNBitUsers is duplicated across RISCVISelDAGToDAG and RISCVOptWInstrs. This deduplicates it between the two, with the common denominator between the two call sites being the opcode and SEW: We need to handle extracting these separately since one operates at the SelectionDAG level and the other at the MachineInstr level.	2023-10-03 12:24:11 +01:00
Craig Topper	3c0990c188	[RISCV] Generalize the (ADD (SLLI X, 32), X) special case in constant materialization. (#66931 ) We don't have to limit ourselves to a shift amount of 32. We can support other shift amounts that make the upper 32 bits line up.	2023-10-02 13:03:06 -07:00
Alex Bradbury	0b0ed8f76a	[RISCV] Add missing hunk to #67889 to fix test failures Without this, various CodeGen tests fail because a RISCV::FCVT_D_W[_IN32X] machine node is created without the rounding mode operand. The relevant PR was committed as bf94ba39b65d1212ea84d5783b393280e1ce7478	2023-10-01 11:34:57 +01:00
Craig Topper	e6b2525daf	[RISCV] Fix -Wsign-compare warning. NFC	2023-09-27 13:41:06 -07:00
Luke Lau	5ffbdd9ed5	[RISCV] Handle .vx pseudos in hasAllNBitUsers (#67419 ) Vector pseudos with scalar operands only use the lower SEW bits (or less in the case of shifts and clips). This patch accounts for this in hasAllNBitUsers for both SDNodes in RISCVISelDAGToDAG. We also need to handle this in RISCVOptWInstrs otherwise we introduce slliw instructions that are less compressible than their original slli counterpart. This is a reland of aff6ffc8760b99cc3d66dd6e251a4f90040c0ab9 with the refactoring omitted.	2023-09-27 19:53:50 +01:00
Philip Reames	487dd5f1e3	Revert "[RISCV] Handle .vx pseudos in hasAllNBitUsers (#67419 )" This reverts commit aff6ffc8760b99cc3d66dd6e251a4f90040c0ab9. Version landed differs from version reviewed in (stylistic) manner worthy of separate review.	2023-09-27 11:24:49 -07:00
Luke Lau	aff6ffc876	[RISCV] Handle .vx pseudos in hasAllNBitUsers (#67419 ) Vector pseudos with scalar operands only use the lower SEW bits (or less in the case of shifts and clips). This patch accounts for this in hasAllNBitUsers for both SDNodes in RISCVISelDAGToDAG. We also need to handle this in RISCVOptWInstrs otherwise we introduce slliw instructions that are less compressible than their original slli counterpart.	2023-09-27 18:12:29 +01:00
Craig Topper	65eb46877c	[RISCV] Explicitly create IMPLICIT_DEF instead of UNDEF for vectors i… (#67369 ) …n RISCVDAGToDAGISel::Select. UNDEF needs to go through isel itself. All of the nodes have been topologically sorted so that instruction selection precedes from root to entry node. If we create a new node that needs to go through isel, we have to insert it into the correct place in the topological sort. If we don't, it might not get selected at all in some cases. Some targets have a function like X86's insertDAGNode to sort newly created nodes. To avoid introducing such a function on RISC-V, we can directly emit the IMPLICIT_DEF node that UNDEF would get selected to.	2023-09-26 08:37:24 -07:00
Philip Reames	233b6ef66c	[RISCV] Handle EltType > XLEN case in VMV_V_X_VL to VMV_S_X_VL fold I'd guarded this case in D158874 to avoid regressions, and decided to go investigate what was going on. The solution turns out to be a generic splat matching extension to handle INSERT_SUBVECTOR. In theory, we could see these from other sources as well, but for some reason we only seem to see the i64 extract on rv32 case in practice. Not sure why that is to be honest. Differential Revision: https://reviews.llvm.org/D159230	2023-09-22 13:43:43 -07:00
Luke Lau	3510552df6	[RISCV] Check for COPY_TO_REGCLASS in usesAllOnesMask (#67037 ) Sometimes with mask vectors that have been widened, there is a CopyToRegClass node in between the VMSET and the CopyToReg. This is a resurrection of https://reviews.llvm.org/D148524, and is needed to remove the mask operand when it's extracted from a subvector as planned in https://github.com/llvm/llvm-project/pull/66267#discussion_r1331998919	2023-09-22 16:30:43 +01:00
Arthur Eubanks	0a1aa6cda2	[NFC][CodeGen] Change CodeGenOpt::Level/CodeGenFileType into enum classes (#66295 ) This will make it easy for callers to see issues with and fix up calls to createTargetMachine after a future change to the params of TargetMachine. This matches other nearby enums. For downstream users, this should be a fairly straightforward replacement, e.g. s/CodeGenOpt::Aggressive/CodeGenOptLevel::Aggressive or s/CGFT_/CodeGenFileType::	2023-09-14 14:10:14 -07:00
Nick Desaulniers	86735a4353	reland [InlineAsm] wrap ConstraintCode in enum class NFC (#66264 ) reland [InlineAsm] wrap ConstraintCode in enum class NFC (#66003) This reverts commit ee643b706be2b6bef9980b25cc9cc988dab94bb5. Fix up build failures in targets I missed in #66003 Kept as 3 commits for reviewers to see better what's changed. Will squash when merging. - reland [InlineAsm] wrap ConstraintCode in enum class NFC (#66003) - fix all the targets I missed in #66003 - fix off by one found by llvm/test/CodeGen/SystemZ/inline-asm-addr.ll	2023-09-13 13:31:24 -07:00
Reid Kleckner	ee643b706b	Revert "[InlineAsm] wrap ConstraintCode in enum class NFC (#66003 )" This reverts commit 2ca4d136124d151216aac77a0403dcb5c5835bcd. Also revert the followup, "[InlineAsm] fix botched merge conflict resolution" This reverts commit 8b9bf3a9f715ee5dce96eb1194441850c3663da1. There were SystemZ and Mips build errors, too many to fix forward.	2023-09-13 09:58:02 -07:00
Nick Desaulniers	2ca4d13612	[InlineAsm] wrap ConstraintCode in enum class NFC (#66003 ) Similar to commit 2fad6e69851e ("[InlineAsm] wrap Kind in enum class NFC") Fix the TODOs added in commit 93bd428742f9 ("[InlineAsm] refactor InlineAsm class NFC (#65649)")	2023-09-13 08:48:09 -07:00
Craig Topper	319aba645f	[RISCV] Teach MatInt to use (ADD_UW X, (SLLI X, 32)) to materialize some constants. If the high and low 32 bits are the same, we try to use (ADD X, (SLLI X, 32)) but that only works if bit 31 is clear since the low 32 bits will be sign extended. If we have Zba we can use add.uw to zero the sign extended bits. Reviewed By: reames, wangpc Differential Revision: https://reviews.llvm.org/D159253	2023-08-31 20:24:34 -07:00
Philip Reames	079c968eb9	[RISCV] Form vmv.s.f/x from single element splats via DAG combine This re-implements the special casing we had in lowerScalarSplat as a DAG combine. As can be seen in the tests, this ends up triggering in a bunch more cases. The semantically interesting bit of this change is the use of the implicit truncate semantics for when XLEN > SEW. We'd already been doing this for vmv.v.x, but this change extends e.g. the constant matching to make the same assumption about vmv.s.x. Per my reading of the specification, this should be fine, and if anything, is more obviously true of vmv.s.x than vmv.v.x. Differential Revision: https://reviews.llvm.org/D158874	2023-08-30 12:44:36 -07:00
Luke Lau	18c7bf0b85	[RISCV] Refactor selectVSplat. NFCI This patch shares the logic between the various splat ComplexPatterns to help the diff in some upcoming patches. It's worth noting that the uimm splat pattern now takes into account the implicit truncation + sign extend semantics of vmv_v_x_vl, but that doesn't seem to affect the result since it always took the sext value anyway. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D158741	2023-08-29 15:42:23 +01:00
Luke Lau	007b41b393	[RISCV] Don't relax policy to ta when vmerge's VL shrinks during folding When folding a vmerge into its operands, if the resulting VL is smaller than what the vmerge had originally then what was previously in its body then gets moved to the tail. In that case, we can't relax the tail policy to agnostic when the merge operand is undefined, since we need to preserve these elements past the new VL. Fixes https://github.com/llvm/llvm-project/issues/64754 Reviewed By: craig.topper, reames Differential Revision: https://reviews.llvm.org/D158161	2023-08-22 10:39:22 +01:00
Philip Reames	a63bd7e99b	[RISCV] Use NoReg in place of IMPLICIT_DEF for undefined passthru operands In a recent series of refactorings (described here: https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295), I greatly increased the number of IMPLICIT_DEF operands to our vector instructions. This has turned out to have an unexpected negative impact because MachineCSE does not CSE IMPLICIT_DEFs, and thus does not CSE any instruction with an IMPLICIT_DEF operand. SelectionDAG does CSE the same case, but that only covers the same block case, not the cross block case. This lead to the performance regression reported in https://github.com/llvm/llvm-project/issues/64282. This change is a slightly ugly hack to side step the issue. Instead of fixing the root cause (lack of CSE for IMPLICIT_DEF) or undoing the operand changes, we leave the extra operand in place, and use NoReg in place of IMPLICIT_DEF. I then convert back to IMPLICIT_DEF just before register allocation so that ProcessImplicitDefs and TwoAddressInstructions can do the normal transforms to Undef tied registers. We may end up backporting this into the 17.x release branch. Given how late in the release cycle this is landing, that's much less likely now, but still a possibility. Differential Revision: https://reviews.llvm.org/D156909	2023-08-14 12:57:38 -07:00
Craig Topper	a8c502a589	[RISCV] Add bf16 to isFPImmLegal. Part of this test file was stolen from D156895. We should merge them when committing. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D156926	2023-08-03 08:27:38 -07:00
Craig Topper	de7fa3ab9a	[RISCV] Copy memoperands in some of the post isel peepholes. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D156830	2023-08-02 09:16:14 -07:00
Jianjian GUAN	b7408ebbb7	[RISCV] Use x0 in vsetvli when avl is equal to vlmax. We could use x0 form in vsetvli when we already know the vlmax and avl is equal to it. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D156404	2023-07-31 09:49:40 +08:00
Luke Lau	ce8f094da8	[RISCV] Add patterns for vnsrl.vx where shift amount is truncated Similar to D155698 where the shift amount is extended, this patch extends the ComplexPattern to handle the case where the shift amount has been truncated. Truncations are custom lowered to truncate_vector_vl, and in cases like i64 -> i16 they are truncated by one power of two at a time, so we need to unravel nested layers of them. The pattern can also be reused for Zvbb's vwsll.vx in an upcoming patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155928	2023-07-26 20:26:32 +01:00
Luke Lau	33a83c5486	[RISCV] Add SDNode patterns for vrol.[vv,vx] and vror.[vv,vx,vi] These correspond to ROTL/ROTR nodes Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155439	2023-07-21 10:22:46 +01:00
Luke Lau	24628a14c4	[RISCV] Add patterns for vnsr[a,l].wx where shift amount has different type than vector element We're currently only matching scalar shift amounts where the type is the same as the vector element type. But because only the bottom log2(2*SEW) bits are used, only 7 bits will be used at most so we can use any scalar type >= i8. This patch adds patterns for the case above, as well as for when the shift amount type is the same as the widened element type and doesn't need extended. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155698	2023-07-21 10:13:28 +01:00
Simon Pilgrim	73f09814ee	Fix MSVC "'GetVMSetForLMul': not all control paths return a value" warning. NFC.	2023-07-19 18:55:37 +01:00
Luke Lau	efedcbeeb8	[RISCV] Fold ops into vmv.v.v as vmerge with all-ones mask A vmv.v.v shares the same encoding as a vmerge that isn't masked, so we can also fold it into its operands if we treat it as a vmerge with an all-ones mask. We take care here not to actually transform the existing vmv into a vmerge, otherwise things like True.hasOneUse() become inaccurate. Instead this just returns an equivalent list of operands. This is an alternative to D153351. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D155101	2023-07-19 17:24:42 +01:00
Luke Lau	0f277ab361	[RISCV] Fold vmerge into its ops with smaller VL if known Currently when folding vmerge into its operands, we stop if the VLs aren't identical. However since the body of (vmerge (vop)) is the intersection of vmerge and vop's bodies, we can use the smaller of the two VLs if we know it ahead of time. This patch relaxes the constraint on VL if they are both constants, or if either of them are VLMAX. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D155071	2023-07-19 17:24:40 +01:00
Philip Reames	8e024283bd	[RISCV] Minor style cleanups in post ISEL combines	2023-07-18 12:26:36 -07:00
Piyou Chen	7ce4e933ea	[RISCV] Implement prefetch locality by NTLH We add the MemOperand then backend will generate NTLH automatically. ``` __builtin_prefetch(ptr, 0 /* rw==read /, 0 / locality /); => ntl.all + prefetch.r (ptr) __builtin_prefetch(ptr, 0 / rw==read /, 1 / locality /); => ntl.pall + prefetch.r (ptr) __builtin_prefetch(ptr, 0 / rw==read /, 2 / locality /); => ntl.p1 + prefetch.r (ptr) __builtin_prefetch(ptr, 0 / rw==read /, 3 / locality */); => prefetch.r (ptr) ``` Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D154691	2023-07-16 20:32:46 -07:00
Craig Topper	d09109aa1e	[RISCV] Use isScalarInteger instead of isInteger. NFC The type should only be scalar here and the isScalarInteger should be a simpler check.	2023-07-15 22:52:43 -07:00
Philip Reames	b8e29dbe54	[RISCV] Common remaining operand logic in performCombineVMergeAndVOps [nfc] We can share the code for both the unmasked and masked cases, and add a missing consistency assert in the process. This is a subset of Luke's D155063. I'm splitting pieces and landing them in the process of convincing myself all the individual transforms are in fact correct. This is the last major piece.	2023-07-13 11:27:16 -07:00
Philip Reames	844fba2f84	[RISCV] Reason explicitly about mask and rounding mode in performCombineVMergeAndVOps [nfc] This is a subset of Luke's D155063. I'm splitting pieces and landing them in the process of convincing myself all the individual transforms are in fact correct. The code structure here is overly verbose. I'm landing this staging change with the code structure exactly matching the non-masked case to make the following cleanup that commons this all obviously correct.	2023-07-13 11:09:00 -07:00
Philip Reames	c1bb2d0b6c	[RISCV] Common post-mask operand construction in performCombineVMergeAndVOps [nfc] This is a subset of Luke's D155063. I'm splitting pieces and landing them in the process of convincing myself all the individual transforms are in fact correct. This particular change involves a slightly ugly bit of code to match the glue to the mask. I'm staging it this way as I ran into a bit of weirdness when commoning mask operands, and wanted to isolate the complexity.	2023-07-13 10:39:52 -07:00
Philip Reames	f648c9f71e	[RISCV] Tail common repeated code in performCombineVMergeAndVOps [nfc] Very minor change, just making sure each step is obvious and easy to follow. This is a subset of Luke's D155063. I'm splitting pieces and landing them in the process of convincing myself all the individual transforms are in fact correct.	2023-07-13 10:07:09 -07:00
Philip Reames	ca8ef82165	[RISCV] Factor out a dupiicate bit of repeated code in performCombineVMergeAndVOps [nfc] We have the SEW operand access repeating in all paths, common it up to make the code easier to read. This is a subset of Luke's D155063. I'm splitting pieces and landing them in the process of convincing myself all the individual transforms are in fact correct.	2023-07-13 09:59:02 -07:00
Philip Reames	11b986522c	[RISCV] Simplify glue handling logic in performCombineVMergeAndVOps [nfc] This is a subset of Luke's D155063. I'm splitting pieces and landing them in the process of convincing myself all the individual transforms are in fact correct. In this case, we're simplifying based on the assumption that all of our vmerge operands have mask operands. This is a fundemental property of a vmerge.	2023-07-13 08:41:33 -07:00
Luke Lau	ed15e9119b	[RISCV] Don't fold vmerge into ops if fp exception can be raised We are already checking for fp exceptions if VL changes, but I believe we should also be checking for them if the mask changes as well, since that also affects the set of active elements. From the spec: > A vector floating-point exception at any active floating-point element sets > the standard FP exception flags in the fflags register. Inactive elements do > not set FP exception flags. Note that we don't change the mask if IsMasked is true, i.e. True is masked already, since in that case we keep the existing mask. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D154980	2023-07-13 11:42:23 +01:00
eopXD	76482078cd	[RISCV][POC] Model frm control for vfadd Depends on D152879. Specification PR: riscv-non-isa/rvv-intrinsic-doc#226 This patch adds variant of `vfadd` that models the rounding mode control. The added variant has suffix `_rm` appended to differentiate from the existing ones that does not alternate `frm` and uses whatever is inside. The value `7` is used to indicate no rounding mode change. Reusing the semantic from the rounding mode encoding for scalar floating-point instructions. Additional data member `HasFRMRoundModeOp` is added so we can append `_rm` suffix for the fadd variants that models rounding mode control. Additional data member `IsRVVFixedPoint` is added so we can define pseudo instructions with rounding mode operand and distinguish the instructions between fixed-point and floating-point. Reviewed By: craig.topper, kito-cheng Differential Revision: https://reviews.llvm.org/D152996	2023-07-13 00:34:00 -07:00
Philip Reames	b5cbd9628e	[RISCV] Remove legacy TA/TU pseudo distinction of vmerge and carry-in arithmetic operations [NFC[ his change continues with the line of work discussed in https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295. This is analogous to other patches in the series, but with one key difference - the resulting pseudo does not have a policy operand. We could add one for vmerge, but the some of the multiclasses are sufficiently entwined with the mask producing arithmetic instructions that the change delta becomes unmanageable. Note that these instructions are not in the RISCVMaskedPseudo table, and thus the difference doesn't complicate other code. The main value of working incrementally here is that we get to eagerly cleanup the IsTA logic flowing through the post-ISEL combines. Differential Revision: https://reviews.llvm.org/D154645	2023-07-12 15:31:02 -07:00
Philip Reames	95d62344c0	[RISCV] Cleanup dead complexity in RISCVMaskedPseudo after TA/TU merge refactoring [nfc] After D154245 lands, we have greatly simplified the possible configurations for an entry in the RISCVMaskedPseudo table. This change goes through and reworks everything which uses that table to exploit the available simplifications. To justify the correctness here, let me note that we no longer had any use of HasTU=true. We were left with only the HasTu=false, and IsCombined=true\|false cases. The only usage is IsCombined=false was for the comparison operations. At the moment, these operations are the only ones in the table without vector policy operands. Instead of switching on the pseudo value, we can just check the VecPolicy flag instead. It may be worth adding a passthru operand to the comparisons (which is actually needed to represent tail undefined vs tail agnostic), and a vector policy operand (which is strictly unneeded) just for consistency, but we can do that in a follow up patch for some further simplification if desired. Note that we do have a few _TU pseudos left at this point. It's simply that none of them are in the RISCVMaskedPseudo table, and thus don't participate in our post-ISEL transforms. Differential Revision: https://reviews.llvm.org/D154620	2023-07-11 10:32:54 -07:00
Zi Xuan Wu (Zeson)	2ccb2dbc8d	[RISCV] Don't fold RISCVISD::VMV_V_X_VL series node and scalar load to vector load when scalar load is update load We try to fold RISCVISD::VMV_V_X_VL series node + scalar load -> vector load. But if scalar load is indexed load (load update form), it's not profitable to fold because load update node can't be removed after fold. Differential Revision: https://reviews.llvm.org/D152222	2023-07-11 15:56:31 +08:00
Philip Reames	403261eafd	[RISCV] Remove legacy TA/TU pseudo distinction for load instructions This change continues with the line of work discussed in https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295. This change targets all the pseudos used in loads (unit, strided, segmented, fault first, and their combinations). As with previous changes in the series, we replace the existing TA and TU forms with a single unified pseudo with a passthru (which may be implicit_def) and a policy operand. One quirk is that I went ahead and treated the unmasked mask load instruction (vlm) the same way. We need the pass thru operand to model tail undefined, but since the instruction is unconditionally agnostic and the instruction has no mask, the policy operand is arguably unneeded. I kept it mostly for consistency sake. Another quirk worth highlighting is that segment loads require a bit of dedicated handling. Surprisingly, we don't have IMPLICIT_DEF nodes of the right types, and attempting to use them results in some odd looking codegen and a few crashes. Instead, I left the REG_SEQUENCE form, and extended InsertVSETVLI to recognize the complex undefs. Arguably, we should probably revisit the handling of undef reg_sequence nodes here, but I'm hoping to side step that in this patch. As before, we see codegen changes (some improvements and some regressions) due to scheduling differences caused by the extra implicit_def instructions. I did have to delete one register allocation regression test as I couldn't figure out how to meaningfully update it. I spent a significant amount of time trying, and finally gave up. Differential Revision: https://reviews.llvm.org/D154141	2023-07-05 13:11:58 -07:00
Alex Bradbury	7e48c2d85e	[RISCV][NFC] Fix doc comment for RISCVDAGToDAGISel::selectSETCC The doc comment referred to a boolean parameter that has since been replaced with an ISD::CondCode.	2023-07-04 13:30:08 +01:00
Philip Reames	92b5a3405d	[RISCV] Remove legacy TA/TU pseudo distinction for unary instructions This change continues with the line of work discussed in https://discourse.llvm.org/t/riscv-transition-in-vector-pseudo-structure-policy-variants/71295. In D153155, we started removing the legacy distinction between unsuffixed (TA) and _TU pseudos. This patch continues that effort for the unary instruction families. The change consists of a few interacting pieces: * Adding a vector policy operand to VPseudoUnaryNoMaskTU. * Then using VPseudoUnaryNoMaskTU for all cases where VPseudoUnaryNoMask was previously used and deleting the unsuffixed form. * Then renaming VPseudoUnaryNoMaskTU to VPseudoUnaryNoMask, and adjusting the RISCVMaskedPseudo table to use the combined pseudo. * Fixing up two places in C++ code which manually construct VMV_V_* instructions. Normally, I'd try to factor this into a couple of changes, but in this case, the table structure is tied to naming and thus we can't really separate the otherwise NFC bits. As before, we see codegen changes (some improvements and some regressions) due to scheduling differences caused by the extra implicit_def instructions. Differential Revision: https://reviews.llvm.org/D153899	2023-06-29 07:34:14 -07:00
Jie Fu	47a4331cd7	[RISCV] Remove unused variables in RISCVISelDAGToDAG.cpp (NFC) /Users/jiefu/llvm-project/llvm/lib/Target/RISCV/RISCVISelDAGToDAG.cpp:97:33: error: unused variable 'FuncInfo' [-Werror,-Wunused-variable] RISCVMachineFunctionInfo *FuncInfo = ^ /Users/jiefu/llvm-project/llvm/lib/Target/RISCV/RISCVISelDAGToDAG.cpp:106:29: error: unused variable 'TLI' [-Werror,-Wunused-variable] const TargetLowering &TLI = CurDAG->getTargetLoweringInfo(); ^ 2 errors generated	2023-06-29 16:58:12 +08:00
Yunze Zhu	9d22b54d6b	[RISCV] Use temporary stack in expanding SPLAT_VECTOR_SPLIT_I64_VL node There is an issue: https://github.com/llvm/llvm-project/issues/63515 The issue is because when expanding SPLAT_VECTOR_SPLIT_I64_VL node, only memoperand is used to create dependency. However in ScheduleDAGNodes, dependency is checked with chain only, and breaks order of store/load instructions. I think in llvm.bitreverse.nxv2i64 intrinsic SPLAT_VECTOR_SPLIT_I64_VL nodes are parallel processed, so no chain should be add to these nodes. Using temporary in expanding SPLAT_VECTOR_SPLIT_I64_VL node can keep vlse instruction get correct value no matter order of store instructions is changed. Differential Revision: https://reviews.llvm.org/D153743	2023-06-29 16:45:16 +08:00
Philip Reames	49428bad58	[RISCV] Fix a typo in a comment	2023-06-27 13:08:11 -07:00

... 2 3 4 5 6 ...

535 Commits