llvm-project

Author	SHA1	Message	Date
Philip Reames	820e84e050	[RISCV] Assert initial load/store SEW is the EEW In D127983, I had flipped from using the computed EEW to using the SEW value pulled from the VSETVLI when checking compatibility. This wasn't intentional, though thankfully it appears to be a non-functional difference. The new code does make a unchecked assumption that the initial SEW operand on the load/store is the EEW. This patch clarifies the assumption, and adds an assert to make sure this remains true. Differential Revision: https://reviews.llvm.org/D128085	2022-06-20 07:45:21 -07:00
Philip Reames	fb8ecca06f	[RISCV] Remove redundant code checking for exact VTYPE match [nfc] Should be fully covered by the generic demanded field based logic just below, and this ensures better coverage of that logic.	2022-06-17 12:20:20 -07:00
Philip Reames	4d245f1bc2	[RISCV] Move store policy and mask reg ops into demanded handling in InsertVSETVLI Doing so let's the post-mutation pass leverage the demanded info to rewrite vsetvlis before a store/mask-op to eliminate later vsetvlis. Sorry for the lack of store test change; all of my attempts to write something reasonable have been handled through existing logic.	2022-06-17 12:09:50 -07:00
Philip Reames	b595cddea7	[riscv] Extract isMaskRegOp helper [nfc]	2022-06-17 10:40:54 -07:00
Philip Reames	e1f1407beb	[RISCV] Delete dead elideCopy code in InsertVSETVLI [nfc] This code should be dead. A simple whole register copy of an IMPLICIT_DEF, is simply an IMPLICIT_DEF of it's own. (This would not be true for freeze, but is for copy.) If we find a case which gets here with vector operand copy of an IMPLICIT_DEF, we most likely have an earlier missed optimization anyways. (The most recent case of this was e6c7a3a, found by Craig during review of this patch.) There might be others, and if so, we'll revisit them individually as regressions are reported. Differential Revision: https://reviews.llvm.org/D127996	2022-06-17 09:58:11 -07:00
Philip Reames	755c84c62c	[RISCV] Avoid changing etype for splat of 0 or -1 A splat of the values 0 and -1 as sign extended 12 bit immediates are always the same bit pattern regardless of the etype used to perform the operation. As a result, we can sometimes avoid introducing a vsetvli just for the purposes of a splat. Looking at the diffs, we don't get a huge amount of immediate value out of this. We mostly push the vsetvli one instruction down, usually in front of a vmerge. We also don't get the corresponding fixed length vector cases because VL typically is changed despite the actual bits written being the same. Both of these are areas I plan to explore in future patches. Interestingly, this makes a great example of why we need the forward and backward implementation to be consistent. Before we merged the demanded field handling, if we implement only the forward direction, we lost the ability to mutate a prior vsetvli and eliminate a later one entirely. This resulted in practical regressions instead of improvements. It's always nice when practice matches theory. :) Differential Revision: https://reviews.llvm.org/D128006	2022-06-17 08:10:14 -07:00
Philip Reames	2fa2cee6a8	[RISCV] Start merging demanded reasoning - starting with load/stores [nfc] This change merges the logic for reasoning about demanded portions of the VTYPE register between the main dataflow algorithm and the backwards mutation post pass. In the process, we get to delete a bunch of now redundant code. This should be entirely NFC. I included a slight hack (see TODO) to avoid changing behavior in the post pass while being able to use the generalized logic in the prepass. I will fix the TODO in a separate change once this lands. Differential Revision: https://reviews.llvm.org/D127983	2022-06-16 14:34:53 -07:00
Philip Reames	89a11ebd8e	[RISCV] Avoid reducing etype just to initialize lane 0 of an undef vector If we're writing to an undef vector (i.e. implicit_def), we can change the value of bits outside the requested write without consequence. This allows us to avoid a VSETVLI just for narrowing the value written. Differential Revision: https://reviews.llvm.org/D127880	2022-06-16 11:14:21 -07:00
Philip Reames	6ed81ec164	[RISCV] Reorder function definitions to reduce upcoming diff [nfc]	2022-06-16 09:25:27 -07:00
Philip Reames	27c61d033f	[RISCV] Split DemandedField logic in advance of reuse in dataflow [nfc] This change just moves some code around, and extracts out a helper function expected to be useful when reusing the demanded field logic in the forward dataflow.	2022-06-16 08:49:41 -07:00
Philip Reames	37fa5850f1	[RISCV] Move getSEWLMULRatio out of VSETVLIInfo [nfc]	2022-06-16 08:40:20 -07:00
Philip Reames	4a3e46115a	[RISCV] Extend demanded field transform in InsertVSETVLI to VTYPE subfeilds The motivating case, and the only one actually enabled by this patch, is a load or store followed by another op with the same SEW/LMUL ratio. As an example, consider: define void @test1(ptr %in, ptr %out) { entry: %0 = load <8 x i16>, ptr %in, align 2 %1 = sext <8 x i16> %0 to <8 x i32> store <8 x i32> %1, ptr %out, align 4 ret void } Without this patch, we get: vsetivli zero, 8, e16, mf4, ta, mu vle16.v v8, (a0) vsetvli zero, zero, e32, mf2, ta, mu vsext.vf2 v9, v8 vse32.v v9, (a1) ret Whereas with the patch we get: vsetivli zero, 8, e32, mf2, ta, mu vle16.v v8, (a0) vsext.vf2 v9, v8 vse32.v v9, (a1) ret We have rewritten the first vsetvli and thus removed the second one. As is strongly hinted by the code structure and todos, I am planning on communing this with all (or most all?) of the cases from isCompatible used in the forward data flow. This will be done in a series of following changes - some NFC reworks, and some reviewed optimization extensions. Differential Revision: https://reviews.llvm.org/D127780	2022-06-16 08:01:27 -07:00
Yeting Kuo	9096a52566	[RISCV] Teach vsetvli insertion to not insert redundant vsetvli right after VLEFF/VLSEGFF. VSETVLIInfos right after VLEFF/VLSEGFF are currently unknown since they modify VL. Unknown VSETVLIInfos make next vector operations needed to be inserted VSET(I)VLI. Actually the next vector operation of VLEFF/VLSEGFF may not need to be inserted VSET(I)VLI if it uses same VTYPE and the resulted vl of VLEFF/VLSEGFF. Take the below C code as an example, vint8m4_t vec_src1 = vle8ff_v_i8m4(str1, &new_vl, vl); vbool2_t mask1 = vmseq_vx_i8m4_b2(vec_src1, 0, new_vl); vsetvli insertion adds a redundant vsetvli for that, Assembly result: vsetvli a2,a2,e8,m4,ta,mu vle8ff.v v28,(a0) csrr a3,vl ; redundant vsetvli zero,a3,e8,m4,ta,mu ; redundant vmseq.vi v25,v28,0 After D126794, VLEFF/VLSEGFF has a define having value of VL. The patch consider there is a ghost vsetvli right after VLEFF/VLSEGFF. The ghost VSET(I)LIs use the vl output of the VLEFF/VLSEGFF as its AVL and same VTYPE of the VLEFF/VLSEGFF. The ghost vsetvli must be redundant, and we could use it to get the VSETVLIInfo right after VLEFF/VLSEGFF. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D127576	2022-06-15 13:58:40 +08:00
Philip Reames	facb96584e	[RISCV] Minor code/comment improvement in prepass of InsertVSETVLI [nfc]	2022-06-14 16:18:11 -07:00
Philip Reames	c67c4133ac	[RISCV] Split out transfer function explicitly in VSETVLI insertion dataflow [nfc] In an effort to make this code easier to read and extend, this splits out helper functions for the transfer function of the data flow. Due to the other results computed during the phases, we can't completely abstract away everything, but we can abstract the actual state transitions. The motivation here is the following upcoming changes: * The fault first load patch - already approved, this will be rebased over - adds another case into the transferAfter path. * An upcoming patch to fold the local prepass back into the main algorithm greatly complicates the transferBefore logic. Differential Revision: https://reviews.llvm.org/D127761	2022-06-14 14:07:15 -07:00
Philip Reames	44a0a558dc	[RISCV] Split out subfields in InsertVSETVLI's demanded fields analysis [nfc] At the moment, this just gets the infrastructure in place. Following changes will start using this in non-trivial ways.	2022-06-14 11:35:24 -07:00
Philip Reames	52b166c0de	[RISCV] Split out getEEWForLoadStore [nfc] Mostly about allowing reuse in an upcoming patch, but also makes the code slightly easier to follow.	2022-06-14 10:10:43 -07:00
Philip Reames	7659dc6cdd	[RISCV] simplify emitVSETVLIs handling of vsetvli xN, phi(), vtype case [NFC] This is possibly somewhat subjective, but having an explicitly named flag to track the property required and code structure that more closely matches phase 1/2 of the dataflow seems much easier to read. Differential Revision: https://reviews.llvm.org/D126893	2022-06-14 08:00:24 -07:00
Philip Reames	aaeb958ced	[RISCV] Mutate instruction after computing transfer rule in InsertVSETVLI [nfc] If we defer the mutation of the instruction, we can add the assert discussed in D126921. Once we do that, the API becomes subject to revision - but let's do that in a separate change.	2022-06-13 09:08:25 -07:00
Hubert Tong	775a22e32a	[NFC] Remove unused variable `MF` https://reviews.llvm.org/D127583 removed the only use of this variable and broke builds with warnings-as-errors.	2022-06-12 16:31:55 -04:00
Craig Topper	d63b66840f	[RISCV] Move some methods out of RISCVInstrInfo and into RISCV namespace. These methods don't access any state from RISCVInstrInfo. Make them free functions in the RISCV namespace. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D127583	2022-06-12 10:47:21 -07:00
Shao-Ce SUN	117e10304b	[RISCV] move `isFaultFirstLoad` into `RISCVInstrInfo` Fix build errors in D126794 ``` ld.lld: error: undefined symbol: llvm::MachineInstr::getNumExplicitDefs() const >>> referenced by RISCVBaseInfo.cpp >>> RISCVBaseInfo.cpp.o:(llvm::isFaultFirstLoad(llvm::MachineInstr const&)) in archive lib/libLLVMRISCVDesc.a ld.lld: error: undefined symbol: llvm::MachineInstr::findRegisterDefOperandIdx(llvm::Register, bool, bool, llvm::TargetRegisterInfo const*) const >>> referenced by RISCVBaseInfo.cpp >>> RISCVBaseInfo.cpp.o:(llvm::isFaultFirstLoad(llvm::MachineInstr const&)) in archive lib/libLLVMRISCVDesc.a clang-15: error: linker command failed with exit code 1 (use -v to see invocation) ``` Reviewed By: fakepaper56, craig.topper Differential Revision: https://reviews.llvm.org/D127477	2022-06-11 00:27:53 +08:00
Shao-Ce SUN	93116374e7	Revert "[RISCV] move `isFaultFirstLoad` into `RISCVInstrInfo`" This reverts commit e018e493c1ac514504bbaa1d1396aec025142a31. There are some problems with this commit, related revision: https://reviews.llvm.org/D127477	2022-06-11 00:03:04 +08:00
Shao-Ce SUN	e018e493c1	[RISCV] move `isFaultFirstLoad` into `RISCVInstrInfo` Fix build errors in D126794 ``` ld.lld: error: undefined symbol: llvm::MachineInstr::getNumExplicitDefs() const >>> referenced by RISCVBaseInfo.cpp >>> RISCVBaseInfo.cpp.o:(llvm::isFaultFirstLoad(llvm::MachineInstr const&)) in archive lib/libLLVMRISCVDesc.a ld.lld: error: undefined symbol: llvm::MachineInstr::findRegisterDefOperandIdx(llvm::Register, bool, bool, llvm::TargetRegisterInfo const*) const >>> referenced by RISCVBaseInfo.cpp >>> RISCVBaseInfo.cpp.o:(llvm::isFaultFirstLoad(llvm::MachineInstr const&)) in archive lib/libLLVMRISCVDesc.a clang-15: error: linker command failed with exit code 1 (use -v to see invocation) ``` Reviewed By: fakepaper56 Differential Revision: https://reviews.llvm.org/D127477	2022-06-10 21:03:47 +08:00
Yeting Kuo	f68cad9087	[RISCV] Lower VLEFF/VLSEGFF SDNodes to MachineInstrs with VL outputs. The patch is a replacement of D125199. PseudoReadVL with vtype has worry for computing same vtypes of VLEFF/VLSEGFF in two different places, DAGToDAG and InsertVSETVLI. VLEFF/VLSEGFF MI with VL output still could provide the vtype of VLEFF/VLSEGFF to the users of its VL. The patch names the new pseudo as original VLEFF/VLSEGFF name suffixed "_VL" and expand them in RISCVInsertVSETVLI pass. This patch also reverts commit 4537aae0d57e17c217c192d8977012ba475b130c, "[RISCV] Make PseudoReadVL have the vtypes of the corresponding VLEFF/VLSEGFF.". Reviewed By: reames Differential Revision: https://reviews.llvm.org/D126794	2022-06-10 13:57:10 +08:00
Craig Topper	c739088af5	[RISCV] Fix 80 column violations in RISCVInsertVSETVLI.cpp. NFC I think these were likely introduced in the recent work done to this pass.	2022-06-08 18:58:48 -07:00
Philip Reames	1ea99328b4	[RISCV] Untangle instruction properties from VSETVLIInfo [NFC] The abstract state used in the data flow should not know anything about the instructions which produced the abstract states. Instead, when comparing two states, we can simply use information about the machine instr at that time. In the old design, basically any use of the instruction flags on the current (as opposed to a "Require" - aka upcoming state) would be a bug. We don't seem to actually have any such bugs, but we can make this much more obvious with code structure. Differential Revision: https://reviews.llvm.org/D126921	2022-06-08 08:09:59 -07:00
Philip Reames	76ac916d63	[RISCV] Inline one copy of needVSETVLI into the other [NFC] Calling the non-MI version directly was unsound (as fixed in dcdb0bf2), so remove that version to decrease likelyhood of future mistakes.	2022-06-02 13:06:18 -07:00
Philip Reames	dcdb0bf25b	[RISCV] Fix an inconsistency with compatible load/store handling Once we've computed the incoming predecessor state, we should use the same compatibility check with knowledge of MI as we did in phase 2 in order to be consistent across all phases. Differential Revision: https://reviews.llvm.org/D126574	2022-06-02 08:03:51 -07:00
Fraser Cormack	5a2e640eb7	[RISCV][NFC] Adjust some comments in RISCVInsertVSETVLI Capitalize the first letter of comments like the others, and a few other tweaks.	2022-05-31 10:13:15 +01:00
Philip Reames	85b4470035	[RISCV] Allow PRE of vsetvli involving non-1 LMUL This is a follow up to address a review comment from D124869. When deciding whether to PRE a vsetvli, we can allow non-LMUL1 vsetvlis. Differential Revision: https://reviews.llvm.org/D126563	2022-05-27 15:49:41 -07:00
Fraser Cormack	3e450d9cbb	[RISCV][NFC] Unify compatibility checks under one function Split off from D125021. We were duplicating logic across different phases. Since we want to ensure a consistency of logic across phases for correctness, this patch combines our multiple compatibility checks into one function to better convey this. Several methods were made const too. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D126472	2022-05-27 11:21:54 +01:00
Philip Reames	afe49934a6	[RISCV] Allow compatible VTYPE in AVL Reg Forward cases During insertion of VSETVLI, we have two related bits of code which decide whether we can reuse a previous vsetvli result. As was pointed out in the original review, these cases can allow any prior state for which we know that VL is the same for any value of AVL. This was originally separated out of a desire for separate tests and review. As it turns out, finding a test case for this has been quite challenging. Most of the cases I tried, we manage to already get through other chains of logic. We do have one correct test change, but that only exercises one of the two changes. Differential Revision: https://reviews.llvm.org/D126400	2022-05-26 08:50:35 -07:00
Fraser Cormack	2c9983f530	[RISCV][NFC] Add braces to 'else' to match braced 'if'	2022-05-26 10:00:33 +01:00
Philip Reames	1f06398e96	Reapply "[RISCV] Enable strict assertions in InsertVSETVLI data flow" be2cb8 fixes the case which triggered the revert. Reapply, and let's see if anything else falls out. Original commit message: These asserts are believed to hold after several recent miscompiles have been fixed. If you see an assertion failure on this change, please toggle the default back and make sure you file a bug with a reproducer. We may have as yet uncaught miscompiles lurking in this code. Differential Revision: https://reviews.llvm.org/D125271	2022-05-25 11:18:55 -07:00
Philip Reames	be2cb824d0	[riscv] Remove mutation of prior vsetvli from insertion dataflow This moves mutation entirely out of the main algorithm. The immediate trigger is that we hit another case of the same issue I thought we'd fixed in 72925d9. It turns out we hadn't considered the cross block case. As a brief summary, the issue being fixed is that if we mutate a previous vsetvli in phase 3, there's a possibility that some later use of that vsetvli changes "compatibility". In the cross_block_mutate test, this later vsetvli occurs in another block (and is thus visit order dependent too!). This causes us to fail strict asserts. (To be explicit, the current on by default workaround should compensate. It's only when we turn that off that we have problems.) Now, I want to explicitly call out an alternate workaround. We could leave the mutation in phase 3, and simplify restrict it to the case where the previous vsetvli's GPR result is unused. That covers the case we've actually seen. (I'll note that codegen regressions with a simple form of this were significant. We might have to check specifically for the use outside block case to keep them reasonable, which complicates the workaround slightly.) Personally, I'm at the point where I want the mutation pulled out just for robustness sake. I'm worried there's yet one more form of this bug we haven't thought about. The other motivation for this change is that it does give us a couple of minor codegen wins. None appear to be hugely significant, but improvements never hurt right? Differential Revision: https://reviews.llvm.org/D125270	2022-05-25 10:51:14 -07:00
Philip Reames	2a3b6f2cba	[RISCV] Hoist VSETVLI vlmax, vtype out of scalable loops This is a straight forward extension of the PRE transform introduced in D124869 to handle the VLMAX case. The test changes here look quite positive. This surprised me until I realized that all the tests are using @llvm.vscale to figure out the VLMAX, not the llvm.riscv.vsetvlmax intrinsic. If they'd used the later, these would have been full redundancy cases and fully handled by the data flow. I'm not really sure if use of vscale here is representative or not. If it is, we should probably look at using VSETVLI to lower vscale rather than a raw read of vlenb and some math. Differential Revision: https://reviews.llvm.org/D126338	2022-05-25 08:00:27 -07:00
Philip Reames	948d931323	[RISCV] Ensure the forwarded AVL register is alive When the AVL value does not fit in 5 bits, the register in which this value is stored may be dead when we want to forward it. This patch ensure the kill flags on the register are cleared before forwarding. Patch by: loralb Differential Revision: https://reviews.llvm.org/D125971	2022-05-24 15:07:42 -07:00
Philip Reames	fb948572e0	[riscv] Use getFirstInstrTerminator [nfc]	2022-05-24 14:56:01 -07:00
Philip Reames	a95ecb20bc	[RISCV] Hoist VSETVLI out of idiomatic fixed length vector loops This patch teaches the VSETVLI insertion pass to perform a very limited form of partial redundancy elimination. The motivating example comes from the fixed length vectorization of a simple loop such as: for (unsigned i = 0; i < a_len; i++) a[i] += b; Without this change, the core vector loop and preheader is as follows: .LBB0_3: # %vector.ph andi a1, a6, -8 addi a4, a0, 16 mv a5, a1 .LBB0_4: # %vector.body # =>This Inner Loop Header: Depth=1 addi a3, a4, -16 vsetivli zero, 4, e32, m1, ta, mu vle32.v v8, (a3) vle32.v v9, (a4) vadd.vx v8, v8, a2 vadd.vx v9, v9, a2 vse32.v v8, (a3) vse32.v v9, (a4) addi a5, a5, -8 addi a4, a4, 32 bnez a5, .LBB0_4 The key thing to note here is that, the execution of the vsetivli only needs to happen once. Since there's no tail folding happening here, the value of the vector configuration registers are invariant through the loop. After this patch, we hoist the configuration into the preheader and perform it once. .LBB0_3: # %vector.ph andi a1, a6, -8 vsetivli zero, 4, e32, m1, ta, mu addi a4, a0, 16 mv a5, a1 .LBB0_4: # %vector.body # =>This Inner Loop Header: Depth=1 addi a3, a4, -16 vle32.v v8, (a3) vle32.v v9, (a4) vadd.vx v8, v8, a2 vadd.vx v9, v9, a2 vse32.v v8, (a3) vse32.v v9, (a4) addi a5, a5, -8 addi a4, a4, 32 bnez a5, .LBB0_4 Differential Revision: https://reviews.llvm.org/D124869	2022-05-24 14:56:01 -07:00
Philip Reames	d4545e6fa0	Revert "[RISCV] Enable strict assertions in InsertVSETVLI data flow" This reverts commit 79a66ec97b4fb8cbc4e0a81ead356caf5507a6ea. The stronger asserts served their purpose; I stumbled across another bug. Will reapply once this one is also fixed. The bug appears to be a variant of a previous one: * We mutate an instruction in one block. * That mutation changes the phase3 results of another block. This is very similiar to a previous issue, except cross block instead of within a single block.	2022-05-17 15:53:13 -07:00
Philip Reames	118c5d1c97	[RISCV] Minor reorganization of VSETVLIInfo::operator== for readability [NFC]	2022-05-17 12:05:17 -07:00
Philip Reames	11a7e77c95	[RISCV] Canonicalize AVL=setvli to AVL=Imm or AVL=VLMAX This patch adds a transform to the local prepass in InsertVSETVLI which canonicalizes an AVL of a register from another vsetvli into immediate or VLMAX when VTYPE is the same. In this patch, I chose to be conservative and avoid arbitrary vreg forwarding due to profitability concerns about possibility overlapping live ranges. This has the effect of eliminating vsetvli instructions in loops which are walking either VLMAX or a constant number of lanes per iteration. Differential Revision: https://reviews.llvm.org/D125812	2022-05-17 11:46:22 -07:00
Philip Reames	79a66ec97b	[RISCV] Enable strict assertions in InsertVSETVLI data flow These asserts are believed to hold after several recent miscompiles have been fixed. If you see an assertion failure on this change, please toggle the default back and make sure you file a bug with a reproducer. We may have as yet uncaught miscompiles lurking in this code. Differential Revision: https://reviews.llvm.org/D125271	2022-05-17 11:12:31 -07:00
Fraser Cormack	8430b82741	[RISCV] Drop notion of "strict" vsetvli compatibility With recent fixes to the dataflow in place, we now never pass Strict=true to isCompatible, so remove the parameter completely. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D125748	2022-05-17 15:24:23 +01:00
Philip Reames	1474880353	[RISCV] Use classic dataflow for VSETVLI insertion Our current implementation of the InsertVSETVLI dataflow allows phase 3 to arrive at a different block end state than the data flow in phase 1/2 computed. This arises because a block which contains instructions (e.g. load or stores) which don't consume all the incoming bits of the VL/VTYPE can be compatible with multiple incoming states. The algorithm effectively changes the SEW on such instructions, and propagates the prior state forward. As phase 3 uses the block input state for this propagation, but phase 1/2 doesn't, this can result in different block end states. If we don't correct for it, this discrepancy can result in miscompiles. This was the source of multiple recent bugs. However, by now we have fixes for all known correctness issues. The basic strategy we use is to insert a compensation vsetvli to bring the block state leaving the block back into consistency with the one computed. This is correct, but results in extra vsetvlis being placed at the end of blocks. This change adjusts the phase 1/2 algorithm to propagate the incoming block state through the block, allowing the compatibility rules to modify the end state. The algorithm may need to run slightly more iterations, but the end result is consistent with what phase 3 does. The benefit of doing this is two fold. First, we reverse some of the code quality introductions introduced in the functional fixes. Second, we simplify the invariants, and allow the strict assertions to be enabled. Several humans, myself included, have found it quite surprising that invariant didn't hold already, and arguably that confusion is the cause of several of our recent miscompiles in this code. The downside to this patch is that the dataflow may require additional iterations to stabilize. In the worse case, we go from O(Edges) to O(E + UniquePaths) as the incoming state (and thus the outgoing one) can now change once for each path from the entry block. Differential Revision: https://reviews.llvm.org/D125232	2022-05-16 17:06:27 -07:00
Philip Reames	3d17c91709	[RISCV] Fix missing vsetvli in transparent block case We've got a lurking problem with our data flow implementation where different phases disagree, resulting in possible miscompiles. D119518 introduced a workaround, but failed to consider blocks which only contain load/stores compatible with their incoming state. When I went to rebase and simplify D125232, it turned out that not all of the correctness issues had been fixed yet after all. This is the correctness fix accidentally embedded in the original more complicated version. Note that the test changes here are mostly regressions. It's worth noting that the simplified version of D125232 exactly reverses all the non-functional diffs in the test caused here. D125232 should be the immediate following commit. Differential Revision: https://reviews.llvm.org/D125703	2022-05-16 17:06:27 -07:00
Philip Reames	e2df48bb23	[RISCV] Add further trace output to InsertVSETLVI	2022-05-16 09:15:32 -07:00
Philip Reames	52b5f1f7d4	[RISCV] Extend dataflow workaround from D119518 to fallthrough blocks We've got a lurking problem with our data flow implementation where different phases disagree, resulting in possible miscompiles. D119518 introduced a workaround, but failed to consider blocks without terminators (e.g. fallthroughs). I have a deeper rework of the algorithm in flight over in D125232, but this patch is specifically a minimal fix for an active miscompile. That change can be reworked over this once landed. Differential Revision: https://reviews.llvm.org/D125408	2022-05-12 10:45:59 -07:00
Craig Topper	09f48c6b80	[RISCV] Move implementation of getVLOpNum and getSEWOpNum from RISCVInsertVSETVLI to RISCVBaseInfo.h. NFC We should consolidate the operand counting and ordering into RISCVBaseInfo.h and stop spreading it around. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D125344	2022-05-11 11:14:58 -07:00

1 2 3 4 5

250 Commits