llvm-project

Author	SHA1	Message	Date
Philip Reames	3df27e9741	[RISCV] Minor style cleanup in advance of D141311 [nfc]	2023-01-09 11:31:54 -08:00
Alexey Baturo	35b8bb0ab3	[RISC-V][HWASAN] Don't explicitly load GOT entry to call hwasan mismatch routine Reviewed by: luismarques Differential Revision: https://reviews.llvm.org/D132994	2023-01-09 16:46:28 +03:00
liqinweng	1f8746cc80	[RISCV][CostModel] Add half type support for the cost model of sqrt/fabs 1. Refactor for costs of sqrt/fabs 2. Add half type support for the cost model of sqrt/fabs Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D132908	2023-01-09 12:57:03 +08:00
liqinweng	f3408739da	[RISCV][CostModel] Add cost model for integer abs Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D132999	2023-01-09 11:38:24 +08:00
Benjamin Kramer	b6942a2880	[NFC] Hide implementation details in anonymous namespaces	2023-01-08 17:37:02 +01:00
LiDongjin	4554663bc0	Recommit "[RISCV] Enable the LocalStackSlotAllocation pass support" This includes a fix for the tramp3d failure from the llvm-testsuite that caused the last revert. Hopefully the others failures were the same issue. Original commit message: For RISC-V, load/store(exclude vector load/store) instructions only has a 12 bit immediate operand. If the offset is out-of-range, it must make use of a temp register to make up this offset. If between these offsets, they have a small(IsInt<12>) relative offset, LocalStackSlotAllocation pass can find a value as frame base register's value, and replace the origin offset with this register's value plus the relative offset. Co-authored-by: luxufan <luxufan@iscas.ac.cn> Co-authored-by: Craig Topper <craig.topper@sifive.com> Differential Revision: https://reviews.llvm.org/D98101	2023-01-06 09:54:19 -08:00
Alexey Bataev	9b5f62685a	[SLP]Fix cost of the broadcast buildvector/gather. Need to include the cost of the initial insertelement to the cost of the broadcasts. Also, need to adjust the cost of the gather/buildvector if the element is inserted into poison/undef vector. Differential Revision: https://reviews.llvm.org/D140498	2023-01-06 09:25:05 -08:00
Craig Topper	9f087ba05b	[RISCV] Improve 4x and 8x (s/u)int_to_fp. Previously we emitted a 4x or 8x vzext followed by a vfcvt. We can instead use a 2x or 4x vzext followed by a vfwcvt.	2023-01-06 08:39:14 -08:00
Craig Topper	1aa9862df3	[RISCV] Add more XVentanaCondOps patterns. Add patterns with seteq/setne conditions. We don't have instructions for seteq/setne except for comparing with zero and need to emit an ADDI or XOR before a seqz/snez to compare other values. The select ISD node takes a 0/1 value for the condition, but the VT_MASKC(N) instructions check all XLen bits for zero or non-zero. We can use this to avoid the seqz/snez in many cases. This is pretty ridiculous number of patterns. I wonder if we could use some ComplexPatterns to merge them, but I'd like to do that as a follow up and focus on correctness of the result in this patch. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D140421	2023-01-06 08:29:23 -08:00
Craig Topper	e5a71a41d8	[RISCV] Add support for the vscale_range attribute. This is based on @frasercrmck's D107290. At least some of the clang portion of D107290 has already been committed. This uses vscale_range for min/max vector width unless the command line overrides are used. As a follow up, I plan to add a max or exact VLEN option to clang to control the vscale_range. This will eliminate many of the reasons for users to use the overrides through the -mllvm interface. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D139873	2023-01-06 08:20:37 -08:00
Guillaume Chatelet	87b6b347fc	Revert D141134 "[NFC] Only expose getXXXSize functions in TypeSize" The patch should be discussed further. This reverts commit dd56e1c92b0e6e6be249f2d2dd40894e0417223f.	2023-01-06 15:27:50 +00:00
Guillaume Chatelet	dd56e1c92b	[NFC] Only expose getXXXSize functions in TypeSize Currently 'TypeSize' exposes two functions that serve the same purpose: - getFixedSize / getFixedValue - getKnownMinSize / getKnownMinValue source : `bf82070ea4/llvm/include/llvm/Support/TypeSize.h (L337-L338)` This patch offers to remove one of the two and stick to a single function in the code base. Differential Revision: https://reviews.llvm.org/D141134	2023-01-06 15:24:52 +00:00
Yeting Kuo	5a57ebcc43	[VP][RISCV] Add vp.abs and RISC-V support. RISC-V uses ISD::ABS lower method (abs x) -> (smax_vl x (sub_vl 0, x)) for ISD::VP_ABS. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D141033	2023-01-06 15:18:12 +08:00
Craig Topper	3b2537be76	[RISCV] Rename SDT_RISCVVecCvtX2FOp_VL->SDT_RISCVVecCvtF2XOp_VL. NFC The instruction name is x.f with the destination type first. The template name was intended as "convert F to X". So the F comes first.	2023-01-05 16:37:13 -08:00
Craig Topper	239a174d92	[RISCV] Prevent constant hoisting for or/and/xor that can use bseti/bclri/binvi. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D140928	2023-01-05 11:18:31 -08:00
Kito Cheng	7504e9a193	[RISCV][NFC] Refine the patch of D141061 Just saw Craig's comment after I commit, he has suggest a good NFC for that change.	2023-01-06 00:48:24 +08:00
Kito Cheng	05a2ae1b4a	[RISCV][InsertVSETVLI] Using right instruction during mutate AVL of vsetvli Fixing a crash during vsetvli insertion pass. We have a testcase with 3 vsetvli: 1. vsetivli zero, 2, e8, m4, ta, ma 2. li a1, 32; vsetvli zero, a1, e8, m4, ta, mu 3. vsetivli zero, 2, e8, m4, ta, ma and then we trying to optimize 2nd vsetvli since the only user is vmv.x.s, so it could mutate the AVL operand to the AVL operand of the 3rd vsetvli. OK, so we propagate 2 to vsetvli, BUT it's vsetvli not vsetivli, so it expect a register rather than a immediate value, so we have to update the opcode if needed. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D141061	2023-01-06 00:44:30 +08:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Yeting Kuo	1e9e1b9cf8	[VP][RISCV] Add vp.ctlz/cttz and RISC-V support. The patch also adds expandVPCTLZ and expandVPCTTZ to expand vp.ctlz/cttz nodes and the cost model of vp.ctlz/cttz. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140370	2023-01-04 15:15:01 +08:00
Philip Reames	46dee4a3a3	[RISCV][InsertVSETVLI] Split out demanded property for zero/non-zero of VL The scalar move instructions (vmv.s.x, and fvmv.s.f) depend solely on whether the VL is 0 or non-zero. By tracking the fact we only demand the zeroness and not the whole VL value, we can allow changing VL over a scalar move. This helps to eliminate vsetvli toggles. Differential Revision: https://reviews.llvm.org/D140157	2023-01-03 14:47:13 -08:00
Philip Reames	6df5464a46	[RISCV] Minor type fix [nfc]	2023-01-03 14:22:38 -08:00
Philip Reames	460c1bd344	[RISCV][InsertVSETVLI] Rewrite scalar insert forward rule in terms of demanded fields This is mostly geared at consolidating logic into one form to reduce code duplication, but also has the effect of being a slight generalization. Since these operations aren't masked, we can ignore the mask policy bit when deciding on compatibility. The previous code was overly strict in checking that both policy bits matched. Note: There's a slight difference from the reviewed version. The reviewed version was based on a local revision which included the isCompatible change to only check AVL if VL is used. I apparently never landed that change, and while functional, the functional change isn't visible without this one. I chose to role the extra change into this patch. Differential Revision: https://reviews.llvm.org/D140147	2023-01-03 14:19:52 -08:00
Philip Reames	d36936fdb4	[RISCV][InsertVSETVLI] Add debug output capability to DemandedFields [nfc]	2023-01-03 13:56:57 -08:00
jacquesguan	3bbdd9f506	[RISCV] Fix compile warning.	2023-01-03 11:58:18 +08:00
jacquesguan	db3f3243bb	[RISCV] Use vfirst.m to extract the first element from mask vector. This patch uses vfirst.m to extract the first bit of mask. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139512	2023-01-03 11:24:18 +08:00
Yeting Kuo	e2b65ff98d	[RISCV] Use tail agnostic if inserting subvector/element at the end of a vector. The patch tries to make more vslidup nodes use tail agnostic. The idea comes from D125546 authored by Zack Chen. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140669	2022-12-31 11:29:09 +08:00
Craig Topper	a63b724729	[RISCV] Use SUB instead of XOR in lowerShiftLeftParts/lowerShiftRightParts./ isel is now capable of turning the SUB into XOR for shift amounts. Though it uses NOT instead of XOR with ShiftSize-1. By using SUB during lowering we enable more DAG combines with other arithmetic on the shift amount.	2022-12-29 17:04:52 -08:00
Craig Topper	7cd725858b	[RISCV] RISCVDAGToDAGISel::selectShiftMask to shift by (sub size-1, X). If the shift amount is (sub C, X) where C is -1 modulo the size of the shift, we can replace the sub with a NOT. We could also use XORI X, size-1, but NOT would work better with c.not from the future Zce extension.	2022-12-29 16:33:18 -08:00
Craig Topper	e50976e569	[RISCV] Teach RISCVDAGToDAGISel::selectShiftMask to bypass adds with constant. If the shift amount is (add X, C) where C is 0 modulo the size of the shift, we can bypass the add. Similar to other targets like AArch64 and X86.	2022-12-29 15:10:36 -08:00
Hsiangkai Wang	af5dd2706c	[RISCV] Add fmin/fmax scalar instructions to isAssociativeAndCommutative Follow-up patch of D140530. We can add FMIN, FMAX to isAssociativeAndCommutative to increase instruction-level parallelism by the existing MachineCombiner pass. Differential Revision: https://reviews.llvm.org/D140602	2022-12-29 11:43:40 +00:00
Hsiangkai Wang	002005e674	[RISCV] Add integer scalar instructions to isAssociativeAndCommutative Inspired by D138107. We can add ADD, AND, OR, XOR, MUL, MIN[U]/MAX[U] to isAssociativeAndCommutative to increase instruction-level parallelism by the existing MachineCombiner pass. Differential Revision: https://reviews.llvm.org/D140530	2022-12-29 11:43:40 +00:00
Yeting Kuo	bd9c0f082b	[RISCV] Add Svpbmt extension support. Spec of Svpbmt: https://github.com/riscv/riscv-isa-manual/blob/master/src/supervisor.tex#L2399 Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D140692	2022-12-28 23:57:54 -08:00
Craig Topper	0e9855c1f2	[RISCV] Add SH1ADD/SH2ADD/SH3ADD to RISCVDAGToDAGISel::hasAllNBitUsers.	2022-12-28 23:38:33 -08:00
Craig Topper	79d6e9c713	[RISCV] Prefer ADDI over ORI if the known bits are disjoint. There is no compressed form of ORI but there is a compressed form for ADDI. This also works for XORI since DAGCombine will turn Xor with disjoint bits in Or. Note: The compressed forms require a simm6 immediate, but I'm doing this for the full simm12 range. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D140674	2022-12-28 19:59:42 -08:00
Craig Topper	6357b63735	[RISCV] Add RISCV::XORI to RISCVDAGToDAGISel::hasAllNBitUsers.	2022-12-28 15:17:41 -08:00
Craig Topper	cdf09ce7e7	[RISCV] Support SRLI in hasAllNBitUsers. We can recursively look through SRLI if the shift amount is less than the demanded bits. We can reduce the demanded bit count by the shift amount and check the users of the SRLI.	2022-12-28 13:10:52 -08:00
Craig Topper	ac51cf1960	[RISCV] Refactor RISCV::hasAllWUsers to hasAllNBitUsers similar to RISCVISelDAGToDAG's version. NFC Move to RISCVInstrInfo since we need RISCVSubtarget now. Instead of asking if only the lower 32 bits are used we can now ask if the lower N bits are used. This will be needed by a future patch.	2022-12-28 12:49:23 -08:00
Craig Topper	1184ede46f	[RISCV] Add const qualifiers to some function arguments. NFC	2022-12-28 11:20:17 -08:00
Hsiangkai Wang	740cb3377d	[RISCV][NFC] Remove redundant setOperationAction. ISD::INSERT_VECTOR_ELT is already set above. Differential Revision: https://reviews.llvm.org/D140716	2022-12-28 09:11:32 +00:00
Jojo R	54752f3ff6	[RISCV] Implement assembler support for XTHeadVdot This patch implements the T-Head vendor extensions (XTHeadVdot), which is documented here, it's based on standard vector extension v1.0: https://github.com/T-head-Semi/thead-extension-spec	2022-12-26 19:05:22 +08:00
Craig Topper	dfec6f7e62	Revert "[RISCV] Enable the LocalStackSlotAllocation pass support." This reverts commit 180397cdded67a8fdf56f92a0b70d32f0dac8af6. This seems to cause llvm-testsuite failures.	2022-12-25 12:57:47 -08:00
Craig Topper	653a9fbd13	[RISCV] Support the short-forward-branch predicated ops in RISCVSExtWRemoval.	2022-12-23 21:39:22 -08:00
Ilya Andreev	550d93ab1d	[RISCV] Combine comparison and logic ops Two comparison operations and a logical operation are combined into selection using MIN or MAX and comparison operation. For optimization to be applied conditions have to be satisfied: 1. In comparison operations has to be the one common operand. 2. Supports only signed and unsigned integers. 3. Comparison has to be the same with respect to common operand. 4. There are no more users of comparison except logic operation. 5. Every combination of comparison and AND, OR are supported. It will convert %l0 = %a < %c %l1 = %b < %c %res = %l0 or %l1 into %sel = min(%a, %b) %res = %sel < %c It supports several comparison operations (<, <=, >, >=), signed, unsigned values and different order of operands if they do not violate conditions. Differential Revision: https://reviews.llvm.org/D134277	2022-12-23 17:10:21 +03:00
Nitin John Raj	d64d3c5a8f	[RISCV] Add pass to remove W suffix from ADDIW and SLLIW to improve compressibility SLLI and ADD are more compressible than SLLIW and ADDW. SLLI/ADD both have a 5-bit register encoding. SLLIW/ADDW have a 3-bit register encoding. They both require the dest to also be one of the sources. We aggressively form ADDW/SLLIW as it helps hasAllWBitUsers in RISCVISelDAGToDAG to not require recursion. So we need a pass to remove excessive -w suffixes. Differential Revision: https://reviews.llvm.org/D139948	2022-12-22 14:19:26 -08:00
ping.deng	31ec840c61	[RISCV][NFC] Use Arrayref in TargetLowering functions. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D140464	2022-12-22 10:45:27 +08:00
Nick Desaulniers	19a004b468	[llvm][SelectionDAGISel] support -{start\|stop}-{before\|after}= for remaining targets Follow up to the series: 1. https://reviews.llvm.org/D140161 2. https://reviews.llvm.org/D140349 3. https://reviews.llvm.org/D140331 4. https://reviews.llvm.org/D140323 Completes the work from the previous two for remaining targets. This creates the following named passes that can be run via `llc -{start\|stop}-{before\|after}`: - arc-isel - arm-isel - avr-isel - bpf-isel - csky-isel - hexagon-isel - lanai-isel - loongarch-isel - m68k-isel - msp430-isel - mips-isel - nvptx-isel - ppc-codegen - riscv-isel - sparc-isel - systemz-isel - ve-isel - wasm-isel - xcore-isel A nice way to write tests for SelectionDAGISel might be to use a RUN: line like: llc -mtriple=<triple> -start-before=<arch>-isel -stop-after=finalize-isel -o - Fixes: https://github.com/llvm/llvm-project/issues/59538 Reviewed By: asb, zixuan-wu Differential Revision: https://reviews.llvm.org/D140364	2022-12-21 13:25:15 -08:00
Craig Topper	9b227cb1f5	[RISCV] Check the sign bits of the input of RISCVISD::ABSW in computeNumSignBitsForTargetNode. We created a SIGN_EXTEND_INREG when we created the ABSW so the input should have 33 sign bits, but check it to be safe.	2022-12-21 12:56:35 -08:00
Craig Topper	132546d939	[RISCV] Add DAG combine to fold (select C, (add X, Y), Y) -> (add (select C, X, 0), Y). Similar for sub, or, and xor. These are all operations that have 0 as a neutral value. This is based on a similar tranform in InstCombine. This allows us to remove some XVentanaCondOps patterns and some code from DAGCombine for RISCVISD::SELECT_CC. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D140465	2022-12-21 10:57:57 -08:00
Matt Arsenault	69e75ae695	CodeGen: Don't lazily construct MachineFunctionInfo This fixes what I consider to be an API flaw I've tripped over multiple times. The point this is constructed isn't well defined, so depending on where this is first called, you can conclude different information based on the MachineFunction. For example, the AMDGPU implementation inspected the MachineFrameInfo on construction for the stack objects and if the frame has calls. This kind of worked in SelectionDAG which visited all allocas up front, but broke in GlobalISel which hasn't visited any of the IR when arguments are lowered. I've run into similar problems before with the MIR parser and trying to make use of other MachineFunction fields, so I think it's best to just categorically disallow dependency on the MachineFunction state in the constructor and to always construct this at the same time as the MachineFunction itself. A missing feature I still could use is a way to access an custom analysis pass on the IR here.	2022-12-21 10:49:32 -05:00
Elena Lepilkina	3a3f725a3c	[RISCV] Omit SRA in case of setlt or setge with zero constant Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140206	2022-12-21 14:19:49 +03:00

1 2 3 4 5 ...

2744 Commits