llvm-project

Author	SHA1	Message	Date
Yingwei Zheng	0107c8824b	[RISCV][SDAG] Improve codegen of select with constants if zicond is available (#82456 ) This patch uses `add + czero.eqz/nez` to lower select with constants if zicond is available. ``` (select c, c1, c2) -> (add (czero_nez c2 - c1, c), c1) (select c, c1, c2) -> (add (czero_eqz c1 - c2, c), c2) ``` The above code sequence is suggested by [RISCV Optimization Guide](https://riscv-optimization-guide-riseproject-c94355ae3e6872252baa952524.gitlab.io/riscv-optimization-guide.html#_avoid_branches_using_conditional_moves).	2024-02-23 00:18:56 +08:00
Yingwei Zheng	02fad0565f	[RISCV][SDAG] Fold `select c, ~x, x` into `xor -c, x` (#82462 ) This patch lowers select of constants if `TrueV == ~FalseV`. Address the comment in https://github.com/llvm/llvm-project/pull/82456#discussion_r1496881603.	2024-02-21 16:27:43 +08:00
Alex Bradbury	d833b9d677	[RISCV] Graduate Zicond to non-experimental (#79811 ) The Zicond extension was ratified in the last few months, with no changes that affect the LLVM implementation. Although there's surely more tuning that could be done about when to select Zicond or not, there are no known correctness issues. Therefore, we should mark support as non-experimental.	2024-01-29 15:58:54 +00:00
Yeting Kuo	a756a6b97e	[TargetLowering][RISCV] Introduce shouldFoldSelectWithSingleBitTest and RISC-V implement. (#72978 ) DAGCombiner folds (select_cc seteq (and x, y), 0, 0, A) to (and (sra (shl x)) A) where y has a single bit set. Previously, DAGCombiner relies on `shouldAvoidTransformToShift` to decide when to do the combine, but `shouldAvoidTransformToShift` is only about shift cost. This patch introuduces a specific hook to decide when to do the combine and disable the combine when Zicond enabled and AndMask <= 1024.	2023-11-22 08:22:14 +08:00
Philip Reames	8624075105	[RISCV] Strip W suffix from ADDIW (#68425 ) The motivation of this change is simply to reduce test duplication. As can be seen in the (massive) test delta, we have many tests whose output differ only due to the use of addi on rv32 vs addiw on rv64 when the high bits are don't care. As an aside, we don't need to worry about the non-zero immediate restriction on the compressed variants because we're not directly forming the compressed variants. If we happen to get a zero immediate for the ADDI, then either a later optimization will strip the useless instruction or the encoder is responsible for not compressing the instruction.	2023-10-06 10:28:01 -07:00
Mikhail Gudim	0fb3ebb2fc	[RISCV] Generalize 'tryFoldSelectIntOp` to other operations. Currently, only `SUB`, `ADD`, `OR` and `XOR` are covered. This patch adds `AND`, `SHL`, `SRA`, `SRL`. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D155344	2023-08-01 11:27:10 -04:00
Craig Topper	0c055286b2	[RISCV] Use RISCVISD::CZERO_EQZ/CZERO_NEZ for XVentanaCondOps. This makes Zicond and XVentanaCondOps use the same code path. The instructions have identical semantics. Reviewed By: wangpc Differential Revision: https://reviews.llvm.org/D155391	2023-07-18 10:18:02 -07:00
Alex Bradbury	5c5a1a2927	[RISCV] Introduce RISCVISD::CZERO_{EQZ,NEZ} nodes produce them when zicond is present in lowerSELECT This patch is a step towards altering how we handle the emission of condops. Marking ISD::SELECT as legal is a major change in the codegen path, and gives few options for maintaining the old codegen path when it is believed to be better (e.g. a better branchless sequence is possible using non-zicond instructions, or the branch-based sequence is preferable). This removes the existing SelectionDAG patterns and moves the logic into lowerSELECT. Along some small codegen changes you'll note a few minor regressions in the generated code quality - this are due to the fact that by lowering the SELECT node early we miss out on combines that would kick in later when setcc condcodes that aren't natively supported have been expanded (thus exposing opportunities for optimisation by performing logical negation and swapping truev/falsev). I've opted to split out work that addresses these into follow-on patches (especially as zicond is still 'experimental'). matchSetCC is a straight-forward translation from the version in RISCVISelDAGToDAG. Ideally, in the future it can be converted to a helper shared between both files. Differential Revision: https://reviews.llvm.org/D155083	2023-07-14 11:31:27 +01:00
Mikhail Gudim	17e2df6695	[RISCV] Removed the requirement of XLenVT for performSELECTCombine. Reviewed By: Craig Topper Differential Revision: https://reviews.llvm.org/D153044	2023-07-12 16:29:09 -04:00
Alex Bradbury	a755e80ed1	[RISCV] Add codegen for the experimental zicond extension This directly matches the codegen for xventanacondops with vt.maskcn => czero.nez and vt.maskc => czero.eqz. An additional difference is that zicond is available on RV32 in addition to RV64 (xventanacondops is RV64 only). Differential Revision: https://reviews.llvm.org/D147147	2023-03-30 21:05:22 +01:00
Alex Bradbury	ac269d185c	[RISCV][test] Update CHECK lines in condops related tests in preparation for Zicond codegen Prefixes like 'CONDOPS' referring to the xventanacondops extension are going to be confusing once zicond is added to the mix.	2023-03-29 14:13:37 +01:00
Craig Topper	1aa9862df3	[RISCV] Add more XVentanaCondOps patterns. Add patterns with seteq/setne conditions. We don't have instructions for seteq/setne except for comparing with zero and need to emit an ADDI or XOR before a seqz/snez to compare other values. The select ISD node takes a 0/1 value for the condition, but the VT_MASKC(N) instructions check all XLen bits for zero or non-zero. We can use this to avoid the seqz/snez in many cases. This is pretty ridiculous number of patterns. I wonder if we could use some ComplexPatterns to merge them, but I'd like to do that as a follow up and focus on correctness of the result in this patch. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D140421	2023-01-06 08:29:23 -08:00
Craig Topper	132546d939	[RISCV] Add DAG combine to fold (select C, (add X, Y), Y) -> (add (select C, X, 0), Y). Similar for sub, or, and xor. These are all operations that have 0 as a neutral value. This is based on a similar tranform in InstCombine. This allows us to remove some XVentanaCondOps patterns and some code from DAGCombine for RISCVISD::SELECT_CC. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D140465	2022-12-21 10:57:57 -08:00
Craig Topper	51315d8dee	[RISCV] Add more test cases to select.ll. NFC These are test for select (and (x , 0x1) == 0), (z ^ y), y ) and select (and (x , 0x1) == 0), (z \| y), y ) These can be made branchless by using ((x-1) & z ) ^ y.	2022-12-21 10:38:12 -08:00
Philip Reames	828b1c55cb	[RISCV] Match neg (and x, 1) to two shifts to improve codesize The negate operation is never compressible (as the destination and rs1 register must differ). The two shift versions will be equal size if the input GPR is reused, or smaller if this is the only use of the input. For clarity, the operation being performed is (select (low-bit-of x), -1, 0). Differential Revision: https://reviews.llvm.org/D140319	2022-12-19 14:17:26 -08:00
Kautuk Consul	6a907a41f4	[RISCV] Add codegen support for RISCV XVentanaCondOps Extension This patch adds codegen support for part of XVentanaCondOps extension. This extension is designed to reduce the number of branches in the generated RISCV assembly by replacing branches with conditional move instructions as defined by XVentanaCondOps specification. The specification for XVentanaCondOps extension can be found at: https://github.com/ventanamicro/ventana-custom-extensions/releases/download/v1.0.1/ventana-custom-extensions-v1.0.1.pdf Co-authored-by: Mikhail Gudim <mgudim@ventanamicro.com> Differential Revision: https://reviews.llvm.org/D139394	2022-12-19 09:56:06 -08:00
Craig Topper	a8c79121bf	[RISCV] Teach getRegAllocationHints about compressible SRAI/SRLI. Similar to previous patches for ADDI/ADDIW/SLLI/ADD, but restricted to only cases where the register is x8-x15(GPRC reg class). I've restricted it so that we can be precise about whether the resulting instruction would be compressible. Changing the register allocation may make some other instruction not compressible so we should try to be accurate. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D138740	2022-11-30 10:28:57 -08:00
Philip Reames	c60a8d9af3	[RISCV] Add codegen coverage for select idioms which might benefit from XVentanaCondOps	2022-11-14 14:27:31 -08:00
Philip Reames	2b5960028e	[RISCV] Branchless lowering for select (and (x , 0x1) == 0), y, (z ^ y) ) and select (and (x , 0x1) == 0), y, (z \| y) ) This code is directly ported from the X86 backend which applies the same rewrite (along with several others). Planning on looking more closely at the other branchless variants from x86 to see if any are worth porting in future changes. Motivation here is the coremark crc8 routine from https://github.com/eembc/coremark/blob/main/core_util.c#L165. This patch significantly reduces the number of unpredictable branches in the workload. Differential Revision: https://reviews.llvm.org/D134881	2022-09-30 08:24:32 -07:00
Philip Reames	f49887f7fd	[RISCV] Add test coverage for upcoming select lowering optimization Test copied from X86 backend since I'm going to be taking the code from there too.	2022-09-28 16:04:25 -07:00

20 Commits