llvm-project

Author	SHA1	Message	Date
Craig Topper	564e09c778	[RISCV] Use bseti for 2048 in RISCVMatInt when Zbs is enabled. 2048 requires an LUI and ADDI instruction due to ADDI using a signed immediate. It can also be done with C.LI+C.SLLI for better code size. With Zbs we can use a single BSETI to have an instruction. Reorder the checks so that BSETI is checked first, with an extra qualification to prefer a single LUI or ADDI when possible. I'm continuing to think about other ways to structure this code, but this works for now. Fixes PR59362.	2022-12-07 20:14:22 -08:00
Craig Topper	f2ffdbeb9c	[RISCV] Add accessors to RISCVMatInt::Inst. Make fields private. This helps hide that the Imm field doesn't store a full int64_t.	2022-12-07 19:02:01 -08:00
Craig Topper	2c52d516da	Revert "[RISCV] Return InstSeq from generateInstSeqImpl instead of using an output parameter. NFC" This reverts commit d24915207c631b7cf637081f333b41bc5159c700. Thinking about this more this probably chewed up 100+ bytes of stack for each recursive call. So this probably needs more thought. The code simplification wasn't that much.	2022-12-07 12:59:31 -08:00
Craig Topper	938d0d6d7b	[RISCV] Replace uses of hasStdExtC with COrZca. Except MakeCompressible which will need more work. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D139504	2022-12-07 09:34:01 -08:00
Craig Topper	d6cfdf0440	[RISCV] Pass ZB_Undefined to countTrailingZeros/countLeadingZeros. NFC We know the input is not zero so we can simplify the generated code.	2022-12-06 14:57:28 -08:00
Craig Topper	d24915207c	[RISCV] Return InstSeq from generateInstSeqImpl instead of using an output parameter. NFC We should be able to rely on RVO here.	2022-12-06 14:57:27 -08:00
Craig Topper	1806ce9097	[RISCV] Teach RISCVMatInt to prefer li+slli over lui+addi(w) for compressibility. With C extension, li with a 6 bit immediate followed by slli is 4 bytes. The lui+addi(w) sequence is at least 6 bytes. The two sequences probably have similar execution latency. The exception being if the target supports lui+addi(w) macrofusion. Since the execution latency is probably the same I didn't restrict this to C extension. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D139135	2022-12-06 10:31:17 -08:00
Craig Topper	ce66f4d0a6	[RISCV] Restrict when RISCVMatInt will retry SLLI as a last step. NFC The main algorithm will already end with a SLLI when there are 12 or more trailing zeros. We only need to retry when there are less than 12 trailing zeros since the main algorithm will pick an ADDI or ADDIW at the end for those cases.	2022-12-06 09:25:20 -08:00
Craig Topper	dd3fe52492	[RISCV] Remove some RISCVMatInt early exits. These were early exiting if we replaced a sequence with a 2 instruction sequence since that is the best we could do. All the later optimizations only occur if the sequence is more than 2 instructions so this wasn't a functional check. At best it helps the compiler generate better code, but I don't think that was analyzed when it was added. Remove it to simplify the code.	2022-12-05 16:29:16 -08:00
Craig Topper	47ff3042e7	[RISCV] Use findFirstSet instead of countTrailingZeros. NFC findFirstSet is a wrapper around countTrailingZeros so they are equivalent here, but I think findFirstSet more cleary describes the algorithm here.	2022-12-04 18:00:36 -08:00
Craig Topper	c8c1d7afa9	[RISCV] Use emplace_back to shorten lines in RISCVMatInt. NFC A few other minor improvements.	2022-12-04 18:00:27 -08:00
jacquesguan	0fe5f03eeb	[RISCV][NFC] Use nested namespace definations. Since we use C++17 now, we could use nested namespace definations to simplify code. Differential Revision: https://reviews.llvm.org/D131751	2022-08-13 09:56:59 +08:00
Craig Topper	d2ee2c9c8d	[RISCV] Add an operand kind to the opcode/imm returned from RISCVMatInt. Instead of matching opcodes to know the format to emit, use an enum value that we can get from the RISCVMatInt::Inst class. Change the consumers to use fully covered switches so that we get a compiler warning if a new kind is added. With the opcode checks it was easier to forget to update one of the 3 consumers. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D126317	2022-05-24 14:56:29 -07:00
Craig Topper	5c38373125	[RISCV] Improve constant materialization for cases that can use LUI+ADDI instead of LUI+ADDIW. It's possible that we have a constant that isn't simm32 so we can't use LUI+ADDIW, but we can use LUI+ADDI. Because ADDI uses a sign extended constant, it's possible that after subtracting it out, we end up with a simm32 that maps to LUI. This patch detects this case after removing Lo12 and before shifting the value for SLLI. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D124222	2022-04-29 08:58:32 -07:00
Craig Topper	9534811aa8	[RISCV] Teach generateInstSeqImpl to generate BSETI for single bit cases. If the immediate has one bit set, but isn't a simm32 we can try the BSETI instruction from Zbs.	2022-04-21 12:08:34 -07:00
Craig Topper	98b866892d	[RISCV] Add special case to constant materialization to remove trailing zeros first. If there are fewer than 12 trailing zeros, we'll try to use an ADDI at the end of the sequence. If we strip trailing zeros and end the sequence with a SLLI we might find a shorter sequence. Differential Revision: https://reviews.llvm.org/D124148	2022-04-21 09:43:32 -07:00
Craig Topper	186d5c8af5	[RISCV] Make getInstSeqCost handle other Zb* instructions. We haven't been updating this as Zb* instructions have been used for immediate materialization. They will hit the default case and trigger an llvm_unreachable. Instead of trying to list them all, assume instructions that aren't explicitly listed aren't compressible. Spotted while looking at integer materialization for other reasons. I haven't seen a crash from this yet.	2022-04-20 22:08:04 -07:00
Craig Topper	70046438d0	[RISCV] Only try LUI+SHADD+ADDI for int materialization if LUI+ADDI+SHADD failed. There's an assert in LUI+SHADD+ADDI materialization that makes sure the lower 12 bits aren't zero since that case should have been handled as LUI+ADDI+SHADD. But nothing prevented the LUI+SH*ADD+ADDI checks from running after the earlier code handled it. The sequence would be the same length or longer so it wouldn't replace the earlier sequence, but the assert happened before that was checked. The vector holding the sequence also wasn't reset before the second check so that guaranteed the sequence would never be found to be shorter. This patch fixes this by only trying the second expansion when the earlier fails. Fixes PR54812. Reviewed By: benshi001 Differential Revision: https://reviews.llvm.org/D123406	2022-04-09 08:52:15 -07:00
Alex Bradbury	588f121ada	[RISCV][NFC] Make Zb* instruction naming match the convention used elsewhere in the RISC-V backend Where the instruction mnemonic contains a dot, we name the corresponding instruction in the .td file using a _ in the place of the dot. e.g. LR_W rather than LRW. This commit updates RISCVInstrInfoZb.td to follow that convention.	2022-01-28 15:20:37 +00:00
Baoshan Pang	af931a51b9	[RISCV] Materializing constants with 'rori' Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D116574	2022-01-07 15:39:22 -08:00
Ben Shi	4c3d916c4b	[RISCV] Optimize immediate materialisation with SHADD Use LUI+SHADD+ADDI to compose specific immediates. Reviewed By: craig.topper, luismarques Differential Revision: https://reviews.llvm.org/D113568	2021-11-15 23:34:28 +00:00
Ben Shi	97e52e1c35	[RISCV] Optimize immediate materialisation with SLLI.UW in the Zba extension Simplify "LUI+SLLI+ADDI+SLLI" and "LUI+ADDIW+SLLI+ADDI+SLLI" to "LUI+ADDIW+SLLIUW" to reduce total instruction amount. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D111933	2021-10-27 02:48:38 +00:00
Ben Shi	4fe5ab4b00	[RISCV] Optimize immediate materialisation with SHADD Use SH1ADD/SH2ADD/SH3ADD along with LUI+ADDI to compose int323, int325 and int329. Reviewed By: craig.topper, luismarques Differential Revision: https://reviews.llvm.org/D111484	2021-10-15 06:46:41 +00:00
Ben Shi	7e81526126	[RISCV] Optimize immediate materialisation with BSETI/BCLRI Opitimize immediate materialisation in the following way if profitable: 1. Use BCLRI for upper 32 bits if the lower 32 bits are negative int32. 2. Use BSETI for upper 32 bits if the lower 32 bits are positive int32. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D111508	2021-10-14 04:56:47 +00:00
Ben Shi	481db13fec	[RISCV] Optimize immediate materialisation with SLLI.UW Use LUI+SLLI.UW to compose the upper bits instead of LUI+SLLI. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D111705	2021-10-14 02:24:50 +00:00
Ben Shi	787eeb8597	[RISCV] Optimize immediate materialisation with BCLRI Do the following optimization for immediate materialisation: 1. For values in range 0xffffffff 7fffffff ~ 0xffffffff 00000000, first generate the lower 32-bit with Val\|0x80000000 (which is expected be an int32), then emit (BCLRI r, 31). 2. For values in range 0x80000000 ~ 0xffffffff, first generate the lower 32-bit with Val&~0x80000000 (which is expected to be an int32), then emit (BSETI r, 31). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D111532	2021-10-13 00:59:23 +00:00
Jim Lin	f29336104d	[RISCV] Rename prefix `FeatureExt` to `FeatureStdExt` for all sub-extension Rename prefix `FeatureExt` to `FeatureStdExt` for all sub-extension for consistency Reviewed By: HsiangKai, asb Differential Revision: https://reviews.llvm.org/D108187	2021-09-13 16:24:15 +08:00
Alexander Pivovarov	1104e3258b	Fix typo in RISCVMatInt.cpp comments	2021-09-02 18:11:09 -07:00
Craig Topper	81efb82570	[RISCV] Teach RISCVMatInt about cases where it can use LUI+SLLI to replace LUI+ADDI+SLLI for large constants. If we need to shift left anyway we might be able to take advantage of LUI implicitly shifting its immediate left by 12 to cover part of the shift. This allows us to use more bits of the LUI immediate to avoid an ADDI. isDesirableToCommuteWithShift now considers compressed instruction opportunities when deciding if commuting should be allowed. I believe this is the same or similar to one of the optimizations from D79492. Reviewed By: luismarques, arcbbb Differential Revision: https://reviews.llvm.org/D105417	2021-07-20 09:22:06 -07:00
Craig Topper	4dbb788068	[RISCV] Teach constant materialization that it can use zext.w at the end with Zba to reduce number of instructions. If the upper 32 bits are zero and bit 31 is set, we might be able to use zext.w to fill in the zeros after using an lui and/or addi. Most of this patch is plumbing the subtarget features into the constant materialization. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D105509	2021-07-16 09:35:56 -07:00
Craig Topper	d7ffa82a8e	[RISCV] Improve 64-bit integer constant materialization for more cases. For positive constants we try shifting left to remove leading zeros and fill the bottom bits with 1s. We then materialize that constant shift it right. This patch adds a new strategy to try filling the bottom bits with zeros instead. This catches some additional cases.	2021-04-02 10:18:08 -07:00
Craig Topper	d61b40ed27	[RISCV] Improve 64-bit integer materialization for some cases. This adds a new integer materialization strategy mainly targeted at 64-bit constants like 0xffffffff where there are 32 or more trailing ones with leading zeros. We can materialize these by using an addi -1 and srli to restore the leading zeros. This matches what gcc does. I haven't limited to just these cases though. The implementation here takes the constant, shifts out all the leading zeros and shifts ones into the LSBs, creates the new sequence, adds an srli, and checks if this is shorter than our original strategy. I've separated the recursive portion into a standalone function so I could append the new strategy outside of the recursion. Since external users are no longer using the recursive function, I've cleaned up the external interface to return the sequence instead of taking a vector by reference. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D98821	2021-04-01 09:12:52 -07:00
Craig Topper	387d3c2479	[RISCV] Merge Utils library into MCTargetDesc MCTargetDesc includes headers from Utils and Utils includes headers from MCTargetDesc. So from a library layering perspective it makes sense for them to be in the same library. I guess the other option might be to move the tablegen includes from RISCVMCTargetDesc.h to RISCVBaseInfo.h so that RISCVBaseInfo.h didn't need to include RISCVMCTargetDesc.h. Everything else that depends on Utils also depends on MCTargetDesc so having one library seemed simpler. Differential Revision: https://reviews.llvm.org/D93168	2021-01-14 11:47:30 -08:00

33 Commits