llvm-project

Author	SHA1	Message	Date
Rahul Joshi	87e8b53009	[LLVM][TableGen] Change CodeGenDAGPatterns to use const RecordKeeper (#108762 ) Change CodeGenDAGPatterns to use const RecordKeeper. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-09-15 10:26:03 -07:00
Rahul Joshi	3786568196	[TableGen] Change CodeGenInstruction record members to const (#107921 ) Change CodeGenInstruction::{TheDef, InfereredFrom} to const pointers. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-09-11 08:52:26 -07:00
Kazu Hirata	7dfaedf861	[TableGen] Avoid repeated hash lookups (NFC) (#108138 )	2024-09-11 06:39:58 -07:00
Brandon Wu	a4c6ebeb20	[MVT][TableGen] Extend Machine Value Type to `uint16_t` (#99657 ) RFC: https://discourse.llvm.org/t/rfc-extend-machine-value-type-from-uint8-t-to-uint16-t/80274 compile-time-tracker: https://llvm-compile-time-tracker.com/compare.php?from=4b9fab591916eec9fd1942f37afe3b137b564089&to=177d28247efe5a4d59a8d8150b4daf01e4f57d74&stat=wall-time Currently 208 out of 256 MVTs are used, it will be run out soon, so ultimately we need to extend the original `MVT::SimpleValueType` from `uint8_t` to `uint16_t` to accomodate more types. The `MatcherTable` uses `unsigned char` for encoding the matcher code, so the extended MVTs are no longer fit into the table, thus we need to use VBR to encode them as we do on others that are wider than 8 bits. The statistics below shows the difference of "Total Array size" of the matcher table that appears in every files: ``` Table Before After Change(%) WebAssemblyGenDAGISel.inc 23576 23775 0.844 NVPTXGenDAGISel.inc 173498 173498 0 RISCVGenDAGISel.inc 2179121 2369929 8.756 AVRGenDAGISel.inc 2754 2754 0 PPCGenDAGISel.inc 163315 163617 0.185 MipsGenDAGISel.inc 47280 47447 0.353 SystemZGenDAGISel.inc 56243 56461 0.388 AArch64GenDAGISel.inc 467893 487830 4.261 MSP430GenDAGISel.inc 8069 8069 0 LoongArchGenDAGISel.inc 78928 79131 0.257 XCoreGenDAGISel.inc 3432 3432 0 BPFGenDAGISel.inc 3733 3733 0 VEGenDAGISel.inc 65174 66456 1.967 LanaiGenDAGISel.inc 2067 2067 0 X86GenDAGISel.inc 628787 636987 1.304 ARMGenDAGISel.inc 170968 171036 0.040 HexagonGenDAGISel.inc 155764 155764 0 SparcGenDAGISel.inc 5762 5798 0.625 AMDGPUGenDAGISel.inc 504356 504463 0.021 R600GenDAGISel.inc 29785 29785 0 ``` The statistics below shows the runtime peak memory usage by compiling a simple C program: `/bin/time -v clang -target $TARGET -O3 -c test.c` ``` int test(int a) { return a * 3; } ``` ``` Target Before(kbytes) After(kbytes) Change(%) wasm64 110172 110088 -0.076 nvptx64 109784 109980 0.179 riscv64 114020 113656 -0.319 avr 110352 110068 -0.257 ppc64 112612 112476 -0.120 mips64 113588 113668 0.070 systemz 110860 110760 -0.090 aarch64 113704 113432 -0.239 msp430 110284 110200 -0.076 loongarch64 111052 110756 -0.267 xcore 108340 108020 -0.295 bpf 110620 110708 0.080 ve 110960 110920 -0.036 lanai 110180 109960 -0.200 x86_64 113640 113304 -0.296 arm64 113540 113172 -0.324 hexagon 114620 114684 0.056 sparc 110412 110136 -0.250 amdgcn 118164 117144 -0.863 r600 111200 110508 -0.622 ```	2024-08-01 01:19:14 +08:00
Brandon Wu	592233a962	[TableGen][SelectionDAG] Make CheckValueTypeMatcher use MVT::SimpleValueType (#99537 ) The original `CheckValueTypeMatcher` stores StringRef as the member variable type, however it's more efficient to use use MVT::SimpleValueType since it prevents string comparison in isEqualImpl, it also reduce the memory consumption in each object.	2024-07-19 12:36:47 +08:00
long.chen	2b2c66c00f	[NFC][llvm] refine generated code format (#90172 )	2024-04-26 15:44:55 +08:00
Pierre van Houtryve	fa3d789df1	[RFC][TableGen] Restructure TableGen Source (#80847 ) Refactor of the llvm-tblgen source into: - a "Basic" library, which contains the bare minimum utilities to build `llvm-min-tablegen` - a "Common" library which contains all of the helpers for TableGen backends. Such helpers can be shared by more than one backend, and even unit tested (e.g. CodeExpander is, maybe we can add more over time) Fixes #80647	2024-03-25 09:40:35 +01:00
Jay Foad	f723260a80	[TableGen] Stop using make_pair and make_tuple. NFC. (#81730 ) These are unnecessary since C++17.	2024-02-14 13:16:20 +00:00
Tomas Matheson	a9e546cc71	[TableGen][NFC] convert TreePatternNode pointers to references (#81134 ) Almost all uses of `*TreePatternNode` expect it to be non-null. There was the occasional check that it wasn't, which I have removed. Making them references makes it clear that they exist. This was attempted in 2018 (1b465767d6ca69f4b7201503f5f21e6125fe049a) for `TreePatternNode::getChild()` but that was reverted.	2024-02-09 13:35:42 +00:00
Pierre van Houtryve	b9079baadd	[NFC] clang-format utils/TableGen (#80973 ) ``` find llvm/utils/TableGen -iname ".h" -o -iname ".cpp" \| xargs clang-format-16 -i ``` Split from #80847	2024-02-09 09:27:04 +01:00
Wang Pengcheng	41fe98a6e7	[TableGen] Use MapVector to remove non-determinism This fixes found non-determinism when `LLVM_REVERSE_ITERATION` option is `ON`. Fixes #79420. Reviewers: ilovepi, MaskRay Reviewed By: MaskRay Pull Request: https://github.com/llvm/llvm-project/pull/79411	2024-01-25 16:16:19 +08:00
Wang Pengcheng	a2af374284	[SelectionDAG] Add space-optimized forms of OPC_CheckPredicate (#77763 ) We record the usage of each `Predicate` and sort them by usage. For the top 8 `Predicate`s, we will emit a `PC_CheckPredicateN` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 61K. This is a recommit of 1a57927, which was reverted in bc98c31. The CI failures occurred when doing expensive checks (with option `LLVM_ENABLE_EXPENSIVE_CHECKS` being ON). The key point here is that we need stable sorting result in the test, but doing expensive checks uncovered the non-determinism of `llvm::sort`. So `llvm::sort` is changed to `llvm::stable_sort` in this revised patch. And we use `llvm::MapVector` to keep insertion order.	2024-01-12 11:38:05 +08:00
Fangrui Song	c230138011	[SelectionDAG,TableGen] Use MapVector after #73310 Otherwise `ComplexPatternList` order can be non-deterministic.	2024-01-11 19:14:49 -08:00
Fangrui Song	c185a66d83	[SelectionDAG,TableGen] Use stable_sort after #73310 to ensure determinism with https://libcxx.llvm.org/DesignDocs/UnspecifiedBehaviorRandomization.html#unspecified-behavior-randomization	2024-01-11 19:01:53 -08:00
Mikhail Goncharov	bc98c3103a	Revert "[SelectionDAG] Add space-optimized forms of OPC_CheckPredicate (#73488 )" This reverts commit 1a5792735aa0bb10e5624a438bcf7fd5091ee265. Test address-space-patfrags.td.test is failing https://lab.llvm.org/buildbot/#/builders/104/builds/15012	2024-01-11 12:25:00 +01:00
Wang Pengcheng	1a5792735a	[SelectionDAG] Add space-optimized forms of OPC_CheckPredicate (#73488 ) We record the usage of each `Predicate` and sort them by usage. For the top 8 `Predicate`s, we will emit a `PC_CheckPredicateN` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 61K.	2024-01-11 15:43:40 +08:00
Wang Pengcheng	5c8d123838	[SelectionDAG] Add space-optimized forms of OPC_CheckPatternPredicate (#73319 ) We record the usage of each `PatternPredicate` and sort them by usage. For the top 8 `PatternPredicate`s, we will emit a `OPC_CheckPatternPredicateN` to save one byte. The old `OPC_CheckPatternPredicate2` is renamed to `OPC_CheckPatternPredicateTwoByte`. Overall this reduces the llc binary size with all in-tree targets by about 93K.	2024-01-11 15:36:21 +08:00
Wang Pengcheng	211abe38d8	[SelectionDAG] Add space-optimized forms of OPC_CheckComplexPat (#73310 ) We record the usage of each `ComplexPat` and sort the `ComplexPat`s by usage. For the top 8 `ComplexPat`s, we will emit a `OPC_CheckComplexPatN` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 89K.	2024-01-11 15:28:12 +08:00
Wang Pengcheng	9348d437f5	[SelectionDAG] Add space-optimized forms of OPC_EmitRegister (#73291 ) The followed byte of `OPC_EmitRegister` is a MVT type, which is usually i32 or i64. We add `OPC_EmitRegisterI32` and `OPC_EmitRegisterI64` so that we can reduce one byte. Overall this reduces the llc binary size with all in-tree targets by about 10K.	2023-12-19 17:31:49 +08:00
Wang Pengcheng	97181bf9a0	[TableGen] Use getSizeInBits (#75157 ) We know the type is scalar type.	2023-12-12 20:40:20 +08:00
Wang Pengcheng	714417455d	[SelectionDAG] Add OPC_MoveSibling (#73643 ) There are a lot of operations to move current node to parent and then move to another child. So `OPC_MoveSibling` and its space-optimized forms are added to do this "move to sibling" operations. These new operations will be generated when optimizing matcher in `ContractNodes`. Currently `MoveParent+MoveChild` will be optimized to `MoveSibling` and sequences `MoveParent+RecordChild+MoveChild` will be transformed into `MoveSibling+RecordNode`. Overall this reduces the llc binary size with all in-tree targets by about 30K.	2023-12-12 17:48:45 +08:00
Wang Pengcheng	0d5f1cc4d0	[SelectionDAG] Add space-optimized forms of OPC_EmitNode/OPC_MorphNodeTo (#73502 ) If there is only one bit set in EmitNodeInfo, then we can encode it implicitly to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 168K.	2023-12-12 17:45:32 +08:00
Wang Pengcheng	6111f5c592	[SelectionDAG] Add instantiated OPC_CheckChildType (#73297 ) The most common type is i32 or i64 so we add `OPC_CheckChildTypeI32` and `OPC_CheckChildTypeI64` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 70K.	2023-12-12 17:31:12 +08:00
Wang Pengcheng	cbf1d58820	[SelectionDAG] Add space-optimized forms of OPC_EmitCopyToReg (#73293 ) These new opcodes implicitly indicate the RecNo. The old `OPC_EmitCopyToReg2` is renamed to `OPC_EmitCopyToRegTwoByte`. Overall this reduces the llc binary size with all in-tree targets by about 33K (most are from RISCV target).	2023-12-12 17:25:33 +08:00
Wang Pengcheng	50c174f99f	[SelectionDAG] Add space-optimized forms of OPC_EmitConvertToTarget (#73286 ) These new opcodes implicitly indicate the RecNo. Overall this reduces the llc binary size with all in-tree targets by about 13K.	2023-12-12 17:13:43 +08:00
Wang Pengcheng	e052c68869	[SelectionDAG] Add instantiated OPC_CheckType (#73283 ) The most common type is i32 or i64 so we add `OPC_CheckTypeI32` and `OPC_CheckTypeI64` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 29K.	2023-12-12 17:12:08 +08:00
wangpc	d0c8d41e10	[TableGen][NFC] Format getOpcodeString and remove unreachable breaks	2023-11-29 14:58:54 +08:00
Wang Pengcheng	2e6c01be0d	[SelectionDAG] Add instantiated OPC_EmitInteger and OPC_EmitStringInteger (#73241 ) These two opcodes are used to be followed by a MVT operand, which is always one of i8/i16/i32/i64. We add instantiated `OPC_EmitInteger` and `OPC_EmitStringInteger` with i8/i16/i32/i64 so that we can reduce one byte. We reserve `OPC_EmitInteger` and `OPC_EmitStringInteger` in case that we may need them someday, though I haven't found one usage after this change. Overall this reduces the llc binary size with all in-tree targets by about 200K.	2023-11-27 11:08:28 +08:00
Craig Topper	a3a7e76893	[SelectionDAG] Add Opc_CheckPatternPredicate2 to support targets with more than 256 predicates. This is an alternative to D156967 where I suggested the author could use a VBR type. This patch takes inspirations from Opc_EmitRegister2 that is used for two byte registers. I'm assuming 1 or 2 byte predicates should be enough so we don't need the fully generality of VBR. This avoids impacting the table size on targets that have more than 128 predicates already like X86. Reviewed By: bogner Differential Revision: https://reviews.llvm.org/D157196	2023-08-05 16:59:45 -07:00
Craig Topper	a8aa43bb1a	[TableGen] Intialize vector with constructor instead of assign. NFC	2023-04-22 22:37:57 -07:00
Craig Topper	44d46c4b3c	[TableGen] Store CodeGenInstruction reference in EmitNodeMatcherCommon. NFC Instead of storing a string containing the instruction name, store a reference to the instruction. We can use that reference to print the instruction name when we emit the table. The only slightly annoying part is that we have to find the CodeGenInstruction for IMPLICIT_DEF. GlobalISel is doing a similar thing.	2023-04-12 20:44:36 -07:00
Craig Topper	ef2d2a11e3	[TableGen] Rename InFlag/OutFlag->InGlue/OutGlue. NFC Flag was renamed to Glue a long time ago, but rename was incomplete.	2023-04-02 14:12:10 -07:00
NAKAMURA Takumi	afde3f549d	llvm-tblgen: Apply IWYU partially	2023-02-17 00:32:46 +09:00
serge-sans-paille	fbbc41f8dd	Cleanup include: TableGen This also includes a few cleanup from Support. Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121331	2022-03-11 11:41:32 +01:00
Duncan P. N. Exon Smith	dcd6162b7f	utils: Remove some no-op raw_string_ostream flush calls, NFC Since 65b13610a5226b84889b923bae884ba395ad084d, raw_string_ostream has been unbuffered by default. Based on an audit of llvm/utils/, this commit removes every call to `raw_string_ostream::flush()` and any call to `raw_string_ostream::str()` whose result is ignored or that doesn't help with clarity. I left behind a few calls to `str()`. In these cases, the underlying std::string was declared pretty far away and never used again, whereas stream recently had its last write. The code is easier to read as-is; the no-op call to `flush()` inside `str()` isn't harmful, and when https://reviews.llvm.org/D115421 lands it'll be gone anyway.	2021-12-10 11:26:08 -08:00
Craig Topper	6430430958	[TableGen] Use sign rotated VBR for OPC_EmitInteger. This allows for a much more efficient encoding for small negative numbers by storing the sign bit first and negating the rest of the bits. This was already being used for OPC_CheckInteger. For every in tree target this affects, the table got smaller. R600GenDAGISel.inc saw the largest reduction of 7K. I did have to add a new opcode for StringIntegers used for register class ids and subregister indices since we don't have the integer value to encode. The enum name is emitted directly into the table. Previously assumed the enum would expand to a positive 7-bit number. We might be able to just shift that right by 1 and assume it is a positive 6 bit number, but that will need more investigation.	2021-05-02 12:40:44 -07:00
Simon Pilgrim	b62928b21e	[TableGen] Avoid repeated TreePredicateFn::getCodeToRunOnSDNode() calls in MatcherTableEmitter::EmitNodePredicatesFunction loop. NFCI.	2021-03-01 15:43:37 +00:00
Craig Topper	61d4d9a5d3	[TableGen][SelectionDAG] Improve efficiency of encoding negative immediates for isel's CheckInteger opcode. CheckInteger uses an int64_t encoded using a variable width encoding that is optimized for encoding a number with a lot of leading zeros. Negative numbers have no leading zeros so use the largest encoding requiring 9 bytes. I believe its most like we want to check for positive and negative numbers near 0. -1 is quite common due to its use in the 'not' idiom. To optimize for this, we can borrow an idea from the bitcode format and move the sign bit to bit 0 with the magnitude stored in the upper bits. This will drastically increase the number of leading zeros for small magnitudes. Then we can run this value through VBR encoding. This gives a small reduction in the table size on all in tree targets except VE where size increased by about 300 bytes due to intrinsic ids now requiring 3 bytes instead of 2. Since the intrinsic enum space is shared by all targets this an unfortunate consquence of where VE is currently located in the range. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D96317	2021-02-18 08:53:17 -08:00
Craig Topper	622611f7e5	[TableGen] Use return value from EmitVBRValue instead of calling GetVBRSize on the same value. Consistently use unsigned for child sizes. NFCI getSize and setSize both use unsigned. So size_t doesn't increase range here and might get truncated if passed to setSize. Also not sure why EmitVBRValue was returning uint64_t, but used an unsigned to supply the value.	2021-02-08 16:34:35 -08:00
Paul C. Anagnostopoulos	9b7b8de6d1	[TableGen] [ISel Matcher Emitter] Rework with two passes: one to size, one to emit Differential Revision: https://reviews.llvm.org/D91632	2020-11-21 10:59:13 -05:00
Jay Foad	d0b8810fe4	[TableGen] Indentation and whitespace fixes in generated code. NFC. Some of these were found by running clang-format over the generated code, although that complains about far more issues than I have fixed here. Differential Revision: https://reviews.llvm.org/D90937	2020-11-06 16:10:57 +00:00
Fangrui Song	7016a4b5c3	llvm-tblgen -gen-dag-isel: Hoist SmallVector TmpBuf	2020-04-25 20:41:04 -07:00
Fangrui Song	58dbd5befd	llvm-tblgen -gen-dag-isel: Reduce lib/Target//GenDAGISel.inc X86GenDAGISel.inc: 22597697 bytes -> 20874981 bytes	2020-04-25 20:02:04 -07:00
Stanislav Mekhanoshin	f8d044bbcf	[TBLGEN] Fix subreg value overflow in DAGISelMatcher Tablegen's DAGISelMatcher emits integers in a VBR format, so if an integer is below 128 it can fit into a single byte, otherwise high bit is set, next byte is used etc. MatcherTable is essentially an unsigned char table. When SelectionDAGISel parses the table it does a reverse translation. In a situation when numeric value of an integer to emit is unknown it can be emitted not as OPC_EmitInteger but as OPC_EmitStringInteger using a symbolic name of the value. In this situation the value should not exceed 127. One of the situations when OPC_EmitStringInteger is used is if we need to emit a subreg into a matcher table. However, number of subregs can exceed 127. Currently last defined subreg for AMDGPU is 192. That results in a silent bug in the ISel with matcher reading from an invalid offset. Fixed this bug to emit actual VBR encoded value for a subregs which value exceeds 127. Differential Revision: https://reviews.llvm.org/D74368	2020-02-12 13:29:57 -08:00
Benjamin Kramer	adcd026838	Make llvm::StringRef to std::string conversions explicit. This is how it should've been and brings it more in line with std::string_view. There should be no functional change here. This is mostly mechanical from a custom clang-tidy check, with a lot of manual fixups. It uncovers a lot of minor inefficiencies. This doesn't actually modify StringRef yet, I'll do that in a follow-up.	2020-01-28 23:25:25 +01:00
Matt Arsenault	542720b2bc	TableGen: Support physical register inputs > 255 This was truncating register value that didn't fit in unsigned char. Switch AMDGPU sendmsg intrinsics to using a tablegen pattern. llvm-svn: 366695	2019-07-22 15:02:34 +00:00
Craig Topper	1a872f2b15	Recommit r355224 "[TableGen][SelectionDAG][X86] Add specific isel matchers for immAllZerosV/immAllOnesV. Remove bitcasts from X86 patterns that are no longer necessary." Includes a fix to emit a CheckOpcode for build_vector when immAllZerosV/immAllOnesV is used as a pattern root. This means it can't be used to look through bitcasts when used as a root, but that's probably ok. This extra CheckOpcode will ensure that the first match in the isel table will be a SwitchOpcode which is needed by the caching optimization in the ISel Matcher. Original commit message: Previously we had build_vector PatFrags that called ISD::isBuildVectorAllZeros/Ones. Internally the ISD::isBuildVectorAllZeros/Ones look through bitcasts, but we aren't able to take advantage of that in isel. Instead of we have to canonicalize the types of the all zeros/ones build_vectors and insert bitcasts. Then we have to pattern match those exact bitcasts. By emitting specific matchers for these 2 nodes, we can make isel look through any bitcasts without needing to explicitly match them. We should also be able to remove the canonicalization to vXi32 from lowering, but I've left that for a follow up. This removes something like 40,000 bytes from the X86 isel table. Differential Revision: https://reviews.llvm.org/D58595 llvm-svn: 355784	2019-03-10 05:21:52 +00:00
Craig Topper	57fd733140	Revert r355224 "[TableGen][SelectionDAG][X86] Add specific isel matchers for immAllZerosV/immAllOnesV. Remove bitcasts from X86 patterns that are no longer necessary." This caused the first matcher in the isel table for many targets to Opc_Scope instead of Opc_SwitchOpcode. This leads to a significant increase in isel match failures. llvm-svn: 355433	2019-03-05 19:18:16 +00:00
Craig Topper	4cfc39179e	[TableGen][SelectionDAG][X86] Add specific isel matchers for immAllZerosV/immAllOnesV. Remove bitcasts from X86 patterns that are no longer necessary. Previously we had build_vector PatFrags that called ISD::isBuildVectorAllZeros/Ones. Internally the ISD::isBuildVectorAllZeros/Ones look through bitcasts, but we aren't able to take advantage of that in isel. Instead of we have to canonicalize the types of the all zeros/ones build_vectors and insert bitcasts. Then we have to pattern match those exact bitcasts. By emitting specific matchers for these 2 nodes, we can make isel look through any bitcasts without needing to explicitly match them. We should also be able to remove the canonicalization to vXi32 from lowering, but I've left that for a follow up. This removes something like 40,000 bytes from the X86 isel table. Differential Revision: https://reviews.llvm.org/D58595 llvm-svn: 355224	2019-03-01 20:18:38 +00:00
Craig Topper	8c9724ea4f	[SelectionDAG] Add a OPC_CheckChild2CondCode to SelectionDAGISel to remove a MoveChild and MoveParent pair. OPC_CheckCondCode is always used as operand 2 of a setcc. And its always surrounded by a MoveChild2 and a MoveParent. By having a dedicated opcode for this case we can reduce the number of bytes needed for this pattern from 4 bytes to 2. This saves ~3000 bytes in the X86 table. llvm-svn: 354763	2019-02-25 03:11:44 +00:00

1 2 3 4

158 Commits