llvm-project

Author	SHA1	Message	Date
Rahul Joshi	6182015698	[NFC][LLVM][TableGen] Adjust pointer increments in DecoderEmitter (#136230 ) - In both `emitTable` and the generated `decodeInstruction` function increment the pointer to the decoder op as a part of the switch statement instead of later on in each case.	2025-04-18 10:08:00 -07:00
Rahul Joshi	c244daec1c	[LLVM][TableGen] Fix Windows failure in DecoderEmitter (#136310 ) - Avoid dereferencing the end() iterator to get the end pointer, instead calculate it explicitly - Fixes a regression introduced in https://github.com/llvm/llvm-project/pull/136220. - The windows build failure shows the following call stack: ``` \| Exception Code: 0x80000003 \| #0 0x00007ff74bc05897 std::_Vector_const_iterator<class std::_Vector_val<struct std::_Simple_types<unsigned char>>>::operator*(void) const C:\Program Files\Microsoft Visual Studio\2022\Professional\VC\Tools\MSVC\14.37.32822\include\vector:52:0 \| #1 0x00007ff74bbd3d64 `anonymous namespace'::DecoderEmitter::emitTable D:\buildbot\llvm-worker\clang-cmake-x86_64-avx512-win\llvm\llvm\utils\TableGen\DecoderEmitter.cpp:852:0 ```	2025-04-18 10:05:40 -07:00
Rahul Joshi	3ed83630b2	[NFC][LLVM][TableGen] Use `decodeULEB128` for `OPC_SoftFail` emission (#136220 ) - Use `decodeULEB128` to decode +ve/-ve mask in OPC_SoftFail case. - Use current `I`/`E` iterators as inputs to `decodeULEB128`.	2025-04-18 05:12:35 -07:00
Rahul Joshi	6c4caae449	[LLVM][TableGen] Move DecoderEmitter output to anonymous namespace (#136214 ) - Move the code generated by DecoderEmitter to anonymous namespace. - Move AMDGPU's usage of this code from header file to .cpp file. Note, we get build errors like "call to function 'decodeInstruction' that is neither visible in the template definition nor found by argument-dependent lookup" if we do not change AMDGPU.	2025-04-18 04:35:05 -07:00
Rahul Joshi	6d8bf3cf3d	Revert "Reapply "[LLVM][TableGen] Parameterize NumToSkip in DecoderEmitter" (#136017 )" (#136068 ) Reverts llvm/llvm-project#136019 Expensive checks tests are failing, so reverting.	2025-04-16 18:24:10 -07:00
Rahul Joshi	8ebdd9d8a1	Reapply "[LLVM][TableGen] Parameterize NumToSkip in DecoderEmitter" (#136017 ) (#136019 ) This reverts commit 7fd0c8acd4659ccd0aef5486afe32c8ddf0f2957, and fixes the assert condition in `patchNumToSkip`.	2025-04-16 15:40:34 -07:00
Rahul Joshi	7fd0c8acd4	Revert "[LLVM][TableGen] Parameterize NumToSkip in DecoderEmitter" (#136017 ) Reverts llvm/llvm-project#135882 Causing assert failures for AArch64 backend	2025-04-16 13:16:32 -07:00
Rahul Joshi	598ec8ce2d	[LLVM][TableGen] Parameterize NumToSkip in DecoderEmitter (#135882 ) - Add command line option `num-to-skip-size` to parameterize the size of `NumToSkip` bytes in the decoder table. Default value will be 2, and targets that need larger size can use 3. - Keep all existing targets, except AArch64, to use size 2, and change AArch64 to use size 3 since it run into the "disassembler decoding table too large" error with size 2. - Following is a rough reduction in size for the decoder tables by switching to size 2. ``` Target Old Size New Size % Reduction ================================================ AArch64 153254 153254 0.00 AMDGPU 471566 412805 12.46 ARC 5724 5061 11.58 ARM 84936 73831 13.07 AVR 1497 1306 12.76 BPF 2172 1927 11.28 CSKY 10064 8692 13.63 Hexagon 47967 41965 12.51 Lanai 1108 982 11.37 LoongArch 24446 21621 11.56 MSP430 4200 3716 11.52 Mips 36330 31415 13.53 PPC 31897 28098 11.91 RISCV 37979 32790 13.66 Sparc 8331 7252 12.95 SystemZ 36722 32248 12.18 VE 48296 42873 11.23 XCore 2590 2316 10.58 Xtensa 3827 3316 13.35 ```	2025-04-16 13:07:58 -07:00
Rahul Joshi	4780658823	[NFC][TableGen] DecoderEmitter optimize scope stack in `Filter::emitTableEntry` (#135693 ) - Create a new stack scope only in the fallthrough case. - For the non-fallthrough cases, any fixup entries will naturally be added to the existing scope without needing to copy them manually. - Verified that the generated `GenDisassembler` files are identical with and without this change.	2025-04-15 07:49:01 -07:00
Rahul Joshi	9ba65cbcb5	[NFC][TableGen] Refactor DecoderEmitter.cpp (#135510 ) - Add helper functions to insert ULEB128 encoded value and NumToSkip. - Use ArrayRef<> instead of const vector references as function arguments. - Return `OpHasCompleteDecoder` by value instead of by reference. - Use range for loops. - Remove {} around single line if/else bodies. - In `emitSoftFailTableEntry`, unconditionally emit the Positive and Negative mask values, instead of explicitly emitting a 0 byte when the mask is not needed.	2025-04-14 14:09:00 -07:00
Craig Topper	40c859a704	[TableGen] Use size returned by encodeULEB128 to simplify some code. NFC (#133750 ) We can use the length to insert all the bytes at once instead of partially decoding them to insert one byte at a time.	2025-03-31 15:58:36 -07:00
Kazu Hirata	2c73711995	[TableGen] Use llvm::append_range (NFC) (#133649 )	2025-03-30 12:21:38 -07:00
Craig Topper	fd21d35178	[TableGen] Reduce the number of vectors passed to getIslands. NFC (#130402 ) Combine the StartBits, EndBits, and FieldVals vectors into a single vector of a struct that contains all 3 pieces of information. Instead of storing EndBits, we store NumBits since that's what the users want. I've removed the BitNo variable as it was easy to construct calculate from StartBit. I've also removed Num in favor of Islands.size().	2025-03-10 21:02:09 -07:00
Craig Topper	f2607df291	[TableGen] Use uint8_t for bit_value_t enum. NFC This reduces the amount of space needed for vectors of bit_value_t and allows the user of memset. Also reorder the enum values so BIT_FALSE is 0 and BIT_TRUE is 1.	2025-03-07 23:22:45 -08:00
Craig Topper	d65719fab3	[TableGen] Use isUInt to simplify some asserts. NFC	2025-03-07 22:51:43 -08:00
Craig Topper	8370ac88af	[TableGen] Remove push_back from loop. NFC We can initialize the vector to the right size and then assign over some entries in the loop.	2025-03-07 22:01:15 -08:00
Craig Topper	f578982490	[TableGen] Remove unnecessary const_cast. NFC	2025-03-07 21:52:29 -08:00
Craig Topper	ff033d1f28	[TableGen] Use reference instead of pointer for FilterChooser in Filter. NFC	2025-03-07 19:11:31 -08:00
Craig Topper	6a42dc694c	[TableGen] Simplify emitULEB128 in DecoderEmitter.cpp. NFC (#130214 ) Instead of returning the number of bytes emitted, just take the iterator by reference so the increments in emitULEB128 will update the copy in the caller. Also pass the iterator by reference to emitNumToSkip so we don't need a separate I += 3 in the caller.	2025-03-07 11:09:34 -08:00
Craig Topper	efb880de11	[TableGen] Fix incorrect comment. NFC	2025-03-03 14:37:37 -08:00
Craig Topper	3ce67a81fa	[TableGen] Remove unnecessary use of utostr to print a byte. NFC We can cast to unsigned instead.	2025-03-03 14:37:37 -08:00
chrisPyr	71f4c7dabe	[NFC]Make file-local cl::opt global variables static (#126486 ) #125983	2025-03-03 13:46:33 +07:00
Craig Topper	4059faf613	[TableGen] Update comment for size of NumToSkip field in DecoderEmitter. NFC NumToSkip is 24 bits. It used to be 16 bits.	2025-02-26 10:12:38 -08:00
Jay Foad	4e8c9d2813	[TableGen] Use std::pair instead of std::make_pair. NFC. (#123174 ) Also use brace initialization and emplace to avoid explicitly constructing std::pair, and the same for std::tuple.	2025-01-16 13:20:41 +00:00
abhishek-kaushik22	943b212d56	[TableGen] Use `std::move` to avoid copy (#123088 )	2025-01-15 22:50:00 +05:30
abhishek-kaushik22	31ce47b5d6	[TableGen] Use `std::move` to avoid copy (#113061 )	2024-11-21 11:48:46 -08:00
Rahul Joshi	62e2c7fb2d	[LLVM][TableGen] Change all `Init` pointers to const (#112705 ) This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-10-18 07:50:22 -07:00
Rahul Joshi	708567ab0b	[LLVM][TableGen] Adopt `indent` for indentation (#109275 ) Adopt `indent` for indentation DAGISelMatcher and DecoderEmitter.	2024-09-20 04:28:01 -07:00
Rahul Joshi	b594b93024	[LLVM][TableGen] Change DisassemblerEmitter to use const RecordKeeper (#109177 ) Change DisassemblerEmitter to use const RecordKeeper. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-09-20 04:22:37 -07:00
Rahul Joshi	3e24dd42dd	[NFC] Rename variables to conform to LLVM coding standards (#109166 ) Rename `indent` to `Indent` and `o` to `OS`. Rename `Indentation` to `Indent`. Remove unused argument from `emitPredicateMatch`. Change `Indent` argument to `emitBinaryParser` to by value.	2024-09-19 04:49:12 -07:00
Rahul Joshi	2bb3621faa	[LLVM][TableGen] Change DecoderEmitter to use const RecordKeeper (#109040 ) Change DecoderEmitter to use const RecordKeeper. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-09-18 05:35:26 -07:00
Rahul Joshi	bdf02249e7	[TableGen] Change CGIOperandList::OperandInfo::Rec to const pointer (#107858 ) Change CGIOperandList::OperandInfo::Rec and CGIOperandList::TheDef to const pointer. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089	2024-09-09 14:33:21 -07:00
Rahul Joshi	0ceffd362b	[TableGen] Add PrintError family overload that take a print function (#107333 ) Add PrintError and family overload that accepts a print function. This avoids constructing potentially long strings for passing into these print functions.	2024-09-07 05:13:54 -07:00
Max Beck-Jones	a46d60ad32	[NFC] [AArch64] Refactor predicate register class decode functions (#97412 ) In a previous PR #81716, a new decoder function was added to llvm/lib/Target/AArch64/Disassembler/AArch64Disassembler.cpp. During code review it was suggested that, as most of the decoder functions were very similar in structure, that they be refactored into a single, templated function. I have added the refactored function, removed the definitions of the replaced functions, and replaced the references to the replaced functions in AArch64Disassembler.cpp and llvm/lib/Target/AArch64/AArch64RegisterInfo.td. To reduce the number of duplicate references in AArch64RegisterInfo.td, I have also made a small change to llvm/utils/TableGen/DecoderEmitter.cpp.	2024-07-15 16:57:42 +01:00
Piotr Fusik	35bb9f158b	[TableGen][NFC] Use `decodeULEB128AndIncUnsafe` in `decodeInstruction` (#98619 )	2024-07-14 13:27:51 -07:00
Fangrui Song	efad14954c	[Support] Add end/error to decode[US]LEB128AndInc Follow-up to #85739 to encourage error checking. We make `end` mandatory and add decodeULEB128AndIncUnsafe to be used without `end`. Pull Request: https://github.com/llvm/llvm-project/pull/90006	2024-05-08 09:22:30 -07:00
superZWT123	da1d3d8fb9	[TableGen] Introduce a less aggressive suppression for HwMode Decoder… (#86060 ) 1. Remove 'AllModes' and 'DefaultMode' suffixes for DecoderTables under default HwMode. 2. Introduce a less aggressive suppression for HwMode DecoderTable, only reduce necessary tables duplications. This allows encodings under different HwModes to retain the original DecoderNamespace. 3. Change 'suppress-per-hwmode-duplicates' command option from bool type to enum type, allowing users to choose what level of suppression to use.	2024-04-01 17:19:46 +08:00
Pierre van Houtryve	fa3d789df1	[RFC][TableGen] Restructure TableGen Source (#80847 ) Refactor of the llvm-tblgen source into: - a "Basic" library, which contains the bare minimum utilities to build `llvm-min-tablegen` - a "Common" library which contains all of the helpers for TableGen backends. Such helpers can be shared by more than one backend, and even unit tested (e.g. CodeExpander is, maybe we can add more over time) Fixes #80647	2024-03-25 09:40:35 +01:00
Fangrui Song	35bf8e798d	[Support] Add decodeULEB128AndInc/decodeSLEB128AndInc Many decodeULEB128/decodeSLEB128 users need to increment the pointer. Add helpers to simplify this common pattern. We don't add `end` and `error` parameters at present because many users don't need them. Pull Request: https://github.com/llvm/llvm-project/pull/85739	2024-03-19 15:40:23 -07:00
mahesh-attarde	390f28702f	[CodeGen][Tablegen] Fix uninitialized var and shift overflow. (#84896 ) Fix uninitialized var and shift overflow.	2024-03-13 22:03:15 +08:00
Jason Eckhardt	e9492ccae0	[TableGen] DecoderEmitter clean-ups and modernization. (#84832 ) The decoder emitter is showing some signs of age. This patch makes a few kinds of clean-ups: - Use ranged-for more widely, including using enumerate() for those loops maintaining a loop index along with the items. - Reduce the number of arguments to fieldFromInsn (removes an out reference parameter: CodingStandards). The insn_t argument to insnWithID can/should probably be removed soon too since modern C++ allows us to return a local container without a copy. - Use raw strings for the large emitted code segments. This enhances both readability and modifiability.	2024-03-12 16:01:58 -05:00
Jason Eckhardt	6f7e940c2d	[TableGen] More efficiency improvements for encode/decode emission. (#84647 ) DecoderEmitter and CodeEmitterGen perform repeated linear walks over the entire instruction list. This patch eliminates two more such walks. The eliminated traversals visit every instruction merely to determine whether the target has variable length encodings. For a target with variable length encodings, the original any_of will terminate quickly. But all targets other than M68k use fixed length encodings and thus any_of must visit the entire instruction list.	2024-03-11 08:13:33 -05:00
Jason Eckhardt	ad43ea3328	[TableGen] Add support for DefaultMode in per-HwMode encode/decode. (#83029 ) Currently the decoder and encoder emitters will crash if DefaultMode is used within an EncodingByHwMode. As can be done today for RegInfoByHwMode and ValueTypeByHwMode, this patch adds support for this usage in EncodingByHwMode: let EncodingInfos = EncodingByHwMode<[ModeA, DefaultMode], [EncA, EncDefault]>;	2024-02-29 01:47:18 +08:00
Jason Eckhardt	f75c6ed93e	[TableGen] Efficiency improvements for encoding HwMode collection. (#82902 ) Currently the DecoderEmitter spends a fair amount of cycles performing repeated linear walks over the entire instruction list. This patch eliminates one such walk during HwMode collection for EncodingInfos. The eliminated traversal visits every instruction and then every EncodingInfos entry for that instruction merely to collect all referenced HwModes. That information already happens to be present in the HwModeSelects created during the one-time construction of CodeGenHwModes. We instead traverse the HwModeSelects, collecting each one referenced as an encoding select. This set is a small constant in size and does not generally grow with the size of the instruction set.	2024-02-26 12:58:17 +08:00
Jason Eckhardt	05af9c83f3	[TableGen] Suppress per-HwMode duplicate instructions/tables. (#82567 ) Currently, for per-HwMode encoding/decoding, those instructions that do not have a HwMode override are duplicated into the decoder tables for all HwModes. This includes inducing multiple tables for instructions that are otherwise unrelated (e.g., different namespace with no overrides at all). This patch adds support to suppress instruction and table duplicates. TableGen option "-gen-disassembler --suppress-per-hwmode-duplicates" enables the suppression (off by default). For one downstream backend with a complicated ISA and major cross-generation encoding differences, this eliminates ~32000 duplicate table entries at the time of this patch. There are legitimate reasons to suppress or not suppress duplicates. If there are relatively few non-overridden related instructions, it can be convenient to pull them into the per-mode tables (only need to decode the per-mode tables, slightly simpler decode function in disassembler). On the other hand, in some backends, the opposite is true or the size is too large to tolerate any duplication in the first place. We let the user decide which makes sense. This is currently off by default, though there is no reason it couldn't be enabled by default. Any existing backends downstream using the per-HwMode feature will function as before. Turning on the feature requires minor modifications to their disassembler due to more/less tables and naming.	2024-02-22 11:36:10 +08:00
Jason Eckhardt	2ed0aacf97	[TableGen] Fixes for per-HwMode decoding problem (#82201 ) Today, if any instruction uses EncodingInfos/EncodingByHwMode to override the default encoding, the opcode field of the decoder table is generated incorrectly. This causes failed disassemblies and other problems. Specifically, the main correctness issue is that the EncodingID is inadvertently stored in the table rather than the actual opcode. This is caused by having set up the IndexOfInstruction map incorrectly during the loop to populate NumberedEncodings-- which is then propagated around when OpcMap is set up with a bad EncodingIDAndOpcode. Instead, do away with IndexOfInstruction altogether and use opcode value queried from CodeGenTarget::getInstrIntValue to set up OpcMap. This itself exposed another problem where emitTable was using the decoded opcode to index into NumberedEncodings. Instead pass in the EncodingIDAndOpcode vector, and create the reverse mapping from Opcode to EncodingID, which is then used to index NumberedEncodings. This problem is not currently exposed upstream since no in-tree targets yet use the per-HwMode feature. It does show up in at least two downstream targets.	2024-02-19 13:14:22 +08:00
Jay Foad	f723260a80	[TableGen] Stop using make_pair and make_tuple. NFC. (#81730 ) These are unnecessary since C++17.	2024-02-14 13:16:20 +00:00
Pierre van Houtryve	b9079baadd	[NFC] clang-format utils/TableGen (#80973 ) ``` find llvm/utils/TableGen -iname ".h" -o -iname ".cpp" \| xargs clang-format-16 -i ``` Split from #80847	2024-02-09 09:27:04 +01:00
Jason Eckhardt	1442b0e653	[TableGen] Remove redundant buffer copies for ULEB128 decode calls. (#80199 ) This patch removes a couple of redundant buffer copies in emitTable for setting up calls to decodeULEB128. Instead, provide the Table.data buffer directly to the calls-- where decodeULEB128 does its own buffer overflow checking. Factor out 7 explicit loops to emit ULEB128 bytes into emitULEB128. Also factor out 4 copies of 24-bit numtoskip emission into emitNumToSkip. The functionality is already covered by existing unit tests and by virtue of most of the in-tree back-ends exercising the decoder emitter.	2024-02-06 13:23:13 +08:00
Jason Eckhardt	d93f850c6f	[TableGen] Extend OPC_ExtractField/OPC_CheckField start value widths. (#79723 ) Both OPC_ExtractField and OPC_CheckField are currently defined to take an unsigned 8-bit start value. On some architectures with long instruction words, this value can silently overflow, resulting in a bad decoder table. This patch changes each to take a ULE128B-encoded start value instead. Additionally, a range assertion is added for the 8-bit length to prominently notify a user in case that field ever overflows. This problem isn't currently exposed upstream since all in-tree targets use small instruction words (i.e., bitwidth <= 64 bits). It does show up in at least one downstream target with instructions > 64 bits long. Co-authored-by: Jason Eckhardt <jeckhardt@nvidia.com>	2024-01-29 09:22:22 -05:00

1 2

68 Commits