llvm-project

Author	SHA1	Message	Date
Rahul Joshi	dafffe262d	[LLVM][MC][DecoderEmitter] Add support to specialize decoder per bitwidth (#154865 ) This change adds an option to specialize decoders per bitwidth, which can help reduce the (compiled) code size of the decoder code. Current state: Currently, the code generated by the decoder emitter consists of two key functions: `decodeInstruction` which is the entry point into the generated code and `decodeToMCInst` which is invoked when a decode op is reached while traversing through the decoder table. Both functions are templated on `InsnType` which is the raw instruction bits that are supplied to `decodeInstruction`. Several backends call `decodeInstruction` with different `InsnType` types, leading to several template instantiations of these functions in the final code. As an example, AMDGPU instantiates this function with type `DecoderUInt128` type for decoding 96/128-bit instructions, `uint64_t` for decoding 64-bit instructions, and `uint32_t` for decoding 32-bit instructions. Since there is just one `decodeToMCInst` in the generated code, it has code that handles decoding for all instruction sizes. However, the decoders emitted for different instructions sizes rarely have any intersection with each other. That means, in the AMDGPU case, the instantiation with InsnType == DecoderUInt128 has decoder code for 32/64-bit instructions that is never exercised. Conversely, the instantiation with InsnType == uint64_t has decoder code for 128/96/32-bit instructions that is never exercised. This leads to unnecessary dead code in the generated disassembler binary (that the compiler cannot eliminate by itself). New state: With this change, we introduce an option `specialize-decoders-per-bitwidth`. Under this mode, the DecoderEmitter will generate several versions of `decodeToMCInst` function, one for each bitwidth. The code is still templated, but will require backends to specify, for each `InsnType` used, the bitwidth of the instruction that the type is used to represent using a type-trait `InsnBitWidth`. This will enable the templated code to choose the right variant of `decodeToMCInst`. Under this mode, a particular instantiation will only end up instantiating a single variant of `decodeToMCInst` generated and that will include only those decoders that are applicable to a single bitwidth, resulting in elimination of the code duplication through instantiation and a reduction in code size. Additionally, under this mode, decoders are uniqued only within a given bitwidth (as opposed to across all bitwidths without this option), so the decoder index values assigned are smaller, and consume less bytes in their ULEB128 encoding. As a result, the generated decoder tables can also reduce in size. Adopt this feature for the AMDGPU and RISCV backend. In a release build, this results in a net 55% reduction in the .text size of libLLVMAMDGPUDisassembler.so and a 5% reduction in the .rodata size. For RISCV, which today uses a single `uint64_t` type, this results in a 3.7% increase in code size (expected as we instantiate the code 3 times now). Actual measured sizes are as follows: ``` Baseline commit: 72c04bb882ad70230bce309c3013d9cc2c99e9a7 Configuration: Ubuntu clang version 18.1.3, release build with asserts disabled. AMDGPU Before After Change ====================================================== .text 612327 275607 55% reduction .rodata 369728 351336 5% reduction RISCV: ====================================================== .text 47407 49187 3.7% increase .rodata 35768 35839 0.1% increase ```	2025-09-01 13:44:18 -07:00
Rahul Joshi	1f2d461e26	[NFC][MC][DecoderEmitter] Simplify loop to find the best filter (#156237 ) We can just use `max_element` on the array of filters.	2025-08-31 06:23:26 -07:00
Sergei Barannikov	06d758537d	[TableGen][Decoder] Remove special case of single sub-op dag (#156175 ) If a custom operand has MIOperandInfo with >= 2 sub-operands, it is required that either the operand or its sub-operands have a decoder method (depending on usage). Require this for single sub-operand operands as well, since there is no good reason not to. There are no changes in the generated files.	2025-08-31 10:07:44 +03:00
Sergei Barannikov	cacab8a86f	[TableGen][Decoder] Simplify parseFixedLenOperands (NFCI) (#156181 ) Use information from CGIOperandList instead of re-parsing operand dags from scratch.	2025-08-30 14:17:30 +00:00
Sergei Barannikov	ea3a3a825a	[TableGen][Decoder] Cache DecoderNamespace in InstructionEncoding (NFC) (#156059 )	2025-08-29 21:23:08 +03:00
Sergei Barannikov	c9d7d10084	[TableGen][DecoderEmitter] Use StringRef in a few places (NFC) (#156051 )	2025-08-29 16:37:48 +00:00
Sergei Barannikov	ca5d19516b	[TableGen][DecoderEmitter] Simplify emitSoftFailTableEntry (NFC) (#155863 )	2025-08-28 16:20:09 +00:00
Sergei Barannikov	cdc79e32f2	[TableGen][DecoderEmitter] Optimize single-case OPC_ExtractField (#155414 ) OPC_ExtractField followed by a single OPC_FilterValue is equivalent to OPC_CheckField. Optimize this relatively common case.	2025-08-26 19:12:04 +03:00
Sergei Barannikov	156c11200d	[TableGen][DecoderEmitter] Remove no longer needed MaxFilterWidth (NFC) (#155382 ) 11c61581 made the variable redundant. Also remove `Target`, which is apparently unused.	2025-08-26 09:46:26 +00:00
Sergei Barannikov	e49946b27f	[TableGen][DecoderEmitter] Factor out DecoderTableBuilder (#155220 ) Extract the table building methods from FilterChooser into a separate class to relieve it of one of its responsibilities.	2025-08-26 05:10:19 +03:00
Sergei Barannikov	2a586a8118	[TableGen][DecoderEmitter] Remove dead OPC_Fail (#155229 ) It can never be reached. It could be reached if we emitted an opcode that could fall outside the outermost scope, but emission of all such opcodes is guarded by `!isOutermostScope()`. That also means we never add fixups to the outermost scope, so avoid pushing an entry for it onto the stack.	2025-08-25 19:15:35 +03:00
Sergei Barannikov	872a1ed081	[TableGen][DecoderEmitter] Remove PredicateNamespace (NFC) (#155211 ) There is no target named Thumb, so there is no need to make a special case for it. As part of this change, pass CodeGenTarget instead of DecoderEmitter to FilterChooser to remove dependency between the latter two.	2025-08-25 18:45:58 +03:00
Rahul Joshi	11c615818f	[NFCI][MC][DecoderEmitter] Fix BitWidth for fixed length inst encodings (#154934 ) Change `InstructionEncoding` to use `Size` field to derive the BitWidth for fixed length instructions as opposed to the number of bits in the `Inst` field. For some backends, `Inst` has more bits than `Size`, but `Size` is the true size of the instruction. Also add validation that `Inst` has at least `Size * 8` bits and any bits in `Inst` beyond that are either 0 or unset. Verified no change in generated *GenDisassembler.inc files before/after.	2025-08-24 07:04:07 -07:00
Sergei Barannikov	c9106c8298	[TableGen][DecoderEmitter] Add a couple of helper methods (NFC) (#155163 ) Replace push_back with more specific insertOpcode/insertUInt8.	2025-08-24 15:32:05 +03:00
Sergei Barannikov	ec860d1b87	[TableGen][DecoderEmitter] Refactor emitTableEntries (NFCI) (#155100 ) * Inline two small functions so that `emitTableEntries()` calls itself directly rather than through other functions. * Peel the last iteration of the loop as it is special. This should make the code easier to follow.	2025-08-24 12:39:19 +03:00
Sergei Barannikov	6ae0d9591e	[TableGen][DecoderEmitter] Print the size of the decoder tables (#155139 ) So we can see the changes in table sizes after making changes to DecoderEmitter by simply running `grep DecoderTable`. Also, remove an unnecessary terminating 0 from the end of the tables.	2025-08-24 09:09:31 +03:00
Sergei Barannikov	1ab3042318	[TableGen][DecoderEmitter] Fix indentation in generated code (NFC) `MCD::OPC_SoftFail` case in the generated `decodeInstruction()` was overindented, except for the closing brace, which was underindented.	2025-08-24 07:17:34 +03:00
Sergei Barannikov	ee55efc711	[TableGen][DecoderEmitter] Repurpose Filter class (#155065 ) There was a lot of confusion about the responsibilities of Filter and FilterChooser. They created instances of each other and called each other's methods. Some of the methods had similar names and did similar things. This change moves most of the Filter members to FilterChooser and turns Filter into a supplementary class with short lifetime. FilterChooser constructs an array of (candidate) Filters, chooses the best performing one, and applies it to the given set of encodings, creating inferior FilterChoosers as necessary. The Filter array is then destroyed. All responsibility for generating the decoder table now lies with FilterChooser.	2025-08-23 09:01:24 +03:00
Sergei Barannikov	68964f5dad	[TableGen][DecoderEmitter] Small refactoring (NFC) Few changes extracted from #155065 to make it smaller.	2025-08-23 06:39:45 +03:00
Sergei Barannikov	98262e5bfe	[TableGen][DecoderEmitter] Fix broken AdditionalEncoding support (#155057 ) We didn't have tests for AdditionalEncoding and none of the in-tree targets use this functionality, so I inadvertently broke it in #154288.	2025-08-23 02:48:59 +00:00
Sergei Barannikov	8aba413497	[TableGen][DecoderEmitter] Extract a couple of methods (NFC) (#155044 ) Extract `findBestFilter() const` searching for the best filter and move calls to `recurse()` out of it to a single place. Extract `dump()` as well, it is useful for debugging.	2025-08-22 23:21:45 +00:00
Sergei Barannikov	539259d6e3	[TableGen][DecoderEmitter] Remove unused move constructor (NFC) Also delete no-op destructor declaration.	2025-08-23 02:00:43 +03:00
Sergei Barannikov	4028896d4b	[TableGen][DecoderEmitter] Move a function to InstructionEncoding (NFC) (#155038 )	2025-08-22 22:37:15 +00:00
Sergei Barannikov	0d6ca2f969	[TableGen][DecoderEmitter] Fix decoder reading bytes past instruction (#154916 ) See the added test. Before this change the decoder would first read the second byte, despite the fact that there are 1-byte instructions that could match: ``` /* 0 / MCD::OPC_ExtractField, 8, 8, // Inst{15-8} ... / 3 / MCD::OPC_FilterValue, 0, 4, 0, // Skip to: 11 / 7 / MCD::OPC_Decode, 186, 2, 0, // Opcode: I16_0, DecodeIdx: 0 / 11 / MCD::OPC_FilterValue, 1, 4, 0, // Skip to: 19 / 15 / MCD::OPC_Decode, 187, 2, 0, // Opcode: I16_1, DecodeIdx: 0 / 19 / MCD::OPC_FilterValue, 2, 4, 0, // Skip to: 27 / 23 / MCD::OPC_Decode, 188, 2, 0, // Opcode: I16_2, DecodeIdx: 0 / 27 / MCD::OPC_ExtractField, 0, 1, // Inst{0} ... / 30 / MCD::OPC_FilterValue, 0, 4, 0, // Skip to: 38 / 34 / MCD::OPC_Decode, 189, 2, 1, // Opcode: I8_0, DecodeIdx: 1 / 38 / MCD::OPC_FilterValueOrFail, 1, / 40 / MCD::OPC_Decode, 190, 2, 1, // Opcode: I8_1, DecodeIdx: 1 / 44 */ MCD::OPC_Fail, ``` There are no changes in the generated files. The only in-tree target that uses variable length decoder is M68k, which is free of decoding conflicts that could result in the decoder doing OOB access. This also fixes misaligned "Decoding Conflict" dump, prettified example output is shown in the second test.	2025-08-23 00:51:47 +03:00
Sergei Barannikov	6a7ade03d1	[TableGen][DecoderEmitter] Remove redundant variable (NFC) (#154880 ) `NumFiltered` is the number of elements in all vectors in a map. It is ever compared to 1, which is equivalent to checking if the map contains exactly one vector with exactly one element.	2025-08-22 04:42:06 +00:00
Sergei Barannikov	418fb50301	[TableGen][DecoderEmitter] Calculate encoding bits once (#154026 ) Parse the `Inst` and `SoftField` fields once and store them in `InstructionEncoding` so that we don't parse them every time `getMandatoryEncodingBits()` is called.	2025-08-22 05:19:35 +03:00
Rahul Joshi	4eeeb8a01e	[NFC][MC][Decoder] Fix off-by-one indentation in generated code (#154855 )	2025-08-21 17:20:05 -07:00
Sergei Barannikov	c74afaac6c	[TableGen][DecoderEmitter] Use KnownBits for filters/encodings (NFCI) (#154691 ) `KnownBits` is faster and smaller than `std::vector<BitValue>`. It is also more convenient to use.	2025-08-22 01:37:47 +03:00
Sergei Barannikov	33f6b10c17	[TableGen][DecoderEmitter] Resolve a FIXME in emitDecoder (#154649 ) As the FIXME says, we might generate the wrong code to decode an instruction if it had an operand with no encoding bits. An example is M68k's `MOV16ds` that is defined as follows: ``` dag OutOperandList = (outs MxDRD16:$dst); dag InOperandList = (ins SRC:$src); list<Register> Uses = [SR]; string AsmString = "move.w\t$src, $dst" dag Inst = (descend { 0, 1, 0, 0, 0, 0, 0, 0, 1, 1 }, (descend { 0, 0, 0 }, (operand "$dst", 3))); ``` The `$src` operand is not encoded, but what we see in the decoder is: ```C++ tmp = fieldFromInstruction(insn, 0, 3); if (!Check(S, DecodeDR16RegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; } if (!Check(S, DecodeSRCRegisterClass(MI, insn, Address, Decoder))) { return MCDisassembler::Fail; } return S; ``` This calls DecodeSRCRegisterClass passing it `insn` instead of the value of a field that doesn't exist. DecodeSRCRegisterClass has an unconditional llvm_unreachable inside it. New decoder looks like: ```C++ tmp = fieldFromInstruction(insn, 0, 3); if (!Check(S, DecodeDR16RegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; } return S; ``` We're still not disassembling this instruction right, but at least we no longer have to provide a weird operand decoder method that accepts instruction bits instead of operand bits. See #154477 for the origins of the FIXME.	2025-08-21 22:22:16 +00:00
Rahul Joshi	22f8693248	[NFC][MC][Decoder] Extract fixed pieces of decoder code into new header file (#154802 ) Extract fixed functions generated by decoder emitter into a new MCDecoder.h header.	2025-08-21 15:06:43 -07:00
Sergei Barannikov	2421929ca6	[TableGen][DecoderEmitter] Infer encoding's HasCompleteDecoder earlier (NFCI) (#154644 ) If an encoding has a custom decoder, the decoder is assumed to be "complete" (always succeed) if hasCompleteDecoder field is true. We determine this when constructing InstructionEncoding. If the decoder for an encoding is generated, it always succeeds if none of the operand decoders can fail. The latter is determined based on the value of operands' DecoderMethod/hasCompleteDecoder. This happens late, at table construction time, making the code harder to follow. This change moves this logic to the InstructionEncoding constructor.	2025-08-21 21:35:30 +00:00
Sergei Barannikov	b96d5c2452	[TableGen][DecoderEmitter] Outline InstructionEncoding constructor (NFC) (#154673 ) It is going to grow, so it makes sense to move its definition out of class. Instead, inline `populateInstruction()` into it. Also, rename a couple of methods to better convey their meaning.	2025-08-21 06:08:57 +00:00
Sergei Barannikov	46343ca374	[TableGen][DecoderEmitter] Add DecoderMethod to InstructionEncoding (NFC) (#154477 ) We used to abuse Operands list to store instruction encoding's DecoderMethod there. Let's store it in the InstructionEncoding class instead, where it belongs.	2025-08-20 21:59:59 +00:00
Sergei Barannikov	19ac1ff56e	[TableGen][DecoderEmitter] Factor populateFixedLenEncoding (NFC) (#154511 ) Also drop the debug code under `#if 0` and a seemingly outdated comment.	2025-08-20 11:34:59 +00:00
Sergei Barannikov	9ae0bd2c9f	[TableGen][DecoderEmitter] Move Operands to InstructionEncoding (NFCI) (#154456 ) This is where they belong, no need to maintain a separate map keyed by encoding ID. `populateInstruction()` has been made a member of `InstructionEncoding` and is now called from the constructor.	2025-08-20 07:10:34 +03:00
Sergei Barannikov	8666ffdd15	[TableGen][DecoderEmitter] Rename some variables (NFC) And change references to pointers, to make the future diff smaller.	2025-08-20 04:55:07 +03:00
Sergei Barannikov	6462223853	[TableGen] Make ParseOperandName method const (NFC) Also change its name to start with a lowercase letter and update the doxygen comment to conform to the coding standard.	2025-08-20 03:21:15 +03:00
Sergei Barannikov	803edce6f7	[TableGen][DecoderEmitter] Analyze encodings once (#154309 ) Follow-up to #154288. With HwModes involved, we used to analyze the same encoding multiple times (unless `-suppress-per-hwmode-duplicates=O2` is specified). This affected the build time and made the statistics inaccurate. From the point of view of the generated code, this is an NFC.	2025-08-19 23:17:12 +00:00
Sergei Barannikov	07a6323c32	[TableGen][DecoderEmitter] Turn EncodingAndInst into a class (NFC) (#154230 ) The class will get more methods in follow-up patches.	2025-08-20 01:29:26 +03:00
Sergei Barannikov	56ce40bc73	[TableGen][DecoderEmitter] Stop duplicating encodings (NFC) (#154288 ) When HwModes are involved, we can duplicate an instruction encoding that does not belong to any HwMode multiple times. We can do better by mapping HwMode to a list of encoding IDs it contains. (That is, duplicate IDs instead of encodings.) The encodings that were duplicated are still processed multiple times (e.g., we call an expensive populateInstruction() on each instance). This is going to be fixed in subsequent patches.	2025-08-19 09:02:22 +00:00
Sergei Barannikov	cded128009	[TableGen][DecoderEmitter] Extract encoding parsing into a method (NFC) (#154271 ) Call it from the constructor so that we can make `run` method `const`. Turn a couple of related functions into methods as well.	2025-08-19 06:35:59 +00:00
Sergei Barannikov	6c3a0ab51a	[TableGen][DecoderEmitter] Shorten a few variable names (NFC) These "Numbered"-prefixed names were rather confusing than helpful.	2025-08-19 08:05:02 +03:00
Sergei Barannikov	f84ce1e1d0	[TableGen][DecoderEmitter] Extract a couple of loop invariants (NFC)	2025-08-19 07:47:15 +03:00
Sergei Barannikov	c8c2218c00	[TableGen][DecoderEmitter] Synthesize decoder table name in emitTable (#154255 ) Previously, HW mode name was appended to decoder namespace name when enumerating encodings, and then emitTable appended the bit width to it to form the final table name. Let's do this all in one place. A nice side effect is that this allows us to avoid having to deal with std::string. The changes in the tests are caused by the different order of tables.	2025-08-19 06:19:54 +03:00
Sergei Barannikov	61a859bf6f	Use llvm::copy instead of append_range to work around MacOS build failure	2025-08-19 01:43:22 +03:00
Sergei Barannikov	0cd4ae9be0	Reland "[TableGen][DecoderEmitter] Store HW mode ID instead of name (NFC) (#154052 )" (#154212 ) This reverts commit 5612dc533a9222a0f5561b2ba7c897115f26673f. Reland with MacOS build fixed.	2025-08-18 22:28:20 +00:00
Shubham Sandeep Rastogi	5612dc533a	Revert "[TableGen][DecoderEmitter] Store HW mode ID instead of name (NFC) (#154052 )" This reverts commit b20bbd48e8b1966731a284b4208e048e060e97c2. Reverted due to greendragon failures: 20:34:43 In file included from /Users/ec2-user/jenkins/workspace/llvm.org/as-lldb-cmake/llvm-project/llvm/utils/TableGen/DecoderEmitter.cpp:14: 20:34:43 In file included from /Users/ec2-user/jenkins/workspace/llvm.org/as-lldb-cmake/llvm-project/llvm/utils/TableGen/Common/CodeGenHwModes.h:14: 20:34:43 In file included from /Users/ec2-user/jenkins/workspace/llvm.org/as-lldb-cmake/llvm-project/llvm/include/llvm/ADT/DenseMap.h:20: 20:34:43 In file included from /Users/ec2-user/jenkins/workspace/llvm.org/as-lldb-cmake/llvm-project/llvm/include/llvm/ADT/STLExtras.h:21: 20:34:43 In file included from /Users/ec2-user/jenkins/workspace/llvm.org/as-lldb-cmake/llvm-project/llvm/include/llvm/ADT/Hashing.h:53: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/algorithm:1913: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/chrono:746: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/__chrono/convert_to_tm.h:19: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/__chrono/statically_widen.h:17: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/__format/concepts.h:17: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/__format/format_parse_context.h:15: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/string_view:1027: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/functional:515: 20:34:43 In file included from /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/__functional/boyer_moore_searcher.h:26: 20:34:43 /Applications/Xcode-beta.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX14.2.sdk/usr/include/c++/v1/vector:1376:19: error: object of type 'llvm::const_set_bits_iterator_impl<llvm::SmallBitVector>' cannot be assigned because its copy assignment operator is implicitly deleted 20:34:43 __mid = __first; 20:34:43 ^ 20:34:43 /Users/ec2-user/jenkins/workspace/llvm.org/as-lldb-cmake/llvm-project/llvm/utils/TableGen/DecoderEmitter.cpp:2404:13: note: in instantiation of function template specialization 'std::vector<unsigned int>::assign<llvm::const_set_bits_iterator_impl<llvm::SmallBitVector>, 0>' requested here 20:34:43 HwModeIDs.assign(BV.set_bits_begin(), BV.set_bits_end()); 20:34:43 ^ 20:34:43 /Users/ec2-user/jenkins/workspace/llvm.org/as-lldb-cmake/llvm-project/llvm/include/llvm/ADT/BitVector.h:35:21: note: copy assignment operator of 'const_set_bits_iterator_impl<llvm::SmallBitVector>' is implicitly deleted because field 'Parent' is of reference type 'const llvm::SmallBitVector &' 20:34:43 const BitVectorT &Parent; 20:34:43 ^ 20:34:43 1 warning and 1 error generated.	2025-08-18 14:36:54 -07:00
Sergei Barannikov	13dd65096b	[TableGen][DecoderEmitter] Rename some variables for clarity (NFC)	2025-08-19 00:16:56 +03:00
Sergei Barannikov	b20bbd48e8	[TableGen][DecoderEmitter] Store HW mode ID instead of name (NFC) (#154052 ) This simplifies code a bit.	2025-08-18 22:53:09 +03:00
Sergei Barannikov	bad02e38c8	[TableGen][DecoderEmitter] Avoid using a sentinel value (#153986 ) `NO_FIXED_SEGMENTS_SENTINEL` has a value that is actually a valid field encoding and so it cannot be used as a sentinel. Replace the sentinel with a new member variable, `VariableFC`, that contains the value previously stored in `FilterChooserMap` with `NO_FIXED_SEGMENTS_SENTINEL` key.	2025-08-18 08:25:17 +03:00

1 2 3 4

152 Commits