llvm-project

Author	SHA1	Message	Date
Shengchen Kan	381f585198	[X86] Fix Werror X86GenCompressEVEXTables.inc:1627:2: error: extra ';' outside of a function	2024-01-22 12:24:41 +08:00
Shengchen Kan	c2bef33c5e	[X86][NFC] Auto-generate the function to check predicate for EVEX compression	2024-01-22 11:49:24 +08:00
Kazu Hirata	bb6564a1b5	[TableGen] Use StringRef::consume_front (NFC)	2024-01-20 18:57:36 -08:00
Wang Pengcheng	3d90e1fa94	[TableGen] Integrate TableGen-based macro fusion (#73115 ) `Fusion` is inherited from `SubtargetFeature` now. Each definition of `Fusion` will define a `SubtargetFeature` accordingly. Method `getMacroFusions` is added to `TargetSubtargetInfo`, which returns a list of `MacroFusionPredTy` that will be evaluated by MacroFusionMution. `getMacroFusions` will be auto-generated if the target has `Fusion` definitions.	2024-01-19 18:08:09 +08:00
Sergei Barannikov	8e8c954a17	[GISel] Erase the root instruction after emitting all its potential uses (#77494 ) This tries to fix a bug by resolving a few FIXMEs. The bug is that `EraseInstAction` is emitted after emitting the _first_ `BuildMIAction`, which is too early because the erased instruction may still be used by subsequent `BuildMIAction`s (in particular, by `CopyRenderer`). An example of the bug (from `match-table-operand-types.td`): ``` def InstTest0 : GICombineRule< (defs root:$a), (match (G_MUL i32:$x, i32:$b, i32:$c), (G_MUL $a, i32:$b, i32:$x)), (apply (G_ADD i64:$tmp, $b, i32:$c), (G_ADD i8:$a, $b, i64:$tmp))>; GIR_EraseFromParent, /InsnID/0, GIR_BuildMI, /InsnID/1, /Opcode/GIMT_Encode2(TargetOpcode::G_ADD), GIR_Copy, /NewInsnID/1, /OldInsnID/0, /OpIdx/0, // a GIR_Copy, /NewInsnID/1, /OldInsnID/0, /OpIdx/1, // b GIR_AddSimpleTempRegister, /InsnID/1, /TempRegID/0, ``` Here, the root instruction is destroyed before copying its operands ('a' and 'b') to the new instruction. The solution is to emit `EraseInstAction` for the root instruction as the last action in the emission pipeline.	2024-01-13 11:17:41 +03:00
Wang Pengcheng	a2af374284	[SelectionDAG] Add space-optimized forms of OPC_CheckPredicate (#77763 ) We record the usage of each `Predicate` and sort them by usage. For the top 8 `Predicate`s, we will emit a `PC_CheckPredicateN` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 61K. This is a recommit of 1a57927, which was reverted in bc98c31. The CI failures occurred when doing expensive checks (with option `LLVM_ENABLE_EXPENSIVE_CHECKS` being ON). The key point here is that we need stable sorting result in the test, but doing expensive checks uncovered the non-determinism of `llvm::sort`. So `llvm::sort` is changed to `llvm::stable_sort` in this revised patch. And we use `llvm::MapVector` to keep insertion order.	2024-01-12 11:38:05 +08:00
Fangrui Song	c230138011	[SelectionDAG,TableGen] Use MapVector after #73310 Otherwise `ComplexPatternList` order can be non-deterministic.	2024-01-11 19:14:49 -08:00
Fangrui Song	c185a66d83	[SelectionDAG,TableGen] Use stable_sort after #73310 to ensure determinism with https://libcxx.llvm.org/DesignDocs/UnspecifiedBehaviorRandomization.html#unspecified-behavior-randomization	2024-01-11 19:01:53 -08:00
Mikhail Goncharov	bc98c3103a	Revert "[SelectionDAG] Add space-optimized forms of OPC_CheckPredicate (#73488 )" This reverts commit 1a5792735aa0bb10e5624a438bcf7fd5091ee265. Test address-space-patfrags.td.test is failing https://lab.llvm.org/buildbot/#/builders/104/builds/15012	2024-01-11 12:25:00 +01:00
Wang Pengcheng	1a5792735a	[SelectionDAG] Add space-optimized forms of OPC_CheckPredicate (#73488 ) We record the usage of each `Predicate` and sort them by usage. For the top 8 `Predicate`s, we will emit a `PC_CheckPredicateN` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 61K.	2024-01-11 15:43:40 +08:00
Wang Pengcheng	5c8d123838	[SelectionDAG] Add space-optimized forms of OPC_CheckPatternPredicate (#73319 ) We record the usage of each `PatternPredicate` and sort them by usage. For the top 8 `PatternPredicate`s, we will emit a `OPC_CheckPatternPredicateN` to save one byte. The old `OPC_CheckPatternPredicate2` is renamed to `OPC_CheckPatternPredicateTwoByte`. Overall this reduces the llc binary size with all in-tree targets by about 93K.	2024-01-11 15:36:21 +08:00
Wang Pengcheng	211abe38d8	[SelectionDAG] Add space-optimized forms of OPC_CheckComplexPat (#73310 ) We record the usage of each `ComplexPat` and sort the `ComplexPat`s by usage. For the top 8 `ComplexPat`s, we will emit a `OPC_CheckComplexPatN` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 89K.	2024-01-11 15:28:12 +08:00
Aiden Grossman	21a784f24e	[llvm-exegesis] Add tablegen support for validation counters (#76652 ) This patch adds support in the llvm-exegesis tablegen emitter for validation counters. Full support for validation counters in llvm-exegesis will be added in a future patch.	2024-01-10 15:05:58 -08:00
Sergei Barannikov	06286a5532	[GISel] Add RegState::Define to temporary defs in apply patterns (#77425 ) Previously, registers created for temporary defs in apply patterns were rendered as uses, resulting in machine verifier errors.	2024-01-09 17:55:21 +03:00
Alex Bradbury	2d54ec36f7	[SelectionDAG] Add and use SDNode::getAsAPIntVal() helper (#77455 ) This is the logical equivalent for #76710 for APInt and uses the same naming scheme. Converted existing users through: `git grep -l "cast<ConstantSDNode>$.$.getAPIntValueValue" \| xargs sed -E -i 's/cast<ConstantSDNode>$(.*)$->getAPIntValue/\1->getAsAPIntVal/'`	2024-01-09 14:27:07 +00:00
Sergei Barannikov	f92b928b1e	[GISel] Infer the type of an immediate when there is one element in TEC (#77399 ) When there is just one element in the type equivalence class (TEC), `inferNamedOperandType` fails because it does not consider the passed operand as a suitable one. This is incorrect when inferring the type of an (unnamed) immediate operand.	2024-01-09 13:49:10 +03:00
Shengchen Kan	fb72a445c1	[X86] Emit NDD2NonNDD entris in the EVEX comprerssion table, NFCI This patch is a straightfoward change based on the design in #77202. It does not have any effect since we haven't supported compressing ND to non-ND in X86CompressEVEX.cpp.	2024-01-08 19:50:28 +08:00
Shengchen Kan	1c674666fa	[X86] Support EVEX compression for EGPR (#77202 ) Compress promoted instruction (EVEX) to pre-promotion instruction (legacy/VEX) when R16-R31 is not used. Alternative of #77065	2024-01-08 16:50:23 +08:00
Shengchen Kan	61bb3d499a	[X86][NFC] Avoid uselss iterations when emitting EVEX compression table BTW, we relax the condition for EVEX compression from ST.hasAVX512() to ST.hasEGPR() \|\| ST.hasAVX512(). It does not have any effect now b/c no APX instruction is in the EVEX compression table so far. This patch is to extract NFC in #77065 into a separate commit.	2024-01-06 23:53:57 +08:00
Shengchen Kan	4b9bbd3868	[X86][NFC] Refine code in X86CompressEVEXTablesEmitter.cpp 1. Simplify getValueFromBitsInit about cast and return type 2. Remove out-of-date comments and allow memory ops in function object `IsMatch` so that we can reuse it for EVEX2Legacy compression. This patch is to extract NFC in #77065 into a separate commit.	2024-01-06 21:48:19 +08:00
Shengchen Kan	0abf3a93a3	[X86][NFC] Use single table for EVEX compression This patch is to address my review comments in #77065 to simplify the implemention of EVEX2Legacy compression.	2024-01-06 18:55:24 +08:00
Shengchen Kan	04a7ec610e	[X86][NFC] Remove VEX_W1X after 80dbf60	2024-01-06 17:07:39 +08:00
Shengchen Kan	80dbf601d1	[X86][NFC] Remove EVEX2VEXOverride/NotEVEX2VEXConvertible Remove these two classes and put all the entries in X86 EVEX compression tables that need special handling in .def file. PR #77065 tries to add entries that need special handling for APX in .def file. Compared to setting fields in td files, that method looks cleaner. This patch is to unify the addition of manual entries.	2024-01-06 15:43:25 +08:00
Shengchen Kan	a5902a4d24	[X86][NFC] Rename variables/passes for EVEX compression optimization RFC: https://discourse.llvm.org/t/rfc-design-for-apx-feature-egpr-and-ndd-support/73031 APX introduces EGPR, NDD and NF instructions. In addition to compressing EVEX encoded AVX512 instructions into VEX encoding, we also have several more possible optimizations. a. Promoted instruction (EVEX space) -> pre-promotion instruction (legacy space) b. NDD (EVEX space) -> non-NDD (legacy space) c. NF_ND (EVEX space) -> NF (EVEX space) The first two types of compression can usually reduce code size, while the third type of compression can help hardware decode although the instruction length remains unchanged. So we do the renaming for the upcoming APX optimizations. BTW, I clang-format the code in X86CompressEVEX.cpp, X86CompressEVEXTablesEmitter.cpp. This patch also extracts the NFC in #77065 into a separate commit.	2024-01-06 12:41:09 +08:00
Wang Pengcheng	a0e6b7c042	[TableGen] Add a backend to generate MacroFusion predicators (#72222 ) `FusionPredicate` is used to predicate if target instruction matches the requirement. The targets can be firstMI, secondMI or both. The `Fusion` contains a list of `FusionPredicate`. The generated code will be like: ``` bool isNAME(const TargetInstrInfo &TII, const TargetSubtargetInfo &STI, const MachineInstr FirstMI, const MachineInstr &SecondMI) { auto &MRI = SecondMI.getMF()->getRegInfo(); / Predicates / return true; } ``` A boilerplate class called `SimpleFusion` is added. `SimpleFusion` has a predefined structure of predicates and accepts predicate for `firstMI`, predicate for `secondMI` and epilog/prolog as arguments. The generated code for `SimpleFusion` will be like: ``` bool isNAME(const TargetInstrInfo &TII, const TargetSubtargetInfo &STI, const MachineInstr FirstMI, const MachineInstr &SecondMI) { auto &MRI = SecondMI.getMF()->getRegInfo(); /* Prolog / / Predicate for `SecondMI` / / Wildcard / / Predicate for `FirstMI` / / Check One Use / / Tie registers / / Epilog */ return true; } ```	2024-01-05 22:44:04 +08:00
Shengchen Kan	d79ccee8dc	[X86][MC] Support encoding/decoding for APX variant ADD/SUB/ADC/SBB/OR/XOR/NEG/NOT instructions (#76319 ) Four variants: promoted legacy, ND (new data destination), NF (no flags update) and NF_ND (NF + ND). The syntax of NF instructions is aligned with GNU binutils. https://sourceware.org/pipermail/binutils/2023-September/129545.html	2023-12-28 21:22:03 +08:00
Kazu Hirata	1daf2994de	[llvm] Use StringRef::contains (NFC)	2023-12-23 22:21:52 -08:00
Lucas Duarte Prates	b652674dd0	[AsmWriter] Ensure getMnemonic doesn't return invalid pointers (#75783 ) For instructions that don't map to a mnemonic string, the implementation of MCInstPrinter::getMnemonic would return an invalid pointer due to the result of the calculation of the instruction's position in the `AsmStrs` table. This patch fixes the issue by ensuring those cases return a `nullptr` value instead. Fixes #74177.	2023-12-20 10:09:29 +00:00
XinWang10	037c220702	[X86][MC] Support Enc/Dec for EGPR for promoted SHA instruction (#75582 ) R16-R31 was added into GPRs in https://github.com/llvm/llvm-project/pull/70958, This patch supports the encoding/decoding for promoted SHA instruction in EVEX space. RFC: https://discourse.llvm.org/t/rfc-design-for-apx-feature-egpr-and-ndd-support/73031/4	2023-12-20 13:54:50 +08:00
Wang Pengcheng	9348d437f5	[SelectionDAG] Add space-optimized forms of OPC_EmitRegister (#73291 ) The followed byte of `OPC_EmitRegister` is a MVT type, which is usually i32 or i64. We add `OPC_EmitRegisterI32` and `OPC_EmitRegisterI64` so that we can reduce one byte. Overall this reduces the llc binary size with all in-tree targets by about 10K.	2023-12-19 17:31:49 +08:00
Michael Liao	33d5f4314f	[TableGen] AsmParser: Keep consistent naming. NFC	2023-12-18 16:08:43 -05:00
darkbuck	d14ee76181	[GISel][TableGen] Enhance default ops support (#75689 ) - Instead of checking the default ops directly, this change queries DAG default operands collected during patterns reading. It does not only simplify the code but also handle few cases where integer values are converted from convertible types, such as 'bits'. - A test case is added GlobalISelEmitter.td as the regression test of default 'bits' values.	2023-12-17 15:02:10 -05:00
XinWang10	295415e720	[X86][MC] Support Enc/Dec for EGPR for promoted MOVDIR instruction (#74713 ) R16-R31 was added into GPRs in https://github.com/llvm/llvm-project/pull/70958, This patch supports the encoding/decoding for promoted MOVDIR instruction in EVEX space. RFC: https://discourse.llvm.org/t/rfc-design-for-apx-feature-egpr-and-ndd-support/73031/4	2023-12-15 16:03:17 +08:00
Simon Pilgrim	bcee4a9363	[X86] Rename VPERMI2/VPERMT2 to VPERMI2Z/VPERMT2Z (#75192 ) Add missing AVX512 Z prefix to conform to the standard naming convention and simplify matching in X86FoldTablesEmitter::addBroadcastEntry etc.	2023-12-14 09:55:18 +00:00
David Spickett	1e53386690	[llvm][TableGen][Docs] Add tools/resources links This adds a link from the main docs page back to the README where I have previously added a list of useful resources. To that list, I've added a link to my recent llvm blog post.	2023-12-13 09:53:03 +00:00
Pierre van Houtryve	a160536f8d	[TableGen][GlobalISel] Add specialized opcodes (#74823 ) Most users of AddImm and CheckConstantInt only use 1 byte immediates, so I added an opcode variants for those. That way all those instructions save 7 bytes. Also added an opcode for AddTempRegister for the cases where there are no register flags. Space savings: - AMDGPUGenGlobalISel: 470180 bytes to 422564 (-10%) - AArch64GenGlobalISel.inc: 383893 bytes to 374046	2023-12-13 09:09:32 +01:00
Pierre van Houtryve	a110e991c6	[GlobalISel] Change MatchTable entries to 1 byte each (#74429 ) See https://discourse.llvm.org/t/rfc-make-globalisel-match-table-entries-1-byte-instead-of-8/75411 This helps reduce llc's binary size, at the cost of some added complexity to the MatchTable machinery.	2023-12-13 08:48:56 +01:00
Wang Pengcheng	97181bf9a0	[TableGen] Use getSizeInBits (#75157 ) We know the type is scalar type.	2023-12-12 20:40:20 +08:00
wangpc	bbc7f09959	[TableGen][NFC] Remove leading spaces	2023-12-12 18:17:13 +08:00
wangpc	0c2a3d6033	[TableGen][NFC] Format parts of DAGISelMatcher.h/DAGISelMatcherGen.cpp To reduce the diff in #73310	2023-12-12 18:11:38 +08:00
Wang Pengcheng	714417455d	[SelectionDAG] Add OPC_MoveSibling (#73643 ) There are a lot of operations to move current node to parent and then move to another child. So `OPC_MoveSibling` and its space-optimized forms are added to do this "move to sibling" operations. These new operations will be generated when optimizing matcher in `ContractNodes`. Currently `MoveParent+MoveChild` will be optimized to `MoveSibling` and sequences `MoveParent+RecordChild+MoveChild` will be transformed into `MoveSibling+RecordNode`. Overall this reduces the llc binary size with all in-tree targets by about 30K.	2023-12-12 17:48:45 +08:00
Wang Pengcheng	0d5f1cc4d0	[SelectionDAG] Add space-optimized forms of OPC_EmitNode/OPC_MorphNodeTo (#73502 ) If there is only one bit set in EmitNodeInfo, then we can encode it implicitly to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 168K.	2023-12-12 17:45:32 +08:00
Wang Pengcheng	6111f5c592	[SelectionDAG] Add instantiated OPC_CheckChildType (#73297 ) The most common type is i32 or i64 so we add `OPC_CheckChildTypeI32` and `OPC_CheckChildTypeI64` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 70K.	2023-12-12 17:31:12 +08:00
Wang Pengcheng	cbf1d58820	[SelectionDAG] Add space-optimized forms of OPC_EmitCopyToReg (#73293 ) These new opcodes implicitly indicate the RecNo. The old `OPC_EmitCopyToReg2` is renamed to `OPC_EmitCopyToRegTwoByte`. Overall this reduces the llc binary size with all in-tree targets by about 33K (most are from RISCV target).	2023-12-12 17:25:33 +08:00
Wang Pengcheng	50c174f99f	[SelectionDAG] Add space-optimized forms of OPC_EmitConvertToTarget (#73286 ) These new opcodes implicitly indicate the RecNo. Overall this reduces the llc binary size with all in-tree targets by about 13K.	2023-12-12 17:13:43 +08:00
Wang Pengcheng	e052c68869	[SelectionDAG] Add instantiated OPC_CheckType (#73283 ) The most common type is i32 or i64 so we add `OPC_CheckTypeI32` and `OPC_CheckTypeI64` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 29K.	2023-12-12 17:12:08 +08:00
Anatoly Trosinenko	78623b079b	[GISel][TableGen] Fix accidental operand name clashes in patterns (#74492 ) When importing instruction selection patterns into GlobalISel, the operands matched in the "source" DAG are copied into corresponding operands of the "destination" DAG according to their names (such as Rd). If multiple operands in the source DAG share the same name, a GIM_CheckIsSameOperand predicate makes instruction selector check the corresponding operands for equality (at compiler run-time) as part of matching the source pattern. The Def operands of the root node of the destination DAG are handled specially. The operands of the instruction corresponding to the root node are taken and GIM_CheckRegBankForClass predicates are tablegen-erated accordingly. If by coincidence the Def operand in question has the same name as one of the named operands in the pattern, a GIM_CheckIsSameOperand predicate is automatically added that is likely to prevent matching the source of otherwise applicable selection pattern at compiler run-time. This patch mangles the Def operand names taken from the instruction corresponding to the root of the destination DAG (for example, "Rd" becomes "DstI[Rd]") preventing unexpected name clashes with pattern's named operands. The patch consists of three sets of changes: * changes to the GlobalISelEmitter.cpp file are the actual fix * a test case is added to GlobalISelEmitter.td file as a regression test * everything else is the biggest and least interesting part - updates to the existing test cases: renames of the form Rd -> DstI[Rd] inside the inline comments in tablegen-erated code	2023-12-10 13:25:11 +03:00
Kazu Hirata	dc1f208346	[TableGen] Remove unnecessary includes (NFC) Identified with clangd.	2023-12-07 21:03:55 -08:00
Shengchen Kan	f17e766972	[X86][NFC] Clang-format X86DisassemblerTables.cpp for #74713	2023-12-07 20:59:21 +08:00
Pierre van Houtryve	54b6bc42aa	[TableGen][GlobalISel] Emit Comment with MatchTable Size (#74701 )	2023-12-07 09:41:37 +01:00

1 2 3 4 5 ...

5649 Commits