llvm-project

Author	SHA1	Message	Date
Phoebe Wang	fbb9d49506	[X86][APX] Support APX + AMX-MOVRS/AMX-TRANSPOSE (#123267 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/784266	2025-01-17 17:51:42 +08:00
Phoebe Wang	1274bca2ad	[X86][APX] Support APX + MOVRS (#123264 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/784266	2025-01-17 16:06:31 +08:00
Evgenii Kudriashov	a242880371	[TableGen][GlobalISel] Reorder atomic predicate to preserve the order (#121806 ) Since there are no opcodes for atomic loads and stores comparing to SelectionDAG, we add `CheckMMOIsNonAtomic` predicate immediately after the opcode predicate to make a logical combination of them. Otherwise when `IPM_AtomicOrderingMMO` is inserted after `IPM_GenericPredicate`, the patterns without predicates get a higher priority as `IPM_AtomicOrderingMMO` has higher priority than `IPM_GenericPredicate`. This is important to preserve an order of aligned/unaligned patterns on X86 because aligned memory operations have an additional alignment predicate and should be checked first according to their placement in td file. Closes #121446	2025-01-16 17:06:21 +01:00
Craig Topper	726cfc67b6	[RISCV] Don't convert virtual register Register to MCRegister in isCompressibleInst. (#122843 ) Calling MCRegisterClass::contains with a Register does an implicit conversion from Register to MCRegister. I think MCRegister is only intended to be used for physical registers. We should protect this implicit conversion by checking for physical registers first. While I was here I removed some unnecessary parentheses from the output.	2025-01-13 23:36:09 -08:00
Markus Böck	97ea0aba15	[TableGen] Do not exit in template argument check (#121636 ) The signature of `CheckTemplateArgValues` implements error handling via the `bool` return type, yet always returned false. The single possible error case instead used `PrintFatalError,` which exits the program afterward. This behavior is undesirable: It prevents any further errors from being printed and makes TableGen less usable as a library as it crashes the entire process (e.g. `tblgen-lsp-server`). This PR therefore fixes the issue by using `Error` instead and returning true if an error occurred. All callers already perform proper error handling. As `llvm-tblgen` exits on error, a test was also added to the LSP to ensure it exits normally despite the error.	2025-01-06 21:06:17 +01:00
Evgenii Kudriashov	2bbdce9a42	[GlobalISel] Support physical register inputs in nested patterns (#121239 ) When importing nested patterns, we create InsnMatcher for each pattern and miss them if consider only the top level InsnMatcher. Iterate PhysRegOperands instead. Change the type of PhysRegOperands from DenseMap to SmallMapVector to have stable generation. Also drop PhysRegInputs member from InsnMatcher as there are no users of it.	2025-01-05 01:10:25 +01:00
Sergei Barannikov	c56b74315f	[TableGen][GISel] Reuse `importNodeRenderer` for `OperandWithDefaultOps` (#121285 ) This avoids some code duplication (handling `Register`, `zero_reg` and immediate operands).	2025-01-05 00:11:24 +03:00
Phoebe Wang	9cd774d1e4	[X86][NFC] Move "_Int" after "k"/"kz" (#121450 ) Address comment at https://github.com/llvm/llvm-project/pull/121373#discussion_r1900402932	2025-01-02 21:02:19 +08:00
Phoebe Wang	23ec9ee17e	[X86][AVX10.2] Lower fmininum/fmaximum to VMINMAX* (#121373 )	2025-01-02 11:30:26 +08:00
Sergei Barannikov	6cbc64ed92	[TableGen][GISel] Fix IMPLICIT_DEF operand being added as a use (#121283 ) `IMPLICIT_DEF` has one operand that is a def, not a use.	2024-12-29 16:29:55 +03:00
Sergei Barannikov	4a92c27f9d	[TableGen][GISel] Remove check for LLT when emitting renderers (#121144 ) Types used in the destination DAG of a pattern should not matter for GlobalISel. All necessary checks are emitted in the form of matchers when traversing the source DAG. In particular, the check prevented importing patterns containing iPTR in the middle of the destination DAG. This reduces the number of skipped patterns on Mips and RISCV: ``` Mips 1270 -> 1212 (-58) RISCV 42165 -> 42088 (-77) ``` Most of these patterns are for atomic operations.	2024-12-26 17:45:29 +03:00
Sergei Barannikov	6f72d28dd9	[TableGen][GISel] Don't copy dead def from a sub-instruction to the root (#121094 ) Sub-instruction can have a def with the same name as a def in a top-level instruction. Previously this could result in both defs copied to the instruction being built.	2024-12-26 08:36:35 +03:00
Sergei Barannikov	c29536b033	[test] Group GlobalISelEmitter tests under a subdirectory (#121093 ) Remove extra command line arguments while here.	2024-12-25 11:32:37 +03:00
Sergei Barannikov	bda7aadfcd	[TableGen][GISel] Fix importing frameindex node (#120921 ) The existing test case is not representative. Even though TableGen doesn't complain, the code generated from it is invalid and fails verification with the message "Use not jointly dominated by defs.". There is no way to magically transform `frameindex` to `tframeindex` as it happens for some other leaf nodes. `frameindex` can only be selected by custom C++ code or by using an `SDNodeXForm`. This patch makes the test representative one and fixes the handling of `G_FRAME_INDEX`, which shouldn't have set the operand's name. It also fixes the type of the result of `G_FRAME_INDEX` in order to get the correct type check (`GIM_CheckPointerToAny` instead of `GIM_CheckType` with a scalar LLT argument).	2024-12-23 11:04:40 +03:00
Sergei Barannikov	a7cd660bd7	[TableGen][GISel] Learn to import patterns with optional defs (#120470 ) The number of skipped patterns reduces for ARM from 4278 to 4257. This is the only in-tree target that makes use of OptionalDefOperand. Pull Request: https://github.com/llvm/llvm-project/pull/120470	2024-12-21 05:24:57 +03:00
Sergei Barannikov	d3750412aa	[TableGen][GISel] Improve dead register handling (#120426 ) A dead implicit def wasn't marked as dead if it is also an implicit use. The new approach should also be more straightforward and simplifies future changes for supporting optional defs and physical register defs. Pull Request: https://github.com/llvm/llvm-project/pull/120426	2024-12-18 18:58:37 +03:00
Krzysztof Drewniak	b24caf3d2b	[llvm][TableGen] Add a !initialized predicate to allow testing for ? (#117964 ) There are cases (like in an upcoming patch to MLIR's `Property` class) where the ? value is a useful null value. However, existing predicates make ti difficult to test if the value in a record one is operating is ? or not. This commit adds the !initialized predicate, which is 1 on concrete, non-? values and 0 on ?. --------- Co-authored-by: Akshat Oke <Akshat.Oke@amd.com>	2024-12-17 20:34:35 -06:00
Sergei Barannikov	cf4375d107	[TableGen][GISel] Extract common function for determining MI's regclass (#120135 ) Add some comments that hopefully clarify a few things. This was supposed to be NFC, but there is a difference in the inferred register class for EXTRACT_SUBREG. Pull Request: https://github.com/llvm/llvm-project/pull/120135	2024-12-17 18:03:22 +03:00
Sergei Barannikov	c9070cce09	[TableGen] Allow empty terminator in SequenceToOffsetTable (#119751 ) Some clients do not want to emit a terminator after each sub-sequence (they have other means of determining the length of sub-sequences). This moves `Term` argument from `emit` method to the constructor and makes it optional. It couldn't be made optional while still on the `emit` method because if the terminator wasn't specified, it has to be taken into account in `layout` method as well. The fact that `layout` method was called is now recorded in a dedicated member variable, `IsLaidOut`. `Entries != 0` can no longer be used to reliably check if `layout` method was called because it may be zero for a different reason: the terminator wasn't specified and all added sequences (if any) were empty. This reduces the size of `LaneMaskLists` and `SubRegIdxLists` a bit and resolves the removed TODO.	2024-12-13 19:55:11 +03:00
David Spickett	1db2d571b5	[llvm][TableGen] Fix misleading error for invalid use of let (#118616 ) Fixes #118490 Point to the value name, otherwise it implies that the part after the '=' is the problem. Before: ``` /tmp/test.td:2:27: error: Value 'FlattenedFeatures' unknown! let FlattenedFeatures = []; ^ ``` After: ``` /tmp/test.td:2:7: error: Value 'FlattenedFeatures' unknown! let FlattenedFeatures = []; ^ ```	2024-12-09 13:21:46 +00:00
Thorsten Schütt	148fdc519c	[GlobalISel] Add G_ABDS and G_ABDU instructions (#118122 ) The DAG has the same instructions: the signed and unsigned absolute difference of it's input. For AArch64, they map to uabd and sabd for Neon and SVE. The Neon and SVE instructions will require custom patterns. They are pseudo opcodes and are not imported by the IRTranslator. We need combines to create them. PowerPC, ARM, and AArch64 have native instructions. /// i.e trunc(abs(sext(Op0) - sext(Op1))) becomes abds(Op0, Op1) /// or trunc(abs(zext(Op0) - zext(Op1))) becomes abdu(Op0, Op1) For GlobalISel, we are going to write the combines in MIR patterns. see: llvm/test/CodeGen/AArch64/abd-combine.ll - [ ] combine into abd - [ ] legalize and add td patterns	2024-12-04 12:53:15 +01:00
Sam Elliott	73731d6873	[llvm-tblgen] Increase Coverage Index Size (#118329 )	2024-12-04 09:19:13 +00:00
Mason Remy	0c6457b781	[LLVM][TableGen] Refine overloaded intrinsic suffix check (#117957 ) Previously the check comments indicated that [pi][0-9]+ would match as a type suffix, however the check itself was looking for [pi][0-9]* and hence an 'i' suffix in isolation was being considered as a type suffix despite it not having a bitwidth. This change makes the check consistent with the comment and looks for [pi][0-9]+	2024-12-03 13:33:15 -05:00
Simon Pilgrim	29f11f0a32	[X86] Add missing reg/imm attributes to VRNDSCALES instruction names (#117203 ) More canonicalization of the instruction names to make the predictable - more closely matches VRNDSCALEP / VROUND equivalent instructions	2024-11-22 17:45:30 +00:00
Pengcheng Wang	4da960b898	[RISCV] Add mvendorid/marchid/mimpid to CPU definitions (#116202 ) We can get these information via `sys_riscv_hwprobe`. This can be used to implement `__builtin_cpu_is`.	2024-11-22 22:58:54 +08:00
Mikhail Goncharov	d1dae1e861	Revert "[RISCV] Add mvendorid/marchid/mimpid to CPU definitions (#116202 )" chain This reverts commit b36fcf4f493ad9d30455e178076d91be99f3a7d8. This reverts commit c11b6b1b8af7454b35eef342162dc2cddf54b4de. This reverts commit 775148f2367600f90d28684549865ee9ea2f11be. multiple bot build breakages, e.g. https://lab.llvm.org/buildbot/#/builders/3/builds/8076	2024-11-22 14:09:13 +01:00
Pengcheng Wang	775148f236	[RISCV] Add mvendorid/marchid/mimpid to CPU definitions (#116202 ) We can get these information via `sys_riscv_hwprobe`. This can be used to implement `__builtin_cpu_is`.	2024-11-22 19:54:45 +08:00
Simon Pilgrim	3a5cf6d99b	[X86] Rename AVX512 VEXTRACT/INSERT??x? to VEXTRACT/INSERT??X? (#116826 ) Use uppercase in the subvector description ("32x2" -> "32X4" etc.) - matches what we already do in VBROADCAST??X?, and we try to use uppercase for all x86 instruction mnemonics anyway (and lowercase just for the arg description suffix).	2024-11-20 08:25:01 +00:00
Simon Pilgrim	7dcefb37a4	[X86] Tidyup up AVX512 FPCLASS instruction naming (#116661 ) FPCLASS is a unary instruction with an immediate operand - update the naming to match similar instructions (e.g. VPSHUFD) by only using the source reg/mem and immediate in the instruction name	2024-11-19 11:26:46 +00:00
Simon Pilgrim	d4f2b71c3f	[X86] Fix position of immediate argument in AVX512 VPCMP comparisons (#116646 ) The 'i' arg was being put between the 'm' and 'b' args instead of afterwards like other avx512 instructions (VCMPPS/D, VPERMILPS/D etc.).	2024-11-19 10:00:24 +00:00
Min-Yih Hsu	e8b70e9744	[TableGen] Make `!and` and `!or` short-circuit (#113963 ) The idea is that by preemptively simplifying the result of `!and` and `!or`, we can fold some of the conditional operators, like `!if` or `!cond`, as early as possible.	2024-11-07 10:22:03 -08:00
JaydeepChauhan14	dd98ae358b	Test added for x86-instr-mapping (#115170 )	2024-11-07 19:09:21 +08:00
Sander de Smalen	ae0ab24862	[TableGen] Fix calculation of Lanemask for RCs with artificial subregs. (#114392 ) TableGen builds up a map of "SubRegIdx -> Subclass" where Subclass is the largest class where all registers have SubRegIdx as a sub-register. When SubRegIdx (vis-a-vis the sub-register) is artificial it should still include it in the map. This map is used in various places, including in the calculation of the Lanemask of a register class, which otherwise calculates an incorrect lanemask.	2024-11-04 16:10:50 +00:00
Sander de Smalen	9a211fe7e4	[TableGen] Fix concatenation of subreg and artificial subregs (#114391 ) When CoveredBySubRegs is true and a sub-register consists of two parts; a regular subreg and an artificial subreg, then TableGen should consider only concatenating the non-artificial subregs. For example, S0_S1 is a concatenated subreg from D0_D1, but S0_S1_HI should not be considered.	2024-11-04 15:51:19 +00:00
Mahesh-Attarde	e61a7dc256	[X86][AVX512] Use comx for compare (#113567 ) We added AVX10.2 COMEF ISA in LLVM, This does not optimize correctly in scenario mentioned below. Summary Input ``` define i1 @oeq(float %x, float %y) { %1 = fcmp oeq float %x, %y ret i1 %1 }define i1 @une(float %x, float %y) { %1 = fcmp une float %x, %y ret i1 %1 }define i1 @ogt(float %x, float %y) { %1 = fcmp ogt float %x, %y ret i1 %1 } // Prior AVX10.2, default code generation oeq: # @oeq cmpeqss xmm0, xmm1 movd eax, xmm0 and eax, 1 ret une: # @une cmpneqss xmm0, xmm1 movd eax, xmm0 and eax, 1 ret ogt: # @ogt ucomiss xmm0, xmm1 seta al ret ``` This patch will remove `cmpeqss` and `cmpneqss`. For complete transform check unit test. Continuing on what PR https://github.com/llvm/llvm-project/pull/113098 added Earlier Legalization and combine expanded `setcc oeq:ch` node into `and` and `setcc eq` , `setcc o`. From suggestions in community new internal transform ``` Optimized type-legalized selection DAG: %bb.0 'hoeq:' SelectionDAG has 11 nodes: t0: ch,glue = EntryToken t2: f16,ch = CopyFromReg t0, Register:f16 %0 t4: f16,ch = CopyFromReg t0, Register:f16 %1 t14: i8 = setcc t2, t4, setoeq:ch t10: ch,glue = CopyToReg t0, Register:i8 $al, t14 t11: ch = X86ISD::RET_GLUE t10, TargetConstant:i32<0>, Register:i8 $al, t10:1 Optimized legalized selection DAG: %bb.0 'hoeq:' SelectionDAG has 12 nodes: t0: ch,glue = EntryToken t2: f16,ch = CopyFromReg t0, Register:f16 %0 t4: f16,ch = CopyFromReg t0, Register:f16 %1 t15: i32 = X86ISD::UCOMX t2, t4 t17: i8 = X86ISD::SETCC TargetConstant:i8<4>, t15 t10: ch,glue = CopyToReg t0, Register:i8 $al, t17 t11: ch = X86ISD::RET_GLUE t10, TargetConstant:i32<0>, Register:i8 $al, t10:1 ``` Earlier transform is mentioned here https://github.com/llvm/llvm-project/pull/113098#discussion_r1810307663 --------- Co-authored-by: mattarde <mattarde@intel.com>	2024-10-30 16:17:25 +08:00
Rahul Joshi	a18af41c20	[LLVM] Change error messages to start with lower case (#113748 ) Change LLVM Asm and TableGen Lexer/Parser error messages to begin with lower case.	2024-10-29 12:26:33 -07:00
Freddy Ye	5aa1275d03	[X86] Support SM4 EVEX version intrinsics/instructions. (#113402 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/671368	2024-10-28 10:46:16 +08:00
Abhina Sree	9d88543301	[AIX] Use internal lit shell for TableGen instead of a global setting (#113627 ) This is to address the latest lit regressions https://lab.llvm.org/buildbot/#/builders/64/builds/1285 caused by using the internal lit shell. This change will limit using the internal lit shell to TableGen on AIX so we do not hit these regressions.	2024-10-25 13:06:02 -04:00
Daniel Paoliello	7eb8238a32	[TableGen] Handle Windows line endings in x86-fold-tables.td test (#112997 ) The x86-fold-tables.td has been failing for me and [in CI](https://buildkite.com/llvm-project/github-pull-requests/builds/111277#0192a122-c5c9-4e4e-bc5b-7532fec99ae4) if Git happens to decide to check out the baseline file with Windows line endings. This fix for this is to add the `--strip-trailing-cr` option to diff to normalize the line endings before comparing them.	2024-10-21 09:58:59 -07:00
JL2210	8f6d4913bb	[llvm][TableGen] Count implicit defs as well as explicit ones in the GlobalISel TableGen emitter (#112673 ) `NumDefs` only counts the number of registers in `(outs)`, not any implicit defs specified with `Defs = [...]` This causes patterns with physical register defs to fail to import here instead of later where implicit defs are rendered. Add on `ImplicitDefs.size()` to count both and create `DstExpDefs` to count only explicit defs, used later on.	2024-10-18 10:50:44 +01:00
Rahul Joshi	2a0073f6b5	[LLVM][TableGen] Check overloaded intrinsic mangling suffix conflicts (#110324 ) Check name conflicts between intrinsics caused by mangling suffix. If the base name of an overloaded intrinsic is a proper prefix of another intrinsic, check if the other intrinsic name suffix after the proper prefix can match a mangled type and issue an error if it can.	2024-10-15 08:15:57 -07:00
Ying Yi	e06e493252	Make a tablegen test match-table.td more robust. Some organizations have added operators downstream, and the test match-table.td tends to fail with off-by-n errors (with n being the number of `added operators`) periodically. This patch will increase the test robust and reduce the impact of merge process.	2024-10-08 13:16:06 +01:00
Rahul Joshi	e31e6f259e	[TableGen] Print assert message inline with assert failure (#111184 ) Print assert message after the "assertion failed" message instead of printing it as a separate note. This makes the assert failure reporting less verbose and also more useful to see the failure message inline with the "assertion failed" message.	2024-10-04 13:21:50 -07:00
Rahul Joshi	04540fac5b	[TableGen] Print record location when record asserts fail (#111029 ) When record assertions fail, print an error message with the record's location, so it's easier to see where the record that caused the assert to fail was instantiated. This is useful when the assert condition in a class depends on a template parameter, so we need to know the context of the definition to determine why the assert failed. Also enhanced the assert.td test to check for these context messages, and also add checks for some assert failures that were missing in the test.	2024-10-04 05:45:45 -07:00
Rahul Joshi	876f661dbe	[LIT] Rename substitution `%basename_s` to `%{s:basename}` (#111062 ) Also added `%{t:stem}` as an alias for `%basename_t` and modified unit test to test these new substitutions.	2024-10-03 18:18:10 -07:00
Rahul Joshi	6f20c3099e	[LIT] Add support for `%basename_s` to get base name of source file (#110993 ) Add support for `%basename_s` pattern in the RUN commands to get the base name of the source file, and adopt it in a TableGen LIT test.	2024-10-03 12:29:11 -07:00
Rahul Joshi	241f93658a	[TableGen] Fix source location for anonymous records (#110935 ) Fix source location for anonymous records to be the one of the locations where that record is instantiated as opposed to the location of the class that was anonymously instantiated. Currently, when a record is anonymously instantiated (via `VarDefInit::instantiate`), we use the location of the class for the record, which is not correct. Instead, pass in the `SMLoc` for the location where the anonymous instantiation happens and use that location when the record is instantiated. If there are multiple anonymous instantiations with the same parameters, the location for the (single) record created will be one of these instantiation locations as opposed to the class location.	2024-10-03 06:16:56 -07:00
Rahul Joshi	0de0354aa8	[LLVM][TableGen] Decrease code size of `Intrinsic::getAttributes` (#110573 ) Decrease code size of `Intrinsic::getAttributes` function by uniquing the function and argument attributes separately and using the `IntrinsicsToAttributesMap` to store argument attribute ID in low 8 bits and function attribute ID in upper 8 bits. This reduces the number of cases to handle in the generated switch from 368 to 131, which is ~2.8x reduction in the number of switch cases. Also eliminate the fixed size array `AS` and `NumAttrs` variable, and instead call `AttributeList::get` directly from each case, with an inline array of the <index, AttribueSet> pairs.	2024-10-01 09:08:47 -07:00
Stephen Chou	ec61311e77	[LLVM][TableGen] Support type casts of nodes with multiple results (#109728 ) Currently, type casts can only be used to pattern match for intrinsics with a single overloaded return value. For instance: ``` def int_foo : Intrinsic<[llvm_anyint_ty], []>; def : Pat<(i32 (int_foo)), ...>; ``` This patch extends type casts to support matching intrinsics with multiple overloaded return values. As an example, the following defines a pattern that matches only if the overloaded intrinsic call returns an `i16` for the first result and an `i32` for the second result: ``` def int_bar : Intrinsic<[llvm_anyint_ty, llvm_anyint_ty], []>; def : Pat<([i16, i32] (int_bar)), ...>; ```	2024-10-01 11:17:00 +04:00
Rahul Joshi	2f43e65955	[LLVM][TableGen] Check name conflicts between target dep and independent intrinsics (#109826 ) Validate that for target independent intrinsics the second dotted component of their name (after the `llvm.`) does not match any existing target names (for which atleast one intrinsic has been defined). Doing so is invalid as LLVM will search for that intrinsic in that target's intrinsic table and not find it, and conclude that its an unknown intrinsic.	2024-09-25 12:01:17 -07:00

1 2 3 4 5 ...

831 Commits