llvm-project

Author	SHA1	Message	Date
Rahul Joshi	e0458a24a1	[LLVM][TableGen] Add error check for duplicate intrinsic names (#109226 ) Check for duplicate intrinsic names in the intrinsic emitter backend and issue a fatal error if we find one.	2024-09-19 05:21:00 -07:00
Mahesh-Attarde	311e4e3245	[X86][AVX10.2] Support AVX10.2 MOVZXC new Instructions. (#108537 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965 Chapter 14 INTEL® AVX10 ZERO-EXTENDING PARTIAL VECTOR COPY INSTRUCTIONS --------- Co-authored-by: mattarde <mattarde@intel.com>	2024-09-18 21:01:51 +08:00
Mahesh-Attarde	f5ad9e1ca5	[X86][AVX10.2] Support AVX10.2-COMEF new instructions. (#108063 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965 Chapter 8 AVX10 COMPARE SCALAR FP WITH ENHANCED EFLAGS INSTRUCTIONS --------- Co-authored-by: mattarde <mattarde@intel.com>	2024-09-18 17:55:29 +08:00
David Green	feac761f37	[GlobalISel][AArch64] Add G_FPTOSI_SAT/G_FPTOUI_SAT (#96297 ) This is an implementation of the saturating fp to int conversions for GlobalISel. On AArch64 the converstion instrctions work this way, producing saturating results. LegalizerHelper::lowerFPTOINT_SAT is ported from SDAG. AArch64 has a lot of existing tests for fptosi_sat, covering a wide range of types. I have tried to make most of them work all at once, but a few fall back due to other missing features such as f128 handling for min/max.	2024-09-16 10:33:59 +01:00
Simon Pilgrim	1e33bd2031	[X86] Add missing immediate qualifier to the (V)PINSR/PEXTR instruction names Makes it easier to algorithmically recreate the instruction name in various analysis scripts I'm working on	2024-09-15 14:12:55 +01:00
Simon Pilgrim	7048857f52	[X86] Add missing immediate qualifier to the (V)EXTRACTPS instruction names Makes it easier to algorithmically recreate the instruction name in various analysis scripts I'm working on	2024-09-15 13:41:46 +01:00
Simon Pilgrim	614a064cac	[X86] Add missing immediate qualifier to the (V)INSERT/EXTRACT/PERM2 instruction names (#108593 ) Makes it easier to algorithmically recreate the instruction name in various analysis scripts I'm working on	2024-09-15 11:42:13 +01:00
Malay Sanghi	a409ebc1fc	[X86][AVX10.2] Support AVX10.2-SATCVT-DS new instructions. (#102592 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-09-12 22:45:20 +08:00
Rahul Joshi	c1e3b990a9	[TableGen] Eliminate static CodeGenIntrinsicMap in PatternParser (#107339 ) Instead, move it to CodeGenTarget class, and use it in both PatternParser and SearchableTableEmitter.	2024-09-07 15:11:34 -07:00
Rahul Joshi	0ceffd362b	[TableGen] Add PrintError family overload that take a print function (#107333 ) Add PrintError and family overload that accepts a print function. This avoids constructing potentially long strings for passing into these print functions.	2024-09-07 05:13:54 -07:00
anjenner	4af249fe6e	Add usub_cond and usub_sat operations to atomicrmw (#105568 ) These both perform conditional subtraction, returning the minuend and zero respectively, if the difference is negative.	2024-09-06 16:19:20 +01:00
Rahul Joshi	50be455ab8	[TableGen] Add check for number of intrinsic return values (#107326 ) Fail if we see an intrinsic that returns more than the supported number of return values. Intrinsics can return only upto a certain nyumber of values, as defined by the `IIT_RetNumbers` list in `Intrinsics.td`. Currently, if we define an intrinsic that exceeds the limit, llvm-tblgen crashes. Instead, read this limit and fail if it's exceeded with a proper error message.	2024-09-05 14:52:30 -07:00
Nikita Popov	f006246299	[CodeGen] Add generic INIT_UNDEF pseudo (#106744 ) The InitUndef pass currently uses target-specific pseudo instructions, with one pseudo per register class. Instead, add a generic pseudo instruction, which can be used by all targets and register classes.	2024-09-05 09:34:39 +02:00
Rahul Joshi	98c6bbfe1f	[TableGen] Refactor Intrinsics record (#106986 ) Eliminate unused `isTarget` field in Intrinsic record. Eliminate `isOverloaded`, `Types` and `TypeSig` fields from the record, as they are already available through the `TypeInfo` field. Change intrinsic emitter code to look for this info using fields of the `TypeInfo` record attached to the `Intrinsic` record. Fix several intrinsic related unit tests to source the `Intrinsic` class def from Intrinsics.td as opposed to defining a skeleton in the test. This eliminates some duplication of information in the Intrinsic class, as well as reduces the memory allocated for record fields, resulting in ~2% reduction (though that's not the main goal).	2024-09-04 15:04:10 -07:00
Freddy Ye	83ad644afa	[X86][AVX10.2] Support AVX10.2-BF16 new instructions. (#101603 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-09-04 08:13:24 +08:00
Stephen Tozer	3d08ade7bd	[ExtendLifetimes] Implement llvm.fake.use to extend variable lifetimes (#86149 ) This patch is part of a set of patches that add an `-fextend-lifetimes` flag to clang, which extends the lifetimes of local variables and parameters for improved debuggability. In addition to that flag, the patch series adds a pragma to selectively disable `-fextend-lifetimes`, and an `-fextend-this-ptr` flag which functions as `-fextend-lifetimes` for this pointers only. All changes and tests in these patches were written by Wolfgang Pieb (@wolfy1961), while Stephen Tozer (@SLTozer) has handled review and merging. The extend lifetimes flag is intended to eventually be set on by `-Og`, as discussed in the RFC here: https://discourse.llvm.org/t/rfc-redefine-og-o1-and-add-a-new-level-of-og/72850 This patch implements a new intrinsic instruction in LLVM, `llvm.fake.use` in IR and `FAKE_USE` in MIR, that takes a single operand and has no effect other than "using" its operand, to ensure that its operand remains live until after the fake use. This patch does not emit fake uses anywhere; the next patch in this sequence causes them to be emitted from the clang frontend, such that for each variable (or this) a fake.use operand is inserted at the end of that variable's scope, using that variable's value. This patch covers everything post-frontend, which is largely just the basic plumbing for a new intrinsic/instruction, along with a few steps to preserve the fake uses through optimizations (such as moving them ahead of a tail call or translating them through SROA). Co-authored-by: Stephen Tozer <stephen.tozer@sony.com>	2024-08-29 17:53:32 +01:00
Rahul Joshi	032c3283ab	[NFC][TableGen] Refactor IntrinsicEmitter code (#106479 ) - Use formatv() and raw string literals to simplify emission code. - Use range based for loops and structured bindings to simplify loops. - Use const Pointers to Records. - Rename `ComputeFixedEncoding` to `ComputeTypeSignature` to reflect what the function actually does, cnd change it to return a vector. - Use reverse() and range based for loop to pack 8 nibbles into 32-bits. - Rename some variables to follow LLVM coding standards. - For function memory effects, print human readable effects in comment.	2024-08-29 08:06:45 -07:00
Freddy Ye	7c4cadfc43	[X86][AVX10.2] Support AVX10.2-CONVERT new instructions. (#101600 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-21 15:44:06 +08:00
Rahul Joshi	340fb65971	[TableGen] Detect invalid -D arguments and fail (#102813 ) - Detect invalid macro names specified on command line and fail if one found. - Specifically, -DXYZ=1 for example, will fail instead is being silently accepted.	2024-08-19 10:15:52 -07:00
Akshat Oke	576aa3a509	[TableGen] Resolve References at top level (#104578 ) Add a dummy resolver to resolve references outside records. This invokes Fold() with isFinal to force resolution. Fixes #102447 Co-authored-by: Akshat Oke <Akshat.Oke@amd.com>	2024-08-19 21:05:39 +05:30
David Green	10fe531d6c	[GlobalISel] Add and use an Opcode variable and update match-table-cxx.td checks. NFC	2024-08-18 11:08:49 +01:00
Rahul Joshi	8ce5a32f02	[TableGen] Rework error reporting for duplicate Feature/Processor (#102257 ) - Extract code for sorting and checking duplicate Records into a helper function and update `collectProcModels` to use the helper. - Update `FeatureKeyValues` to: (a) Remove code for duplicate checks and use the helper. (b) Trim features with empty name explicitly to be able to use the helper. - Make the sorting deterministic by using record name as a secondary key for sorting, and re-enable SubtargetFeatureUniqueNames.td test that was disabled due to the non-determinism of the error messages. - Change wording of error message when duplicate records are found to be source code position agnostic, since `First` may not be before `Second` lexically.	2024-08-08 02:00:36 +03:00
Rahul Joshi	30a4e9e5be	[NFC] Temporarily disable failing TableGen unit test. (#102308 ) - This test was reported as failing in ASAN builds, with non-deterministic order of the error message and note expected. - This is due to llvm::sort() being unstable. - Temporarily disable the test while its been fixed.	2024-08-07 15:41:05 +03:00
Rahul Joshi	4c97c52fe0	[TableGen] Emit better error message for duplicate Subtarget features. (#102090 ) - Keep track of last definition of a feature in a `DenseMap` and use it to report a better error message when a duplicate feature is found. - Use StringMap instead of a std::map in `EmitStageAndOperandCycleData` - Add a unit test to check if duplicate names are flagged.	2024-08-06 23:39:04 +02:00
Freddy Ye	80721e0d6c	[X86][AVX10.2] Support AVX10.2-SATCVT new instructions. (#101599 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-06 19:37:49 +08:00
Phoebe Wang	b0329206db	[X86][AVX10.2] Support AVX10.2 VNNI FP16/INT8/INT16 new instructions (#101783 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-05 18:57:42 +08:00
Pierre van Houtryve	4cfbd494b0	[GlobalISel][TableGen] Fix match-table-variadics tests label checking (#101625 ) We don't really care about the precise label values for specific tests like this, so just match any number. The comments are enough to say where we jump in every case, and we have plenty of tests that check basic match table features (on top of being just tested by the fact GlobalISel CodeGen isn't broken).	2024-08-05 09:13:31 +02:00
Freddy Ye	3d5cc7e1e6	[X86][AVX10.2] Support AVX10.2-MINMAX new instructions. (#101598 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-05 11:06:02 +08:00
Phoebe Wang	259ca9ee9c	Reland "[X86][AVX10.2] Support AVX10.2 option and VMPSADBW/VADDP[D,H,S] new instructions (#101452 )" (#101616 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-03 09:26:07 +08:00
Phoebe Wang	2e0588d5e1	Revert "[X86][AVX10.2] Support AVX10.2 option and VMPSADBW/VADDP[D,H,S] new instructions" (#101612 ) Reverts llvm/llvm-project#101452 There are several buildbot failed. Revert first.	2024-08-02 13:04:10 +08:00
Phoebe Wang	10bad2c8d7	[X86][AVX10.2] Support AVX10.2 option and VMPSADBW/VADDP[D,H,S] new instructions (#101452 ) Ref.: https://cdrdv2.intel.com/v1/dl/getContent/828965	2024-08-02 12:10:50 +08:00
pvanhout	ab33c3dd65	[GlobalISel][TableGen] Make variadic-errors.td test more robust Use a regex instead of hardcoded numbers for anonymous pattern suffixes.	2024-08-01 08:42:17 +02:00
Pierre van Houtryve	972c02929b	[GlobalISel][TableGen] MIR Pattern Variadics (#100563 ) Allow for matching & rewriting a variable number of arguments in an instructions. Solves #87459	2024-08-01 08:30:20 +02:00
Brandon Wu	a4c6ebeb20	[MVT][TableGen] Extend Machine Value Type to `uint16_t` (#99657 ) RFC: https://discourse.llvm.org/t/rfc-extend-machine-value-type-from-uint8-t-to-uint16-t/80274 compile-time-tracker: https://llvm-compile-time-tracker.com/compare.php?from=4b9fab591916eec9fd1942f37afe3b137b564089&to=177d28247efe5a4d59a8d8150b4daf01e4f57d74&stat=wall-time Currently 208 out of 256 MVTs are used, it will be run out soon, so ultimately we need to extend the original `MVT::SimpleValueType` from `uint8_t` to `uint16_t` to accomodate more types. The `MatcherTable` uses `unsigned char` for encoding the matcher code, so the extended MVTs are no longer fit into the table, thus we need to use VBR to encode them as we do on others that are wider than 8 bits. The statistics below shows the difference of "Total Array size" of the matcher table that appears in every files: ``` Table Before After Change(%) WebAssemblyGenDAGISel.inc 23576 23775 0.844 NVPTXGenDAGISel.inc 173498 173498 0 RISCVGenDAGISel.inc 2179121 2369929 8.756 AVRGenDAGISel.inc 2754 2754 0 PPCGenDAGISel.inc 163315 163617 0.185 MipsGenDAGISel.inc 47280 47447 0.353 SystemZGenDAGISel.inc 56243 56461 0.388 AArch64GenDAGISel.inc 467893 487830 4.261 MSP430GenDAGISel.inc 8069 8069 0 LoongArchGenDAGISel.inc 78928 79131 0.257 XCoreGenDAGISel.inc 3432 3432 0 BPFGenDAGISel.inc 3733 3733 0 VEGenDAGISel.inc 65174 66456 1.967 LanaiGenDAGISel.inc 2067 2067 0 X86GenDAGISel.inc 628787 636987 1.304 ARMGenDAGISel.inc 170968 171036 0.040 HexagonGenDAGISel.inc 155764 155764 0 SparcGenDAGISel.inc 5762 5798 0.625 AMDGPUGenDAGISel.inc 504356 504463 0.021 R600GenDAGISel.inc 29785 29785 0 ``` The statistics below shows the runtime peak memory usage by compiling a simple C program: `/bin/time -v clang -target $TARGET -O3 -c test.c` ``` int test(int a) { return a * 3; } ``` ``` Target Before(kbytes) After(kbytes) Change(%) wasm64 110172 110088 -0.076 nvptx64 109784 109980 0.179 riscv64 114020 113656 -0.319 avr 110352 110068 -0.257 ppc64 112612 112476 -0.120 mips64 113588 113668 0.070 systemz 110860 110760 -0.090 aarch64 113704 113432 -0.239 msp430 110284 110200 -0.076 loongarch64 111052 110756 -0.267 xcore 108340 108020 -0.295 bpf 110620 110708 0.080 ve 110960 110920 -0.036 lanai 110180 109960 -0.200 x86_64 113640 113304 -0.296 arm64 113540 113172 -0.324 hexagon 114620 114684 0.056 sparc 110412 110136 -0.250 amdgcn 118164 117144 -0.863 r600 111200 110508 -0.622 ```	2024-08-01 01:19:14 +08:00
Kai Nacke	a79db96ec0	[GISel][TableGen] Generate getRegBankFromRegClass (#99896 ) Generating the mapping from a register class to a register bank is complex: - there can be lots of register classes - the mapping may be ambiguos - a register class can span several register banks (e.g. a register class containing all registers) - the type information is not enough to decide which register bank to map to (e.g. a register class containing floating point and vector registers, and all register can represent a f64 value) The approach taken here is to encode the register banks in an array indexed by the ID of the register class. To save space, the entries are packed into chunks of size 2^n.	2024-07-25 09:41:55 -04:00
Piyou Chen	f4d4ce1a31	[RISCV] Add groupid/bitmask for RISC-V extension (#94440 ) Base on https://github.com/riscv-non-isa/riscv-c-api-doc/pull/74. This patch defines the groupid/bitmask in RISCVFeatures.td and generates the corresponding table in RISCVTargetParserDef.inc. The groupid/bitmask of extensions provides an abstraction layer between the compiler and runtime functions.	2024-07-22 14:18:05 +08:00
Thorsten Schütt	1cc1072349	[GlobalIsel] Add G_SCMP and G_UCMP instructions (#98894 ) https://github.com/llvm/llvm-project/pull/83227	2024-07-18 16:22:37 +02:00
Craig Topper	73acf8d755	[RISCV] Add -m[no-]scalar-strict-align and -m[no-]vector-strict-align. (#95024 )	2024-07-14 13:39:17 -07:00
Garvit Gupta	6c903f05f3	[TableGen] Add support for emitting new function definition to return a range of results for Primary Key (#96174 ) In the RISC-V architecture, multiple vendor-specific Control and Status Registers (CSRs) share the same encoding. However, the existing lookup function, which currently returns only a single result, falls short. During disassembly, it consistently returns the first CSR encountered, which may not be the correct CSR for the subtarget. To address this issue, we modify the function definition to return a range of results. These results can then be iterated upon to identify the CSR that best fits the subtarget’s feature requirements. The behavior of this new definition is controlled by a variable named `ReturnRange`, which defaults to `false`. Specifically, this patch introduces support for emitting a new lookup function for the primary key. This function returns a pair of iterators pointing to the first and last values, providing a comprehensive range of values that satisfy the query	2024-07-11 10:54:31 -07:00
Wolfgang Pieb	68a8ae0696	Make some tablegen tests more flexible. (#97651 ) Some organizations have added operators downstream, and the 3 tests in this PR tend to fail with off-by-n errors (with n being the number of added operators) periodically. To alleviate this. the proposed change is taking advantage of the fact that the 2nd and 3rd element in a switch table represent the lower and upper bounds of the operator id, and that the offset of the first byte of the encoded operations is a function of these 2 bounds. Additionally, we the default label offset is encoded in a FileCheck variable.	2024-07-08 09:31:44 -07:00
Daniil Kovalev	1488fb4153	[PAC][AArch64] Lower ptrauth constants in code (#96879 ) This re-applies #94241 after fixing buildbot failure, see https://lab.llvm.org/buildbot/#/builders/51/builds/570 According to standard, `constexpr` variables and `const` variables initialized with constant expressions can be used in lambdas w/o capturing - see https://en.cppreference.com/w/cpp/language/lambda. However, MSVC used on buildkite seems to ignore that rule and does not allow using such uncaptured variables in lambdas: we have "error C3493: 'Mask16' cannot be implicitly captured because no default capture mode has been specified" - see https://buildkite.com/llvm-project/github-pull-requests/builds/73238 Explicitly capturing such a variable, however, makes buildbot fail with "error: lambda capture 'Mask16' is not required to be captured for this use [-Werror,-Wunused-lambda-capture]" - see https://lab.llvm.org/buildbot/#/builders/51/builds/570. Fix both cases by using `0xffff` value directly instead of giving a name to it. Original PR description below. Depends on #94240. Define the following pseudos for lowering ptrauth constants in code: - non-`extern_weak`: - no GOT load needed: `MOVaddrPAC` - similar to `MOVaddr`, with added PAC; - GOT load needed: `LOADgotPAC` - similar to `LOADgot`, with added PAC; - `extern_weak`: `LOADauthptrstatic` - similar to `LOADgot`, but use a special stub slot named `sym$auth_ptr$key$disc` filled by dynamic linker during relocation resolving instead of a GOT slot. --------- Co-authored-by: Ahmed Bougacha <ahmed@bougacha.org>	2024-06-28 07:29:38 +03:00
Daniil Kovalev	99251f5a11	Revert "[PAC][AArch64] Lower ptrauth constants in code (#94241 )" (#96865 ) This reverts #94241. See buildbot failure https://lab.llvm.org/buildbot/#/builders/51/builds/570	2024-06-27 11:10:38 +03:00
Daniil Kovalev	b5cc19e572	[PAC][AArch64] Lower ptrauth constants in code (#94241 ) Depends on #94240. Define the following pseudos for lowering ptrauth constants in code: - non-`extern_weak`: - no GOT load needed: `MOVaddrPAC` - similar to `MOVaddr`, with added PAC; - GOT load needed: `LOADgotPAC` - similar to `LOADgot`, with added PAC; - `extern_weak`: `LOADauthptrstatic` - similar to `LOADgot`, but use a special stub slot named `sym$auth_ptr$key$disc` filled by dynamic linker during relocation resolving instead of a GOT slot. --------- Co-authored-by: Ahmed Bougacha <ahmed@bougacha.org>	2024-06-27 10:02:17 +03:00
Jason Eckhardt	edf5782f17	[TableGen] Check for duplicate register tuple definitions. (#95725 ) Currently TableGen does not directly detect duplicate synthesized registers as can happen in this example: def GPR128 : RegisterTuples<[sub0, sub1, sub2, sub3], [(decimate (shl GPR32, 0), 1), (decimate (shl GPR32, 1), 1), (decimate (shl GPR32, 2), 1), (decimate (shl GPR32, 3), 1)]>; def GPR128_Aligned : RegisterTuples<[sub0, sub1, sub2, sub3], [(decimate (shl GPR32, 0), 4), (decimate (shl GPR32, 1), 4), (decimate (shl GPR32, 2), 4), (decimate (shl GPR32, 3), 4)]>; TableGen does fail, but with an unrelated and difficult to understand error that happens downstream of tuple expansion: "error: No SubRegIndex for R0_R1_R2_R3 in R0_R1_R2_R3". This patch detects the problem directly during expansion and emits an error pointing the user to the actual issue: "error: Register tuple redefines register 'R0_R1_R2_R3'".	2024-06-25 16:42:29 -05:00
Nikita Popov	f2f18459d4	Revert "Intrinsic: introduce minimumnum and maximumnum (#93841 )" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 89881480030f48f83af668175b70a9798edca2fb. This reverts commit 225d8fc8eb24fb797154c1ef6dcbe5ba033142da.	2024-06-21 08:34:04 +02:00
YunQiang Su	8988148003	Intrinsic: introduce minimumnum and maximumnum (#93841 ) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AArch64/PowerPC: follow the IEEE754-2008's minNUM: return qNaN. RISC-V/Hexagon follow the IEEE754-2019's minimumNumber: return NUM. X86: Returns NUM but not same with IEEE754-2019's minimumNumber as +0.0 is not always greater than -0.0. MIPS/LoongArch/Generic: return NUM. LIBCALL: returns qNaN. So, let's introduce llvm.minmumnum/llvm.maximumnum, which always follow IEEE754-2019's minimumNumber/maximumNumber. Half-fix: #93033	2024-06-21 11:53:08 +08:00
Matt Arsenault	5c9352eb02	DAG: Replace bitwidth with type in suffix in atomic tablegen ops (#94845 )	2024-06-13 11:52:22 +02:00
Krzysztof Parzyszek	e36c8dca38	[Frontend] Introduce `getDirectiveCategory` for ACC/OMP directives (#94689 ) The categories are primarily meant for OpenMP, where the spec assigns a category to each directive. It's one of declarative, executable, informational, meta, subsidiary, and utility. These will be used in clang to avoid listing directives belonging to certain categories by hand. --------- Co-authored-by: Valentin Clement (バレンタインクレメン) <clementval@gmail.com>	2024-06-10 08:05:06 -05:00
jofrn	d0dc29c208	[TableGen] HasOneUse builtin predicate on PatFrags (#91578 ) This predicate tells GlobalISelEmitter and DAGISelEmitter to check that the instruction to emit has only one use of its result. This can be used on a PatFrag instead of defining custom predicates for both emitters per record that requires it.	2024-05-20 06:18:49 -08:00
Pierre van Houtryve	7d81062352	[GlobalISel] Refactor Combiner MatchData & Apply C++ Code Handling (#92239 ) Combiners that use C++ code in their "apply" pattern only use that. They never mix it with MIR patterns as that has little added value. This patch restricts C++ apply code so that if C++ is used, we cannot use MIR patterns or builtins with it. Adding this restriction allows us to merge calls to match and apply C++ code together, which in turns makes it so we can just have MatchData variables on the stack. So before, we would have ``` GIM_CheckCxxInsnPredicate // match GIM_CheckCxxInsnPredicate // apply GIR_Done ``` Alongside a massive C++ struct holding the MatchData of all rules possible (which was a big space/perf issue). Now we just have ``` GIR_DoneWithCustomAction ``` And the function being ran just does ``` unsigned SomeMatchData; if (match(SomeMatchData)) apply(SomeMatchData) ``` This approach solves multiple issues in one: - MatchData handling is greatly simplified and more efficient, "don't pay for what you don't use" - We reduce the size of the match table - Calling C++ code has a certain overhead (we need a switch), and this overhead is only paid once now. Handling of C++ code inside PatFrags is unchanged though, that still emits a `GIM_CheckCxxInsnPredicate`. This is completely fine as they can't use MatchDatas.	2024-05-16 13:39:00 +02:00

... 2 3 4 5 6 ...

928 Commits