llvm-project

Author	SHA1	Message	Date
Craig Topper	5cf0fb4317	[StackSlotColoring] Ignore non-spill objects in RemoveDeadStores. (#80242 ) The stack slot coloring pass is concerned with optimizing spill slots. If any change is a pass is made over the function to remove stack stores that use the same register and stack slot as an immediately preceding load. The register check is too simple for constant registers like AArch64 and RISC-V's zero register. This register can be used as the result of a load if we want to discard the result, but still have the memory access performed. Like for a volatile or atomic load. If the code sees a load from the zero register followed by a store of the zero register at the same stack slot, the pass mistakenly believes the store isn't needed. Since the main stack coloring optimization is only concerned with spill slots, it seems reasonable that RemoveDeadStores should only be concerned with spills. Since we never generate a reload of x0, this avoids the issue seen by RISC-V. Test case concept is adapted from pr30821.mir from X86. That test had to be updated to mark the stack slot as a spill slot. Fixes #80052.	2024-02-01 13:25:15 -08:00
Anatoly Trosinenko	08fccf8094	[AArch64][PAC] Expand blend(reg, imm) operation in aarch64-pauth pass (#74729 ) In preparation for implementing code generation for more @llvm.ptrauth.* intrinsics, move the expansion of blend(register, small integer) variant of @llvm.ptrauth.blend to the AArch64PointerAuth pass, where most other PAuth-related code generation takes place.	2024-02-01 13:02:39 -08:00
Jiahan Xie	10c2d5ff7c	[RISCV][GISel] RegBank select and instruction select for vector G_ADD, G_SUB (#74114 ) RegisterBank Selection for scalable vector G_ADD and G_SUB by creating new mappings for different types of vector register banks. Then implement Instruction Selection for the same operations by choosing the correct RISC-V vector register class.	2024-02-01 15:06:43 -05:00
Brendan Sweeney	e296cedcd6	[RISCV][MC] MC layer support for the experimental zalasr extension (#79911 ) This PR implements experimental support for the RISC-V Atomic Load-Acquire and Store-Release Extension (Zalasr). It has been approved to be pursued as a fast track extension (https://lists.riscv.org/g/tech-unprivileged/topic/arc_architecture_review/101951698), but has not yet been approved by ARC or ratified. See https://github.com/mehnadnerd/riscv-zalasr for draft spec. --------- Co-authored-by: brs <turtwig@utexas.edu> Co-authored-by: Philip Reames <preames@rivosinc.com>	2024-02-01 10:58:21 -08:00
Fangrui Song	10a55caccf	[RISCV] Support constraint "s" (#80201 ) GCC has supported a generic constraint "s" for a long time (since at least 1992), which references a symbol or label with an optional constant offset. "i" is a superset that also supports a constant integer. GCC's RISC-V port also supports a machine-specific constraint "S", which cannot be used with a preemptible symbol. (We don't bother to check preemptibility.) In PIC code, an external symbol is preemptible by default, making "S" less useful if you want to create an artificial reference for linker garbage collection, or define sections to hold symbol addresses: ``` void fun(); // error: impossible constraint in ‘asm’ for riscv64-linux-gnu-gcc -fpie/-fpic void foo() { asm(".reloc ., BFD_RELOC_NONE, %0" :: "S"(fun)); } // good even if -fpie/-fpic void foo() { asm(".reloc ., BFD_RELOC_NONE, %0" :: "s"(fun)); } ``` This patch adds support for "s". Modify https://reviews.llvm.org/D105254 ("S") to handle multi-depth GEPs (https://reviews.llvm.org/D61560).	2024-02-01 10:18:42 -08:00
Yaxun (Sam) Liu	1f3c30911c	[AMDGPU] Mark PC_ADD_REL_OFFSET rematerializable (#79674 ) Currently machine LICM hoist PC_ADD_REL_OFFSET out of loops, causes register pressure when function calls are deep in loops. This is a main cause of sgpr spill for programs containing large number of function calls in loops. This patch marks PC_ADD_REL_OFFSET as rematerializable, which eliminates sgpr spills due to function calls in loops.	2024-02-01 12:21:19 -05:00
Amy Kwan	2a50921553	[AIX][TLS] Optimize the small local-exec access sequence for non-zero offsets (#71485 ) This patch utilizes the -maix-small-local-exec-tls option to produce a faster, non-TOC-based access sequence for the local-exec TLS model. Specifically, for when the offsets from the TLS variable are non-zero. In particular, this patch produces either a single: - addi/la with a displacement off of R13 plus a non-zero offset for when an address is calculated, or - load or store off of R13 plus a non-zero offset for when an address is calculated and used for further access where R13 is the thread pointer, respectively. In order to produce a single addi or load/store off of the thread pointer with a non-zero offset, this patch also adds the necessary support in the assembly printer when printing these instructions. Specifically: - The non-zero offset is added to the TLS variable address when the address of the TLS variable + it's offset is less than 32KB. - Otherwise, when the address of the TLS variable + its offset is greater than 32KB, the non-zero offset (and a multiple of 64KB) is subtracted from the TLS address. This handling in the assembly printer is necessary to ensure that the TLS address + the non-zero offset is between [-32768, 32768), so that the total displacement can fit within the addi/load/store instructions. This patch is meant to be a follow-up to 3f46e5453d9310b15d974e876f6132e3cf50c4b1 (where the optimization occurs for when the offset is zero).	2024-02-01 09:29:21 -05:00
Quentin Dian	112fba974c	[MIRPrinter] Don't print line break when there is no instructions (NFC) (#80147 ) Per #80143, we can remove the extra line break when there is no instruction.	2024-02-01 22:10:52 +08:00
David Green	5d41788f37	[AArch64] Alter latency of FCSEL under Cortex-A510 (#80178 ) As per the Cortex-A510 software optimization guide, the latency of a fcsel should be 3 not 4. It would previously get the latency from WriteF.	2024-02-01 13:42:14 +00:00
Sander de Smalen	d313614b60	[AArch64] Replace LLVM IR function attributes for PSTATE.ZA. (#79166 ) Since https://github.com/ARM-software/acle/pull/276 the ACLE defines attributes to better describe the use of a given SME state. Previously the attributes merely described the possibility of it being 'shared' or 'preserved', whereas the new attributes have more semantics and also describe how the data flows through the program. For ZT0 we already had to add new LLVM IR attributes: * aarch64_new_zt0 * aarch64_in_zt0 * aarch64_out_zt0 * aarch64_inout_zt0 * aarch64_preserves_zt0 We have now done the same for ZA, such that we add: * aarch64_new_za (previously `aarch64_pstate_za_new`) * aarch64_in_za (more specific variation of `aarch64_pstate_za_shared`) * aarch64_out_za (more specific variation of `aarch64_pstate_za_shared`) * aarch64_inout_za (more specific variation of `aarch64_pstate_za_shared`) * aarch64_preserves_za (previously `aarch64_pstate_za_shared, aarch64_pstate_za_preserved`) This explicitly removes 'pstate' from the name, because with SME2 and the new ACLE attributes there is a difference between "sharing ZA" (sharing the ZA matrix register with the caller) and "sharing PSTATE.ZA" (sharing either the ZA or ZT0 register, both part of PSTATE.ZA with the caller).	2024-02-01 13:37:37 +00:00
Joseph Huber	f956e7fbf1	[AMDGPU] Prefer `s_memtime` for `readcyclecounter` on GFX10 (#80211 ) Summary: The old `s_memtime` instruction was supported until the GFX10 architecture. Although this instruction has a higher latency than the new shader counter, it's much more usable as a processor clock as it is a full 64-bit counter. The new shader counter is only a 20-bit counter, which makes it difficult to use as a standard cycle counter as it will overflow in a few milliseconds. This patch suggests preferring `s_memtime` for this instrinsic if it is still available.	2024-02-01 07:19:57 -06:00
Wang Pengcheng	178719e860	[RISCV][NFC] Simplify calls.ll and autogenerate checks for tail-calls.ll Split out from #78417. Reviewers: topperc, asb, kito-cheng Reviewed By: asb Pull Request: https://github.com/llvm/llvm-project/pull/79248	2024-02-01 20:50:20 +08:00
Simon Pilgrim	ea2984287d	[ARM] Add ctpop codegen tests	2024-02-01 11:42:18 +00:00
Craig Topper	cf401f72e1	[RISCV] Use Zacas for AtomicRMWInst::Nand i32 and XLen. (#80119 ) We don't have an AMO instruction for Nand, so with the A extension we use an LR/SC loop. If we have Zacas we can use a CAS loop instead. According to the Zacas spec, a CAS loop scales to highly parallel systems better than LR/SC.	2024-01-31 15:37:41 -08:00
Congcong Cai	5561beae29	[WebAssembly] avoid to enable explicit disabled feature (#80094 )	2024-02-01 07:26:58 +08:00
Philip Reames	ff53d50742	[RISCV] Improve legalization of e8 m8 VL>256 shuffles (#79330 ) If we can't produce a large enough index vector in i8, we may need to legalize the shuffle (via scalarization - which in turn gets lowered into stack usage). This change makes two related changes: * Deferring legalization until we actually need to generate the vrgather instruction. With the new recursive structure, this only happens when doing the fallback for one of the arms. * Check the actual mask values for something outside of the representable range. Both are covered by recently added tests.	2024-01-31 14:41:15 -08:00
Alex MacLean	5e3ae4c4af	[NVPTX] improve Boolean ISel (#80166 ) Add TableGen patterns to convert more instructions to boolean expressions: - mul -> and/or: i1 multiply instructions currently cannot be selected causing the compiler to crash. See https://github.com/llvm/llvm-project/issues/57404 - select -> and/or: Converting selects to and/or can enable more optimizations. `InstCombine` cannot do this as aggressively due to poison semantics.	2024-01-31 14:37:27 -08:00
Usman Nadeem	1d1432356e	[AArch64][SVE2] Generate urshr rounding shift rights (#78374 ) Add a new node `AArch64ISD::URSHR_I_PRED`. `srl(add(X, 1 << (ShiftValue - 1)), ShiftValue)` is transformed to `urshr`, or to `rshrnb` (as before) if the result it truncated. `uzp1(rshrnb(uunpklo(X),C), rshrnb(uunpkhi(X), C))` is converted to `urshr(X, C)` (tested by the wide_trunc tests). Pattern matching code in `canLowerSRLToRoundingShiftForVT` is taken from prior code in rshrnb. It returns true if the add has NUW or if the number of bits used in the return value allow us to not care about the overflow (tested by rshrnb test cases).	2024-01-31 14:03:58 -08:00
Zaara Syeda	a03a6e9964	[AIX] [XCOFF] Add support for common and local common symbols in the TOC (#79530 ) This patch adds support for common and local symbols in the TOC for AIX. Note that we need to update isVirtualSection so as a common symbol in TOC will have the symbol type XTY_CM and will be initialized when placed in the TOC so sections with this type are no longer virtual. --------- Co-authored-by: Zaara Syeda <syzaara@ca.ibm.com>	2024-01-31 16:34:21 -05:00
David Green	d04ae1b15f	[AArch64] Use DAG->isAddLike in add_and_or_is_add (#79563 ) This allows it to work with disjoint or's as well as computing the known bits.	2024-01-31 16:49:23 +00:00
Rin Dobrescu	2907c63311	Revert "[AArch64] Convert concat(uhadd(a,b), uhadd(c,d)) to uhadd(concat(a,c), concat(b,d))" (#80157 ) Reverts llvm/llvm-project#79464 while figuring out why the tests are failing.	2024-01-31 16:45:25 +00:00
David Green	5d7d89de31	[AArch64] Use add_and_or_is_add for CSINC (#79552 ) Adds or add-like-or's of 1 can both be turned into csinc, which can help fold more instructions into a csinc.	2024-01-31 15:48:31 +00:00
Sjoerd Meijer	8841846050	[AArch64] MI Scheduler LDP combine follow up (#79003 ) This is a follow up of 75d820dcdd86, adding more opcodes to the combine target hook enabling more LDP creation. Patch co-authored by Cameron McInally.	2024-01-31 15:41:32 +00:00
Shimin Cui	1bab570e9b	Move the PowerPC/PPCMergeStringPool work to initializer (#77352 ) Currently, the `PPCMergeStringPool` merges the global variable after the `AsmPrinter` initializer adds the global variables to its symbol list. This is to move the merging work of `PPCMergeStringPool` to its initializer, just like what GlobalMerge does, to avoid adding merged global variables to the `AsmPrinter` symbol lis.	2024-01-31 10:27:07 -05:00
Quentin Dian	b7738e275d	[MIRPrinter] Don't print space when there is no successor (#80143 ) Extra space causes the checks generated by update_mir_test_checks to be unavailable. ``` # NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 4 # RUN: llc -mtriple=x86_64-- -o - %s -run-pass=none -verify-machineinstrs -simplify-mir \| FileCheck %s --- name: foo body: \| ; CHECK-LABEL: name: foo ; CHECK: bb.0: ; CHECK-NEXT: successors: ; CHECK-NEXT: {{ $}} ; CHECK-NEXT: {{ $}} ; CHECK-NEXT: bb.1: ; CHECK-NEXT: RET 0, $eax bb.0: successors: bb.1: RET 0, $eax ... ``` The failure log is as follows: ``` llvm/test/CodeGen/MIR/X86/unreachable-block-print.mir:9:16: error: CHECK-NEXT: is on the same line as previous match ; CHECK-NEXT: {{ $}} ^ <stdin>:21:13: note: 'next' match was here successors: ^ <stdin>:21:13: note: previous match ended here successors: ```	2024-01-31 22:35:41 +08:00
Alfie Richards	de75e5079a	[ARM][NEON] Add constraint to vld2 Odd/Even Pseudo instructions. (#79287 ) This ensures the odd/even pseudo instructions are allocated to the same register range. This fixes #71763	2024-01-31 14:08:02 +00:00
Shengchen Kan	e3c9327bc4	[X86][CodeGen] Set isReMaterializable = 1 for AVX broadcast load Broadcast of a single float should not be any slower than loading 32B using vmovaps. So remat it can help reduce register spill when there is big register pressure.	2024-01-31 20:55:56 +08:00
Rin Dobrescu	cf828aee24	[AArch64] Convert concat(uhadd(a,b), uhadd(c,d)) to uhadd(concat(a,c), concat(b,d)) (#79464 ) We can convert concat(v4i16 uhadd(a,b), v4i16 uhadd(c,d)) to v8i16 uhadd(concat(a,c), concat(b,d)), which can lead to further simplifications.	2024-01-31 12:52:12 +00:00
Simon Pilgrim	648eb7c141	[X86] divrem8_ext.ll - replace X32 check prefixes with X86 We try to only use X32 for gnux32 triple tests.	2024-01-31 12:06:48 +00:00
Simon Pilgrim	1d8c8f1169	[X86] cfguard - replace X32 check prefixes with X86 We try to only use X32 for gnux32 triple tests.	2024-01-31 12:05:32 +00:00
Simon Pilgrim	824d073fb6	[X86] fold-vector-sext - replace X32 check prefixes with X86 We try to only use X32 for gnux32 triple tests.	2024-01-31 12:01:02 +00:00
Simon Pilgrim	ed11f255a8	[X86] divide-by-constant.ll - replace X32 check prefixes with X86 We try to only use X32 for gnux32 triple tests.	2024-01-31 12:01:02 +00:00
Simon Pilgrim	e4af212f96	[X86] divrem.ll - replace X32 check prefixes with X86 We try to only use X32 for gnux32 triple tests.	2024-01-31 12:01:01 +00:00
Simon Pilgrim	a82ca1cd1b	[X86] insertps-from-constantpool.ll - replace X32 check prefixes with X86 and expose address math We try to only use X32 for gnux32 triple tests. Use no_x86_scrub_mem_shuffle so the test shows updated shuffle intermediate and the +4 offset into the constant pool vector entry	2024-01-31 12:01:01 +00:00
Simon Pilgrim	929503ead3	[X86] v2f32.ll - replace X32 check prefixes with X86 (and add common CHECK prefix) We try to only use X32 for gnux32 triple tests.	2024-01-31 11:04:20 +00:00
Simon Pilgrim	00a6817108	[X86] v4f32-immediate.ll - replace X32 check prefixes with X86 We try to only use X32 for gnux32 triple tests.	2024-01-31 11:04:19 +00:00
Simon Pilgrim	8d450b47ba	[X86] mmx-arith.ll - replace X32 check prefixes with X86 + strip cfi noise We try to only use X32 for gnux32 triple tests.	2024-01-31 11:04:19 +00:00
Simon Pilgrim	53b9d479d5	[X86] i256-add - replace i386 triple X32 check prefixes with X86 and add gnux32 triple tests	2024-01-31 11:04:19 +00:00
Vyacheslav Levytskyy	5a07774fe1	[SPIR-V] Improve how lowering of formal arguments in SPIR-V Backend interprets a value of 'kernel_arg_type' (#78730 ) The goal of this PR is to tolerate differences between description of formal arguments by function metadata (represented by "kernel_arg_type") and LLVM actual parameter types. A compiler may use "kernel_arg_type" of function metadata fields to encode detailed type information, whereas LLVM IR may utilize for an actual parameter a more general type, in particular, opaque pointer type. This PR proposes to resolve this by a fallback to LLVM actual parameter types during the lowering of formal function arguments in cases when the type can't be created by string content of "kernel_arg_type", i.e., when "kernel_arg_type" contains a type unknown for the SPIR-V Backend. An example of the issue manifestation is https://github.com/KhronosGroup/SPIRV-LLVM-Translator/blob/main/test/transcoding/KernelArgTypeInOpString.ll, where a compiler generates for the following kernel function detailed `kernel_arg_type` info in a form of `!{!"image_kernel_data", !"myInt", !"struct struct_name"}`, and in LLVM IR same arguments are referred to as `@foo(ptr addrspace(1) %in, i32 %out, ptr addrspace(1) %outData)`. Both definitions are correct, and the resulting LLVM IR is correct, but lowering stage of SPIR-V Backend fails to generate SPIR-V type. ``` typedef int myInt; typedef struct { int width; int height; } image_kernel_data; struct struct_name { int i; int y; }; void kernel foo(__global image_kernel_data* in, __global struct struct_name outData, myInt out) {} ``` ``` define spir_kernel void @foo(ptr addrspace(1) %in, i32 %out, ptr addrspace(1) %outData) ... !kernel_arg_type !7 ... { entry: ret void } ... !7 = !{!"image_kernel_data", !"myInt", !"struct struct_name"} ``` The PR changes a contract of `SPIRVType getArgSPIRVType(...)` in a way that it may return `nullptr` to signal that the metadata string content is not recognized, so corresponding comments are added and a couple of checks for `nullptr` are inserted where appropriate.	2024-01-31 02:58:50 -08:00
Jay Foad	c2c650f62e	[AMDGPU] Stop combining arbitrary offsets into PAL relocs (#80034 ) PAL uses ELF REL (not RELA) relocations which can only store a 32-bit addend in the instruction, even for reloc types like R_AMDGPU_ABS32_HI which require the upper 32 bits of a 64-bit address calculation to be correct. This means that it is not safe to fold an arbitrary offset into a GlobalAddressSDNode, so stop doing that. In practice this is mostly a problem for small negative offsets which do not work as expected because PAL treats the 32-bit addend as unsigned.	2024-01-31 10:28:23 +00:00
Yingwei Zheng	50e80e06d1	[ValueTracking] Merge `cannotBeOrderedLessThanZeroImpl` into `computeKnownFPClass` (#76360 ) This patch merges the logic of `cannotBeOrderedLessThanZeroImpl` into `computeKnownFPClass` to improve the signbit inference. --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2024-01-31 18:26:50 +08:00
Yingwei Zheng	89f87c3876	[RISCV][MC] Add MC layer support for the experimental zabha extension (#80005 ) This patch implements the zabha (Byte and Halfword Atomic Memory Operations) v1.0-rc1 extension. See also https://github.com/riscv/riscv-zabha/blob/v1.0-rc1/zabha.adoc.	2024-01-31 17:06:43 +08:00
Sander de Smalen	dd73666182	[SME] Stop RA from coalescing COPY instructions that transcend beyond smstart/smstop. (#78294 ) This patch introduces a 'COALESCER_BARRIER' which is a pseudo node that expands to a 'nop', but which stops the register allocator from coalescing a COPY node when its use/def crosses a SMSTART or SMSTOP instruction. For example: %0:fpr64 = COPY killed $d0 undef %2.dsub:zpr = COPY %0 // <- Do not coalesce this COPY ADJCALLSTACKDOWN 0, 0 MSRpstatesvcrImm1 1, 0, csr_aarch64_smstartstop, implicit-def dead $d0 $d0 = COPY killed %0 BL @use_f64, csr_aarch64_aapcs If the COPY would be coalesced, that would lead to: $d0 = COPY killed %0 being replaced by: $d0 = COPY killed %2.dsub which means the whole ZPR reg would be live upto the call, causing the MSRpstatesvcrImm1 (smstop) to spill/reload the ZPR register: str q0, [sp] // 16-byte Folded Spill smstop sm ldr z0, [sp] // 16-byte Folded Reload bl use_f64 which would be incorrect for two reasons: 1. The program may load more data than it has allocated. 2. If there are other SVE objects on the stack, the compiler might use the 'mul vl' addressing modes to access the spill location. By disabling the coalescing, we get the desired results: str d0, [sp, #8] // 8-byte Folded Spill smstop sm ldr d0, [sp, #8] // 8-byte Folded Reload bl use_f64	2024-01-31 09:04:13 +00:00
Chia	dc5dca1d01	[RISCV][Isel] Remove redundant vmerge for the scalable vwadd(u).wv (#80079 ) Similar to #78403, but for scalable `vwadd(u).wv`, given that #76785 is recommited. ### Code ``` define <vscale x 8 x i64> @vwadd_wv_mask_v8i32(<vscale x 8 x i32> %x, <vscale x 8 x i64> %y) { %mask = icmp slt <vscale x 8 x i32> %x, shufflevector (<vscale x 8 x i32> insertelement (<vscale x 8 x i32> poison, i32 42, i64 0), <vscale x 8 x i32> poison, <vscale x 8 x i32> zeroinitializer) %a = select <vscale x 8 x i1> %mask, <vscale x 8 x i32> %x, <vscale x 8 x i32> zeroinitializer %sa = sext <vscale x 8 x i32> %a to <vscale x 8 x i64> %ret = add <vscale x 8 x i64> %sa, %y ret <vscale x 8 x i64> %ret } ``` ### Before this patch [Compiler Explorer](https://godbolt.org/z/xsoa5xPrd) ``` vwadd_wv_mask_v8i32: li a0, 42 vsetvli a1, zero, e32, m4, ta, ma vmslt.vx v0, v8, a0 vmv.v.i v12, 0 vmerge.vvm v24, v12, v8, v0 vwadd.wv v8, v16, v24 ret ``` ### After this patch ``` vwadd_wv_mask_v8i32: li a0, 42 vsetvli a1, zero, e32, m4, ta, ma vmslt.vx v0, v8, a0 vsetvli zero, zero, e32, m4, tu, mu vwadd.wv v16, v16, v8, v0.t vmv8r.v v8, v16 ret ```	2024-01-31 17:11:07 +09:00
Changpeng Fang	3564666fe1	[AMDGPU]: Fix type signatures for wmma intrinsics, NFC (#80087 ) Make the wmma intrinsic type signatures to be canonical. We need a type signature as long as the type is not fixed. However, when an argument's type matches a previous argument's type, we do not need the signature for this argument. This patch fixes three general cases: 1. add missing signatures 2. remove signatures for matching arguments 3. reorer the signatures -- return type signature should always appear first	2024-01-30 23:17:35 -08:00
Craig Topper	8a98091162	[RISCV] Use disjoint flag in or_is_add.	2024-01-30 22:12:28 -08:00
Shengchen Kan	8e77390c06	[X86][CodeGen] Support folding memory broadcast in X86InstrInfo::foldMemoryOperandImpl (#79761 )	2024-01-31 12:51:03 +08:00
Oskar Wirga	ff4636a4ab	Refactor recomputeLiveIns to converge on added MachineBasicBlocks (#79940 ) This is a fix for the regression seen in https://github.com/llvm/llvm-project/pull/79498 > Currently, the way that recomputeLiveIns works is that it will recompute the livein registers for that MachineBasicBlock but it matters what order you call recomputeLiveIn which can result in incorrect register allocations down the line. Now we do not recompute the entire CFG but we do ensure that the newly added MBB do reach convergence.	2024-01-30 19:33:04 -08:00
Congcong Cai	c43fda3efc	Revert "[WebAssembly] avoid to use explicit disabled feature" This reverts commit 1a17f2beb9cd1f5bbaa64502ab5c02ff74c199a4.	2024-01-31 11:20:34 +08:00
Congcong Cai	1a17f2beb9	[WebAssembly] avoid to use explicit disabled feature In `CoalesceFeaturesAndStripAtomics`, feature string is converted to FeatureBitset and back to feature string. It will lose information about explicit diasbled features.	2024-01-31 11:14:40 +08:00

1 2 3 4 5 ...

51883 Commits