llvm-project

Author	SHA1	Message	Date
Yolanda Chen	291101aa8e	[WebAssembly] Optimize vector shift using a splat value from outside block The vector shift operation in WebAssembly uses an i32 shift amount type, while the LLVM IR requires binary operator uses the same type of operands. When the shift amount operand is splated from a different block, the splat source will not be exported and the vector shift will be unrolled to scalar shifts. This patch enables the vector shift to identify the splat source value from the other block, and generate expected WebAssembly bytecode when lowering. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D158399	2023-08-25 08:13:27 -07:00
Reid Kleckner	984dc4b9cd	[WebAssembly] Create separation between MC and CodeGen layers Move WebAssemblyUtilities from Utils to the CodeGen library. It primarily deals in MIR layer types, so it really lives in the CodeGen library. Move a variety of other things around to try create better separation. See issue #64166 for more info on layering. Move llvm/include/CodeGen/WasmAddressSpaces.h back to llvm/lib/Target/WebAssembly/Utils. Differential Revision: https://reviews.llvm.org/D156472	2023-08-18 14:08:37 -07:00
Thomas Lively	4f065fcb57	[WebAssembly] Fix incorrect assertion in SIMD reduction codegen The codegen routine introduced in 18077e9fd688 did not account for vectors with more than 16 lanes. Remove the incorrect assertion and bail out of the optimization when encountering this case. Add test cases that previously triggered the assertion. Unfortunately, these test cases now have terrible codegen, but that is at least better than crashing. Fixes #63500. Differential Revision: https://reviews.llvm.org/D154124	2023-06-30 11:30:18 -07:00
xortoast	bb648c9177	[WebAssembly] Add lowering for llvm.rint and llvm.roundeven WebAssembly doesn't expose inexact exceptions, so frint can be mapped to fnearbyint. Likewise, WebAssembly always rounds ties-to-even, so froundeven can be mapped to fnearbyint. Differential Revision: https://reviews.llvm.org/D153451	2023-06-23 14:07:11 -07:00
Paulo Matos	55aeb23fe0	[clang][WebAssembly] Implement support for table types and builtins This commit implements support for WebAssembly table types and respective builtins. Table tables are WebAssembly objects to store reference types. They have a large amount of semantic restrictions including, but not limited to, only being allowed to be declared at the top-level as static arrays of zero-length. Not being arguments or result of functions, not being stored ot memory, etc. This commit introduces the __attribute__((wasm_table)) to attach to arrays of WebAssembly reference types. And the following builtins to manage tables: * ref __builtin_wasm_table_get(table, idx) * void __builtin_wasm_table_set(table, idx, ref) * uint __builtin_wasm_table_size(table) * uint __builtin_wasm_table_grow(table, ref, uint) * void __builtin_wasm_table_fill(table, idx, ref, uint) * void __builtin_wasm_table_copy(table, table, uint, uint, uint) This commit also enables reference-types feature at bleeding-edge. This is joint work with Alex Bradbury (@asb). Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D139010	2023-06-10 15:53:13 +02:00
Caleb Zulawski	18077e9fd6	[WebAssembly] Re-land 8392bf6000ad Correctly handle single-element vectors to fix an assertion failure. Add tests that were missing from the original commit. Differential Revision: D151782	2023-06-09 08:42:27 -07:00
Thomas Lively	100c756d96	Revert "Improve WebAssembly vector bitmask, mask reduction, and extending" This reverts commit 8392bf6000ad039bd0e55383d40a05ddf7b4af13. The commit missed some edge cases that led to crashes. Reverting to resolve downstream breakage while a fix is pending.	2023-06-08 14:36:29 -07:00
Caleb Zulawski	8392bf6000	Improve WebAssembly vector bitmask, mask reduction, and extending This is inspired by a recently filed Rust issue noting poor codegen for vector masks (https://github.com/rust-lang/portable-simd/issues/351). Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D151782	2023-06-07 10:20:22 -07:00
Thomas Lively	72a72315b0	[WebAssembly] Mark @llvm.wasm.shuffle lane indices as immediates This intrinsic is meant to lower directly to the i8x16.shuffle instruction, which takes its lane index arguments as immmediates. The ISel for the intrinsic assumed that the lane index arguments were constants, so bitcode that "incorrectly" used this intrinsic with non-immediate arguments caused an assertion failure in the backend. Avoid the crash by defining the lane index arguments to be immediates, matching the underlying instruction. Update ISel accordingly. This change means that the bitcode that previously caused a crash will now fail to validate. Fixes #55559. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D149898	2023-05-05 08:12:41 -07:00
Peter Rong	3b2476910b	[WASM] Prevent casting `undef` to `CosntantSDNode` WebAssembly tries to cast an `undef` to `CosntantSDNode` during `LowerAccessVectorElement`. These operations will trigger an assertion error in cast. To avoid this issue, we prevent casting, and abort the lowering operation. A unit test is also included. This patch fixes [pr61828](https://github.com/llvm/llvm-project/issues/61828) Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D147198	2023-03-30 20:14:11 -07:00
Peter Rong	51a93828d7	[WASM] Fix legalizer for LowerBUILD_VECTOR. Constants in BUILD_VECTOR may be down cast into a smaller value that fits LaneBits, i.e., the bit width of elements in the vector. This cast didn't consider 2^N where it would be cast into -2^N, which still doesn't fit into LaneBits after casting. This will cause an assertion in later legalization. 2^N should be cast into 0, and this patch reflects such behavior. This patch also includes a test to reflect the fix. This patch fixes [issue 61780](https://github.com/llvm/llvm-project/issues/61780) Related patch: https://reviews.llvm.org/D108669 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D147208	2023-03-30 19:20:04 -07:00
Peter Rong	163d7bb941	[WASM] Precommit WebAssemblyISelLowering.cpp format changes for D147198 Signed-off-by: Peter Rong <PeterRong96@gmail.com>	2023-03-29 22:18:53 -07:00
Heejin Ahn	d91c9aef9b	[WebAssembly] Select call_indirect for alloca calls Currently calling stack locations is selected using `CALL` in ISel, resulting in an invalid code and crashing in AsmPrinter. FastISel correctly selects it will `CALL_INDIRECT`. Fixes the problem reported in D146781. Reviewed By: tlively, HerrCai0907 Differential Revision: https://reviews.llvm.org/D147033	2023-03-29 12:46:58 -07:00
Jun Ma	403926aefe	[WebAssembly] Skip implied bitmask operation in LowerShift This patch skips redundant explicit masks of the shift count since it is implied inside wasm shift instruction. Differential Revision: https://reviews.llvm.org/D144619	2023-03-02 09:37:25 +08:00
Luke Lau	fb6602616c	[WebAssembly] Explicitly add {z,s}ext so extends are selected During DAG legalization, {u,s}itofp instructions on v2i8, v2i16, v4i8 and v4i16 types ended up being legalized into scalar instructions, when they could just be extended to v2i32/v4i32 instead. Fixes https://github.com/llvm/llvm-project/issues/57182 Differential Revision: https://reviews.llvm.org/D140916	2023-01-06 12:28:29 +00:00
Luke Lau	f841ad30d7	[WebAssembly] Replace LOAD_SPLAT with SPLAT_VECTOR Splats were selected by matching on uses of `build_vector` with identical elements, but a while back a target independent node for vector splatting was added. This removes the WebAssembly specific LOAD_SPLAT intrinsic, and instead makes SPLAT_VECTOR legal and adds patterns for splat loads. Differential Revision: https://reviews.llvm.org/D139871	2023-01-04 15:07:47 +00:00
Luke Lau	8ef5da7010	[WebAssembly] Fix crash when selecting 64 bit lane extract operand The tablegen patterns on vector_extract only match i32 constants, but on wasm64 these come in as i64 constants. In certain situations this would cause crashes whenever it couldn't select an extract_vector_elt instruction. Rather than add duplicate patterns for every instruction, this just canonicalizes the constant to be i32 when lowering. Fixes https://github.com/llvm/llvm-project/issues/57577 Differential Revision: https://reviews.llvm.org/D140205	2022-12-19 10:37:19 +00:00
Fangrui Song	b0df70403d	[Target] llvm::Optional => std::optional The updated functions are mostly internal with a few exceptions (virtual functions in TargetInstrInfo.h, TargetRegisterInfo.h). To minimize changes to LLVMCodeGen, GlobalISel files are skipped. https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 22:43:14 +00:00
Kazu Hirata	20cde15415	[Target] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 20:36:06 -08:00
Stanislav Mekhanoshin	bcaf31ec3f	[AMDGPU] Allow finer grain control of an unaligned access speed A target can return if a misaligned access is 'fast' as defined by the target or not. In reality there can be different levels of 'fast' and 'slow'. This patch changes the boolean 'Fast' argument of the allowsMisalignedMemoryAccesses family of functions to an unsigned representing its speed. A target can still define it as it wants and the direct translation of the current code uses 0 and 1 for current false and true. This makes the change an NFC. Subsequent patch will start using an actual value of speed in the load/store vectorizer to compare if a vectorized access going to be not just fast, but not slower than before. Differential Revision: https://reviews.llvm.org/D124217	2022-11-17 09:23:53 -08:00
Paulo Matos	1bd1a44070	[WebAssembly] Use intrinsics for table.get/set instructions Initial table.get/set implementation would match and lower combinations of GEP+load/store to table.get/set instructions. However, this is error prone due to potential combinations of GEP+load/store we don't implement, and load/store optimizations. By changing the code to using intrinsics, we avoid both issues and simplify the code. New builtins implemented: * @llvm.wasm.table.get.externref * @llvm.wasm.table.get.funcref * @llvm.wasm.table.set.externref * @llvm.wasm.table.set.funcref Reviewed By: asb, tlively Differential Revision: https://reviews.llvm.org/D134436	2022-09-27 09:16:30 +02:00
Fanchen Kong	28557e8c98	[WebAssembly] Improve codegen for shuffles with undefined lane indices For undefined lane indices, fill the mask with {0..N} instead of zeros to allow further reduction to word/dword shuffle on the VM. Reviewed By: tlively, penzn Differential Revision: https://reviews.llvm.org/D133473	2022-09-13 16:03:18 -07:00
Kazu Hirata	2833760c57	[Target] Qualify auto in range-based for loops (NFC)	2022-08-28 17:35:09 -07:00
Simon Pilgrim	f9de13232f	[X86] Promote i8/i16 CTTZ (BSF) instructions and remove speculation branch This patch adds a Type operand to the TLI isCheapToSpeculateCttz/isCheapToSpeculateCtlz callbacks, allowing targets to decide whether branches should occur on a type-by-type/legality basis. For X86, this patch proposes to allow CTTZ speculation for i8/i16 types that will lower to promoted i32 BSF instructions by masking the operand above the msb (we already do something similar for i8/i16 TZCNT). This required a minor tweak to CTTZ lowering - if the src operand is known never zero (i.e. due to the promotion masking) we can remove the CMOV zero src handling. Although BSF isn't very fast, most CPUs from the last 20 years don't do that bad a job with it, although there are some annoying passthrough EFLAGS dependencies. Additionally, now that we emit 'REP BSF' in most cases, we are tending towards assuming this will most likely be executed as a TZCNT instruction on any semi-modern CPU. Differential Revision: https://reviews.llvm.org/D132520	2022-08-24 17:28:18 +01:00
Alex Bradbury	104a24ec8b	[WebAssembly] Produce error when encountering unlowerable Wasm global accesses WebAssembly globals are represented as IR globals with the wasm_var address space (AS1). Prior to this patch, a wasm global load that isn't lowerable will produce a failure to select, while a wasm global store will produced incorrect code. This patch ensures we consistently produce a clear error. As noted in the test cases, it's conceivable that a frontend or an optimisation pass could produce similar IR even in the presence of the semantic restrictions on pointers to Wasm globals in the frontend, which is a separate problem to address. Differential Revision: https://reviews.llvm.org/D131387	2022-08-10 10:34:10 +01:00
Andrew Brown	3696a789d2	[WebAssembly] Use `localexec` as default TLS model for non-Emscripten targets Only Emscripten supports dynamic linking with threads. To use thread-local storage for other targets, this change defaults to the `localexec` model. Differential Revision: https://reviews.llvm.org/D130053	2022-07-25 13:25:46 -07:00
Kazu Hirata	e0039b8d6a	Use llvm::less_second (NFC)	2022-06-04 22:48:32 -07:00
Shao-Ce SUN	662b9fa02c	[NFC][CodeGen] Add a setTargetDAGCombine use ArrayRef Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D122557	2022-03-29 09:53:24 +08:00
Heejin Ahn	b8038a916d	[WebAssembly] Disable SimplifyDemandedVectorElts after legalization This fixes a reported bug that caused an infinite loop during the SelectionDAG optimization phase in ISel, by creating an overridable hook in `TargetLowering` that allows us to bail out from running `SimplifyDemandedVectorElts`. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D121869	2022-03-16 20:52:43 -07:00
Shengchen Kan	37b378386e	[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments	2022-03-16 20:25:42 +08:00
serge-sans-paille	989f1c72e0	Cleanup codegen includes This is a (fixed) recommit of https://reviews.llvm.org/D121169 after: 1061034926 before: 1063332844 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121681	2022-03-16 08:43:00 +01:00
Nico Weber	a278250b0f	Revert "Cleanup codegen includes" This reverts commit 7f230feeeac8a67b335f52bd2e900a05c6098f20. Breaks CodeGenCUDA/link-device-bitcode.cu in check-clang, and many LLVM tests, see comments on https://reviews.llvm.org/D121169	2022-03-10 07:59:22 -05:00
serge-sans-paille	7f230feeea	Cleanup codegen includes after: 1061034926 before: 1063332844 Differential Revision: https://reviews.llvm.org/D121169	2022-03-10 10:00:30 +01:00
Jim Lin	d6b0734837	[NFC] Use Register instead of unsigned	2022-01-19 20:17:04 +08:00
Paulo Matos	97ef15ad76	[WebAssembly] Fix reftype load/store match with idx from call Implement support for matching an index from a WebAssembly CALL instruction. Add test. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D115327	2022-01-13 11:04:22 +01:00
Jing Bao	2a4a229d6d	[WebAssembly] Custom optimization for truncate When possible, optimize TRUNCATE to generate Wasm SIMD narrow instructions (i16x8.narrow_i32x4_u, i8x16.narrow_i16x8_u), rather than generate lots of extract_lane and replace_lane. Closes #50350.	2021-12-14 08:42:39 -08:00
Paulo Matos	a96d828510	[WebAssembly] Implementation of intrinsic for ref.null and HeapType removal This patch implements the intrinsic for ref.null. In the process of implementing int_wasm_ref_null_func() and int_wasm_ref_null_extern() intrinsics, it removes the redundant HeapType. This also causes the textual assembler syntax for ref.null to change. Instead of receiving an argument: `func` or `extern`, the instruction mnemonic is either ref.null_func or ref.null_extern, without the need for a further operand. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D114979	2021-12-06 09:46:15 +01:00
Craig Topper	fe1f0de003	[RISCV][WebAssembly][TargetLowering] Allow expandCTLZ/expandCTTZ to rely on CTPOP expansion for vectors. Our fallback expansion for CTLZ/CTTZ relies on CTPOP. If CTPOP isn't legal or custom for a vector type we would scalarize the CTLZ/CTTZ. This is different than CTPOP itself which would use a vector expansion. This patch teaches expandCTLZ/CTTZ to rely on the vector CTPOP expansion instead of scalarizing. To do this I had to add additional checks to make sure the operations used by CTPOP expansions are all supported. Some of the operations were already needed for the CTLZ/CTTZ expansion. This is a huge improvement to the RISCV which doesn't have a scalar ctlz or cttz in the base ISA. For WebAssembly, I've added Custom lowering to keep the scalarizing behavior. I've also extended the scalarizing to CTPOP. Differential Revision: https://reviews.llvm.org/D111919	2021-10-20 07:46:41 -07:00
Paulo Matos	6d0c7bc17d	[WebAssembly] Implementation of table.get/set for reftypes in LLVM IR This change implements new DAG nodes TABLE_GET/TABLE_SET, and lowering methods for load and stores of reference types from IR arrays. These global LLVM IR arrays represent tables at the Wasm level. Differential Revision: https://reviews.llvm.org/D111154	2021-10-20 10:31:31 +02:00
Kazu Hirata	8568ca789e	Use llvm::erase_if (NFC)	2021-10-18 09:33:42 -07:00
Heejin Ahn	9261ee32dc	[WebAssembly] Make EH work with dynamic linking This makes Wasm EH work with dynamic linking. So far we were only able to handle destructors, which do not use any tags or LSDA info. 1. This uses `TargetExternalSymbol` for `GCC_except_tableN` symbols, which points to the address of per-function LSDA info. It is more convenient to use than `MCSymbol` because it can take additional target flags. 2. When lowering `wasm_lsda` intrinsic, if PIC is enabled, make the symbol relative to `__memory_base` and generate the `add` node. If PIC is disabled, continue to use the absolute address. 3. Make tag symbols (`__cpp_exception` and `__c_longjmp`) undefined in the backend, because it is hard to make it work with dynamic linking's loading order. Instead, we make all tag symbols undefined in the LLVM backend and import it from JS. 4. Add support for undefined tags to the linker. Companion patches: - https://github.com/WebAssembly/binaryen/pull/4223 - https://github.com/emscripten-core/emscripten/pull/15266 Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D111388	2021-10-12 23:28:27 -07:00
Paulo Matos	0c7495848a	[WebAssembly] Fix call_indirect on funcrefs The currently implementation of funcrefs is broken since it is putting the funcref itself on the stack before the call_indirect. Instead what should be on the stack is the constant 0, which is the index at which we store the funcref in __funcref_call_table. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D111152	2021-10-06 10:11:53 +02:00
Paulo Matos	91fe069c35	[WebAssembly] De-duplicate WasmAddressSpace and refactor reftype predicates This is a non-functional change to remove the duplicate WasmAddressSpace enum and refactor reftype predicates by moving them to the Utilities source file. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D111144	2021-10-06 09:56:23 +02:00
Sam Clegg	ef8c9135ef	[WebAssembly] Allow import and export of TLS symbols between DSOs We previously had a limitation that TLS variables could not be exported (and therefore could also not be imported). This change removed that limitation. Differential Revision: https://reviews.llvm.org/D108877	2021-09-14 06:47:37 -07:00
Sam Clegg	c32884c482	[WebAssembly] Rename WrapperPIC -> WrapperREL. NFC This ISD node/wrapper represents am address which is relative to a base address and therefore lowers to `i32.const` rather than `global.get`. Use this wrapper type for TLS-relative addresses, paving the way for the non-REL wrapper to be used to external TLS address once those are supported. Differential Revision: https://reviews.llvm.org/D109179	2021-09-02 20:04:34 -04:00
Thomas Lively	fec4749200	[WebAssembly] Lower v2f32 to v2f64 extending loads with promote_low Previously extra wide v4f32 to v4f64 extending loads would be legalized to v2f32 to v2f64 extending loads, which would then be scalarized by legalization. (v2f32 to v2f64 extending loads not produced by legalization were already being emitted correctly.) Instead, mark v2f32 to v2f64 extending loads as legal and explicitly lower them using promote_low. This regresses the addressing modes supported for the extloads not produced by legalization, but that's a fine trade off for now. Differential Revision: https://reviews.llvm.org/D108496	2021-09-01 10:27:42 -07:00
Thomas Lively	977eeb0c38	[WebAssembly] Fix some UB from ca541aa319cd	2021-08-24 19:44:03 -07:00
Thomas Lively	ca541aa319	[WebAssembly] Fix up out-of-range BUILD_VECTOR lane constants Fixes PR51605 in which a DAG combine and legalization sequence generated out-of-range constants in BUILD_VECTOR lanes. In the v16i8 case, the constants were 255, which would be in range if DAG ISel used unsigned constants, but it is out of range because DAG ISel uses signed constants. Differential Revision: https://reviews.llvm.org/D108669	2021-08-24 17:24:03 -07:00
Thomas Lively	be6c49e743	[WebAssembly] Add explicit casts to silence -Wc++11-narrowing	2021-08-19 16:00:07 -07:00
Thomas Lively	fd0557dbf1	[WebAssembly] More convert_low and promote_low codegen The convert_low and promote_low instructions can widen the lower two lanes of a four-lane vector, but we were previously scalarizing patterns that widened lanes besides the low two lanes. The commit adds a shuffle to move the widened lanes into the low lane positions so the convert_low and promote_low instructions can be used instead of scalarizing. Depends on D108266. Differential Revision: https://reviews.llvm.org/D108341	2021-08-19 15:37:12 -07:00

1 2 3 4 5 ...

360 Commits