llvm-project

Author	SHA1	Message	Date
Vladimir Vereschaka	19d681177f	Revert "[MC][TableGen] Expand Opcode field of MCInstrDesc" (#180321 ) Reverts llvm/llvm-project#179652 This PR causes the out-of-memory build failures on many Windows builders.	2026-02-06 21:58:50 -08:00
Demetrius Kanios	4919e0da50	[WebAssembly][FastISel] Make use of `sign-ext` proposals instructions when available (#179855 ) Enables FastISel to use the dedicated sign-extension instructions (rather than shl, shr) when available.	2026-02-06 12:41:39 -08:00
sstipano	13d8870d45	[MC][TableGen] Expand Opcode field of MCInstrDesc (#179652 ) Increase width of Opcode to `int` from `short` to allow more capacity.	2026-02-06 20:21:48 +01:00
Demetrius Kanios	9976e5702f	[WebAssembly][GlobalISel] Part 1 - Setup skeleton (#178796 ) This PR is the first step towards bringing GlobalISel to the Wasm backend. Split from #157161	2026-02-06 18:38:56 +00:00
Derek Schuff	c3db52701e	[MC][Wasm] Emit useful error message when encountering common symbols (#179586 ) We don't currently support common symbols for Wasm, and we currently emit a generic error with a backtrace. Instead, don't crash, and report the names of the offending symbols.	2026-02-06 00:40:25 +00:00
hanbeom	22f53531d7	[WebAssembly] Combine shuffle and signed extend to extend_high (#179166 ) Fold shuffles and bitcasts feeding extend_low_s into extend_high_s. This enables i32x4.dot_i16x8_s selection and removes redundant shuffles. Fixed: https://github.com/llvm/llvm-project/issues/179145	2026-02-03 17:02:53 +09:00
Nicolai Hähnle	6f0b873f1c	[CodeGen] Refactor targets to override the new getTgtMemIntrinsic overload (NFC) (#175844 ) This is a fairly mechanical change. Instead of returning true/false, we either keep the Infos vector empty or push one entry.	2026-02-02 17:40:02 -08:00
Anshul Nigham	85545d4c84	[NewPM] Port MachineDominanceFrontierAnalysis (#177709 )	2026-02-01 22:02:45 -08:00
Demetrius Kanios	95ac9314df	[WebAssembly] Prevent FastISel from trying to select funcref calls (#178742 ) Before, Wasm FastISel treated all indirect calls the same, causing miscompilations at O0 when trying to call a funcref (`call ptr addrspace(20)`), as it would treat the funcref as a normal `ptr` This adds a check so it falls back to ISelDAG when encountering calls outside addrspace 0 (which covers direct calls and indirect calls through normal function pointers). Related: #140933	2026-01-30 12:05:15 -08:00
Damian Heaton	762ba885f9	[LV] Add support for llvm.vector.partial.reduce.fadd (#163975 ) Allows the Loop Vectorizer to generate `llvm.vector.partial.reduce.fadd` intrinsics when sequences which match its requirements are found.	2026-01-28 15:05:34 +00:00
hanbeom	16d8d4b84e	[WebAssembly] Fix crash in ReplaceNodeResults for ANY_EXTEND_VECTOR_INREG (#178374 ) Fixes a crash during type legalization by allowing ISD::ANY_EXTEND_VECTOR_INREG to fall back to default expansion instead of hitting llvm_unreachable. Fixed: #177209	2026-01-28 20:45:04 +09:00
Sam Parker	1e0114c21d	[WebAssembly] Zero and NaN checks for min/max (#177968 ) Custom lower FMINNUM, FMINIMUMNUM, FMAXNUM and FMAXIMUMNUM to generate relaxed_min and relaxed_max when the inputs cannot be NaN or signed zero. Tablegen patterns have also been modified to check the above conditions when trying to match relaxed min/max using the pmin/pmax pattern.	2026-01-28 09:25:41 +00:00
Florian Hahn	b794baf8e7	[TTI] Add VectorInstrContext for context-aware insert/extract costs. (#175982 ) This commit introduces the VectorInstrContext (VIC) infrastructure to improve cost estimates for insert/extracts based on the context instruction in which the insert/extract is used. This is similar to CastContextHint, and allows providing context on how the insert/extract is going to be used before creating IR. This is useful in the LoopVectorizer, where costs need to estimated before creating IR. The new hint currently only replaces an existing check in AArch64, but new uses will be introduced in follow-ups, including https://github.com/llvm/llvm-project/pull/177201. PR: https://github.com/llvm/llvm-project/pull/175982	2026-01-27 16:30:29 +00:00
Anutosh Bhat	2503ffbdf3	[WebAssembly] Fix exception handling initialization order in TargetMachine constructor (#177542 ) The WebAssemblyTargetMachine constructor had an ordering issue where initAsmInfo() was called before basicCheckForEHAndSjLj(). This caused problems in incremental compilation scenarios where: 1. `initAsmInfo()` sets `MCAsmInfo` exception type based on `Options.ExceptionModel` 2. But `Options.ExceptionModel` might still be None at this point 3. `basicCheckForEHAndSjLj()` runs later and updates `Options.ExceptionModel` based on command-line flags like `-wasm-enable-eh` 4. `MCAsmInfo` retains the incorrect exception type (`None` instead of `Wasm`) 5. This prevents WebAssembly exception handling passes from running The fix swaps the order so basicCheckForEHAndSjLj() runs first to establish the correct exception model before initAsmInfo() configures MCAsmInfo based on that model. This enables WebAssembly exception handling to work correctly in clang-repl and other incremental compilation scenarios.	2026-01-24 06:26:02 +00:00
Jameson Nash	d10b2b566a	[NFCI] replace getValueType with new getGlobalSize query (#177186 ) Returns uint64_t to simplify callers. The goal is eventually replace getValueType with this query, which should return the known minimum reference-able size, as provided (instead of a Type) during create. Additionally the common isSized query would be replaced with an isExactKnownSize query to test if that size is an exact definition.	2026-01-22 13:55:53 -05:00
Jameson Nash	2458387ac1	[NFC] replace getValueType with more specific getFunctionType (#177175 ) When trivially valid already, use the more specific method, instead of casting the result of the less specific method.	2026-01-21 10:30:09 -05:00
Matt Arsenault	69df4acf33	WebAssembly: Use LibcallLoweringInfo (#176804 ) Query libcalls through analysis instead of the TargetLowering copy. This could be nicer by parsing the libcall name and checking against the LibcallImpl, but that's probably slower for such a specific case (alternatively, ExternalSymbol should support encoding as a LibcallImpl).	2026-01-20 09:20:49 +01:00
Matt Arsenault	2c9cc88e25	FastISel: Thread LibcallLoweringInfo through (#176799 ) Boilerplate change to prepare to take LibcallLoweringInfo from an analysis. For now, it just sets it from the copy inside of TargetLowering.	2026-01-19 20:44:48 +00:00
Florian Hahn	811fb223af	[WebAssembly] Mark extract.last.active as having invalid cost. Currently the WebAssembly backend crashes when trying to lower some extract.last.active intrinsic calls. Mark their cost as invalid temporarily, to avoid them being introduced by the loop vectorizer after 2abd6d6d7ac (#158088).	2026-01-17 21:21:34 +00:00
Sam Elliott	2042887709	Reland "[NFC][MI] Tidy Up RegState enum use (1/2)" (#176277 ) This Change is to prepare to make RegState into an enum class. It: - Updates documentation to match the order in the code. - Brings the `get<>RegState` functions together and makes them `constexpr`. - Adopts the `get<>RegState` where RegStates were being chosen with ternary operators in backend code. - Introduces `hasRegState` to make querying RegState easier once it is an enum class. - Adopts `hasRegState` where equivalent was done with bitwise arithmetic. - Introduces `RegState::NoFlags`, which will be used for the lack of flags. - Documents that `0x1` is a reserved flag value used to detect if someone is passing `true` instead of flags (due to implicit bool to unsigned conversions). - Updates two calls to `MachineInstrBuilder::addReg` which were passing `false` to the flags operand, to no longer pass a value. - Documents that `getRegState` seems to have forgotten a call to `getEarlyClobberRegState`. This PR relands llvm/llvm-project#176091 (commit 1d616cdca3aba9d22f120888bb6b09b75ca90b92) which was reverted in llvm/llvm-project#176190 (commit 6309cd8668fc2ae589f156b23f86821f4ce5b7ea).	2026-01-16 13:05:06 -08:00
Akshay Deodhar	3860147a7f	[NFC][TargetLowering] Make shouldExpandAtomicRMWInIR and shouldExpandAtomicCmpXchgInIR take a const Instruction pointer (#176073 ) Splits out change from https://github.com/llvm/llvm-project/pull/176015 Changes shouldExpandAtomicRMWInIR to take a constant argument: This is to allow some other TargetLowering constant-argument functions to call it. This change touches several backends. An alternative solution exists, but to me, this seems the "right" way.	2026-01-15 14:22:57 -08:00
Sam Elliott	6309cd8668	Revert "[NFC][MI] Tidy Up RegState enum use (1/2)" (#176190 ) Reverts llvm/llvm-project#176091 Reverting because some compilers were erroring on the call to `Reg.isReg()` (which is not `constexpr`) in a `constexpr` function.	2026-01-15 07:58:05 -08:00
Sam Elliott	1d616cdca3	[NFC][MI] Tidy Up RegState enum use (1/2) (#176091 ) This Change is to prepare to make RegState into an enum class. It: - Updates documentation to match the order in the code. - Brings the `get<>RegState` functions together and makes them `constexpr`. - Adopts the `get<>RegState` where RegStates were being chosen with ternary operators in backend code. - Introduces `hasRegState` to make querying RegState easier once it is an enum class. - Adopts `hasRegState` where equivalent was done with bitwise arithmetic. - Introduces `RegState::NoFlags`, which will be used for the lack of flags. - Documents that `0x1` is a reserved flag value used to detect if someone is passing `true` instead of flags (due to implicit bool to unsigned conversions). - Updates two calls to `MachineInstrBuilder::addReg` which were passing `false` to the flags operand, to no longer pass a value. - Documents that `getRegState` seems to have forgotten a call to `getEarlyClobberRegState`.	2026-01-15 07:47:05 -08:00
Sam Parker	b84ffe040b	[WebAssembly] LoadLane matching with offsets (#176005 )	2026-01-15 08:39:42 +00:00
Austin Jiang	e6cdfb75ac	Fix typos and spelling errors across codebase (#156270 ) Corrected various spelling mistakes such as 'occurred', 'receiver', 'initialized', 'length', and others in comments, variable names, function names, and documentation throughout the project. These changes improve code readability and maintain consistency in naming and documentation. Co-authored-by: Louis Dionne <ldionne.2@gmail.com>	2026-01-13 11:52:46 -05:00
Sam Parker	e5b6833e49	[WebAssembly] vi8 mul cost modelling. (#175177 ) We've already optimised these, so update the cost model to reflect it. And skip the isBeforeLegalize check when lowering i8 muls, because it then misses the cases where, say v32i8, has been type legalised into 2x v16i8. Also explicitly disable memory interleaving for any factor other than two or four.	2026-01-12 09:25:54 +00:00
Trevor Gross	3920bc61ca	[TargetLowering] Change the `softPromoteHalfType` default to `true` (#175149 ) The default `f16` lowering has some issues that result in incorrect float behavior, so over time most targets have switched to use `softPromoteHalfType`. Swap to soft promotion by default and add overrides for SystemZ and AMDGPU, which are the two remaining backends that still depend on this behavior. All basic `f16` op tests now pass on all remaining experimental arches. Fixes: https://github.com/llvm/llvm-project/issues/97981 Fixes: https://github.com/llvm/llvm-project/issues/97975	2026-01-11 12:48:26 +01:00
Derek Schuff	7a22bea512	[WebAssembly] Expand vector frem instructions (#174854 ) Commit `6ad41bcc49` changed how frem is expanded during legalization and it broke WebAssembly but we were missing test coverage. We want to maintain our previous behavior of unrolling vectors and using a libcall to implement scalar frem. I'm not sure why this now has to be different (in ISelLowering) from other libcalls like fsin which work the same way in the end, but this code does accurately describe what we want. Fixes: https://github.com/emscripten-core/emscripten/issues/25991	2026-01-08 16:19:44 -08:00
Trevor Gross	4903c6260c	[WebAssembly] Change `half` to use soft promotion rather than `PromoteFloat` (#152833 ) The default `half` legalization, which Wasm currently uses, does not respect IEEE conventions: for example, casting to bits may invoke a lossy libcall, meaning soft float operations cannot be correctly implemented. Change to the soft promotion legalization which passes `f16` as an `i16` and treats each `half` operation as an individual f16->f32->libcall->f32->f16 sequence. Of note in the test updates are that `from_bits` and `to_bits` are now libcall-free, and that chained operations now round back to `f16` after each step. Fixes the wasm portion of https://github.com/llvm/llvm-project/issues/97981 Fixes the wasm portion of https://github.com/llvm/llvm-project/issues/97975 Fixes: https://github.com/llvm/llvm-project/issues/96437 Fixes: https://github.com/llvm/llvm-project/issues/96438	2026-01-08 15:07:59 +01:00
valadaptive	8da7c05933	[WebAssembly] Fold constant `i8x16.swizzle` and `i8x16.relaxed.swizzle` to `shufflevector` (#169110 ) Resolves #169058. This adds ~~an InstCombine pass~~ a TTI hook to the WebAssembly backend that folds `i8x16.swizzle` and `i8x16.relaxed.swizzle` operations to `shufflevector` operations if their mask operands are constant. This is mainly useful for abstractions over the raw intrinsics--for instance, in architecture-generic SIMD code that may not be able to expose the constant shuffles due to type system limitations. I took most of this from the x86 backend (in particular, `simplifyX86vpermilvar` in `X86InstCombineIntrinsic`), and adapted it for the WebAssembly backend. There wasn't any previous `instCombineIntrinsic` method on the WebAssembly `TargetTransformInfo`, so I added it. Right now, this swizzle optimization is the only one it performs. As I noted in the transform itself, the "relaxed" swizzle actually has stricter preconditions than the non-relaxed one. If a non-negative but still out-of-bounds index is provided, the "relaxed" swizzle can choose between returning 0 and the lane at the index modulo 16. However, it must make the same choice every time, and we don't know which choice the runtime will make, so we can't constant-fold it. The regression tests were mostly generated by Claude and adapted a bit by me (I tried to follow the [InstCombine contributor guide](https://llvm.org/docs/InstCombineContributorGuide.html#tests)). There was previously no WebAssembly subdirectory within the InstCombine tests, so I created that too; as of now, the swizzle fold test is the only file in it. Everything else was written by myself (well, partly copy-pasted from the x86 backend). I'm not sure how to write an Alive2 test for this; I can't find any examples where the input is an arbitrary constant.	2026-01-07 17:39:36 -08:00
hanbeom	1171e30cb0	[WebAssembly] Support v128.load{32,64}_zero for f32 and f64 types (#172291 ) This patch extends the `load_zero` pattern matching to support floating-point vector types (`v4f32` and `v2f64`). Previously, the optimization to generate `v128.load32_zero` and `v128.load64_zero` was only enabled for integer types (`v4i32` and `v2i64`). This change adds the necessary TableGen patterns to correctly match scalar floating-point loads inserted into zero-initialized vectors.	2026-01-08 09:28:14 +09:00
Islam Imad	7ceecfad40	[CodeGen] Fix EVT::changeVectorElementType assertion on simple-to-extended fallback (#173413 ) Fixes #171608	2025-12-28 18:51:18 +00:00
Stefan Weigl-Bosker	da8497ed08	[IR][Verifier] Verification for `target-features` attribute (#173119 ) Fixes https://github.com/llvm/llvm-project/issues/172647 Currently, MC assumes that all `target-feature` flag attributes are well formed and will crash otherwise. This change handles those cases more gracefully.	2025-12-22 11:13:56 +01:00
Rahul Joshi	d921b54d93	[LLVM][Target] Use ListSeparator in lib/Target (#172919 ) Use `ListSeparator` in various places in Target.	2025-12-19 10:19:57 -08:00
Frederik Harwath	6ad41bcc49	[CodeGen] expand-fp: Change frem expansion criterion (#158285 ) The existing condition for checking whether or not to expand an frem instruction in expand-fp is not sufficiently precise. The expansion on other targets than AMDGPU - which is the only intended user right now - is only prevented due to the interaction with the MaxLegalFpConvertBitWidth check. Relying on this is conceptually wrong and limits the use of the pass for other targets and further expansions (e.g. merging with the similar ExpandLargeDivRem pass). Change the expansion criterion to always expand frem of a given type for targets that use "Expand" as the legalization action for the underlying scalar type and use this to exit the pass early for targets which do not require any expansions. This requires to change the frem legalization action for all targets which do not want frem to be expanded in this pass from "Expand" to "LibCall". --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2025-12-16 17:31:26 +01:00
Derek Schuff	6d60d3d7e4	Revert "[WebAssembly] Implement addrspacecast to funcref" (#170785 ) Reverts llvm/llvm-project#166820 There was a failure in the ENABLE_EXPENSIVE_CHECKS configuration.	2025-12-04 17:24:14 -08:00
Demetrius Kanios	d3b9fd0f86	[WebAssembly] Implement addrspacecast to funcref (#166820 ) Adds lowering of `addrspacecast [0 -> 20]` to allow easy conversion of function pointers to Wasm `funcref` When given a constant function pointer, it lowers to a direct `ref.func`. Otherwise it lowers to a `table.get` from `__indirect_function_table` using the provided pointer as the index.	2025-12-04 16:34:42 -08:00
Robert Imschweiler	5c3c0020af	[NFC] Refactor TargetLowering::getTgtMemIntrinsic to take CallBase parameter (#170334 ) cf. https://github.com/llvm/llvm-project/pull/133907#discussion_r2578576548	2025-12-02 19:42:31 +01:00
Jasmine Tang	e0db7f347c	[WebAssembly] Optimize away mask of 63 for sra and srl( zext (and i32 63))) (#170128 ) Follow up to #71844 after shl implementation	2025-12-02 18:23:17 +00:00
Jasmine Tang	edd1856686	[WebAssembly] Optimize away mask of 63 for shl ( zext (and i32 63))) (#152397 ) Fixes https://github.com/llvm/llvm-project/issues/71844	2025-12-01 11:32:46 +00:00
Matt Arsenault	0c2701fe7f	CodeGen: Make all targets override pseudos with pointers (#159881 ) This eliminates the need to have PointerLikeRegClass handling in codegen.	2025-11-26 14:57:14 +00:00
Sam Parker	e44646b795	[WebAssembly] Lower ANY_EXTEND_VECTOR_INREG (#167529 ) Treat it in the same manner of zero_extend_vector_inreg and generate an extend_low_u if possible. This is to try an prevent expensive shuffles from being generated instead. computeKnownBitsForTargetNode has also been updated to specify known zeros on extend_low_u.	2025-11-20 08:57:08 +00:00
Matt Arsenault	a757c4e74e	CodeGen: Add subtarget to TargetLoweringBase constructor (#168620 ) Currently LibcallLoweringInfo is defined inside of TargetLowering, which is owned by the subtarget. Pass in the subtarget so we can construct LibcallLoweringInfo with the subtarget. This is a temporary step that should be revertable in the future, after LibcallLoweringInfo is moved out of TargetLowering.	2025-11-19 19:18:13 +00:00
Jasmine Tang	672757bf55	[WebAssembly] Add patterns for extadd pairwise (#167960 ) Add a few patterns for extadd pairwise.	2025-11-18 02:41:16 -08:00
Hongyu Chen	63e6373efd	[WebAssembly] Truncate extra bits of large elements in BUILD_VECTOR (#167223 ) Fixes https://github.com/llvm/llvm-project/issues/165713 This patch handles out-of-bound vector elements and truncates extra bits.	2025-11-17 10:39:18 +00:00
Craig Topper	3e6442a516	[WebAssembly] Use MCRegister::id(). NFC (#167609 )	2025-11-11 18:31:43 -08:00
Hongyu Chen	9697f4b9e4	[WebAssembly][FastISel] Bail out on meeting non-integer type in selectTrunc (#167165 ) Fixes https://github.com/llvm/llvm-project/issues/165438 With `simd128` enabled, we may meet vector type truncation in FastISel. To respect #138479, this patch merely bails out on non-integer IR types, though I prefer bailing out for all non-simple types as most targets (X86, AArch64) do.	2025-11-12 04:33:41 +08:00
Matt Arsenault	11ab23c33d	CodeGen: Keep reference to TargetRegisterInfo in TargetInstrInfo (#158224 ) Both conceptually belong to the same subtarget, so it should not be necessary to pass in the context TargetRegisterInfo to any TargetInstrInfo member. Add this reference so those superfluous arguments can be removed. Most targets placed their TargetRegisterInfo as a member in TargetInstrInfo. A few had this owned by the TargetSubtargetInfo, so unify all targets to look the same.	2025-11-10 22:40:39 +00:00
serge-sans-paille	898d6fecf6	Remove unused <algorithm> inclusion (#166942 )	2025-11-10 10:30:37 +00:00
Sam Parker	d10a85167a	[WebAssembly] Implement more of getCastInstrCost (#164612 ) Fill out more information for sign and zero extend and add some truncate information; however, the primary change is to int/fp conversions. In particular, fp to (narrow) int appears to be relatively expensive.	2025-11-10 08:07:16 +00:00

1 2 3 4 5 ...

2231 Commits