llvm-project

Author	SHA1	Message	Date
Erick Ochoa Lopez	8dde305150	[mlir][vector] Add finer grained populate methods for multi_reduction (NFC). (#180750 ) Thiese commits add three more populate methods for `vector.multi_reduction`'s lowering patterns: * populateVectorMultiReductionTransformationPatterns * populateVectorMultiReductionFlatteningPatterns * populateVectorMultiReductionUnrollingPatterns These methods have a finer level of granularity and allow users to select between unrolling, flattening, and applying transformations that would set up operations for unrolling and flattening. The previous populateVectorMultiReductionLoweringPatterns method is rewritten in terms of these new methods.	2026-02-10 21:27:23 +00:00
Noah Prisament	e2b7cbf71f	Fix outdated docs with vector.reduce instead of vector.reduction (#178111 ) There is no existing `vector.reduce` op in the vector dialect, but multiple doc strings reference it. This change updates those instances to the correct `vector.reduction` op.	2026-02-02 17:52:04 +00:00
Han-Chung Wang	84c66f4f0d	[mlir][vector] Add assumeAligned mode to vector.store narrow type emulation (#178565 ) The revision adds a new `assumeAligned` mode to the emulation, so downstream projects can use simple path when it meets the requirements. E.g., if the offset is always aligned with container's element type, we can skip the check of front padding sizes. --------- Signed-off-by: hanhanW <hanhan0912@gmail.com>	2026-01-29 18:27:18 +00:00
Jakub Kuderski	9aaf0b89f5	[mlir] Apply clang-tidy check llvm-use-vector-utils. NFC. (#178526 )	2026-01-29 02:19:00 +00:00
Jakub Kuderski	59e44799bd	[mlir] Fix new clang-tidy warning llvm-type-switch-case-types. NFC. (#178487 ) Pre-commiting this before landing the new check in https://github.com/llvm/llvm-project/pull/177892	2026-01-28 19:13:47 +00:00
Artem Kroviakov	0926743e2e	[MLIR][XeGPU] Add uniform values distribution pattern (#176737 )	2026-01-26 21:23:31 +01:00
Prathamesh Tagore	32d46c9033	[mlir][vector] Fix masked load/store emulation for rank-0 memrefs (#173325 ) Added rank‑0 handling to masked load/store emulation by reinterpreting rank‑0 memrefs as 1‑D buffers with a synthetic index, preventing empty‑indices crashes. Fixes https://github.com/llvm/llvm-project/issues/131243	2026-01-05 17:56:09 +00:00
Longsheng Mou	a46cb15b42	[mlir][vector] Fix typo in `vector.contract` mnemonic (NFC) (#173661 )	2025-12-31 09:31:59 +08:00
Prathamesh Tagore	8e90208037	[mlir][vector] Skip vector mask elimination for funcs without body (#173330 )	2025-12-29 22:37:16 +00:00
Artem Kroviakov	aba8ebbda0	[MLIR][Vector] Add distribution pattern for `vector::ConstantMaskOp` (#172268 )	2025-12-16 17:24:13 +01:00
Nishant Patel	71ee84acc4	[MLIR][Vector] Add unroll pattern for vector.constant_mask (#171518 ) This PR adds unrolling for vector.constant_mask op based on the targetShape. Each unrolled vector computes its local mask size in each dimension (d) as: min(max(originalMaskSize[d] - offset[d], 0), unrolledMaskSize[d]).	2025-12-11 13:16:55 -08:00
Men-cotton	94ebcfd16d	[mlir][vector] Fix crash in ReorderCastOpsOnBroadcast with non-vector result (#170985 ) Fixes a crash in `ReorderCastOpsOnBroadcast` by ensuring the cast result is a `VectorType` before applying the pattern. A regression test has been added to mlir/test/Dialect/Vector/vector-sink.mlir. Fixes: #126371	2025-12-09 16:38:02 +00:00
Nishant Patel	7931e2fd52	[MLIR][Vector] Add unroll pattern for vector.create_mask (#169119 ) This PR adds unrolling for vector.create_mask op based on the targetShape. Each unrolled vector computes its local mask size in each dimension (d) as: min(max(originalMaskSize[d] - offset[d], 0), unrolledMaskSize[d]).	2025-12-03 13:38:17 -08:00
Andrzej Warzyński	cfda27d0fb	[mlir][Vector] Add support for scalable vectors to `ScanToArithOps` (#123117 ) Note, scalable reductions dims are left as a TODO.	2025-11-20 13:39:52 +00:00
Nishant Patel	af73aeaa19	[MLIR][Vector] Add unroll pattern for vector.shape_cast (#167738 ) This PR adds pattern for unrolling shape_cast given a targetShape. This PR is a follow up of #164010 which was very general and was using inserts and extracts on each element (which is also LowerVectorShapeCast.cpp is doing). After doing some more research on use cases, we (me and @Jianhui-Li ) realized that the previous version in #164010 is unnecessarily generic and doesn't fit our performance needs. Our use case requires that targetShape is contiguous in both source and result vector. This pattern only applies when contiguous slices can be extracted from the source vector and inserted into the result vector such that each slice remains in vector form with targetShape (and not decompose to scalars). In these cases, the unrolling proceeds as: vector.extract_strided_slice -> vector.shape_cast (on the slice unrolled) -> vector.insert_strided_slice	2025-11-19 16:16:44 -08:00
Ryutaro Okada	7e7ea9c535	[MLIR] Extend vector.scatter to accept tensor as base (#165548 ) This PR makes the following improvements to `vector.scatter` and its lowering pipeline: - In addition to `memref`, accept a ranked `tensor` as the base operand of `vector.scatter`, similar to `vector.transfer_write`. - Implement bufferization support for `vector.scatter`, so that tensor-based scatter ops can be fully lowered to memref-based forms. It's worth to complete the functionality of map_scatter decomposition. Full discussion can be found here: https://github.com/iree-org/iree/issues/21135 --------- Signed-off-by: Ryutaro Okada <1015ryu88@gmail.com>	2025-11-14 19:56:24 +00:00
Charitha Saumya	1e8834ea3a	[mlir][vector][xegpu] Accept uniform values in `getDistributedType` (#163887 ) Uniform values should not be distributed during vector distribution. Example would be a reduction result where reduction happens across lanes. However, current `getDistributedType` does not accept a zero result affine map (i.e. no distributed dims) when describing the distributed dimensions. This result in null type being returned and crashing the vector distribution in some cases. An example case would be a `scf.for` op (about to be distributed) in which one of the for result is a uniform value and it does not have a user outside the warp op. This necessitates querying the `getDistributedType` to figure our the distributed type of this value.	2025-10-22 08:41:41 -07:00
Jakub Kuderski	ae11c5c2c4	[mlir] Switch uses of deprecated .create methods to free function. NFC. (#164635 ) See https://discourse.llvm.org/t/psa-opty-create-now-with-100-more-tab-complete/87339.	2025-10-22 14:51:03 +00:00
James Newling	fe5b72a0e8	[mlir][Vector] Pattern to linearize broadcast (#163845 ) The PR https://github.com/llvm/llvm-project/pull/162167 removed a pattern to linearize vector.splat, without adding the equivalent pattern for vector.broadcast. This PR adds such a pattern, hopefully brining vector.broadcast up to full parity with vector.splat that has now been removed. --------- Signed-off-by: James Newling <james.newling@gmail.com>	2025-10-17 23:31:18 +00:00
Charitha Saumya	f7a5264890	[mlir][vector] Add support for yielding loop bounds in `scf.for` distribution. (#163443 ) In some cases, loop bounds (lower, upper and step) of `scf.for` can come locally from the parent warp op the `scf.for`. Current logic will not yield the loop bounds in the new warp op generated during lowering causing sinked `scf.for` to have non dominating use. In this PR, we have added logic to yield loop bounds by default (treat them as other operands of `scf.for`) which fixes this bug.	2025-10-17 09:07:17 -07:00
Nishant Patel	4ff8f118cc	[MLIR][Vector] Extend elementwise pattern to support unrolling from higher rank to lower rank (#162515 ) This PR enhances the elementwise unrolling pattern to support higher rank to lower rank unroll. The approach is to add leading unit dims to lower rank targetShape to match the rank of original vector (because ExtractStridedSlice requires same rank to extractSlices), extract slice, reshape to targetShape's rank and perform the operation.	2025-10-15 06:51:23 -07:00
Artem Kroviakov	0a71fd1528	[MLIR][Vector] Improve warp distribution robustness (#161647 )	2025-10-15 10:52:24 +02:00
Jakub Kuderski	0820266651	[mlir] Use llvm accumulate wrappers. NFCI. (#162957 ) Use wrappers around `std::accumulate` to make the code more concise and less bug-prone: https://github.com/llvm/llvm-project/pull/162129. With `std::accumulate`, it's the initial value that determines the accumulator type. `llvm::sum_of` and `llvm::product_of` pick the right accumulator type based on the range element type. Found some funny bugs like a local accumulate helper that calculated a sum with initial value of 1 -- we didn't hit the bug because the code was actually dead...	2025-10-11 11:33:18 -04:00
James Newling	ea291d0e8c	[MLIR][Vector] Remove vector.splat (#162167 ) vector.splat has been deprecated (user: please use the very similar vector.broadcast instead) with the last PR landing about 6 weeks ago. The discourse discussion is at https://discourse.llvm.org/t/rfc-mlir-vector-deprecate-then-remove-vector-splat/87143/1 The last PR was #152230 This PR completely removes vector.splat. In addition to removing vector.splat from VectorOps.td, it - Updates the few remaining places where vector::SplatOp is created (now vector::BroadcastOp is created) - Removes temporary patterns where vector.splat is replaced by vector.broadcast The only place 'vector.splat' appears is now the files https://github.com/llvm/llvm-project/blob/main/mlir/utils/tree-sitter-mlir/test/corpus/op.txt and https://github.com/llvm/llvm-project/blob/main/mlir/utils/tree-sitter-mlir/dialect/vector.js --------- Signed-off-by: James Newling <james.newling@gmail.com>	2025-10-10 09:58:18 -07:00
Mehdi Amini	eabfed8690	[MLIR] Apply clang-tidy fixes for modernize-use-bool-literals in VectorEmulateNarrowType.cpp (NFC)	2025-10-03 11:04:58 -07:00
Jakub Kuderski	3960ff6ca0	[mlir][vector] Simplify op rewrite pattern inheriting constructors. NFC. (#161670 ) Use the `Base` type alias from https://github.com/llvm/llvm-project/pull/158433.	2025-10-02 19:07:25 -04:00
jiang1997	d8a8d1fc56	[MLIR][MemRef] Change builders with `int` alignment params to `llvm::MaybeAlign` (#159449 ) Change remaining OpBuilder methods to use `llvm::MaybeAlign` instead of `uint64_t` for alignment parameters. --------- Co-authored-by: Erick Ochoa Lopez <erick.ochoalopez@amd.com>	2025-09-29 09:25:49 -04:00
Erick Ochoa Lopez	8b9c70dcdb	[mlir] Move vector.{to_elements,from_elements} unrolling to `VectorUnroll.cpp` (#159118 ) This PR moves the patterns that unroll vector.to_elements and vector.from_elements into the file with other vector unrolling operations. This PR also adds these unrolling patterns into the `populateVectorUnrollPatterns`. And renames `populateVectorToElementsLoweringPatterns` `populateVectorFromElementsLoweringPatterns` to `populateVectorToElementsUnrollPatterns` `populateVectorFromElementsUnrollPatterns`.	2025-09-18 12:03:54 -04:00
Jan Patrick Lehr	50f81531b4	[MLIR] Fix compilation after #157771 (#159257 ) The original PR broke pretty much all our bots. Apologies for the noise in the previous PR.	2025-09-17 09:13:44 +02:00
Diego Caballero	7bdd88c1e3	[mlir][Vector] Add patterns to lower `vector.shuffle` (#157611 ) This PR adds patterns to lower `vector.shuffle` with inputs with different vector sizes more efficiently. The current LLVM lowering for these cases degenerates to a sequence of `vector.extract` and `vector.insert` operations. With this PR, the smaller input is promoted to larger vector size by introducing an extra `vector.shuffle`.	2025-09-16 18:01:30 -07:00
Nishant Patel	0e5c32bd6d	[MLIR][Vector] Add unrolling pattern for vector StepOp (#157752 ) This PR adds unrolling pattern for vector.step op to VectorUnroll transform.	2025-09-16 15:33:08 -07:00
Alan Li	4a094095a4	[MLIR] Make 1-D memref flattening a prerequisite for vector narrow type emulation (#157771 ) Addresses: https://github.com/llvm/llvm-project/issues/115653 We already have utilities to flatten memrefs into 1-D. This change makes memref flattening a prerequisite for vector narrow type emulation, ensuring that emulation patterns only need to handle 1-D scenarios.	2025-09-16 20:43:20 +00:00
Andrzej Warzyński	1287ed1fa2	[mlir][vector] Use `source` as the source argument name (#158258 ) This patch updates the following ops to use `source` (instead of `vector`) as the name for their source argument: * `vector.extract` * `vector.scalable.extract` * `vector.extract_strided_slice` This change ensures naming consistency with the "builders" for these Ops that already use the name `source` rather than `vector`. It also addresses part of: * https://github.com/llvm/llvm-project/issues/131602 Specifically, it ensures that we use `source` and `dest` for read and write operations, respectively (as opposed to `vector` and `dest`).	2025-09-15 21:18:26 +01:00
Erick Ochoa Lopez	b812e3d61a	[mlir][vector] Add LinearizeVectorToElements (#157740 ) Co-authored-by: James Newling <james.newling@gmail.com>	2025-09-11 13:58:42 -04:00
Erick Ochoa Lopez	9d19250610	[mlir][vector] Add vector.to_elements unrolling (#157142 ) This PR adds support for unrolling `vector.to_element`'s source operand. It transforms ```mlir %0:8 = vector.to_elements %v : vector<2x2x2xf32> ``` to ```mlir %v0 = vector.extract %v[0] : vector<2x2xf32> from vector<2x2x2xf32> %v1 = vector.extract %v[1] : vector<2x2xf32> from vector<2x2x2xf32> %0:4 = vector.to_elements %v0 : vector<2x2xf32> %1:4 = vector.to_elements %v1 : vector<2x2xf32> // %0:8 = %0:4 - %1:4 ``` This pattern will be applied until there are only 1-D vectors left. --------- Signed-off-by: hanhanW <hanhan0912@gmail.com> Co-authored-by: hanhanW <hanhan0912@gmail.com> Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>	2025-09-11 13:56:57 -04:00
Artem Kroviakov	7f007b572d	[MLIR][Vector] Add warp distribution for `scf.if` (#157119 ) This PR adds `scf.if` op distribution to the existing `VectorDistribute` patterns. The logic mostly follows that of `scf.for`: move op outside, wrap each branch with `gpu.warp_execute_on_lane_0`. A notable difference to `scf.for` is that each branch has its own set of escaping values, and `scf.if` itself does not have block arguments.	2025-09-10 13:33:26 -07:00
Erick Ochoa Lopez	c5e6d56f01	[mlir][vector] Propagate alignment when emulating masked{load,stores}. (#155648 ) Propagate alignment from `vector.maskedload` and `vector.maskedstore` to `memref.load` and `memref.store` during `VectorEmulateMaskedLoadStore` pass.	2025-09-05 13:48:40 +00:00
Erick Ochoa Lopez	250d2517b2	[mlir][vector] Propagate alignment in LowerVectorGather. (#155683 ) Alignment is properly propagated when patterns `UnrollGather`, `RemoveStrideFromGatherSource`, or `Gather1DToConditionalLoads` are applied.	2025-09-05 13:43:18 +00:00
Mehdi Amini	8edb5b4fb3	[MLIR] Add LDBG() tracing to VectorTransferOpTransforms.cpp (NFC) Had to debug an issue, more debugging helped here.	2025-09-03 15:32:22 -07:00
Mehdi Amini	9bb9206d8e	[MLIR] Apply clang-tidy fixes for readability-container-size-empty in VectorTransforms.cpp (NFC)	2025-08-31 02:09:27 -07:00
Artem Kroviakov	f2e6ca805d	[MLIR][Vector] Add warp distribution for `vector.step` op (#155425 ) This PR adds a distribution pattern for [`vector.step`](https://mlir.llvm.org/docs/Dialects/Vector/#vectorstep-vectorstepop) op. The result of the step op is a vector containing a sequence `[0,1,...,N-1]`. For the warp distribution, we consider a vector with `N == warp_size` (think SIMD). Distributing it to SIMT, means that each lane is represented by a thread/lane id scalar. More complex cases with the support for warp size multiples (e.g., `[0,1,...,2*N-1]`) require additional layout information to be handled properly. Such support may be added later. The lane id scalar is wrapped into a `vector<1xindex>` to emulate the sequence distribution result. Other than that, the distribution is similar to that of `arith.constant`.	2025-08-28 09:24:01 -07:00
Yang Bai	5fdd3a12e5	[mlir][vector] Follow-up improvements for multi-dimensional vector.from_elements support (#154664 ) This PR is a follow-up to #151175 that supported lowering multi-dimensional `vector.from_elements` op to LLVM by introducing a unrolling pattern. ## Changes ### Add `vector.shape_cast` based flattening pattern for `vector.from_elements` This change introduces a new linearization pattern that uses `vector.shape_cast` to flatten multi-dimensional `vector.from_elements` operations. This provides an alternative approach to the unrolling-based method introduced in #151175. Example: ```mlir // Before %v = vector.from_elements %e0, %e1, %e2, %e3 : vector<2x2xf32> // After %flat = vector.from_elements %e0, %e1, %e2, %e3 : vector<4xf32> %result = vector.shape_cast %flat : vector<4xf32> to vector<2x2xf32> ``` --------- Co-authored-by: Yang Bai <yangb@nvidia.com> Co-authored-by: James Newling <james.newling@gmail.com>	2025-08-27 21:41:06 -07:00
Andrzej Warzyński	613ec4c24c	[mlir][vector] Rename gather/scatter arguments (nfc) (#153640 ) Renames `indices` as `offsets` and `index_vec` as `indices`. This is primarily to make clearer distinction between the arguments.	2025-08-25 17:02:11 +01:00
Adam Siemieniuk	533ddcd989	[mlir][gpu] Warp execute terminator getter (#154729 ) Adds a utility getter to `warp_execute_on_lane_0` which simplifies access to the op's terminator. Uses are refactored to utilize the new terminator getter.	2025-08-22 18:24:23 +02:00
donald chen	5af7263d42	[mlir] add getViewDest method to viewLikeOpInterface (#154524 ) The viewLikeOpInterface abstracts the behavior of an operation view one buffer as another. However, the current interface only includes a "getViewSource" method and lacks a "getViewDest" method. Previously, it was generally assumed that viewLikeOpInterface operations would have only one return value, which was the view dest. This assumption was broken by memref.extract_strided_metadata, and more operations may break these silent conventions in the future. Calling "viewLikeInterface->getResult(0)" may lead to a core dump at runtime. Therefore, we need 'getViewDest' method to standardize our behavior. This patch adds the getViewDest function to viewLikeOpInterface and modifies the usage points of viewLikeOpInterface to standardize its use.	2025-08-21 20:09:52 +08:00
Md Asghar Ahmad Shahid	c24c23d9ab	[NFC][mlir][vector] Handle potential static cast assertion. (#152957 ) In FoldArithToVectorOuterProduct pattern, static cast to vector type causes assertion when a scalar type was encountered. It seems the author meant to have a dyn_cast instead. This NFC patch handles it by using dyn_cast.	2025-08-19 09:27:20 +05:30
Yang Bai	4eb1a07d7d	[mlir][vector] Support multi-dimensional vectors in VectorFromElementsLowering (#151175 ) This patch introduces a new unrolling-based approach for lowering multi-dimensional `vector.from_elements` operations. Implementation Details: 1. New Transform Pattern: Added `UnrollFromElements` that unrolls a N-D(N>=2) from_elements op to a (N-1)-D from_elements op align the outermost dimension. 2. Utility Functions: Added `unrollVectorOp` to reuse the unroll algo of vector.gather for vector.from_elements. 3. Integration: Added the unrolling pattern to the convert-vector-to-llvm pass as a temporal transformation. 4. Use direct LLVM dialect operations instead of intermediate vector.insert operations for efficiency in `VectorFromElementsLowering`. Example: ```mlir // unroll %v = vector.from_elements %e0, %e1, %e2, %e3 : vector<2x2xf32> => %poison_2d = ub.poison : vector<2x2xf32> %vec_1d_0 = vector.from_elements %e0, %e1 : vector<2xf32> %vec_2d_0 = vector.insert %vec_1d_0, %poison_2d [0] : vector<2xf32> into vector<2x2xf32> %vec_1d_1 = vector.from_elements %e2, %e3 : vector<2xf32> %result = vector.insert %vec_1d_1, %vec_2d_0 [1] : vector<2xf32> into vector<2x2xf32> // convert-vector-to-llvm %v = vector.from_elements %e0, %e1, %e2, %e3 : vector<2x2xf32> => %poison_2d = ub.poison : vector<2x2xf32> %poison_2d_cast = builtin.unrealized_conversion_cast %poison_2d : vector<2x2xf32> to !llvm.array<2 x vector<2xf32>> %poison_1d_0 = llvm.mlir.poison : vector<2xf32> %c0_0 = llvm.mlir.constant(0 : i64) : i64 %vec_1d_0_0 = llvm.insertelement %e0, %poison_1d_0[%c0_0 : i64] : vector<2xf32> %c1_0 = llvm.mlir.constant(1 : i64) : i64 %vec_1d_0_1 = llvm.insertelement %e1, %vec_1d_0_0[%c1_0 : i64] : vector<2xf32> %vec_2d_0 = llvm.insertvalue %vec_1d_0_1, %poison_2d_cast[0] : !llvm.array<2 x vector<2xf32>> %poison_1d_1 = llvm.mlir.poison : vector<2xf32> %c0_1 = llvm.mlir.constant(0 : i64) : i64 %vec_1d_1_0 = llvm.insertelement %e2, %poison_1d_1[%c0_1 : i64] : vector<2xf32> %c1_1 = llvm.mlir.constant(1 : i64) : i64 %vec_1d_1_1 = llvm.insertelement %e3, %vec_1d_1_0[%c1_1 : i64] : vector<2xf32> %vec_2d_1 = llvm.insertvalue %vec_1d_1_1, %vec_2d_0[1] : !llvm.array<2 x vector<2xf32>> %result = builtin.unrealized_conversion_cast %vec_2d_1 : !llvm.array<2 x vector<2xf32>> to vector<2x2xf32> ``` --------- Co-authored-by: Nicolas Vasilache <Nico.Vasilache@amd.com> Co-authored-by: Yang Bai <yangb@nvidia.com> Co-authored-by: James Newling <james.newling@gmail.com> Co-authored-by: Diego Caballero <dieg0ca6aller0@gmail.com>	2025-08-18 10:09:12 -07:00
Matthias Springer	21b607adbe	[mlir][SCF] `scf.for`: Add support for unsigned integer comparison (#153379 ) Add a new unit attribute to allow for unsigned integer comparison. Example: ```mlir scf.for unsigned %iv_32 = %lb_32 to %ub_32 step %step_32 : i32 { // body } ``` Discussion: https://discourse.llvm.org/t/scf-should-scf-for-support-unsigned-comparison/84655	2025-08-15 10:59:14 +02:00
James Newling	671eaf84b3	[mlir][vector] Avoid use of vector.splat in transforms (#150279 ) This is part of vector.splat deprecation Reference: https://discourse.llvm.org/t/rfc-mlir-vector-deprecate-then-remove-vector-splat/87143/5 Instead of creating vector::SplatOp, create vector::BroadcastOp	2025-07-31 06:28:01 -07:00
Krzysztof Drewniak	9a46091d4b	[mlir][Vector] Allow elementwise/broadcast swap to handle mixed types (#151274 ) This patch extends the operation that rewrites elementwise operations whose inputs are all broadcast from the same shape to handle mixed-types, such as when the result and input types don't match, or when the inputs have multiple types. PR #150867 failed to check for the possibility of type mismatches when rewriting splat constants. In order to fix that issue, we add support for mixed-type operations more generally.	2025-07-30 09:36:37 -07:00

1 2 3 4 5 ...

585 Commits