llvm-project

Author	SHA1	Message	Date
Ryutaro Okada	7e7ea9c535	[MLIR] Extend vector.scatter to accept tensor as base (#165548 ) This PR makes the following improvements to `vector.scatter` and its lowering pipeline: - In addition to `memref`, accept a ranked `tensor` as the base operand of `vector.scatter`, similar to `vector.transfer_write`. - Implement bufferization support for `vector.scatter`, so that tensor-based scatter ops can be fully lowered to memref-based forms. It's worth to complete the functionality of map_scatter decomposition. Full discussion can be found here: https://github.com/iree-org/iree/issues/21135 --------- Signed-off-by: Ryutaro Okada <1015ryu88@gmail.com>	2025-11-14 19:56:24 +00:00
Lee Wei	993a38fa53	[MLIR][Affine] Extend getVectorReductionOp to support xor/maxnumf/minnumf (#163310 ) This PR extends the `getVectorReductionOp` function, which is used by the affine vectorizer, to also recognize and support `xor/maxnumf/minnumf` reduction operations.	2025-11-04 12:52:51 +08:00
Haocong Lu	37cf1fc62c	[mlir][vector] Fix missed `return` in ExtractStridedSliceOp::fold (#165669 ) Fix missed `return` when folding splat ConstantOp, it could work well probably because of good compatibility of `foldExtractStridedSliceNonSplatConstant`.	2025-10-31 16:43:25 +08:00
Keshav Vinayak Jha	00092f9bdd	[MLIR] [Vector] Added canonicalizer for folding from_elements + transpose (#161841 ) ## Description Adds a new canonicalizer that folds `vector.from_elements(vector.transpose))` => `vector.from_elements`. This canonicalization reorders the input elements for `vector.from_elements`, adjusts the output shape to match the effect of the transpose op and eliminating its need. ## Testing Added a 2D vector lit test that verifies the working of the rewrite. --------- Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>	2025-10-21 14:09:57 +05:30
James Newling	bea77ed52e	[mlir][Vector] Fold vector.step compared to constant (#161615 ) This PR adds a canonicalizer to vector.step that folds vector.step iff the result of the fold is a splat value. An alternative would be to always constant fold it, but that might result in some very large/cumbersome constants. I do wonder if vector.step might be better represented as some sort of attribute in the arith dialect, like %step = arith.constant iota<32> : vector<32xindex>. --------- Signed-off-by: James Newling <james.newling@gmail.com>	2025-10-15 08:40:30 -07:00
Jakub Kuderski	0820266651	[mlir] Use llvm accumulate wrappers. NFCI. (#162957 ) Use wrappers around `std::accumulate` to make the code more concise and less bug-prone: https://github.com/llvm/llvm-project/pull/162129. With `std::accumulate`, it's the initial value that determines the accumulator type. `llvm::sum_of` and `llvm::product_of` pick the right accumulator type based on the range element type. Found some funny bugs like a local accumulate helper that calculated a sum with initial value of 1 -- we didn't hit the bug because the code was actually dead...	2025-10-11 11:33:18 -04:00
James Newling	ea291d0e8c	[MLIR][Vector] Remove vector.splat (#162167 ) vector.splat has been deprecated (user: please use the very similar vector.broadcast instead) with the last PR landing about 6 weeks ago. The discourse discussion is at https://discourse.llvm.org/t/rfc-mlir-vector-deprecate-then-remove-vector-splat/87143/1 The last PR was #152230 This PR completely removes vector.splat. In addition to removing vector.splat from VectorOps.td, it - Updates the few remaining places where vector::SplatOp is created (now vector::BroadcastOp is created) - Removes temporary patterns where vector.splat is replaced by vector.broadcast The only place 'vector.splat' appears is now the files https://github.com/llvm/llvm-project/blob/main/mlir/utils/tree-sitter-mlir/test/corpus/op.txt and https://github.com/llvm/llvm-project/blob/main/mlir/utils/tree-sitter-mlir/dialect/vector.js --------- Signed-off-by: James Newling <james.newling@gmail.com>	2025-10-10 09:58:18 -07:00
Keshav Vinayak Jha	805451faed	[MLIR][Vector] Added ToElementsOp::fold for broadcast->to_elements pattern rewrite. (#160318 ) Adds `::fold` for the new `vector.to_elements` op, folding `broadcast` into `to_elements` or no-op wherever possible. --------- Signed-off-by: keshavvinayak01 <keshavvinayakjha@gmail.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com> Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>	2025-10-08 08:13:46 -04:00
Jakub Kuderski	3960ff6ca0	[mlir][vector] Simplify op rewrite pattern inheriting constructors. NFC. (#161670 ) Use the `Base` type alias from https://github.com/llvm/llvm-project/pull/158433.	2025-10-02 19:07:25 -04:00
Jhalak Patel	ec179f2269	[mlir][vector]: Extend convertIntegerAttr to handle float-to-integer conversion (#159627 ) Fixes #159613 `vector.from_elements` crashes when processing float attributes with integer result types (e.g., `llvm.mlir.constant(0.0 : f8E4M3FN) : i8` from arith-to-llvm lowering): ``` Assertion `newType.getElementType() == curType.getElementType() && "expected the same element type"' failed. ``` ## Implementation - Rename `convertIntegerAttr` → `convertNumericAttr` - Add float-to-integer conversion using `APFloat::convertToInteger()` with exactness assertion - Preserve existing integer-to-integer conversion behavior Only implements conversions that pass LLVM verification. Other patterns (int→float, float→float, bool→int) are rejected by LLVM verifier before reaching this code, as documented in the attached verification failures.	2025-09-25 15:23:49 +08:00
Fabian Mora	077a796c0d	[mlir] Implement a memory-space cast bubbling-down transform (#159454 ) This commit adds functionality to bubble down memory-space casts operations, allowing consumer operations to use the original memory-space rather than first casting to a different memory space. Changes: - Introduce `MemorySpaceCastOpInterface` to handle memory-space cast operations - Create a `MemorySpaceCastConsumerOpInterface` pass that identifies and bubbles down eligible casts - Add implementation for memref and vector operations to handle memory-space cast propagation - Add `bubbleDownCasts` method to relevant operations to support the fusion In particular, in the current implementation only memory-space casts into the default memory-space can be bubbled-down. Example: ```mlir func.func @op_with_cast_sequence(%arg0: memref<4x4xf32, 1>, %arg1: index, %arg2: f32) -> memref<16xf32> { %memspacecast = memref.memory_space_cast %arg0 : memref<4x4xf32, 1> to memref<4x4xf32> %c0 = arith.constant 0 : index %c4 = arith.constant 4 : index %expanded = memref.expand_shape %memspacecast [[0], [1, 2]] output_shape [4, 2, 2] : memref<4x4xf32> into memref<4x2x2xf32> %collapsed = memref.collapse_shape %expanded [[0, 1, 2]] : memref<4x2x2xf32> into memref<16xf32> %loaded = memref.load %collapsed[%c0] : memref<16xf32> %added = arith.addf %loaded, %arg2 : f32 memref.store %added, %collapsed[%c0] : memref<16xf32> %atomic_result = memref.atomic_rmw addf %arg2, %collapsed[%c4] : (f32, memref<16xf32>) -> f32 return %collapsed : memref<16xf32> } // mlir-opt --bubble-down-memory-space-casts func.func @op_with_cast_sequence(%arg0: memref<4x4xf32, 1>, %arg1: index, %arg2: f32) -> memref<16xf32> { %c4 = arith.constant 4 : index %c0 = arith.constant 0 : index %expand_shape = memref.expand_shape %arg0 [[0], [1, 2]] output_shape [4, 2, 2] : memref<4x4xf32, 1> into memref<4x2x2xf32, 1> %collapse_shape = memref.collapse_shape %expand_shape [[0, 1, 2]] : memref<4x2x2xf32, 1> into memref<16xf32, 1> %memspacecast = memref.memory_space_cast %collapse_shape : memref<16xf32, 1> to memref<16xf32> %0 = memref.load %collapse_shape[%c0] : memref<16xf32, 1> %1 = arith.addf %0, %arg2 : f32 memref.store %1, %collapse_shape[%c0] : memref<16xf32, 1> %2 = memref.atomic_rmw addf %arg2, %collapse_shape[%c4] : (f32, memref<16xf32, 1>) -> f32 return %memspacecast : memref<16xf32> } ``` --------- Signed-off-by: Fabian Mora <fabian.mora-cordero@amd.com> Co-authored-by: Mehdi Amini <joker.eph@gmail.com>	2025-09-24 09:11:43 -04:00
Jhalak Patel	339450fa80	[mlir][vector] Fix crash in vector.from_elements folding with poison values (#158528 ) The vector.from_elements constant folding was crashing when poison values were present in the element list. The convertIntegerAttr function was not properly handling poison attributes, leading to assertion failures in dyn_cast operations. This patch refactors convertIntegerAttr to take IntegerAttr directly, moving poison detection to the caller using explicit isa<ub::PoisonAttrInterface> checks. The function signature change provides compile-time type safety while the early poison validation in foldFromElementsToConstant prevents unsafe casting operations. The folding now gracefully aborts when poison attributes are encountered, preventing the crash while preserving correct folding for legitimate mixed-type constants (int/float). Fixes assertion: "dyn_cast on a non-existent value" when processing ub.poison values in vector.from_elements operations.	2025-09-18 17:13:58 -07:00
Andrzej Warzyński	1287ed1fa2	[mlir][vector] Use `source` as the source argument name (#158258 ) This patch updates the following ops to use `source` (instead of `vector`) as the name for their source argument: * `vector.extract` * `vector.scalable.extract` * `vector.extract_strided_slice` This change ensures naming consistency with the "builders" for these Ops that already use the name `source` rather than `vector`. It also addresses part of: * https://github.com/llvm/llvm-project/issues/131602 Specifically, it ensures that we use `source` and `dest` for read and write operations, respectively (as opposed to `vector` and `dest`).	2025-09-15 21:18:26 +01:00
Jakub Kuderski	2ed3f49c49	[mlir] Use free op create functions. NFC. (#157374 ) The builder create methods are deprecated: https://mlir.llvm.org/deprecation/. See https://discourse.llvm.org/t/psa-opty-create-now-with-100-more-tab-complete/87339.	2025-09-07 22:13:20 -04:00
Matthias Springer	93b05dd076	[mlir][vector] Fix crashes in `from_elements` folder + `broadcast` verifier (#155393 ) This PR fixes two crashes / failures. 1. The `vector.broadcast` verifier did not take into account `VectorElementTypeInterface` and was looking for int/index/float types. 2. The `vector.from_elements` folder attempted to create an invalid `DenseElementsAttr`. Only int/float/index/complex types are supported.	2025-08-26 12:49:22 +02:00
Andrzej Warzyński	613ec4c24c	[mlir][vector] Rename gather/scatter arguments (nfc) (#153640 ) Renames `indices` as `offsets` and `index_vec` as `indices`. This is primarily to make clearer distinction between the arguments.	2025-08-25 17:02:11 +01:00
Yang Bai	b4c31dc98d	[mlir][Vector] add vector.insert canonicalization pattern to convert a chain of insertions to vector.from_elements (#142944 ) ## Description This change introduces a new canonicalization pattern for the MLIR Vector dialect that optimizes chains of insertions. The optimization identifies when a vector is completely initialized through a series of vector.insert operations and replaces the entire chain with a single `vector.from_elements` operation. Please be aware that the new pattern doesn't work for poison vectors where only some elements are set, as MLIR doesn't support partial poison vectors for now. New Pattern: InsertChainFullyInitialized * Detects chains of vector.insert operations. * Validates that all insertions are at static positions, and all intermediate insertions have only one use. * Ensures the entire vector is completely initialized. * Replaces the entire chain with a single vector.from_elementts operation. Refactored Helper Function * Extracted `calculateInsertPosition` from `foldDenseElementsAttrDestInsertOp` to avoid code duplication. ## Example ``` // Before: %v1 = vector.insert %c10, %v0[0] : i64 into vector<2xi64> %v2 = vector.insert %c20, %v1[1] : i64 into vector<2xi64> // After: %v2 = vector.from_elements %c10, %c20 : vector<2xi64> ``` It also works for multidimensional vectors. ``` // Before: %v1 = vector.insert %cv0, %v0[0] : vector<3xi64> into vector<2x3xi64> %v2 = vector.insert %cv1, %v1[1] : vector<3xi64> into vector<2x3xi64> // After: %0:3 = vector.to_elements %arg1 : vector<3xi64> %1:3 = vector.to_elements %arg2 : vector<3xi64> %v2 = vector.from_elements %0#0, %0#1, %0#2, %1#0, %1#1, %1#2 : vector<2x3xi64> ``` --------- Co-authored-by: Yang Bai <yangb@nvidia.com> Co-authored-by: Andrzej Warzyński <andrzej.warzynski@gmail.com>	2025-08-19 13:43:31 +01:00
Matthias Springer	ef2b8805bf	[mlir][vector] Implement `InferTypeOpInterface` on `vector.to_elements` (#153172 ) Just for convenience. This auto-generates an additional builder that infers the result type.	2025-08-12 15:15:30 +02:00
Min-Yih Hsu	b4e8b8ee91	[mlir][vector] Canonicalize broadcast of shape_cast (#150523 ) Fold `broadcast(shape_cast(x))` into `broadcast(x)` if the type of x is compatible with broadcast's result type and the shape_cast only adds or removes ones in the leading dimensions. --------- Co-authored-by: Andrzej Warzyński <andrzej.warzynski@gmail.com> Co-authored-by: James Newling <james.newling@gmail.com>	2025-08-08 09:25:32 -07:00
James Newling	a96d8aed98	[mlir][vector] vector.splat and vector.broadcast folding/canonicalizing parity (#150284 ) This PR ensures parity in folding/canonicalizing of vector.broadcast (from a scalar) and vector.splat. This means that by using vector.broadcast instead of vector.splat (which is currently deprecated), there is no loss in optimizations performed. All tests which were previously checking folding/canonicalizing of vector.splat are now done for vector.broadcast. The vector.splat canonicalization tests are now in a separate file, ready for removal when, in the future, we remove vector.splat completely. This PR also adds a canonicalizer to vector.splat to always convert it to vector.broadcast. This is to reduce the 'traffic' through vector.splat. There is a chance that this PR will break downstream users who create/expect for vector.splat. Changing all such logic to work just vector.broadcast instead should fix.	2025-08-01 08:57:38 -07:00
James Newling	0f35244816	[mlir][vector] shape_cast(constant) -> constant fold for non-splats (#145539 ) The folder `shape_cast(splat constant) -> splat constant` was first introduced [here](`36480657d8 (diff-484cea976e0c96459027c951733bf2d22d34c5a0c0de6f577069870ef4588983R2600)`) (Nov 2020). In that commit there is a comment to _Only handle splat for now_. Based on that I assume the intention was to, at a later time, support a general `shape_cast(constant) -> constant` folder. That is what this PR does One minor downside: It is possible with this folder end up with, instead of 1 large constant and 1 shape_cast, 2 large constants: ```mlir func.func @foo() -> (vector<4xi32>, vector<2x2xi32>) { %cst = arith.constant dense<[1, 2, 3, 4]> : vector<4xi32> # 'large' constant 1 %0 = vector.shape_cast %cst : vector<4xi32> to vector<2x2xi32> return %cst, %0 : vector<4xi32>, vector<2x2xi32> } ``` gets folded with this new folder to ```mlir func.func @foo() -> (vector<4xi32>, vector<2x2xi32>) { %cst = arith.constant dense<[1, 2, 3, 4]> : vector<4xi32> # 'large' constant 1 %cst_0 = arith.constant dense<[[1, 2], [3, 4]]> : vector<2x2xi32> # 'large' constant 2 return %cst, %cst_0 : vector<4xi32>, vector<2x2xi32> } ``` Notes on the above case: 1) This only effects the textual IR, the actual values share the same context storage (I've verified this by checking pointer values in the `DenseIntOrFPElementsAttrStorage` [constructor](`da5c442550/mlir/lib/IR/AttributeDetail.h (L59)`)) so no compile-time memory overhead to this folding. At the LLVM IR level the constant is shared, too. 2) This only happens when the pre-folded constant cannot be dead code eliminated (i.e. when it has 2+ uses) which I don't think is common.	2025-07-31 12:12:53 -07:00
Max191	91e0055c7c	[mlir] Implement inferResultRanges for vector.step op (#151536 ) Implements the `inferResultRanges` method from the `InferIntRangeInterface` interface for `vector.step`. The implementation is similar to that of arith.constant, since the exact result values are statically known. Signed-off-by: Max Dawkins <max.dawkins@gmail.com>	2025-07-31 10:35:26 -07:00
Max191	69751196a9	[mlir] Implement inferResultRanges for vector.transpose (#151537 ) Implements the `inferResultRanges` method from the `InferIntRangeInterface` interface for `vector.transpose`. The result ranges simply match the source ranges. Signed-off-by: Max Dawkins <max.dawkins@gmail.com>	2025-07-31 09:32:50 -07:00
Diego Caballero	33465bb2bb	[mlir][Vector] Remove `vector.extractelement` and `vector.insertelement` ops (#149603 ) This PR removes `vector.extractelement` and `vector.insertelement` ops from the code base in favor of the `vector.extract` and `vector.insert` counterparts. See RFC: https://discourse.llvm.org/t/rfc-psa-remove-vector-extractelement-and-vector-insertelement-ops-in-favor-of-vector-extract-and-vector-insert-ops	2025-07-28 11:01:14 -07:00
Maksim Levental	fcbcfe44cf	[mlir][NFC] update `mlir/Dialect` create APIs (32/n) (#150657 ) See https://github.com/llvm/llvm-project/pull/147168 for more info.	2025-07-25 13:50:15 -05:00
Longsheng Mou	f047b735e9	[mlir][NFC] Use `getDefiningOp<OpTy>()` instead of `dyn_cast<OpTy>(getDefiningOp())` (#150428 ) This PR uses `val.getDefiningOp<OpTy>()` to replace `dyn_cast<OpTy>(val.getDefiningOp())` , `dyn_cast_or_null<OpTy>(val.getDefiningOp())` and `dyn_cast_if_present<OpTy>(val.getDefiningOp())`.	2025-07-25 10:35:51 +08:00
Longsheng Mou	8937b61f21	[mlir][vector] Fix cast incompatible type bug in `ShuffleOp::fold` (#150037 ) This PR uses `dyn_cast` instead of `cast` to avoid a crash when the constant attribute is not a `DenseElementsAttr`. Fixes #149325.	2025-07-23 11:37:22 +08:00
Maksim Levental	f904cdd6c3	[mlir][NFC] update `mlir/Dialect` create APIs (24/n) (#149931 ) See https://github.com/llvm/llvm-project/pull/147168 for more info.	2025-07-22 08:16:15 -04:00
James Newling	abce4e9ad0	[mlir][vector] Folder: shape_cast(extract) -> extract (#146368 ) In a later PR more shape_cast ops will appear. Specifically, broadcasts that just prepend ones become shape_cast ops (i.e. volume preserving broadcasts are canonicalized to shape_casts). This PR ensures that broadcast-like shape_cast ops fold at least as well as broadcast ops. This is done by modifying patterns that target broadcast ops, to target 'broadcast-like' ops. No new patterns are added, the patterns that exist are just made to match on shape_casts where appropriate. This PR also includes minor code simplifications: use `isBroadcastableTo` to simplify `ExtractOpFromBroadcast` and simplify how broadcast dims are detected in `foldExtractFromBroadcast`. These are NFC. --------- Co-authored-by: Andrzej Warzyński <andrzej.warzynski@gmail.com>	2025-07-21 11:12:50 -07:00
Tomás Longeri	5d367080a8	[MLIR][Vector] Fix bug in ExtractStrideSlicesOp canonicalization (#147591 ) The pattern would produce an invalid slice when some dimensions were both sliced and broadcast.	2025-07-16 08:52:35 +01:00
Kazu Hirata	c06d3a7b72	[mlir] Remove unused includes (NFC) (#148769 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-07-14 22:19:23 -07:00
Kunwar Grover	0227aef688	[mlir][Vector] Add canonicalization for extract_strided_slice(create_mask) (#146745 ) extract_strided_slice(create_mask) can be folded into create_mask by simply subtracting the offsets from the bounds.	2025-07-10 15:43:20 +01:00
lonely eagle	517cda12e5	[mlir][vector] Add foldInsertUseChain folder function to insert op (#147045 ) When the result of an insert op is used by an insert op, and the subsequent insert op is inserted at the same location as the previous insert op, replaces the dest of the subsequent insert op with the dest of the previous insert op.This is because the previous insert op does not affect subsequent insert ops. --------- Co-authored-by: Mehdi Amini <joker.eph@gmail.com> Co-authored-by: Andrzej Warzyński <andrzej.warzynski@gmail.com>	2025-07-08 22:39:18 +08:00
Jakub Kuderski	6512ca7ddb	[mlir] Add `isStatic`* size check for `ShapedType`s. NFCI. (#147085 ) The motivation is to avoid having to negate `isDynamic*` checks, avoid double negations, and allow for `ShapedType::isStaticDim` to be used in ADT functions without having to wrap it in a lambda performing the negation. Also add the new functions to C and Python bindings.	2025-07-07 14:57:27 -04:00
Kunwar Grover	5eb195fa90	[mlir][Vector] Fold vector.constant_mask to SplatElementsAttr (#146724 ) Adds a folder to vector.constant_mask to fold to SplatElementsAttr when possible	2025-07-04 14:44:36 +01:00
Yang Bai	393a75ebb7	[mlir][Vector] Add constant folding for vector.from_elements operation (#145849 ) ### Summary This PR adds a new folding pattern for vector.from_elements that canonicalizes it to arith.constant when all input operands are constants. ### Implementation Details Leverages FoldAdaptor capabilities: Uses adaptor.getElements() to access pre-computed constant attributes, avoiding redundant pattern matching on operands. ### Example Transformation ``` Before: %c0_i32 = arith.constant 0 : i32 %c1_i32 = arith.constant 1 : i32 %c2_i32 = arith.constant 2 : i32 %c3_i32 = arith.constant 3 : i32 %v = vector.from_elements %c0_i32, %c1_i32, %c2_i32, %c3_i32 : vector<2x2xi32> After: %v = arith.constant dense<[[0, 1], [2, 3]]> : vector<2x2xi32> ``` --------- Co-authored-by: Yang Bai <yangb@nvidia.com>	2025-06-30 20:39:53 -07:00
Fabian Mora	878d3594ed	[mlir][vector] Avoid setting padding by default to `0` in `vector.transfer_read` prefer `ub.poison` (#146088 ) Context: `vector.transfer_read` always requires a padding value. Most of its builders take no `padding` value and assume the safe value of `0`. However, this should be a conscious choice by the API user, as it makes it easy to introduce bugs. For example, I found several occasions while making this patch that the padding value was not getting propagated (`vector.transfer_read` was transformed into another `vector.transfer_read`). These bugs, were always caused because of constructors that don't require specifying padding. Additionally, using `ub.poison` as a possible default value is better, as it indicates the user "doesn't care" about the actual padding value, forcing users to specify the actual padding semantics they want. With that in mind, this patch changes the builders in `vector.transfer_read` to always having a `std::optional<Value> padding` argument. This argument is never optional, but for convenience users can pass `std::nullopt`, padding the transfer read with `ub.poison`. --------- Signed-off-by: Fabian Mora <fabian.mora-cordero@amd.com>	2025-06-30 15:20:42 -04:00
Andrzej Warzyński	e9e25f02e6	[mlir][vector] Restrict vector.insert/vector.extract to disallow 0-d vectors (#121458 ) This patch enforces a restriction in the Vector dialect: the non-indexed operands of `vector.insert` and `vector.extract` must no longer be 0-D vectors. In other words, rank-0 vector types like `vector<f32>` are disallowed as the source or result. EXAMPLES -------- The following are now illegal (note the use of `vector<f32>`): ```mlir %0 = vector.insert %v, %dst[0, 0] : vector<f32> into vector<2x2xf32> %1 = vector.extract %src[0, 0] : vector<f32> from vector<2x2xf32> ``` Instead, use scalars as the source and result types: ```mlir %0 = vector.insert %v, %dst[0, 0] : f32 into vector<2x2xf32> %1 = vector.extract %src[0, 0] : f32 from vector<2x2xf32> ``` Note, this change serves three goals. These are summarised below. ## 1. REDUCED AMBIGUITY By enforcing scalar-only semantics when the result (`vector.extract`) or source (`vector.insert`) are rank-0, we eliminate ambiguity in interpretation. Prior to this patch, both `f32` and `vector<f32>` were accepted. ## 2. MATCH IMPLEMENTATION TO DOCUMENTATION The current behaviour contradicts the documented intent. For example, `vector.extract` states: > Degenerates to an element type if n-k is zero. This patch enforces that intent in code. ## 3. ENSURE SYMMETRY BETWEEN INSERT AND EXTRACT With the stricter semantics in place, it’s natural and consistent to make `vector.insert` behave symmetrically to `vector.extract`, i.e., degenerate the source type to a scalar when n = 0. NOTES FOR REVIEWERS ------------------- 1. Main change is in "VectorOps.cpp", where stricter type checks are implemented. 2. Test updates in "invalid.mlir" and "ops.mlir" are minor cleanups to remove now-illegal examples. 2. Lowering changes in "VectorToSCF.cpp" are the main trade-off: we now require an additional `vector.extract` when a preceding `vector.transfer_read` generates a rank-0 vector. RELATED RFC ----------- * https://discourse.llvm.org/t/rfc-should-we-restrict-the-usage-of-0-d-vectors-in-the-vector-dialect	2025-06-26 09:47:06 +01:00
Andrzej Warzyński	c92b580d63	[mlir][ArmSME] Remove `ConvertIllegalShapeCastOpsToTransposes` (#139706 ) As a follow-up to PR #135841 (see discussion for background), this patch removes the `ConvertIllegalShapeCastOpsToTransposes` pattern from the SME legalization pass. This change unblocks folding for ShapeCastOp involving scalable vectors. Originally, the `ConvertIllegalShapeCastOpsToTransposes` pattern was introduced to rewrite certain `vector.shape_cast` ops that could not be lowered otherwise. Based on local end-to-end testing, this workaround is no longer required, and the pattern can now be safely removed. This patch also removes a special case from `ShapeCastOp::fold`, simplifying the fold logic. As a side effect of removing `ConvertIllegalShapeCastOpsToTransposes`, we lose the mechanism that enabled lowering of certain ops like: ```mlir %res = vector.transfer_read %mem[%a, %b] (...) : memref<?x?xf32>, vector<[4]x1xf32> ``` Previously, such cases were handled by: * Rewriting a nearby `vector.shape_cast` to a `vector.transpose` (via `ConvertIllegalShapeCastOpsToTransposes`) * Then lowering the result with `LiftIllegalVectorTransposeToMemory`. This patch introduces a new dedicated pattern, `LowerColumnTransferReadToLoops`, that directly handles illegal `vector.transfer_read` ops involving leading scalable dimensions.	2025-06-25 16:40:14 +01:00
Nicolas Vasilache	d31ba52563	[mlir][Interface] Factor out common IndexingMapOpInterface behavior in a new generic interface (#145313 ) Refactor the verifiers to make use of the common bits and make `vector.contract` also use this interface. In the process, the confusingly named getStaticShape has disappeared. Note: the verifier for IndexingMapOpInterface is currently called manually from other verifiers as it was unclear how to avoid it taking precedence over more meaningful error messages	2025-06-24 07:56:32 +02:00
Nishant Patel	9c1ce31f54	[mlir][vector] Add unroll patterns for vector.load and vector.store (#143420 ) This PR adds unroll patterns for vector.load and vector.store. This PR is follow up of #137558	2025-06-20 13:50:25 -07:00
Diego Caballero	ff6367b470	[[mlir][Vector] Add simple folders for `vector.from_element`/`vector.to_elements` (#144444 ) This PR adds simple folders to remove no-op sequences of `vector.from_elements` and `vector.to_elements`.	2025-06-20 11:16:46 -07:00
Yang Bai	fe3933da15	[mlir][vector] Support complete folding in single pass for vector.insert/vector.extract (#142124 ) ### Description This patch improves the folding efficiency of `vector.insert` and `vector.extract` operations by not returning early after successfully converting dynamic indices to static indices. This PR also renames the test pass `TestConstantFold` to `TestSingleFold` and adds comprehensive documentation explaining the single-pass folding behavior. ### Motivation Since the `OpBuilder::createOrFold` function only calls `fold` once, the current `fold` methods of `vector.insert` and `vector.extract` may leave the op in a state that can be folded further. For example, consider the following un-folded IR: ``` %v1 = vector.insert %e1, %v0 [0] : f32 into vector<128xf32> %c0 = arith.constant 0 : index %e2 = vector.extract %v1[%c0] : f32 from vector<128xf32> ``` If we use `createOrFold` to create the `vector.extract` op, then the result will be: ``` %v1 = vector.insert %e1, %v0 [127] : f32 into vector<128xf32> %e2 = vector.extract %v1[0] : f32 from vector<128xf32> ``` But this is not the optimal result. `createOrFold` should have returned `%e1`. The reason is that the execution of fold returns immediately after `extractInsertFoldConstantOp`, causing subsequent folding logics to be skipped. --------- Co-authored-by: Yang Bai <yangb@nvidia.com>	2025-06-18 09:26:04 -07:00
Nicolas Vasilache	e4de74ba11	[mlir][Vector] Tighten up application conditions in TransferReadAfter… (#143869 ) …WriteToBroadcast The pattern would previously apply in spurious cases and generate incorrect IR. In the process, we disable the application of this pattern in the case where there is no broadcast; this should be handled separately and may more easily support masking. The case {no-broadcast, yes-transpose} was previously caught by this pattern and arguably could also generate incorrect IR (and was also untested): this case does not apply anymore. The last cast {yes-broadcast, yes-transpose} continues to apply but should arguably be removed from the future because creating transposes as part of canonicalization feels dangerous. There are other patterns that move permutation logic: - either into the transfer, or - outside of the transfer Ideally, this would be target-dependent and not a canonicalization (i.e. does your DMA HW allow transpose on the fly or not) but this is beyond the scope of this PR. Co-authored-by: Nicolas Vasilache <nicolasvasilache@users.noreply.github.com>	2025-06-12 17:11:06 +02:00
Simone Pellegrini	abbbe4a6cd	[mlir][vector] Fix attaching write effects on transfer_write's base (#142940 ) This fixes an issue with `TransferWriteOp`'s implementation of the `MemoryEffectOpInterface` where the write effect was attached to the stored value rather than the base. This had the effect that when asking for the memory effects for the input memref buffer using `getEffectsOnValue(...)`, the function would return no-effects (as the effect would have been attached to the stored value rather than the input buffer).	2025-06-11 12:37:34 +01:00
Chao Chen	def37f7e3a	[mlir][vector] add unroll pattern for broadcast (#142011 ) This PR adds `UnrollBroadcastPattern` to `VectorUnroll` transform. To support this, it also extends `BroadcastOp` definition with `VectorUnrollOpInterface`	2025-06-05 12:42:16 -05:00
James Newling	543446a353	[mli][vector] canonicalize vector.from_elements from ascending extracts (#139819 ) Example: ```mlir %0 = vector.extract %source[0, 0] : i8 from vector<1x2xi8> %1 = vector.extract %source[0, 1] : i8 from vector<1x2xi8> %2 = vector.from_elements %0, %1 : vector<2xi8> ``` becomes ```mlir %2 = vector.shape_cast %source : vector<1x2xi8> to vector<2xi8> ``` It was decided that we should spill canonicalization tests into new files (see [discussion](https://github.com/llvm/llvm-project/pull/135096#pullrequestreview-2760245596)) In view of this I added the new tests to a new file specifically for canonicalization of from_elements. To be consistent in the location of the tests, I moved existing tests `extract_scalar_from_from_element`, `extract_1d_from_from_elements`, `extract_2d_from_from_elements` and `from_elements_to_splat` from `canonicalize.mlir` to `canonicalze/vector-from-elements.mlir`. In addition to moving I changed the LIT variables to all be upper-case for consistency.	2025-06-02 11:15:25 -07:00
Diego Caballero	204eb70af8	[mlir][Vector] Canonicalize empty `vector.mask` into `arith.select` (#140976 ) This PR adds a missing canonicalization for empty `vector.mask` ops with a passthru value. ``` %0 = vector.mask %mask, %passthru { vector.yield %a : vector<8xf32> } : vector<8xi1> -> vector<8xf32> becomes: %0 = arith.select %mask, %a, %passthru : vector<8xf32> ```	2025-05-23 08:29:57 -07:00
Diego Caballero	d6f394e141	[mlir][Vector] Move `vector.mask` canonicalization to folder (#140324 ) This MR moves the canonicalization that elides empty `vector.mask` ops to folders.	2025-05-21 17:25:01 -07:00
Diego Caballero	fd8bc37b45	[mlir][Vector][NFC] Run `extractInsertFoldConstantOp` earlier in the folder (#140814 ) This PR moves `extractInsertFoldConstantOp` earlier in the folder lists of `vector.extract` and `vector.insert`. Many folders require having non-dynamic indices so `extractInsertFoldConstantOp` is a requirement for them to trigger.	2025-05-21 13:57:01 -07:00

1 2 3 4 5 ...

364 Commits