llvm-project

Author	SHA1	Message	Date
Samarth Narang	7bf47e2e84	[mlir][tensor] Guard constant reshape folding (#179077 )	2026-02-05 15:17:47 -05:00
Samarth Narang	4b893d25ea	[mlir][tensor] Emit diagnostics for unranked tensor reshape ops instead of asserting (#179005 ) This PR updates tensor.expand_shape and tensor.collapse_shape ODS definitions to require ranked tensor operands/results by switching from AnyTensor to AnyRankedTensor. Fixes https://github.com/llvm/llvm-project/issues/178228	2026-02-02 16:56:59 -05:00
Ian Wood	1644e7d09b	[mlir][tensor] Fix return type for ExtractSlice canonicalizer (#178118 ) The `OpWithOffsetSizesAndStridesConstantArgumentFolder` pattern for tensor.extract_slice was incorrectly using `inferCanonicalRankReducedResultType()` to compute the result type after folding constant offsets. This function infers a canonical rank reduction (dropping the first N unit dimensions), which breaks when the original operation used a non-canonical rank reduction. This change is similar to the existing correct behavior in `SubViewReturnTypeCanonicalizer` for `memref.subview`. For example, extracting `tensor<1x4xf16>` from `tensor<1x1x8x1xf16>` by dropping dims 0 and 3 would incorrectly produce tensor<4x1xf16> (canonical: drop dims 0 and 1). --------- Signed-off-by: Ian Wood <ianwood@u.northwestern.edu>	2026-01-29 14:41:40 -08:00
Jakub Kuderski	9aaf0b89f5	[mlir] Apply clang-tidy check llvm-use-vector-utils. NFC. (#178526 )	2026-01-29 02:19:00 +00:00
Longsheng Mou	a51eab9492	[mlir][memref] Add a `ViewOp::getMixedSizes` (#176561 ) This PR adds a useful `getMixedSizes` method for memref.view.	2026-01-18 00:02:22 +08:00
Maya Amrami	4ccf926e7f	[mlir] Compose expand of collapse to cast (#172864 ) In some cases `y = expand(collapse(x))` cannot be folded into x, since x and y have different types. In that case, we check if the two types are cast compatible. If they are, it means the two types have compatible shape and layout and y can be folded into cast(x). This causes a change in memref::CastOp::areCastCompatible, where now a dim of size 1 may have different strides.	2026-01-13 11:57:23 +02:00
Nick Kreeger	e289b2e765	[mlir][Utils] Add VerificationUtils (NFC) (#174336 ) Introduces `VerificationUtils` to consolidate common operation verification patterns in MLIR. This initial implementation provides `verifyDynamicDimensionCount()` to reduce code duplication across dialect verifiers. This is an NFC (No Functional Change) refactoring that improves code maintainability by extracting reusable verification logic into a shared utility.	2026-01-12 16:52:44 +00:00
Longsheng Mou	670a68efd1	[mlir][tensor] Preserve encoding in `CollapseShapeOp::build` (#173720 ) This PR updates `CollapseShapeOp::build` so that when the result type is not explicitly provided, the inferred result type preserves the encoding of the source tensor.	2025-12-31 11:30:16 +08:00
Arjun Parmar	8be2c19f88	[MLIR] Fix mlir-opt crash in ReshapeOpsUtils.cpp when collapse_shape index is invalid (#173791 ) This patch fixes a crash occurring in mlir-opt when running collapse_shape with an invalid index configuration. Instead of crashing, an error message is returned to the user. Fixes: #173567 --------- Co-authored-by: Bazinga! <akparmar004>	2025-12-30 11:35:53 +01:00
Andrzej Warzyński	1965523171	[mlir][tensor] Add new builders for insert_slice/extract_slice Ops (nfc) (#169533 ) Adds new builders for `tensor.insert_slice` and `tensor.extract_slice` Ops for which the _offsets_ and the _strides_ are all 0s and 1s, respecitvely. This allows us to write: ```cpp // No offsets and no strides - implicitly set to 0s and 1s, // respectively. tensor::InsertSliceOp::create(rewriter, loc, src, dest, writeSizes); ``` instead of: ```cpp // Strides are initialised explicitly to 1s Attribute oneIdxAttr = rewriter.getIndexAttr(1); SmallVector<OpFoldResult> writeStrides(destRank, oneIdxAttr); // Offsets are initialised explicitly to 0s Attribute zeroIdxAttr = rewriter.getIndexAttr(0); SmallVector<OpFoldResult> writeOffsets(destRank, zeroIdxAttr); tensor::InsertSliceOp::create(rewriter, loc, src, dest, writeOffsets, writeSizes, writeStrides); ```	2025-11-26 10:08:29 +00:00
Andrzej Warzyński	c582688b69	[MLIR][tensor] Simplify ExtractSliceOp::inferResultType (nfc) (#169313 ) The `offsets` and `strides` arguments are neither used nor required - removed them and simplify this hook.	2025-11-25 16:14:51 +00:00
Jakub Kuderski	1fd9c02513	[mlir] Adopt cast function objects. NFC. (#168228 ) These were added in https://github.com/llvm/llvm-project/pull/165803.	2025-11-15 14:51:14 -05:00
Kazu Hirata	c56fdf9e5a	[mlir] Remove unused "using" decls (NFC) (#165652 ) Identified with misc-unused-using-decls.	2025-10-30 07:10:52 -07:00
Andrzej Warzyński	e90f8d84b0	[mlir][linalg] Extend DecomposeOuterUnitDimsPackOpPattern (linalg.pack) (#162666 ) Similarly to #152960, this PR fixes `getTiledOuterDims` for `linalg.pack` by ensuring that the `outer_dims_perm` attributeis properly taken into account. This enables the main change in this PR: relaxing the constraints in * `DecomposeOuterUnitDimsPackOpPattern`. Specifically, the pattern is extended to allow non-unit untiled outer dimensions. For example: ```mlir func.func @example( %src: tensor<2x32x16x8xf32>, %dest: tensor<2x1x16x8x32xf32>) -> tensor<2x1x16x8x32xf32> { %pack = linalg.pack %src inner_dims_pos = [1] inner_tiles = [32] into %dest : tensor<2x32x16x8xf32> -> tensor<2x1x16x8x32xf32> return %pack : tensor<2x1x16x8x32xf32> } ``` decomposes as: ```mlir func.func @example( %src: tensor<2x32x16x8xf32>, %dest: tensor<2x1x16x8x32xf32>) -> tensor<2x1x16x8x32xf32> { %0 = tensor.empty() : tensor<2x16x8x32xf32> %transposed = linalg.transpose ins(%src : tensor<2x32x16x8xf32>) outs(%init : tensor<2x16x8x32xf32>) permutation = [0, 2, 3, 1] %inserted_slice = tensor.insert_slice %transposed into %dest[0, 0, 0, 0, 0] [2, 1, 16, 8, 32] [1, 1, 1, 1, 1] : tensor<2x16x8x32xf32> into tensor<2x1x16x8x32xf32> return %inserted_slice : tensor<2x1x16x8x32xf32> } ``` Importantly, this change makes `DecomposeOuterUnitDimsPackOpPattern` (the decomposition pattern for `linalg.pack`) consistent with the corresponding pattern for `linalg.unpack`: * `DecomposeOuterUnitDimsUnPackOpPattern`. One notable assumption remains: untiled outer dimensions are not permuted. This was already the case but is now explicitly documented. Co-authored by: Max Bartel <bartel@roofline.ai>	2025-10-13 09:48:03 +01:00
Alan Li	b87f1b22a8	[MLIR] Add `InParallelOpInterface` for parallel combining operations (#157736 ) This commit: - Introduces a new `InParallelOpInterface`, along with the `ParallelCombiningOpInterface`, represent the parallel updating operations we have in a parallel loop of `scf.forall`. - Change the name of `ParallelCombiningOpInterface` to `InParallelOpInterface` as the naming was quite confusing. - `ParallelCombiningOpInterface` now is used to generalize operations that insert into shared tensors within parallel combining regions. Previously, only `tensor.parallel_insert_slice` was supported directly in `scf.InParallelOp` regions. - `tensor.parallel_insert_slice` now implements `ParallelCombiningOpInterface`. This change enables future extensions to support additional parallel combining operations beyond `tensor.parallel_insert_slice`, which have different update semantics, so the `in_parallel` region can correctly and safely represent these kinds of operation without potential mistakes such as races. Author credits: @qedawkins	2025-09-12 14:23:00 -07:00
Kazu Hirata	04b0658453	[mlir] Fix warnings This patch fixes: mlir/lib/Dialect/Tensor/IR/TensorOps.cpp:3205:13: error: unused function 'printInferType' [-Werror,-Wunused-function] mlir/lib/Dialect/Tensor/IR/TensorOps.cpp:3210:1: error: unused function 'parseInferType' [-Werror,-Wunused-function]	2025-08-31 07:41:23 -07:00
Mehdi Amini	731f4d904c	[MLIR] Apply clang-tidy fixes for misc-use-internal-linkage in TensorOps.cpp (NFC)	2025-08-31 02:14:18 -07:00
Maksim Levental	8fff238b2c	[mlir][NFC] update `mlir/Dialect` create APIs (23/n) (#149930 ) See https://github.com/llvm/llvm-project/pull/147168 for more info.	2025-07-23 10:16:52 -04:00
Kazu Hirata	5e0de68626	[mlir] Remove unused includes (NFC) (#148119 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-07-11 11:59:26 -07:00
Jakub Kuderski	6512ca7ddb	[mlir] Add `isStatic`* size check for `ShapedType`s. NFCI. (#147085 ) The motivation is to avoid having to negate `isDynamic*` checks, avoid double negations, and allow for `ShapedType::isStaticDim` to be used in ADT functions without having to wrap it in a lambda performing the negation. Also add the new functions to C and Python bindings.	2025-07-07 14:57:27 -04:00
Markus Böck	6c9be27b52	[mlir][tensor] Fold identity `reshape` of 0d-tensors (#146375 ) Just like 1d-tensors, reshapes of 0d-tensors (aka scalars) are always no-folds as they only have one possible layout. This PR adds logic to the `fold` implementation to optimize these away as is currently implemented for 1d tensors.	2025-07-02 09:09:03 +02:00
Benjamin Kramer	6a5469bb81	[bazel] Fixes for e5a8c51c9dc85a7b463a4570942e3e5e1cb70e0b	2025-06-26 15:06:18 +02:00
Nicolas Vasilache	e5a8c51c9d	[mlir][tensor] Make tensor::PadOp a ReifyRankedShapedTypeOpInterface (#145867 ) Co-authored-by: Fabian Mora <fmora.dev@gmail.com>	2025-06-26 14:40:57 +02:00
MaheshRavishankar	7bc956d3d6	[mlir][PartialReductionTilingInterface] Add support for `ReductionTilingStrategy::PartialReductionOuterParallel` in `tileUsingSCF`. (#143988 ) Following up from https://github.com/llvm/llvm-project/pull/143467, this PR adds support for `ReductionTilingStrategy::PartialReductionOuterParallel` to `tileUsingSCF`. The implementation of `PartialReductionTilingInterface` for `Linalg` ops has been updated to support this strategy as well. This makes the `tileUsingSCF` come on par with `linalg::tileReductionUsingForall` which will be deprecated subsequently. Changes summary - `PartialReductionTilingInterface` changes : - `tileToPartialReduction` method needed to get the induction variables of the generated tile loops. This was needed to keep the generated code similar to `linalg::tileReductionUsingForall`, specifically to create a simplified access for slicing the intermediate partial results tensor when tiled in `num_threads` mode. - `getPartialResultTilePosition` methods needs the induction varialbes for the generated tile loops for the same reason above, and also needs the `tilingStrategy` to be passed in to generate correct code. The tests in `transform-tile-reduction.mlir` testing the `linalg::tileReductionUsingForall` have been moved over to test `scf::tileUsingSCF` with `ReductionTilingStrategy::PartialReductionOuterParallel` strategy. Some of the test that were doing further cyclic distribution of the transformed code from tiling are removed. Those seem like two separate transformation that were merged into one. Ideally that would need to happen when resolving the `scf.forall` rather than during tiling. Please review only the top commit. Depends on https://github.com/llvm/llvm-project/pull/143467 Signed-off-by: MaheshRavishankar <mahesh.ravishankar@gmail.com>	2025-06-23 12:27:26 -07:00
Kazu Hirata	1cf1c21b84	[mlir] Strip away lambdas (NFC) (#143280 ) We don't need lambdas here.	2025-06-08 01:34:17 -07:00
asraa	c66b72f8ce	[mlir][tensor] remove tensor.insert constant folding out of canonicalization (#142671 ) Follow ups from https://github.com/llvm/llvm-project/pull/142458/ In particular concerns that indiscriminately folding tensor constants can lead to bloating the IR as these can be arbitrarily large. Signed-off-by: Asra Ali <asraa@google.com>	2025-06-05 14:53:33 -07:00
Kazu Hirata	95ce58bc4a	[mlir] Fix a warning This patch fixes: mlir/lib/Dialect/Tensor/IR/TensorOps.cpp:1680:37: error: comparison of integers of different signs: 'int' and 'uint64_t' (aka 'unsigned long') [-Werror,-Wsign-compare]	2025-06-03 10:11:51 -07:00
asraa	34d8275e4f	[mlir][tensor] add tensor insert/extract op folders (#142458 ) Adds a few canonicalizers, folders, and rewrite patterns to tensor ops: * tensor.insert folder: insert into a constant is replaced with a new constant * tensor.extract folder: extract from a parent tensor that was inserted at the same indices is folded into the inserted value * rewrite pattern added that replaces an extract of a collapse shape with an extract of the source tensor (requires static source dimensions) Signed-off-by: Asra Ali <asraa@google.com>	2025-06-03 09:16:03 -07:00
Han-Chung Wang	c39915fa2e	[mlir][NFC] Simplify constant checks with isOneInteger and renamed isZeroInteger. (#139340 ) The revision adds isOneInteger helper, and simplifies the existing code with the two methods. It removes some lambda, which makes code cleaner. For downstream users, you can update the code with the below script. ```bash sed -i "s/isZeroIndex/isZeroInteger/g" */.h sed -i "s/isZeroIndex/isZeroInteger/g" */.cpp ``` --------- Signed-off-by: hanhanW <hanhan0912@gmail.com>	2025-05-20 14:53:02 -07:00
Aaron St George	a0a55df385	[mlir][tensor][NFC] Code cleanup around shape inference support for `tensor.concat` op (#140616 ) Addresses some code review on https://github.com/llvm/llvm-project/pull/140168 that came in after merge.	2025-05-19 18:54:13 -07:00
Aaron St George	da944e0099	[mlir][tensor] Add shape inference support for `tensor.concat` op. (#140168 ) ## description `tensor.concat` requires operands and the result to match on all dimensions except the concatenation dimension. If one operand is already static in those dimensions, the other operands and result type may safely be refined to that same static shape. This PR adds canonicalization patterns to refine `tensor.concat` types and propagate static shapes to other canonicalization patterns through casts. ## example ```mlir %2 = tensor.concat dim(0) %0, %1: (tensor<?x12xi32>, tensor<?x?xi32>) ->tensor<?x12xi32> ``` becomes: ```mlir %cast = tensor.cast %1 : tensor<?x?xi32> to tensor<?x12xi32> %2 = tensor.concat dim(0) %0, %cast : (tensor<?x12xi32>, tensor<?x12xi32>) -> tensor<?x12xi32> ``` --------- Co-authored-by: Ian Wood <ianwood2024@u.northwestern.edu>	2025-05-16 15:06:42 -07:00
Kazu Hirata	50316c1eb8	[mlir] Use llvm::binary_search (NFC) (#139760 )	2025-05-13 15:53:54 -07:00
Kazu Hirata	15f7c6ed70	[mlir] Remove unused local variables (NFC) (#138481 )	2025-05-05 10:08:00 -07:00
Vivek Khandelwal	e377a5d168	[MLIR][Tensor] Remove tensor.dim canonicalization patterns registered on tensor.expand_shape/tensor.collapse_shape (#134219 ) These are problematic because the iterative application that locally resolves the tensor.dim operation introduces intermediate floor_div, which is losing the information about the exact division that was carried out in the original IR, and the iterative algorithm can't converge towards the simplest form. Information loss is not acceptable for canonicalization. Resolving the dimOp can be achieved through resolve-ranked-shaped-type-result-dims and resolve-shaped-type-result-dims passes. --------- Signed-off-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2025-04-11 06:57:34 -04:00
Matthias Springer	5edf127384	[mlir][memref] Verify out-of-bounds access for memref.subview (#133086 ) * Improve the verifier of `memref.subview` to detect out-of-bounds extractions. * Improve the documentation of `memref.subview` to make clear that out-of-bounds extractions are not allowed. Rewrite examples to use the new `strided<>` notation instead of `affine_map` layout maps. Also remove all unrelated operations (`memref.alloc`) from the examples. * Fix various test cases where `memref.subview` ops ran out-of-bounds. * Update canonicalizations patterns to ensure that they do not fold IR if it would generate IR that no longer verifies. Related discussion on Discourse: https://discourse.llvm.org/t/out-of-bounds-semantics-of-memref-subview/85293 This is a re-upload of #131876, which was reverted due to failing GPU tests. These tests were faulty and fixed in #133051.	2025-03-31 10:28:55 -07:00
Karlo Basioli	f6823a0ae1	Revert "[mlir][memref] Verify out-of-bounds access for `memref.subview`" (#132940 ) Reverts llvm/llvm-project#131876 GPU integration tests get broken by this PR. E.x. `mlir/test/Integration/GPU/CUDA/sm90/gemm_f32_f16_f16_128x128x128.mlir`	2025-03-25 14:56:08 +00:00
Matthias Springer	d4304d85f2	[mlir][memref] Verify out-of-bounds access for `memref.subview` (#131876 ) * Improve the verifier of `memref.subview` to detect out-of-bounds extractions. * Improve the documentation of `memref.subview` to make clear that out-of-bounds extractions are not allowed. Rewrite examples to use the new `strided<>` notation instead of `affine_map` layout maps. Also remove all unrelated operations (`memref.alloc`) from the examples. * Fix various test cases where `memref.subview` ops ran out-of-bounds. * Update canonicalizations patterns to ensure that they do not fold IR if it would generate IR that no longer verifies. Related discussion on Discourse: https://discourse.llvm.org/t/out-of-bounds-semantics-of-memref-subview/85293	2025-03-25 11:25:11 +01:00
Matthias Springer	529ee3cf3b	[mlir][tensor] Fix slice canonicalizer for out-of-bounds cases (#132534 ) Since #130487, `tensor.extract_slice` and `tensor.insert_slice` ops that are statically detected to go out of bounds are rejected by the verifier. This commit fixes canonicalization patterns that currently fold dynamically out-of-bounds ops (valid IR) to statically out-of-bounds ops (invalid IR).	2025-03-24 14:39:37 +01:00
Matthias Springer	f304fd0d5c	[mlir][tensor][NFC] Remove dead code `tensor.extract_slice` canonicalization pattern (#131903 ) Folding a cast into an `extract_slice` does not change the result type.	2025-03-19 08:35:40 +01:00
Matthias Springer	418e07b7e6	[mlir][Tensor] Check for out-of-bounds slice in `insert/extract_slice` verifier (#130487 ) Also fix test cases that had invalid ops.	2025-03-12 08:34:21 +01:00
Andrzej Warzyński	a778930f85	[mlir][linalg] Create a dedicated target for `LinalgRelayoutInterface` (#128485 ) Creates an interface target for `LinalgRelayoutInterface`. This is primarily to reduce the dependency of `Tensor` on `Linalg` to the required minimum. For context and rationale, see: * https://github.com/llvm/llvm-project/issues/127668 Note, I also took the liberty of renaming `LinalgRelayoutInterface` as `RelayoutOpInterface` (removed `Linalg`, added `Op`).	2025-02-25 18:48:08 +00:00
Christian Sigg	77410f2a25	[mlir][tensor] Remove unnecessary include. This include introduced an unwanted dependency from tensor to tensor utils.	2025-02-18 07:50:23 +01:00
Christian Sigg	a5e6ccf546	[mlir][bazel] Port `517800e37e` (#127544 ) Introduces a `LinalgInterfaces` target so that `TensorDialect` doesn't need to depend on `LinalgDialect`, which would introduce a circular dependency.	2025-02-18 07:13:59 +01:00
Andrzej Warzyński	517800e37e	[mlir][tensor][linalg] Move Pack/UnPack Ops to Linalg (#123902 ) Moves `PackOp` and `UnPackOp` from the Tensor dialect to Linalg. This change was discussed in the following RFC: * https://discourse.llvm.org/t/rfc-move-tensor-pack-and-tensor-unpack-into-linalg This change involves significant churn but only relocates existing code - no new functionality is added. Note for Downstream Users Downstream users must update references to `PackOp` and `UnPackOp` as follows: * Code: `s/tensor::(Up)PackOp/linalg::(Un)PackOp/g` * Tests: `s/tensor.(un)pack/linalg.(un)pack/g` No other modifications should be required.	2025-02-17 10:44:27 +00:00
Andrzej Warzyński	5586541d22	[mlir][tensor] Make useful Tensor utilities public (#126802 ) 1. Extract the main logic from `foldTensorCastPrecondition` into a dedicated helper hook: `hasFoldableTensorCastOperand`. This allows for reusing the corresponding checks. 2. Rename `getNewOperands` to `getUpdatedOperandsAfterCastOpFolding` for better clarity and documentation of its functionality. 3. These updated hooks will be reused in: * https://github.com/llvm/llvm-project/pull/123902. This PR makes them public. Note: Moving these hooks to `Tensor/Utils` is not feasible because `MLIRTensorUtils` depends on `MLIRTensorDialect` (CMake targets). If these hooks were moved to `Utils`, it would create a dependency of `MLIRTensorDialect` on `MLIRTensorUtils`, leading to a circular dependency.	2025-02-12 23:12:14 +00:00
Andrzej Warzyński	80fd902573	[mlir][tensor] Introduce `TensorRelayoutOpInterface` (#125823 ) The newly introduced `TensorRelayoutOpInterface` is created specifically for `tensor.pack` + `tensor.unpack`. Although the interface is currently empty, it enables us to refactor the logic in `FoldTensorCastProducerOp` within the Tensor dialect as follows: ```cpp // OLD // Reject tensor::PackOp - there's dedicated pattern for that instead. if (!foldTensorCastPrecondition(op) \|\| isa<tensor::PackOp, tensor::UnPackOp>(op)) return failure(); ``` is replaced with: ```cpp // NEW // Reject tensor::PackOp - there's dedicated pattern for that instead. if (!foldTensorCastPrecondition(op) \|\| isa<tensor::RelayoutOpInterface>(op)) return failure(); ``` This will be crucial once `tensor.pack` + `tensor.pack` are replaced with `linalg.pack` + `linalg.unpack` (i.e. moved to Linalg): * https://github.com/llvm/llvm-project/pull/123902, * https://discourse.llvm.org/t/rfc-move-tensor-pack-and-tensor-unpack-into-linalg/. Note that the interface itself will later be moved to the Linalg dialect. This decoupling ensures that the Tensor dialect does not require an understanding of Linalg ops, thus keeping the dependency lightweight. This PR is effectively a preparatory step for moving PackOp and UnpackOp to Linalg. Once that's completed, most CMake changes from this PR will be effectively reverted.	2025-02-06 09:18:13 +00:00
Krzysztof Drewniak	cdc09a118a	[mlir][IntRangeInference] Infer values for {memref,tensor}.dim (#122945 ) Implement the integer range inference niterface for memref.dim and tetnor.dim using shared code. The inference will infer the `dim` of dynamic dimensions to [0, index_max] and take the union of all the dimensions that the `dim` argument could be validly referring to.	2025-01-29 18:43:53 -06:00
MaheshRavishankar	092372da15	[mlir][Tensor] Rework `ReifyRankedShapedTypeInterface` implementation for `tensor.expand_shape` op. (#113501 ) The op carries the output-shape directly. This can be used directly. Also adds a method to get the shape as a `SmallVector<OpFoldResult>`. Signed-off-by: MaheshRavishankar <mahesh.ravishankar@gmail.com>	2025-01-27 07:05:34 -08:00
Andrzej Warzyński	9f6a1ddb43	[mlir][tensor] Introduce `FoldTensorCastUnPackOp` (#121393 ) This patch specializes `FoldTensorCastProducerOp` for `tensor::UnPackOp` by introducing a dedicated pattern: `FoldTensorCastUnPackOp`. This mirrors a similar update made for `tensor::PackOp` in #114559. Below is the updated rationale tailored to `tensor::UnPackOp`. ISSUE DESCRIPTION Currently, `FoldTensorCastProducerOp` incorrectly folds the following: ```mlir %cast = tensor.cast %dest : tensor<1x1x8x1xi32> to tensor<1x1x?x1xi32> // Note: `%c8` and `?`. %unpack = tensor.unpack %cast inner_dims_pos = [0, 1] inner_tiles = [%c8, 1] into %res : tensor<1x1x?x1xi32> -> tensor<7x?xi32> ``` as: ```mlir // Note: `%c8` and `8`. %unpack = tensor.unpack %cast inner_dims_pos = [0, 1] inner_tiles = [%c8, 1] into %res : tensor<1x1x8x1xi32> -> tensor<7x?xi32> ``` This triggers an Op verification failure because the folder does not update the inner tile sizes in the unpack Op. This patch addresses the issue by ensuring proper handling of inner tile sizes. ADDITIONAL CHANGES * invalid.mlir: Fixed a typo. * TensorOps.cpp: * Removed unnecessary `(void)tileSize`. * Added comments following the discussion in PR #115772. * Made minor updates to `FoldTensorCastPackOp` for consistency with the newly introduced `FoldTensorCastUnPackOp`. * Tensor/canonicalize.mlir: Ensured consistent usage of `test_attr` (e.g., replaced mixed use of `test_attr` and `some_attr`).	2025-01-03 09:09:23 +00:00
Kazu Hirata	fecf1397e3	[Tensor] Migrate away from PointerUnion::{is,get} (NFC) (#120679 ) Note that PointerUnion::{is,get} have been soft deprecated in PointerUnion.h: // FIXME: Replace the uses of is(), get() and dyn_cast() with // isa<T>, cast<T> and the llvm::dyn_cast<T> I'm not touching PointerUnion::dyn_cast for now because it's a bit complicated; we could blindly migrate it to dyn_cast_if_present, but we should probably use dyn_cast when the operand is known to be non-null.	2024-12-20 10:41:54 -08:00

1 2 3 4 5 ...

279 Commits