llvm-project

Author	SHA1	Message	Date
Jakub Kuderski	6512ca7ddb	[mlir] Add `isStatic`* size check for `ShapedType`s. NFCI. (#147085 ) The motivation is to avoid having to negate `isDynamic*` checks, avoid double negations, and allow for `ShapedType::isStaticDim` to be used in ADT functions without having to wrap it in a lambda performing the negation. Also add the new functions to C and Python bindings.	2025-07-07 14:57:27 -04:00
Kazu Hirata	be4cd9f4da	[mlir] Remove unused includes (NFC) (#147206 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-07-06 19:06:07 -07:00
Mehdi Amini	ff0dcc4614	[MLIR][Linalg] Harden parsing Linalg named ops (#145337 ) This thread through proper error handling / reporting capabilities to avoid hitting llvm_unreachable while parsing linalg ops. Fixes #132755 Fixes #132740 Fixes #129185	2025-06-26 00:22:08 +02:00
Hsiangkai Wang	d16f42d1e2	[mlir][linalg] Constrain the parameters m, r in Winograd ops (#144657 ) We only support fixed set of minimum filtering algorithm for Winograd Conv2D decomposition. Instead of letting users specify any integer, define a fixed set of enumeration values for the parameters of minimum filtering algorithm.	2025-06-25 14:02:07 +01:00
Ian Wood	c005df3c7e	[mlir][linalg] Fix EraseIdentityLinalgOp on fill-like ops (#130000 ) Adds a check to make sure that the linalg op is safe to erase by ensuring that the `linalg.yield` is yielding one of the linalg op's block args. This check already exists for linalg ops with pure tensor semantics. Closes https://github.com/llvm/llvm-project/issues/129414 --------- Signed-off-by: Ian Wood <ianwood2024@u.northwestern.edu>	2025-06-02 12:18:57 -07:00
Han-Chung Wang	c39915fa2e	[mlir][NFC] Simplify constant checks with isOneInteger and renamed isZeroInteger. (#139340 ) The revision adds isOneInteger helper, and simplifies the existing code with the two methods. It removes some lambda, which makes code cleaner. For downstream users, you can update the code with the below script. ```bash sed -i "s/isZeroIndex/isZeroInteger/g" */.h sed -i "s/isZeroIndex/isZeroInteger/g" */.cpp ``` --------- Signed-off-by: hanhanW <hanhan0912@gmail.com>	2025-05-20 14:53:02 -07:00
Kazu Hirata	395f8695da	[mlir] Use llvm::replace (NFC) (#140344 )	2025-05-17 09:09:26 -07:00
Rahul Joshi	b17f3c63de	[NFC][MLIR] Add {} for `else` when `if` body has {} (#139422 )	2025-05-12 10:29:03 -07:00
Md Asghar Ahmad Shahid	d78ff5f6a9	[MLIR][Linalg] Introduce transpose/broadcast semantic to linalg.batch… (#130944 ) …_reduce_matmul. This patch exposes broadcast and transpose semantics on 'batch_reduce_matmul'. This is the last one in continuation of other two variant of matmul ops. The broadcast and transpose semantic are as follows: Broadcast and Transpose semantics can be appiled by specifying the explicit attribute 'indexing_maps' as shown below. This is a list attribute, so must include maps for all arguments if specified. Example Transpose: ``` linalg.batch_reduce_matmul indexing_maps = [ affine_map<(d0, d1, d2, d3) -> (d0, d3, d1)>, // transpose affine_map<(d0, d1, d2, d3) -> (d0, d3, d2)>, affine_map<(d0, d1, d2, d3) -> (d1, d2)> ] ins(%arg0, %arg1 : memref<2x5x3xf32>,memref<2x5x7xf32>) outs(%arg2: memref<3x7xf32>) ``` Example Broadcast: ``` linalg.batch_reduce_matmul indexing_maps = [ affine_map<(d0, d1, d2, d3) -> (d3)>, // broadcast affine_map<(d0, d1, d2, d3) -> (d0, d3, d2)>, affine_map<(d0, d1, d2, d3) -> (d1, d2)> ] ins(%arg0, %arg1 : memref<5xf32>, memref<2x5x7xf32>) outs(%arg2: memref<3x7xf32>) ``` Example Broadcast and Transpose: ``` linalg.batch_reduce_matmul indexing_maps = [ affine_map<(d0, d1, d2, d3) -> (d1, d3)>, // broadcast affine_map<(d0, d1, d2, d3) -> (d0, d2, d3)>, // transpose affine_map<(d0, d1, d2, d3) -> (d1, d2)> ] ins(%arg0, %arg1 : memref<3x5xf32>, memref<2x7x5xf32>) outs(%arg2: memref<3x7xf32>) ``` RFCs and related PR: https://discourse.llvm.org/t/rfc-linalg-opdsl-constant-list-attribute-definition/80149 https://discourse.llvm.org/t/rfc-op-explosion-in-linalg/82863 https://discourse.llvm.org/t/rfc-mlir-linalg-operation-tree/83586 https://github.com/llvm/llvm-project/pull/115319 https://github.com/llvm/llvm-project/pull/122275	2025-05-12 13:29:34 +01:00
Kazu Hirata	15f7c6ed70	[mlir] Remove unused local variables (NFC) (#138481 )	2025-05-05 10:08:00 -07:00
Stanley Winata	9bf702337d	[mlir][linalg] fix indexOp folder to work in genericOp build with createOrFold (#137427 ) Currently in torch-mlir, indexOp folder is segfaulting when we call createOrFold in a genericOp builder. (*this)->getParentOp ended up with null which is causing the issue. This is seen in poolSizeCalculator.getPoolSize being called from createAvgPoolValueCountIncludePadFalseCase. link: `80a3dfddd3/lib/Conversion/TorchToLinalg/Pooling.cpp (L918-L921)` --------- Signed-off-by: Stanley Winata <stanley.winata@amd.com> Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>	2025-04-25 18:52:02 -07:00
Jakub Kuderski	f010725e39	[mlir][linalg] Add folder for `linalg.index` (#136640 ) We know that the index of unit dims is always 0.	2025-04-22 11:29:41 -04:00
Jakub Kuderski	c62afbfeda	[mlir][linalg][gpu] Clean up printing. NFC. (#136330 ) * Use `llvm::interleaved` from #135517 to simplify printing * Avoid needless vector allocations	2025-04-18 15:05:27 -04:00
AdityaK	ce7466f66c	NFC: Rewrite auto castIter -> const auto *castIter (#133521 )	2025-04-16 10:30:53 -07:00
Max191	1407f5bee9	[mlir] Canonicalize extract_slice(unpack) (#133777 ) Canonicalizes a chain of `linalg.unpack -> tensor.extract_slice` into a `linalg.unpack` with reduced dest sizes. This will only happen when the unpack op's only user is a non rank-reducing slice with zero offset and unit strides. --------- Signed-off-by: Max Dawkins <max.dawkins@gmail.com> Signed-off-by: Max Dawkins <maxdawkins19@gmail.com> Co-authored-by: Max Dawkins <maxdawkins19@gmail.com>	2025-04-01 14:51:58 -04:00
Kazu Hirata	1cc07a0865	[mlir] Use Set::insert_range (NFC) (#133043 ) We can use Set::insert_range to collapse: for (auto Elem : Range) Set.insert(E); down to: Set.insert_range(Range); In some cases, we can further fold that into the set declaration.	2025-03-26 07:47:02 -07:00
Han-Chung Wang	900be712ce	[mlir][Linalg] Preserve encodings in static shape inference. (#132311 ) Previously, the encodings are unconditionally dropped during the shape inference. The revision adds the support for preserving the encodings in the linalg ops. --------- Signed-off-by: hanhanW <hanhan0912@gmail.com>	2025-03-21 13:36:44 -07:00
Kazu Hirata	3041fa6c7a	[mlir] Use *Set::insert_range (NFC) (#132326 ) DenseSet, SmallPtrSet, SmallSet, SetVector, and StringSet recently gained C++23-style insert_range. This patch replaces: Dest.insert(Src.begin(), Src.end()); with: Dest.insert_range(Src); This patch does not touch custom begin like succ_begin for now.	2025-03-20 22:24:17 -07:00
Benjamin Kramer	02c804dd70	[mlir][linalg] Silence unused variable warning	2025-02-21 12:45:37 +01:00
Javed Absar	6de5d1e46d	[mlir][linalg] Extend elementwise (#124661 ) Implements Linalg elemwise named-op following the proposal and discussions in RFC: https://discourse.llvm.org/t/rfc-extend-linalg-elemwise-named-ops-semantics/83927/1	2025-02-21 10:51:21 +00:00
Md Asghar Ahmad Shahid	760ec2c38e	[MLIR][Linalg] Introduce Python API for linalg.batch_matmul Ops. (#127614 ) As linalg.batch_matmul has been moved into tablegen from OpDSL, its derived python wrapper no longer exist.This patch adds the required python wrapper. Also refactors the BatchmatmulOp printer to make it consistent with its parser.	2025-02-19 14:15:02 +00:00
Andrzej Warzyński	517800e37e	[mlir][tensor][linalg] Move Pack/UnPack Ops to Linalg (#123902 ) Moves `PackOp` and `UnPackOp` from the Tensor dialect to Linalg. This change was discussed in the following RFC: * https://discourse.llvm.org/t/rfc-move-tensor-pack-and-tensor-unpack-into-linalg This change involves significant churn but only relocates existing code - no new functionality is added. Note for Downstream Users Downstream users must update references to `PackOp` and `UnPackOp` as follows: * Code: `s/tensor::(Up)PackOp/linalg::(Un)PackOp/g` * Tests: `s/tensor.(un)pack/linalg.(un)pack/g` No other modifications should be required.	2025-02-17 10:44:27 +00:00
Rolf Morel	f796bc622a	[MLIR][Linalg] Expose linalg.matmul and linalg.contract via Python API (#126377 ) Now that linalg.matmul is in tablegen, "hand write" the Python wrapper that OpDSL used to derive. Similarly, add a Python wrapper for the new linalg.contract op. Required following misc. fixes: 1) make linalg.matmul's parsing and printing consistent w.r.t. whether indexing_maps occurs before or after operands, i.e. per the tests cases it comes _before_. 2) tablegen for linalg.contract did not state it accepted an optional cast attr. 3) In ODS's C++-generating code, expand partial support for `$_builder` access in `Attr::defaultValue` to full support. This enables access to the current `MlirContext` when constructing the default value (as is required when the default value consists of affine maps).	2025-02-10 12:05:13 +00:00
Md Asghar Ahmad Shahid	f2bca9e385	[MLIR][Linalg] Introduce broadcast/transpose semantic to batch_matmul (#122275 ) Goals: 1. To add syntax and semantic to 'batch_matmul' without changing any of the existing syntax expectations for current usage. batch_matmul is still just batch_matmul. 2. Move the definition of batch_matmul from linalg OpDsl to tablegen ODS infra. Scope of this patch: To expose broadcast and transpose semantics on the 'batch_matmul'. The broadcast and transpose semantic are as follows: By default, 'linalg.batch_matmul' behavior will remain as is. Broadcast and Transpose semantics can be applied by specifying the explicit attribute 'indexing_maps' as shown below. This is a list attribute, so the list must include all the maps if specified. Example Transpose: ``` linalg.batch_matmul indexing_maps = [ affine_map< (d0, d1, d2, d3) -> (d0, d3, d1)>, //transpose affine_map< (d0, d1, d2, d3) -> (d0, d3, d2)>, affine_map< (d0, d1, d2, d3) -> (d0, d1, d2)> ] ins (%arg0, %arg1: memref<2x5x3xf32>,memref<2x5x7xf32>) outs (%arg2: memref<2x3x7xf32>) ``` Example Broadcast: ``` linalg.batch_matmul indexing_maps = [ affine_map< (d0, d1, d2, d3) -> (d3)>, //broadcast affine_map< (d0, d1, d2, d3) -> (d0, d3, d2)>, affine_map< (d0, d1, d2, d3) -> (d0, d1, d2)> ] ins (%arg0, %arg1: memref<5xf32>,memref<2x5x7xf32>) outs (%arg2: memref<2x3x7xf32>) ``` Example Broadcast and transpose: ``` linalg.batch_matmul indexing_maps = [ affine_map< (d0, d1, d2, d3) -> (d1, d3)>, //broadcast affine_map< (d0, d1, d2, d3) -> (d0, d2, d3)>, //transpose affine_map< (d0, d1, d2, d3) -> (d0, d1, d2)> ] ins (%arg0, %arg1: memref<3x5xf32>, memref<2x7x5xf32>) outs (%arg2: memref<2x3x7xf32>) ``` RFCs and related PR: https://discourse.llvm.org/t/rfc-linalg-opdsl-constant-list-attribute-definition/80149 https://discourse.llvm.org/t/rfc-op-explosion-in-linalg/82863 https://discourse.llvm.org/t/rfc-mlir-linalg-operation-tree/83586 https://github.com/llvm/llvm-project/pull/115319	2025-02-06 19:08:50 +00:00
erichkeane	cf8c730ce9	Fix LinalgOps build error on older clang versions As reported in the PR #123618, 0d4efa27252cbbea4b56 included a construction of a `FailureOr` object with a `nullptr`, which didn't work in at least clang-10. This patch changes it into a constructor call instead of a brace-init call so that it is unambiguous.	2025-01-31 06:31:44 -08:00
Rolf Morel	0d4efa2725	[MLIR][Linalg] Introduce linalg.contract (#123618 ) A new op that allows for representing arbitrary contractions on operands of arbitrary rank, with arbitrary transposes and arbitrary broadcasts specified through its indexing_maps attribute. Supports the expected lowerings to linalg.generic and to vector.contract. Corresponding RFC is here: https://discourse.llvm.org/t/mlir-rfc-introduce-linalg-contract/83589	2025-01-29 17:28:52 +00:00
Dmitriy Smirnov	f20b8e35b3	[MLIR][Linalg] Fixes for Winograd decomposition and for tiling (#123675 ) The PR addresses issues with the filters of 1 x r and of r x 1 and with the tiling. --------- Signed-off-by: Dmitriy Smirnov <dmitriy.smirnov@arm.com>	2025-01-29 10:38:29 +00:00
Adam Siemieniuk	458542f454	[mlir][linalg] Relax structured op region filler check (#123741 ) Removes assert on output type from structure op region filler to allow more graceful error handling.	2025-01-28 09:55:59 +01:00
Clément Fournier	36c3466aef	[mlir][linalg] Fix neutral elt for softmax (#118952 ) The decomposition of `linalg.softmax` uses `maxnumf`, but the identity element that is used in the generated code is the one for `maximumf`. They are not the same, as the identity for `maxnumf` is `NaN`, while the one of `maximumf` is `-Infty`. This is wrong and prevents the maxnumf from being folded. Related to #114595, which fixed the folder for maxnumf.	2025-01-13 15:21:07 +08:00
lorenzo chelini	6e2e4d446c	Revert "[MLIR][Arith] Add denormal attribute to binary/unary operations (#112700 )" This reverts commit 4a7b56e6e7dd0f83c379ad06b6e81450bc691ba6. There is no agreement.	2024-12-10 04:18:20 +01:00
lorenzo chelini	4a7b56e6e7	[MLIR][Arith] Add denormal attribute to binary/unary operations (#112700 ) Add support for denormal in the Arith dialect (binary and unary operations). Denormal are attached to every operation, and they can be of three different kinds: 1) ieee, denormal are preserved and processed as defined by IEEE 754 rules. 2) preserve sign, a mode where denormal numbers are flushed to zero, but the sign of the zero (+0 or -0) is preserved. 3) positive zero, a mode where all denormal numbers are flushed to positive zero (+0), ignoring the sign of the original number. Denormal refers to both the operands and the result. Currently only lowering for ieee is supported.	2024-11-26 11:58:43 +01:00
Chuvak	1fd8d3fea5	[mlir] Fix wrong names in LinalgOps and ScalableValueBoundsConstraintSet (#117227 ) Fix for some mistakes in source code found using PVS Studio. Inspired by: https://pvs-studio.com/en/blog/posts/cpp/1188/ Fixed: - [Bug 2](https://pvs-studio.com/en/blog/posts/cpp/1188/#ID725051E718) - [Bug 3](https://pvs-studio.com/en/blog/posts/cpp/1188/#IDFA2459368E)	2024-11-23 00:34:56 +08:00
Md Asghar Ahmad Shahid	d0d726e56d	Fix GCC build problem with 288f05f related to SmallVector. (#116958 ) Below is the error message for reference. /llvm-project/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp: In static member function 'static llvm::SmallVector<mlir::AffineMap> mlir::linalg::MatmulOp::getDefaultIndexingMaps(mlir::MLIRContext*)': /llvm-project/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp:3468:10: error: could not convert 'indexingMaps' from 'SmallVector<[...],3>' to 'SmallVector<[...],6>' 3468 \| return indexingMaps; \| ^~~~~~~~~~~~ \| \| \| SmallVector<[...],3> Here is the link to the failure. https://lab.llvm.org/buildbot/#/builders/117/builds/3919 ...	2024-11-20 07:40:42 -06:00
Md Asghar Ahmad Shahid	288f05f63e	[NFC][MLIR][Linalg] Refactor linalg.matmul tablegen ODS and related C++ code. (#116377 ) This commit refactors part of the code in preparation for the migration of other matmul variants from OpDSL to ODS. Moves getDefaultIndexingmaps() helper into the MatmulOp class.	2024-11-20 09:26:06 +00:00
Kunwar Grover	db115ba3ef	[mlir][Linalg] Fix non-matmul linalg structured ops (#116412 ) `3ad0148020` broke linalg structured ops other than MatmulOp. The patch: - Changes the printer to hide additional attributes, which weren't hidden before: "indexing_maps". - Changes the build of every linalg structured op to have an indexing map for matmul. These changes combined, hide the problem until you print the operation in it's generic form. Reproducer: ```mlir func.func public @bug(%arg0 : tensor<5x10x20xf32>, %arg1 : tensor<5x20x40xf32>, %arg3 : tensor<5x10x40xf32>) -> tensor<5x10x40xf32> { %out = linalg.batch_matmul ins(%arg0, %arg1 : tensor<5x10x20xf32>, tensor<5x20x40xf32>) outs(%arg3 : tensor<5x10x40xf32>) -> tensor<5x10x40xf32> func.return %out : tensor<5x10x40xf32> } ``` Prints fine, with `mlir-opt <file>`, but if you do `mlir-opt --mlir-print-op-generic <file>`: ``` #map = affine_map<(d0, d1, d2) -> (d0, d2)> #map1 = affine_map<(d0, d1, d2) -> (d2, d1)> #map2 = affine_map<(d0, d1, d2) -> (d0, d1)> #map3 = affine_map<(d0, d1, d2, d3) -> (d0, d1, d3)> #map4 = affine_map<(d0, d1, d2, d3) -> (d0, d3, d2)> #map5 = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2)> "builtin.module"() ({ "func.func"() <{function_type = (tensor<5x10x20xf32>, tensor<5x20x40xf32>, tensor<5x10x40xf32>) -> tensor<5x10x40xf32>, sym_name = "bug", sym_visibility = "public"}> ({ ^bb0(%arg0: tensor<5x10x20xf32>, %arg1: tensor<5x20x40xf32>, %arg2: tensor<5x10x40xf32>): %0 = "linalg.batch_matmul"(%arg0, %arg1, %arg2) <{operandSegmentSizes = array<i32: 2, 1>}> ({ ^bb0(%arg3: f32, %arg4: f32, %arg5: f32): %1 = "arith.mulf"(%arg3, %arg4) <{fastmath = #arith.fastmath<none>}> : (f32, f32) -> f32 %2 = "arith.addf"(%arg5, %1) <{fastmath = #arith.fastmath<none>}> : (f32, f32) -> f32 "linalg.yield"(%2) : (f32) -> () }) {indexing_maps = [#map, #map1, #map2], linalg.memoized_indexing_maps = [#map3, #map4, #map5]} : (tensor<5x10x20xf32>, tensor<5x20x40xf32>, tensor<5x10x40xf32>) -> tensor<5x10x40xf32> "func.return"(%0) : (tensor<5x10x40xf32>) -> () }) : () -> () }) : () -> () ``` The batch_matmul operation's builder now always inserts a indexing_map which is unrelated to the operation itself. This was caught when a transformation from one LinalgStructuredOp to another, tried to pass it's attributes to the other ops builder and there were multiple indexing_map attributes in the result. This patch fixes this by specializing the builders for MatmulOp with indexing map information.	2024-11-16 08:13:10 +00:00
Md Asghar Ahmad Shahid	3ad0148020	[MLIR][Linalg] Re-land linalg.matmul move to ODS. + Remove/update failing obsolete OpDSL tests. (#115319 ) The earlier PR(https://github.com/llvm/llvm-project/pull/104783) which introduces transpose and broadcast semantic to linalg.matmul was reverted due to two failing OpDSL test for linalg.matmul. Since linalg.matmul is now defined using TableGen ODS instead of Python-based OpDSL, these test started failing and needs to be removed/updated. This commit removes/updates the failing obsolete tests from below files. All other files were part of earlier PR and just cherry picked. "mlir/test/python/integration/dialects/linalg/opsrun.py" "mlir/test/python/integration/dialects/transform.py" --------- Co-authored-by: Renato Golin <rengolin@systemcall.eu>	2024-11-07 14:51:02 +00:00
Emilio Cota	1276ce9e97	Revert "[mlir][linalg] Introduce transpose semantic to 'linalg.matmul' ops. (#104783 )" This reverts commit 03483737a7a2d72a257a5ab6ff01748ad9cf0f75 and 99c8557, which is a fix-up on top of the former. I'm reverting because this commit broke two tests: mlir/test/python/integration/dialects/linalg/opsrun.py mlir/test/python/integration/dialects/transform.py See https://lab.llvm.org/buildbot/#/builders/138/builds/4872 I'm not familiar with the tests, so I'm leaving it to the original author to either remove or adapt the broken tests, as discussed here: https://github.com/llvm/llvm-project/pull/104783#issuecomment-2406390905	2024-10-11 05:22:56 -04:00
Renato Golin	99c8557c17	Fix GCC build problem with 03483737a7a2	2024-10-10 18:52:27 +01:00
Md Asghar Ahmad Shahid	03483737a7	[mlir][linalg] Introduce transpose semantic to 'linalg.matmul' ops. (#104783 ) The main goal of this patch is to extend the semantic of 'linalg.matmul' named op to include per operand transpose semantic while also laying out a way to move ops definition from OpDSL to tablegen. Hence, it is implemented in tablegen. Transpose semantic is as follows. By default 'linalg.matmul' behavior will remain as is. Transpose semantics can be appiled on per input operand by specifying the optional permutation attributes (namely 'permutationA' for 1st input and 'permutationB' for 2nd input) for each operand explicitly as needed. By default, no transpose is mandated for any of the input operand. Example: ``` %val = linalg.matmul ins(%arg0, %arg1 : memref<5x3xf32>, memref<5x7xf32>) outs(%arg2: memref<3x7xf32>) permutationA = [1, 0] permutationB = [0, 1] ```	2024-10-10 17:00:58 +01:00
JOE1994	884221eddb	[mlir] Tidy uses of llvm::raw_stream_ostream (NFC) As specified in the docs, 1) raw_string_ostream is always unbuffered and 2) the underlying buffer may be used directly ( 65b13610a5226b84889b923bae884ba395ad084d for further reference ) * Don't call raw_string_ostream::flush(), which is essentially a no-op. * Avoid unneeded calls to raw_string_ostream::str(), to avoid excess indirection.	2024-09-16 23:23:25 -04:00
MaheshRavishankar	d5f0969c96	[mlir][TilingInterface] Avoid looking at operands for getting slices to continue tile + fuse. (#107882 ) Current implementation of `scf::tileConsumerAndFuseProducerUsingSCF` looks at operands of tiled/tiled+fused operations to see if they are produced by `extract_slice` operations to populate the worklist used to continue fusion. This implicit assumption does not always work. Instead make the implementations of `getTiledImplementation` return the slices to use to continue fusion. This is a breaking change - To continue to get the same behavior of `scf::tileConsumerAndFuseProducerUsingSCF`, change all out-of-tree implementation of `TilingInterface::getTiledImplementation` to return the slices to continue fusion on. All in-tree implementations have been adapted to this. - This change touches parts that required a simplification to the `ControlFn` in `scf::SCFTileAndFuseOptions`. It now returns a `std::optional<scf::SCFTileAndFuseOptions::ControlFnResult>` object that should be `std::nullopt` if fusion is not to be performed. Signed-off-by: MaheshRavishankar <mahesh.revishankar@gmail.com>	2024-09-11 22:15:43 -07:00
Kunwar Grover	c9aa55da62	[mlir][Linalg] Add speculation for LinalgStructuredOps (#108032 ) This patch adds speculation behavior for linalg structured ops, allowing them to be hoisted out of loops using LICM.	2024-09-11 09:30:05 +01:00
Longsheng Mou	7063c9427e	[mlir][Linalg] Bugfix for folder of `linalg.transpose` (#102888 ) Folder of linalg transpose only support tensor type. Fix #102576.	2024-08-21 15:49:10 +08:00
Hsiangkai Wang	c4bf949171	[mlir][linalg] Implement TilingInterface for winograd operators (#96184 ) In order to support arbitrary size input data of conv2d, implement TilingInterface for winograd operations. Before converting winograd operations into nested loops with matrix multiply, tile the input of conv2d into the supported size first. Add a transform operation structured.decompose_winograd_op to decompose winograd operations. Before applying the transform op, use tile_using_for to tile the input data into supported size. The test case shows how to tile and decompose winograd operations.	2024-08-16 16:22:02 +01:00
Ian Wood	4b6f4efabc	[mlir][linalg] Canonicalize non-identity `linalg.generic` ops (#101430 ) Extend `linalg.generic`'s canonicalization patterns to be able to erase ops with non-identity indexing maps but that are still noops.	2024-08-02 09:25:52 -07:00
MaheshRavishankar	d9c85338ea	[mlir][Linalg] Add a pattern to fold concats of fill. (#98995 ) If a concat has all its operands as just fills, and the values match, then the fill could happen on the concatenated values of the `outs` operands. Signed-off-by: MaheshRavishankar <mahesh.ravishankar@gmail.com>	2024-07-30 14:45:14 -07:00
donald chen	9cc11b98a7	[mlir] [linalg] Add pattern to swap transpose with broadcast (#97063 ) Add a pattern that implement: transpose(broadcast(input)) -> broadcast(transpose(input))	2024-07-23 12:52:25 +08:00
Hsiangkai Wang	7d246e84a4	[mlir][linalg] Implement Conv2D using Winograd Conv2D algorithm (#96181 ) Define high level winograd operators and convert conv_2d_nhwc_fhwc into winograd operators. According to Winograd Conv2D algorithm, we need three transform operators for input, filter, and output transformation. The formula of Winograd Conv2D algorithm is Y = A^T x [(G x g x G^T) @ (B^T x d x B)] x A filter transform: G x g x G^T input transform: B^T x d x B output transform: A^T x y x A The implementation is based on the paper, Fast Algorithm for Convolutional Neural Networks. (https://arxiv.org/abs/1509.09308) Reviewers: stellaraccident, ftynse, Max191, GeorgeARM, cxy-1993, nicolasvasilache, MaheshRavishankar, dcaballe, rengolin Reviewed By: ftynse, Max191, stellaraccident Pull Request: https://github.com/llvm/llvm-project/pull/96181	2024-07-10 07:30:45 +01:00
Felix Schneider	c65f8d8816	[mlir][linalg] Fix crashes in parser on linalg ops without operands (#97944 ) `parseDstStyleOp` parses both `ins()` and `outs()` optionally. The parsers for `linalg.transpose`, `linalg.broadcast` and `linalg.map` however assume that at least one operand is present in the state, leading to crashes otherwise. This patch adds checks to the parsers which stop them from crashing if no operands were parsed. When the Ops are parsed successfuly, the verifiers can work on them. Fix https://github.com/llvm/llvm-project/issues/97857	2024-07-07 18:09:24 +02:00
Prashant Kumar	fa0666876c	[mlir][linalg] Fix numerical issue with softmax (#96090 ) For more info: https://github.com/iree-org/iree/issues/17670#issuecomment-2167591878	2024-06-20 08:08:37 +05:30

1 2 3 4 5 ...

528 Commits