llvm-project

Author	SHA1	Message	Date
Rafael Ubal	214d32ccd2	Support for dynamic dimensions in 'tensor.splat' (#74626 ) This feature had been marked as `TODO` in the `tensor.splat` documentation for a while. This MR includes: - Support for dynamically shaped tensors in the return type of `tensor.splat` with the syntax suggested in the `TODO` comment. - Updated op documentation. - Bufferization support. - Updates in op folders affected by the new feature. - Unit tests for valid/invalid syntax, valid/invalid folding, and lowering through bufferization. - Additional op builders resembling those available in `tensor.empty`.	2023-12-15 13:54:45 +00:00
Matthias Springer	464dfeba44	[mlir][tensor][bufferize] `tensor.empty` bufferizes to an allocation (#68080 ) Make `tensor.empty` bufferizable, so that the `-empty-tensor-to-alloc-tensor` pass becomes optional. This makes the bufferization easier to use. `tensor.empty` used to be non-bufferizable, so that there two separate ops, one that can be optimized away (`tensor.empty`) and one that is guaranteed to bufferize to an allocation (`bufferization.alloc_tensor`). With the recent improvements of "empty tensor elimination" this is no longer needed and `bufferization.alloc_tensor` can be phased out.	2023-10-03 16:00:37 +02:00
Matthias Springer	23bd2e96fe	[mlir][Affine] Delete duplicate code: `applyMapToValues` The same functionality is provided by `makeComposedFoldedAffineApply`. Differential Revision: https://reviews.llvm.org/D154199	2023-06-30 14:01:13 +02:00
Matthias Springer	481b254e45	[mlir][tensor][bufferize] Bufferize tensor.splat op The op bufferizes similarly to tensor.generate: it is lowered to a linalg.map, which may then lower to a loop nest that fills the buffer. Differential Revision: https://reviews.llvm.org/D150952	2023-05-22 14:31:39 +02:00
Quentin Colombet	3810f76c50	[mlir][tensor\|memref] Harden the checks on dim op Prior to this patch it was possible to use the dim operation on a 0-D memref/tensor. Unless we want to change the semantic of a 0-D shape, this doesn't make sense because, paraphrasing the dim op semantic, this is guaranteed to produce something that is undefined. (The requested index is guaranteed to be equal to or greater than the rank.) Harden the type requirements for the dim op by disallowing 0-D shaped types. This "fixes" llvm.org/PR60195 by rejecting dim op on 0-D shapes instead of crashing during LLVM conversion. Differential Revision: https://reviews.llvm.org/D142445	2023-02-02 11:34:03 +01:00
Matthias Springer	b6ae3f8873	[mlir][tensor][bufferize] Implement getBufferType for CastOp This interface method is used to compute the buffer type of a value during bufferization. It was missing. This is interface method is used during loop bufferization. Also fix a bug where a cast from an unranked tensor to a ranked tensor type did not always apply a fully dynamic layout map on the result memref. Differential Revision: https://reviews.llvm.org/D143063	2023-02-01 14:24:10 +01:00
Matthias Springer	be630f07de	[mlir][bufferize] Implement BufferizableOpInterface for tensor.empty The op is not bufferizable but should be analyzable (for `EliminateEmptyTensors`, which uses the bufferization infrastructure). Also improve debugging functionality and error messages. Also adds a missing pass to the sparse pipeline. (tensor.empty should be replaced with bufferization.alloc_tensor, but it sometimes used to work without depending on how the tensor.empty is used. Now we always fail explicitly.)	2022-12-12 14:19:38 +01:00
Emilio Cota	72d76a2403	[mlir][bufferize] lower allocation alignment from 128 to 64 bytes While it is unlikely to matter in practice, there is no reason for this value to be larger than it should be. 64 bytes is the size of a cache line in most machines, and we can fit a full 512-bit vector in it. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D139434	2022-12-07 11:12:46 -05:00
Matthias Springer	09dfb44193	[mlir][tensor][bufferize] Support memory_space for tensor.pad This change adds memory space support to tensor.pad. (tensor.generate and tensor.from_elements do not support memory spaces yet.) The memory space is inferred from the buffer of the source tensor. Instead of lowering tensor.pad to tensor.generate + tensor.insert_slice, it is now lowered to bufferization.alloc_tensor (with the correct memory space) + linalg.map + tensor.insert_slice. Memory space support for the remaining two tensor ops is left for a later point, as this requires some more design discussions. Differential Revision: https://reviews.llvm.org/D136265	2022-10-27 12:29:57 +02:00
Matthias Springer	66baa349c6	[mlir][tensor] Fix build: Add missing line break to test case This should have been part of D136767.	2022-10-27 12:20:05 +02:00
Matthias Springer	c1f0a15c65	[mlir][tensor][bufferize] Lower tensor.generate to linalg.map There is no memref equivalent of tensor.generate. The purpose of this change is to avoid creating scf.parallel loops during bufferization. Differential Revision: https://reviews.llvm.org/D136767	2022-10-27 12:03:13 +02:00
Johannes Reifferscheid	78f4a02aef	Fixes for D133947.	2022-09-16 11:38:30 +02:00
Johannes Reifferscheid	d7c606f5b7	Fix bufferization of collapse_shape of subviews with size 1 dims. Currently, there's an optimization that claims dimensions of size 1 are always contiguous. This is not necessarily the case for subviews. ``` Input: [ [ [0, 1], [2, 3] ], [ [4, 5] [6, 7] ] ] Subview: [ [ [0, 1], ], [ [4, 5] ] ] ``` The old logic treats this subview as contiguous, when it is not. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D134026	2022-09-16 11:32:30 +02:00
Alex Zinenko	46b90a7b5d	[mlir] make remaining memref dialect ops produce strided layouts The three following ops in the memref dialect: transpose, expand_shape, collapse_shape, have been originally designed to operate on memrefs with strided layouts but had to go through the affine map representation as the type did not support anything else. Make these ops produce memref values with StridedLayoutAttr instead now that it is available. Depends On D133938 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133947	2022-09-16 10:56:48 +02:00
Alex Zinenko	2791162b01	[mlir] make memref.subview produce strided layout Memref subview operation has been initially designed to work on memrefs with strided layouts only and has never supported anything else. Port it to use the recently added StridedLayoutAttr instead of extracting the strided from implicitly from affine maps. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133938	2022-09-16 10:56:46 +02:00
Matthias Springer	c37ed7762e	[tensor][bufferize] Use affine.apply instead of arith.addi in PadOp lowering Affine exprs compose better than arith ops. Differential Revision: https://reviews.llvm.org/D132456	2022-08-23 11:46:11 +02:00
Matthias Springer	9ee12f4778	[mlir][tensor][bufferize] Bufferize tensor.pad tensor.pad is lowered to tensor.generate + tensor.insert_slice during bufferization. For best performance with constant padding values, users should vectorize the IR before bufferizing it. This change also relaxes tje restriction that no new ops that bufferize to a memory write should be added during bufferization. Since bufferization has been split into two steps a while ago (tensor copy insertion + bufferization), it is reasonable to allow this now. Differential Revision: https://reviews.llvm.org/D132355	2022-08-22 17:00:33 +02:00
Matthias Springer	6c3c5f8069	[mlir][memref] Improve type inference for rank-reducing subviews The result shape of a rank-reducing subview cannot be inferred in the general case. Just the result rank is not enough. The only thing that we can infer is the layout map. This change also improves the bufferization patterns of tensor.extract_slice and tensor.insert_slice to fully support rank-reducing operations. Differential Revision: https://reviews.llvm.org/D129144	2022-07-05 16:49:07 +02:00
Matthias Springer	cc6462a475	[mlir][tensor][bufferize][NFC] Clean up test case Insert -split-input-file flag to make the test cases more stable. Differential Revision: https://reviews.llvm.org/D129143	2022-07-05 16:10:39 +02:00
Benjamin Kramer	6eb0f8e285	[mlir][MemRef] Fix a crash when expanding a scalar shape In this case the reassociation is empty, yielding no strides for the result type. Differential Revision: https://reviews.llvm.org/D127232	2022-06-08 09:37:40 +02:00
Ashay Rane	e287d647c6	[mlir] Add translation from tensor.reshape to memref.reshape This patch augments the `tensor-bufferize` pass by adding a conversion rule to translate ReshapeOp from the `tensor` dialect to the `memref` dialect, in addition to adding a unit test to validate the translation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125031	2022-05-09 17:45:07 +02:00
Yi Zhang	1cddcfdc3c	Fix CollapsedLayoutMap for dim size 1 case This change fixes `CollapsedLayoutMap` for cases where the collapsed dims are size 1. The cases where inner most dims are size 1 and noncontiguous can be represented by the strided form and therefore can be allowed. For such cases, the new stride should be of the next entry in an association whose dimension is not size 1. If the next entry is dynamic, it's not possible to decide which stride to use at compilation time and the stride is set to dynamic. Differential Revision: https://reviews.llvm.org/D124137	2022-04-22 17:48:24 -04:00
Matthias Springer	d820acdde1	[mlir][bufferize][NFC] Use custom walk instead of GreedyPatternRewriter The bufferization driver was previously using a GreedyPatternRewriter. This was problematic because bufferization must traverse ops top-to-bottom. The GreedyPatternRewriter was previously configured via `useTopDownTraversal`, but this was a hack; this API was just meant for performance improvements and should not affect the result of the rewrite. BEGIN_PUBLIC No public commit message needed. END_PUBLIC Differential Revision: https://reviews.llvm.org/D123618	2022-04-22 18:23:09 +09:00
River Riddle	c48e3a13f3	[mlir][NFC] Update textual references of `func` to `func.func` in Tensor/Tosa/Vector tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:29 -07:00
Matthias Springer	d7a9bf9143	[mlir][tensor] Fix verifier and bufferization of collapse_shape Insert a buffer copy unless the dims are guaranteed to be collapsible. In the verifier, accept collapses unless they are guaranteed to be non-collapsible. Differential Revision: https://reviews.llvm.org/D123316	2022-04-08 18:20:40 +09:00
River Riddle	af371f9f98	Reland [GreedPatternRewriter] Preprocess constants while building worklist when not processing top down Reland Note: Adds a fix to properly mark a commutative operation as folded if we change the order of its operands. This was uncovered by the fact that we no longer re-process constants. This avoids accidentally reversing the order of constants during successive application, e.g. when running the canonicalizer. This helps reduce the number of iterations, and also avoids unnecessary changes to input IR. Fixes #51892 Differential Revision: https://reviews.llvm.org/D122692	2022-04-07 11:31:42 -07:00
Nicolas Vasilache	fc8f465a00	[mlir][MemRef] Allow transposed layouts in ExpandShapeOp. https://reviews.llvm.org/D122641 introduced fixes to the ExpandShapeOp verifier but also introduced an artificial layout limitation that prevents the consideration of transposed layouts. This revision fixes the omissions and reimplements the logic using saturated arithmetic which is more idiomatic and avoids leaking internal implementation details. Tests cases are added for transposed layouts. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D122845	2022-04-06 04:19:30 -04:00
Matthias Springer	73c0333dee	[mlir][tensor][bufferize] Support 0-d collapse_shape with offset Differential Revision: https://reviews.llvm.org/D122901	2022-04-01 22:30:37 +09:00
Mehdi Amini	ba43d6f85c	Revert "[GreedPatternRewriter] Preprocess constants while building worklist when not processing top down" This reverts commit 59bbc7a0851b6e0054bb3ed47df0958822f08880. This exposes an issue breaking the contract of `applyPatternsAndFoldGreedily` where we "converge" without applying remaining patterns.	2022-04-01 06:16:55 +00:00
River Riddle	59bbc7a085	[GreedPatternRewriter] Preprocess constants while building worklist when not processing top down This avoids accidentally reversing the order of constants during successive application, e.g. when running the canonicalizer. This helps reduce the number of iterations, and also avoids unnecessary changes to input IR. Fixes #51892 Differential Revision: https://reviews.llvm.org/D122692	2022-03-31 12:08:55 -07:00
Matthias Springer	51df62388e	[mlir][tensor] Fix bufferization of CollapseShapeOp / ExpandShapeOp Infer a tighter MemRef type instead of always falling back to the most dynamic MemRef type. This is inefficient and caused op verification errors. Differential Revision: https://reviews.llvm.org/D122649	2022-03-31 17:11:45 +09:00
Matthias Springer	39ec46bd83	[mlir][bufferize] Extract buffer hoisting into separate function This improves the modularity of the bufferization. From now on, all ops that do not implement BufferizableOpInterface are considered hoisting barriers. Previously, all ops that do not implement the interface were not considered barriers and such ops had to be marked as barriers explicitly. This was unsafe because we could've hoisted across unknown ops where it was not safe to hoist. As a side effect, this allows for cleaning up AffineBufferizableOpInterfaceImpl. This build unit no longer needed and can be deleted. Differential Revision: https://reviews.llvm.org/D121519	2022-03-15 21:25:03 +09:00
Matthias Springer	e6f691615e	[mlir][bufferize] Support tensor.expand_shape and tensor.collapse_shape Differential Revision: https://reviews.llvm.org/D112512	2022-02-15 19:53:49 +09:00
Matthias Springer	daf18108ec	[mlir][tensor] Replace tensor-bufferize with BufferizableOpInterface impl This commit switches the `tensor-bufferize` pass over to BufferizableOpInterface-based bufferization. Differential Revision: https://reviews.llvm.org/D118246	2022-01-27 19:30:45 +09:00
Alexander Belyaev	f77e9f8768	[mlir] Extend `tensor.from_elements` to support N-D case. RFC: https://llvm.discourse.group/t/rfc-extend-tensor-fromelementsop-to-n-d/4715 Differential Revision: https://reviews.llvm.org/D115821	2021-12-16 14:52:41 +01:00
Alexander Belyaev	a82a19c137	[mlir] Add a missing pattern to bufferize tensor.rank. Differential Revision: https://reviews.llvm.org/D115745	2021-12-14 20:04:57 +01:00
Alexander Belyaev	57470abc41	[mlir] Move memref.[tensor_load\|buffer_cast\|clone] to "bufferization" dialect. https://llvm.discourse.group/t/rfc-dialect-for-bufferization-related-ops/4712 Differential Revision: https://reviews.llvm.org/D114552	2021-11-25 11:50:39 +01:00
River Riddle	015192c634	[mlir:DialectConversion] Restructure how argument/target materializations get invoked The current implementation invokes materializations whenever an input operand does not have a mapping for the desired type, i.e. it requires materialization at the earliest possible point. This conflicts with goal of dialect conversion (and also the current documentation) which states that a materialization is only required if the materialization is supposed to persist after the conversion process has finished. This revision refactors this such that whenever a target materialization "might" be necessary, we insert an unrealized_conversion_cast to act as a temporary materialization. This allows for deferring the invocation of the user materialization hooks until the end of the conversion process, where we actually have a better sense if it's actually necessary. This has several benefits: * In some cases a target materialization hook is no longer necessary When performing a full conversion, there are some situations where a temporary materialization is necessary. Moving forward, these users won't need to provide any target materializations, as the temporary materializations do not require the user to provide materialization hooks. * getRemappedValue can now handle values that haven't been converted yet Before this commit, it wasn't well supported to get the remapped value of a value that hadn't been converted yet (making it difficult/impossible to convert multiple operations in many situations). This commit updates getRemappedValue to properly handle this case by inserting temporary materializations when necessary. Another code-health related benefit is that with this change we can move a majority of the complexity related to materializations to the end of the conversion process, instead of handling adhoc while conversion is happening. Differential Revision: https://reviews.llvm.org/D111620	2021-10-27 02:09:04 +00:00
Mogball	a54f4eae0e	[MLIR] Replace std ops with arith dialect ops Precursor: https://reviews.llvm.org/D110200 Removed redundant ops from the standard dialect that were moved to the `arith` or `math` dialects. Renamed all instances of operations in the codebase and in tests. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D110797	2021-10-13 03:07:03 +00:00
Matthias Springer	e895a670f8	[mlir] Move BufferizeDimOp to Tensor/Transforms/Bufferize.cpp Differential Revision: https://reviews.llvm.org/D105256	2021-07-02 10:05:59 +09:00
Matthias Springer	c0a6318d96	[mlir][tensor] Add tensor.dim operation * Split memref.dim into two operations: memref.dim and tensor.dim. Both ops have the same builder interface and op argument names, so that they can be used with templates in patterns that apply to both tensors and memrefs (e.g., some patterns in Linalg). * Add constant materializer to TensorDialect (needed for folding in affine.apply etc.). * Remove some MemRefDialect dependencies, make some explicit. Differential Revision: https://reviews.llvm.org/D105165	2021-07-01 10:00:19 +09:00
Julian Gross	e2310704d8	[MLIR] Create memref dialect and move dialect-specific ops from std. Create the memref dialect and move dialect-specific ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp AssumeAlignmentOp -> MemRef_AssumeAlignmentOp DeallocOp -> MemRef_DeallocOp DimOp -> MemRef_DimOp MemRefCastOp -> MemRef_CastOp MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp LoadOp -> MemRef_LoadOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp SubViewOp -> MemRef_SubViewOp TransposeOp -> MemRef_TransposeOp TensorLoadOp -> MemRef_TensorLoadOp TensorStoreOp -> MemRef_TensorStoreOp TensorToMemRefOp -> MemRef_BufferCastOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D98041	2021-03-15 11:14:09 +01:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit 8aa6c3765b924d86f623d452777eb76b83bf2787.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Sean Silva	be7352c00d	[mlir][splitting std] move 2 more ops to `tensor` - DynamicTensorFromElementsOp - TensorFromElements Differential Revision: https://reviews.llvm.org/D94994	2021-01-19 13:49:25 -08:00
Sean Silva	129d6e554e	[mlir] Move `std.tensor_cast` -> `tensor.cast`. This is almost entirely mechanical. Differential Revision: https://reviews.llvm.org/D93357	2020-12-17 16:06:56 -08:00
Sean Silva	444822d77a	Revert "Revert "[mlir] Start splitting the `tensor` dialect out of `std`."" This reverts commit 0d48d265db6633e4e575f81f9d3a52139b1dc5ca. This reapplies the following commit, with a fix for CAPI/ir.c: [mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 14:30:50 -08:00
Sean Silva	0d48d265db	Revert "[mlir] Start splitting the `tensor` dialect out of `std`." This reverts commit cab8dda90f48e15ee94b0d55ceac5b6a812e4743. I mistakenly thought that CAPI/ir.c failure was unrelated to this change. Need to debug it.	2020-12-11 14:15:41 -08:00
Sean Silva	cab8dda90f	[mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 13:50:55 -08:00

49 Commits