llvm-project

Author	SHA1	Message	Date
Matthias Springer	9fa6b3504b	[mlir][bufferization] Improve aliasing OpOperand/OpResult property `getAliasingOpOperands`/`getAliasingOpResults` now encodes OpOperand/OpResult, buffer relation and a degree of certainty. E.g.: ``` // aliasingOpOperands(%r) = {(%t, EQUIV, DEFINITE)} // aliasingOpResults(%t) = {(%r, EQUIV, DEFINITE)} %r = tensor.insert %f into %t[%idx] : tensor<?xf32> // aliasingOpOperands(%r) = {(%t0, EQUIV, MAYBE), (%t1, EQUIV, MAYBE)} // aliasingOpResults(%t0) = {(%r, EQUIV, MAYBE)} // aliasingOpResults(%t1) = {(%r, EQUIV, MAYBE)} %r = arith.select %c, %t0, %t1 : tensor<?xf32> ``` `BufferizableOpInterface::bufferRelation` is removed, as it is now part of `getAliasingOpOperands`/`getAliasingOpResults`. This change allows for better analysis, in particular wrt. equivalence. This allows additional optimizations and better error checking (which is sometimes overly conservative). Examples: * EmptyTensorElimination can eliminate `tensor.empty` inside `scf.if` blocks. This requires a modeling of equivalence: It is not a per-OpResult property anymore. Instead, it can be specified for each OpOperand and OpResult. This is important because `tensor.empty` may be eliminated only if all values on the SSA use-def chain to the final consumer (`tensor.insert_slice`) are equivalent. * The detection of "returning allocs from a block" can be improved. (Addresses a TODO in `assertNoAllocsReturned`.) This allows us to bufferize IR such as "yielding a `tensor.extract_slice` result from an `scf.if` branch", which currently fails to bufferize because the alloc detection is too conservative. * Better bufferization of loops. Aliases of the iter_arg can be yielded (even if they are not equivalent) without having to realloc and copy the entire buffer on each iteration. The above-mentioned examples are not yet implemented with this change. This change just improves the BufferizableOpInterface, its implementations and related helper functions, so that better aliasing information is available for each op. Differential Revision: https://reviews.llvm.org/D142129	2023-02-09 11:35:03 +01:00
Matthias Springer	1ac248e485	[mlir][bufferization][NFC] Rename getAliasingOpOperand/getAliasingOpResult * `getAliasingOpOperand` => `getAliasingOpOperands` * `getAliasingOpResult` => `getAliasingOpResults` Also a few minor code cleanups and better documentation. Differential Revision: https://reviews.llvm.org/D142979	2023-02-01 10:07:41 +01:00
Matthias Springer	017be821da	[mlir][vector][bufferize] Fix Windows build failure introduced by D141686	2023-01-31 09:36:20 +01:00
Matthias Springer	199f368e35	[mlir][vector][bufferize] Bufferize vector.mask and vector.yield The masked op can currently not bufferize out-of-place. Such IR would be rejected by the One-Shot Bufferize because it would mean that a new buffer allocation is yielded from a block. Furthermore, only one operation is currently allowed inside `vector.mask`. Differential Revision: https://reviews.llvm.org/D141686	2023-01-31 09:02:27 +01:00
Matthias Springer	bf531f28f5	[mlir][vector][bufferize] Implement DestinationStyleOpInterface on TransferWriteOp This simplifies the BufferizableOpInterface implementation of vector.transfer_write. Differential Revision: https://reviews.llvm.org/D136348	2022-10-27 11:02:19 +02:00
Jerry Wu	66c2b76846	[MLIR] Extend vector.gather to accept tensor as base In addition to memref, accept ranked tensor as the base operand of vector.gather, similar to vector.trasnfer_read. This will allow us to vectorize noncontiguous tensor.extract into vector.gather. Full discussion can be found here: https://github.com/iree-org/iree/issues/9198 Reviewed By: hanchung, dcaballe Differential Revision: https://reviews.llvm.org/D130097	2022-08-09 11:19:16 -07:00
Matthias Springer	a28ce1a42b	[mlir][vector][bufferize] Fix transfer_write dropping mask operand Differential Revision: https://reviews.llvm.org/D129253	2022-07-07 10:02:13 +02:00
Matthias Springer	5d50f51c97	[mlir][bufferization][NFC] Add error handling to getBuffer This is in preparation of adding memory space support. Differential Revision: https://reviews.llvm.org/D128277	2022-06-27 13:48:01 +02:00
Matthias Springer	b55d55ecd9	[mlir][bufferize][NFC] Remove BufferizationState With the recent refactorings, this class is no longer needed. We can use BufferizationOptions in all places were BufferizationState was used. Differential Revision: https://reviews.llvm.org/D127653	2022-06-17 14:04:11 +02:00
Matthias Springer	b3ebe3beed	[mlir][bufferize] Bufferize after TensorCopyInsertion This change changes the bufferization so that it utilizes the new TensorCopyInsertion pass. One-Shot Bufferize no longer calls the One-Shot Analysis. Instead, it relies on the TensorCopyInsertion pass to make the entire IR fully inplacable. The `bufferize` implementations of all ops are simplified; they no longer have to account for out-of-place bufferization decisions. These were already materialized in the IR in the form of `bufferization.alloc_tensor` ops during the TensorCopyInsertion pass. Differential Revision: https://reviews.llvm.org/D127652	2022-06-17 13:29:52 +02:00
Jacques Pienaar	7c38fd605b	[mlir] Flip Vector dialect accessors used to prefixed form. This has been on _Both for a couple of weeks. Flip usages in core with intention to flip flag to _Prefixed in follow up. Needed to add a couple of helper methods in AffineOps and Linalg to facilitate a pure flag flip in follow up as some of these classes are used in templates and so sensitive to Vector dialect changes. Differential Revision: https://reviews.llvm.org/D122151	2022-03-28 11:24:47 -07:00
River Riddle	77eee5795e	[mlir] Refactor DialectRegistry delayed interface support into a general DialectExtension mechanism The current dialect registry allows for attaching delayed interfaces, that are added to attrs/dialects/ops/etc. when the owning dialect gets loaded. This is clunky for quite a few reasons, e.g. each interface type has a separate tracking structure, and is also quite limiting. This commit refactors this delayed mutation of dialect constructs into a more general DialectExtension mechanism. This mechanism is essentially a registration callback that is invoked when a set of dialects have been loaded. This allows for attaching interfaces directly on the loaded constructs, and also allows for loading new dependent dialects. The latter of which is extremely useful as it will now enable dependent dialects to only apply in the contexts in which they are necessary. For example, a dialect dependency can now be conditional on if a user actually needs the interface that relies on it. Differential Revision: https://reviews.llvm.org/D120367	2022-03-16 22:15:25 -07:00
Matthias Springer	9597b16aa9	[mlir][bufferize][NFC] Split BufferizationState into AnalysisState/BufferizationState Differential Revision: https://reviews.llvm.org/D121361	2022-03-15 17:35:47 +09:00
Matthias Springer	585a8a321c	[mlir][bufferize] OpOperands can have multiple aliasing OpResults This makes getAliasingOpResult symmetric to getAliasingOpOperand. The previous implementation was confusing for users and implemented in such a way only because there are currently no bufferizable ops that have multiple aliasing OpResults. Differential Revision: https://reviews.llvm.org/D119259	2022-02-09 20:58:45 +09:00
Matthias Springer	5523c1455a	[mlir][bufferize][NFC] Move vector BufferizableOpInterface impl to vector dialect Differential Revision: https://reviews.llvm.org/D118540	2022-02-01 00:28:28 +09:00

15 Commits