llvm-project

Author	SHA1	Message	Date
Matthias Springer	1b99f3a224	[mlir][bufferize] Treat certain aliasing-only uses like memory reads This fixes an issue in One-Shot Bufferize that could lead to missing buffer copies in the future. This bug can currently not be triggered because of the order in which ops are analyzed (always bottom-to-top). However, if we consider different traversal orders for the analysis in the future, this bug can cause subtle issues that are difficult to debug. Example: ``` %0 = ... %1 = tensor.insert ... into %0 %2 = tensor.extract_slice %0 tensor.extract %2[...] ``` In case of a top-to-bottom analysis of the above IR, the `tensor.insert` is analyzed before the `tensor.extract_slice`. In that case, the `tensor.insert` will bufferize in-place because %2 is not yet known to become an alias of %0 (and therefore causing a conflict). With this change, the `tensor.insert` will bufferize out-of-place, regardless of the traversal order. Differential Revision: https://reviews.llvm.org/D135049	2022-10-14 10:40:45 +09:00
Mehdi Amini	0666e50e14	Apply clang-tidy fixes for modernize-use-equals-default in Bufferize.cpp (NFC)	2022-10-10 01:08:27 +00:00
Matthias Springer	1f8ffbd1cc	[mlir][bufferize][NFC] Address review comments of D135420 These changes should have been landed as part of D135420. Differential Revision: https://reviews.llvm.org/D135438	2022-10-07 19:54:08 +09:00
Matthias Springer	2e210034da	[mlir][bufferize] Fix repetitive region conflict detection This fixes a bug where a required buffer copy was not inserted. Not only written aliases, but also read aliases should be taken into account when computing common enclosing repetitive regions. Furthermore, for writing ops, it does not matter where the destination tensor is defined, but where the op itself is located. Differential Revision: https://reviews.llvm.org/D135420	2022-10-07 16:39:03 +09:00
Matthias Springer	f4e8f44811	[mlir][bufferize] Fix enclosing repetitive region computation The wrong function overload was called. Differential Revision: https://reviews.llvm.org/D135342	2022-10-07 10:37:04 +09:00
Matthias Springer	6cdd34b973	[mlir][tensor][bufferize] Bufferize inserts into equivalent tensors in-place Inserting a tensor into an equivalent tensor is a no-op after bufferization. No alloc is needed. Differential Revision: https://reviews.llvm.org/D132662	2022-10-06 15:06:33 +09:00
Matthias Springer	129420df51	[mlir][bufferization][NFC] Move EmptyTensorToAllocTensorPass This change moves the pass from the Linalg dialect to the bufferization dialect. Differential Revision: https://reviews.llvm.org/D135130	2022-10-05 09:57:22 +09:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Alex Zinenko	f096e72ce6	[mlir] switch bufferization to use strided layout attribute Bufferization already makes the assumption that buffers pass function boundaries in the strided form and uses the corresponding affine map layouts. Switch it to use the recently introduced strided layout instead to avoid unnecessary casts when bufferizing further operations to the memref dialect counterparts that now largely rely on the strided layout attribute. Depends On D133947 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133951	2022-09-16 10:56:50 +02:00
Matthias Springer	f7dd9a3206	[mlir][bufferize] Add new debug flag: copy-before-write If this flag is set, the analysis is skipped and buffers are copied before every write. Differential Revision: https://reviews.llvm.org/D133288	2022-09-05 14:41:19 +02:00
Matthias Springer	f7f0c7f7e3	[mlir][bufferize] Add isRepetitiveRegion to BufferizableOpInterface This method allows to declare regions as "repetitive" even if the parent op does not implement the RegionBranchOpInterface. This is needed to support loop-like ops that have parallel semantics but do not branch between regions. Differential Revision: https://reviews.llvm.org/D133113	2022-09-02 14:47:20 +02:00
Michele Scuttari	67d0d7ac0a	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-31 12:28:45 +02:00
Michele Scuttari	039b969b32	Revert "[MLIR] Update pass declarations to new autogenerated files" This reverts commit 2be8af8f0e0780901213b6fd3013a5268ddc3359.	2022-08-30 22:21:55 +02:00
Michele Scuttari	2be8af8f0e	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-30 21:56:31 +02:00
Matthias Springer	123c4b0251	[mlir][SCF][bufferize] Support different iter_arg/init_arg types (scf.for) Even though iter_arg and init_arg of an scf.for loop may have the same tensor type, their bufferized memref types are not necessarily equal. It is sometimes necessary to insert a cast in case of differing layout maps. Differential Revision: https://reviews.llvm.org/D132860	2022-08-30 16:35:32 +02:00
Matthias Springer	111c919665	[mlir][bufferization] Generalize getBufferType This change generalizes getBufferType. This function can be used to predict the buffer type of any tensor value (not just BlockArguments) without changing any IR. It also subsumes getMemorySpace. This is useful for loop bufferization, where the precise buffer type of an iter_arg cannot be known without examining the loop body. Differential Revision: https://reviews.llvm.org/D132859	2022-08-30 16:26:44 +02:00
Johannes Reifferscheid	23dec4a352	Move BufferViewFlowAnalysis to the Bufferization dialect. It's only used from there, and this lets us remove the dependency from Analysis to the Arith dialect. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D132928	2022-08-30 14:25:49 +02:00
Matthias Springer	9ee12f4778	[mlir][tensor][bufferize] Bufferize tensor.pad tensor.pad is lowered to tensor.generate + tensor.insert_slice during bufferization. For best performance with constant padding values, users should vectorize the IR before bufferizing it. This change also relaxes tje restriction that no new ops that bufferize to a memory write should be added during bufferization. Since bufferization has been split into two steps a while ago (tensor copy insertion + bufferization), it is reasonable to allow this now. Differential Revision: https://reviews.llvm.org/D132355	2022-08-22 17:00:33 +02:00
Kazu Hirata	258531b7ac	Remove redundant initialization of Optional (NFC)	2022-08-20 21:18:28 -07:00
Matthias Springer	3f914d84c3	[mlir][bufferize] Better error handling: Fail if ToMemrefOps are found bufferization.to_memref ops are not supported in One-Shot Analysis. They often trigger a failed assertion that can be confusing. Instead, scan for to_memref ops before running the analysis and immediately abort with a proper error message. Differential Revision: https://reviews.llvm.org/D132027	2022-08-18 11:37:57 +02:00
root	894e8a5446	[MLIR] Add dealloc alias check to bufferization Traverse the cloneOp for aliases to find the alloc op Reviewed By: frgossen, bondhugula Differential Revision: https://reviews.llvm.org/D131797	2022-08-17 19:11:59 -04:00
Matthias Springer	a36348c586	[mlir][bufferize] Fix bug in AllocTensorElimination AllocTensorElimination does currently not support chains where the type is changing. AllocTensorElimination used to generate invalid IR for such inputs. With this commit, AllocTensorElimination does no longer apply to such inputs. (It can be extended to support such IR if needed.) Differential Revision: https://reviews.llvm.org/D131880	2022-08-15 11:45:58 +02:00
Kazu Hirata	3a6da9ebcb	[mlir] Remove redundant member initialization (NFC) Identified with readability-redundant-member-init.	2022-08-14 12:51:59 -07:00
Jeff Niu	58a47508f0	(Reland) [mlir] Switch segment size attributes to DenseI32ArrayAttr This reland includes changes to the Python bindings. Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131801 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D131803	2022-08-12 19:44:52 -04:00
Matthias Springer	bf1b9528ff	[mlir][bufferize] Fix missing copy when bufferizing loops Using a loop init_arg inside of the loop is not supported. This change adds a pre-processing pass that resolves such IR with copies. Differential Revision: https://reviews.llvm.org/D131689	2022-08-12 10:44:55 +02:00
Alex Zinenko	e8e718fa4b	Revert "[mlir] Switch segment size attributes to DenseI32ArrayAttr" This reverts commit 30171e76f0e5ea8037bc4d1450dd3e12af4d9938. Breaks Python tests in MLIR, missing C API and Python changes.	2022-08-12 10:22:47 +02:00
Jeff Niu	30171e76f0	[mlir] Switch segment size attributes to DenseI32ArrayAttr Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131738 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D131702	2022-08-11 20:56:45 -04:00
Matthias Springer	664ffa46bb	[mlir][tensor][bufferize] Fix deallocation of GenerateOp/FromElementsOp Both ops allocate a buffer. There were cases in which the buffer was not deallocated. Differential Revision: https://reviews.llvm.org/D130469	2022-07-25 12:25:06 +02:00
Alex Zinenko	333ee218ce	[mlir] Transform dialect: separate dependent and generated dialects In the Transform dialect extensions, provide the separate mechanism to declare dependent dialects (the dialects the transform IR depends on) and the generated dialects (the dialects the payload IR may be transformed into). This allows the Transform dialect clients that are only constructing the transform IR to avoid loading the dialects relevant for the payload IR along with the Transform dialect itself, thus decreasing the build/link time. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D130289	2022-07-25 09:59:53 +00:00
Kazu Hirata	c730f9a164	Convert for_each to range-based for loops (NFC)	2022-07-23 12:17:27 -07:00
Kazu Hirata	380a1b204c	Use callables directly in any_of, count_if, etc (NFC)	2022-07-23 00:28:31 -07:00
Matthias Springer	27a431f5e9	[mlir][bufferization][NFC] Move sparse_tensor.release to bufferization dialect This op used to belong to the sparse dialect, but there are use cases for dense bufferization as well. (E.g., when a tensor alloc is returned from a function and should be deallocated at the call site.) This change moves the op to the bufferization dialect, which now has an `alloc_tensor` and a `dealloc_tensor` op. Differential Revision: https://reviews.llvm.org/D129985	2022-07-19 09:18:19 +02:00
Kazu Hirata	3b0dce5b8b	Use value_or (NFC)	2022-07-15 19:46:29 -07:00
Jeff Niu	b7f93c2809	[mlir] (NFC) run clang-format on all files	2022-07-14 13:32:13 -07:00
Matthias Springer	74902cc96f	[mlir][linalg][NFC] Cleanup: Drop linalg.inplaceable attribute bufferization.writable is used in most cases instead. All remaining test cases are updated. Some code that is no longer needed is deleted. Differential Revision: https://reviews.llvm.org/D129739	2022-07-14 15:50:03 +02:00
Matthias Springer	c66303c287	[mlir][sparse] Switch to One-Shot Bufferize This change removes the partial bufferization passes from the sparse compilation pipeline and replaces them with One-Shot Bufferize. One-Shot Analysis (and TensorCopyInsertion) is used to resolve all out-of-place bufferizations, dense and sparse. Dense ops are then bufferized with BufferizableOpInterface. Sparse ops are still bufferized in the Sparsification pass. Details: * Dense allocations are automatically deallocated, unless they are yielded from a block. (In that case the alloc would leak.) All test cases are modified accordingly. E.g., some funcs now have an "out" tensor argument that is returned from the function. (That way, the allocation happens at the call site.) * Sparse allocations are not automatically deallocated. They must be "released" manually. (No change, this will be addressed in a future change.) * Sparse tensor copies are not supported yet. (Future change) * Sparsification no longer has to consider inplacability. If necessary, allocations and/or copies are inserted during TensorCopyInsertion. All tensors are inplaceable by the time Sparsification is running. Instead of marking a tensor as "not inplaceable", it can be marked as "not writable", which will trigger an allocation and/or copy during TensorCopyInsertion. Differential Revision: https://reviews.llvm.org/D129356	2022-07-14 09:52:48 +02:00
Kazu Hirata	c27d815249	[mlir] Use value instead of getValue (NFC)	2022-07-14 00:19:59 -07:00
Kazu Hirata	491d27013d	[mlir] Use has_value instead of hasValue (NFC)	2022-07-13 00:57:02 -07:00
Jacques Pienaar	136d746ec7	[mlir] Flip accessors to prefixed form (NFC) Another mechanical sweep to keep diff small for flip to _Prefixed.	2022-07-10 21:19:11 -07:00
Matthias Springer	fc9b37dd53	[mlir][bufferization] Do not canonicalize to_tensor(to_memref(x)) This is a partial revert of D128615. to_memref(to_tensor(x)) always be folded to x. But to_tensor(to_memref(x)) cannot be folded in the general case because writes to the intermediary memref may go unnoticed. Differential Revision: https://reviews.llvm.org/D129354	2022-07-09 09:16:52 +02:00
Matthias Springer	606f7c8f7a	[mlir][bufferization][NFC] Move more unknown type conversion logic into BufferizationOptions The `unknownTypeConversion` bufferization option (enum) is now a type converter function option. Some logic of `getMemRefType` is now handled by that function. This change makes type conversion more controllable. Previously, there were only two options when generating memref types for non-bufferizable ops: Static identity layout or fully dynamic layout. With this change, users of One-Shot Bufferize can provide a function with custom logic. Differential Revision: https://reviews.llvm.org/D129273	2022-07-07 13:36:28 +02:00
Matthias Springer	6c3c5f8069	[mlir][memref] Improve type inference for rank-reducing subviews The result shape of a rank-reducing subview cannot be inferred in the general case. Just the result rank is not enough. The only thing that we can infer is the layout map. This change also improves the bufferization patterns of tensor.extract_slice and tensor.insert_slice to fully support rank-reducing operations. Differential Revision: https://reviews.llvm.org/D129144	2022-07-05 16:49:07 +02:00
Nicolas Vasilache	741f8f2bed	[mlir][Tensor][NFC] Better document rank-reducing behavior of ExtractSliceOp and cleanup	2022-06-29 07:37:58 -07:00
Jacques Pienaar	04235d07ad	[mlir] Update flipped accessors (NFC) Follow up with memref flipped and flipping any intermediate changes made.	2022-06-28 13:11:26 -07:00
Matthias Springer	cb47124179	[mlir][bufferize] Improve to_tensor/to_memref folding Differential Revision: https://reviews.llvm.org/D128615	2022-06-27 21:42:39 +02:00
Matthias Springer	c0b0b6a00a	[mlir][bufferize] Infer memory space in all bufferization patterns This change updates all remaining bufferization patterns (except for scf.while) and the remaining bufferization infrastructure to infer the memory space whenever possible instead of falling back to "0". (If a default memory space is set in the bufferization options, we still fall back to that value if the memory space could not be inferred.) Differential Revision: https://reviews.llvm.org/D128423	2022-06-27 16:32:52 +02:00
Matthias Springer	45b995cda4	[mlir][bufferize][NFC] Change signature of allocateTensorForShapedValue Add a failure return value and bufferization options argument. This is to keep a subsequent change smaller. Differential Revision: https://reviews.llvm.org/D128278	2022-06-27 16:00:06 +02:00
Matthias Springer	5d50f51c97	[mlir][bufferization][NFC] Add error handling to getBuffer This is in preparation of adding memory space support. Differential Revision: https://reviews.llvm.org/D128277	2022-06-27 13:48:01 +02:00
Matthias Springer	0d0a94a792	[mlir][bufferization][NFC] Fix typo in AllocTensorOp builders	2022-06-27 13:41:18 +02:00
Matthias Springer	ba9d886db4	[mlir][bufferization][NFC] Bufferize with PostOrder traversal This is useful because the result type of an op can sometimes be inferred from its body (e.g., `scf.if`). This will be utilized in subsequent changes. Also introduces a new `getBufferType` interface method on BufferizableOpInterface. This method is useful for computing a bufferized block argument type with respect to OpOperand types of the parent op. Differential Revision: https://reviews.llvm.org/D128420	2022-06-27 12:42:41 +02:00

1 2 3 4

175 Commits