llvm-project

Author	SHA1	Message	Date
Alex Zinenko	371366ce27	[mlir][nvgpu] add simple pipelining for shared memory copies Add a simple transform operation to the NVGPU extension that performs software pipelining of copies to shared memory. The functionality is extremely minimalistic in this version and only supports copies from global to shared memory inside an `scf.for` loop with either `vector.transfer` or `nvgpu.device_async_copy` operations when pipelining preconditions are already satisfied in the IR. This is the minimally useful version that uses the more general loop pipeliner in an NVGPU-specific way. Further extensions and orthogonalizations will be necessary. This required a change to the loop pipeliner itself to properly propagate errors should the predicate generator fail. This is loosely inspired from the vesion in IREE, but has less unsafe assumptions and more principled way of communicating decisions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155223	2023-07-17 14:29:12 +00:00
Lorenzo Chelini	2049b2adfe	[MLIR] Fix compiler warnings (NFC) In `TestTensorTransforms.cpp` `replaced` is nullptr I assumed the intent was to emit the error for the `rootOp`. In `TransformInterfaces.cpp` there were some uninitialized variables. In `NVGPUTransformOps.cpp` `matmulOp` was never used. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D154439	2023-07-05 09:49:57 +02:00
Nicolas Vasilache	13f4e889c5	Revert "Revert "[mlir][Transform] Add support for mma.sync m16n8k16 f16 rewrite." and "[mlir][Transform] Introduce nvgpu transform extensions"" This reverts commit 6506692fe619ef8a1f7c6ea829d9a9eceb31622d. Differential Revision: https://reviews.llvm.org/D153845	2023-06-28 06:50:05 +00:00
Mehdi Amini	6506692fe6	Revert "[mlir][Transform] Add support for mma.sync m16n8k16 f16 rewrite." and "[mlir][Transform] Introduce nvgpu transform extensions" This reverts commit 40deed40ae77ba22f7c72693903752ab6bfeb4e7. and commit 1660f2174d59bc2fd04131dab9ab0b43178bf665. The buildbot is broken, the two tests aren't passing.	2023-06-27 08:46:18 +02:00
Nicolas Vasilache	1660f2174d	[mlir][Transform] Add support for mma.sync m16n8k16 f16 rewrite. This PR adds support for the m16n8k16 f16 case. At this point, the support is mostly mechanical and could be Tablegen'd to all cases. Until then, this can be populated as needed on a case-by-case basis. Depends on: D153420 Differential Revision: https://reviews.llvm.org/D153428	2023-06-26 16:46:42 +00:00
Nicolas Vasilache	40deed40ae	[mlir][Transform] Introduce nvgpu transform extensions Mapping to NVGPU operations such as mma.sync with mixed precision and ldmatrix with transposes and various data types involves complex matchings from low-level IR. This is akin to raising complex patterns after unnecessarily having lost structural information. To avoid such unnecessary complexity, introduce a direct mapping step from a matmul on memrefs to distributed NVGPU vector abstractions. In this context, mapping to specific mma.sync operations is trivial and consists in simply translating the documentation into indexing expressions. Correctness is demonstrated with an end-to-end integration test. Differential Revision: https://reviews.llvm.org/D153420	2023-06-26 16:21:28 +00:00

6 Commits