llvm-project

Author	SHA1	Message	Date
Navdeep Katel	3d35546cd1	Support `transpose` mode for `gpu.subgroup` WMMA ops Add support for loading, computing, and storing `gpu.subgroup` WMMA ops in transpose mode as well. Update the GPU to NVVM lowerings to support `transpose` mode and update integration tests as well. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D139021	2022-12-05 22:37:02 +05:30
Quinn Dawkins	c0321edc26	[mlir][gpu] Adding support for transposed mma_load_matrix Enables transposed gpu.subgroup_mma_load_matrix and updates the lowerings in Vector to GPU and GPU to SPIRV. Needed to enable B transpose matmuls lowering to wmma ops. Taken over from author: stanley-nod <stanley@nod-labs.com> Reviewed By: ThomasRaoux, antiagainst Differential Revision: https://reviews.llvm.org/D138770	2022-11-29 03:35:49 +00:00
Nirvedh Meshram	c441070665	[mlir][spirv] Add conversion from GPU WMMA ops to SPIRV Cooperative matrix Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D136521	2022-10-22 18:29:40 -07:00
River Riddle	10c04f4641	[mlir:GPU][NFC] Update GPU API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
River Riddle	986b5c56ea	[mlir] Flip Async/GPU/OpenACC/OpenMP to use Both accessors This allows for incrementally updating the old API usages without needing to update everything at once. These will be left on Both for a little bit and then flipped to prefixed when all APIs have been updated. Differential Revision: https://reviews.llvm.org/D134386	2022-09-21 17:36:13 -07:00
Jeff Niu	5c5af910fe	[mlir][LLVMIR] "Modernize" Insert/ExtractValueOp This patch "modernizes" the LLVM `insertvalue` and `extractvalue` operations to use DenseI64ArrayAttr, since they only require an array of indices and previously there was confusion about whether to use i32 or i64 arrays, and to use assembly format. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D131537	2022-08-10 12:51:11 -04:00
Jeff Niu	0af643f3ce	[mlir][LLVMIR] (NFC) Add convenience builders for ConstantOp And clean up some of the user code	2022-08-09 15:34:36 -04:00
Thomas Raoux	a6f2c2291e	[mlir][GPUToNVVM] Fix bug in mma elementwise lowering The maxf implementation of wmma elementwise op was incorrect as the operands of the select to check for Nan were swapped. Differential Revision: https://reviews.llvm.org/D127879	2022-06-15 17:23:17 +00:00
Mogball	d7ef488bb6	[mlir][gpu] Move GPU headers into IR/ and Transforms/ Depends on D127350 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D127352	2022-06-09 22:49:03 +00:00
Tres Popp	b4e0507ce0	Rename PatternRewriteSet::insert to add insert is soft deprecated, so remove all references so it's less likely to be used and can be easily removed in the future. Differential Revision: https://reviews.llvm.org/D120021	2022-02-18 12:18:41 +01:00
River Riddle	38abdddf6f	[mlir][NFC] Update AMX/LLVM/NVVM/X86 vector operations to use `hasVerifier` instead of `verifier` The verifier field is deprecated, and slated for removal. Differential Revision: https://reviews.llvm.org/D118819	2022-02-02 13:34:29 -08:00
Mehdi Amini	be0a7e9f27	Adjust "end namespace" comment in MLIR to match new agree'd coding style See D115115 and this mailing list discussion: https://lists.llvm.org/pipermail/llvm-dev/2021-December/154199.html Differential Revision: https://reviews.llvm.org/D115309	2021-12-08 06:05:26 +00:00
Thomas Raoux	e7969240dc	[mlir][VectorToGPU] Support more cases in conversion to MMA ops Support load with broadcast, elementwise divf op and remove the hardcoded restriction on the vector size. Picking the right size should be enfored by user and will fail conversion to llvm/spirv if it is not supported. Differential Revision: https://reviews.llvm.org/D113618	2021-11-11 13:10:38 -08:00
thomasraoux	f309939d06	[mlir][nvvm] Remove special case ptr arithmetic lowering in gpu to nvvm Use existing helper instead of handling only a subset of indices lowering arithmetic. Also relax the restriction on the memref rank for the GPU mma ops as we can now support any rank. Differential Revision: https://reviews.llvm.org/D113383	2021-11-10 10:00:12 -08:00
thomasraoux	d88cc07943	[mlir][gpuTonvvm] Remove hardcoded values in MMAType to llvm struct Also relax the types allowed in GPU wmma ops Differential Revision: https://reviews.llvm.org/D112969	2021-11-02 08:12:27 -07:00
thomasraoux	8a992b20db	[mlir][gpu] Add basic support to do elementwise ops on mma matrix type In order to support fusion with mma matrix type we need to be able to execute elementwise operations on them. This add an op to be able to support some basic elementwise operations. This is a is not a full solution as it only supports a limited scope or operations. Ideally we would want to be able to fuse with more kind of operations. Differential Revision: https://reviews.llvm.org/D112857	2021-11-01 11:51:19 -07:00
thomasraoux	77eafb8430	[mlir][nvvm] Generalize wmma ops to handle more types and shapes wmma intrinsics have a large number of combinations, ideally we want to be able to target all the different variants. To avoid a combinatorial explosion in the number of mlir op we use attributes to represent the different variation of load/store/mma ops. We also can generate with tablegen helpers to know which combinations are available. Using this we can avoid having too hardcode a path for specific shapes and can support more types. This patch also adds boiler plates for tf32 op support. Differential Revision: https://reviews.llvm.org/D112689	2021-11-01 10:27:26 -07:00
thomasraoux	eacd6e1ebe	[mlir][GPUtoNVVM] Relax restriction on wmma op lowering Allow lowering of wmma ops with 64bits indexes. Change the default version of the test to use default layout. Differential Revision: https://reviews.llvm.org/D112479	2021-10-27 21:31:55 -07:00
River Riddle	ef976337f5	[mlir:OpConversion] Remove the remaing usages of the deprecated matchAndRewrite methods This commits updates the remaining usages of the ArrayRef<Value> based matchAndRewrite/rewrite methods in favor of the new OpAdaptor overload. Differential Revision: https://reviews.llvm.org/D110360	2021-09-24 17:51:41 +00:00
Alex Zinenko	75e5f0aac9	[mlir] factor memref-to-llvm lowering out of std-to-llvm After the MemRef has been split out of the Standard dialect, the conversion to the LLVM dialect remained as a huge monolithic pass. This is undesirable for the same complexity management reasons as having a huge Standard dialect itself, and is even more confusing given the existence of a separate dialect. Extract the conversion of the MemRef dialect operations to LLVM into a separate library and a separate conversion pass. Reviewed By: herhut, silvas Differential Revision: https://reviews.llvm.org/D105625	2021-07-09 14:49:52 +02:00
thomasraoux	0298f2cfb1	[mlir] Fix wrong type in WmmaConstantOpToNVVMLowering InsertElement takes a scalar integer attribute not an array of integer. Differential Revision: https://reviews.llvm.org/D105174	2021-06-30 09:10:02 -07:00
thomasraoux	428a62f65f	[mlir][gpu] Add op to create MMA constant matrix This allow creating a matrix with all elements set to a given value. This is needed to be able to implement a simple dot op. Differential Revision: https://reviews.llvm.org/D103870	2021-06-10 08:34:04 -07:00
thomasraoux	9b496c2373	[mlir][gpu][NFC] Simplify conversion of MMA type to NVVM Consolidate the type conversion in a single function to make it simpler to use. This allow to re-use the type conversion for up coming ops. Differential Revision: https://reviews.llvm.org/D103868	2021-06-09 09:33:38 -07:00
thomasraoux	b44007bec2	[mlir][gpu] Relax restriction on MMA store op to allow chain of mma ops. In order to allow large matmul operations using the MMA ops we need to chain operations this is not possible unless "DOp" and "COp" type have matching layout so remove the "DOp" layout and force accumulator and result type to match. Added a test for the case where the MMA value is accumulated. Differential Revision: https://reviews.llvm.org/D103023	2021-05-27 09:13:51 -07:00
Navdeep Kumar	eaaf7a6a09	[MLIR][GPU][NVVM] Add conversion of warp synchronous matrix-multiply accumulate GPU ops Add conversion of warp synchronous matrix-multiply accumulate GPU ops Add conversion of warp synchronous matrix-multiply accumulate GPU ops to NVVM ops. The following conversions are added :- 1.) subgroup_mma_load_matrix -> wmma.m16n16k16.load.[a,b,c]..row.stride 2.) subgroup_mma_store_matrix -> wmma.m16n16k16.store.d.[f16,f32].row.stride 3.) subgroup_mma_compute -> wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32] Reviewed By: bondhugula, ftynse Differential Revision: https://reviews.llvm.org/D95331	2021-05-21 21:20:33 +05:30

25 Commits