llvm-project

History

Manish Gupta 114ba722c1 [mlir][NVGPU] Handle native mma.sync and ldmatrix(x4) sizes

This patch handles native `mma.sync` sizes and enables issuing `ldmatrix` on
largest possible tiles for matrixB. It requires handling
`vector.extract_strided_slice` from vector to ngpu lowering.

Differential Revision: https://reviews.llvm.org/D135749

2022-10-19 17:10:21 -07:00

[mlir][nvgpu] Use TableGen TypeDef for NVGPU dialect types

2022-09-23 19:46:03 -06:00

Transforms

[mlir][arith] Change dialect name from Arithmetic to Arith

2022-09-29 11:23:28 -04:00

Utils

[mlir][NVGPU] Handle native mma.sync and ldmatrix(x4) sizes

2022-10-19 17:10:21 -07:00

CMakeLists.txt

[mlir][nvgpu] NFC - move NVGPU conversion helpers to NvGpu utils library

2022-10-05 20:21:27 -06:00