llvm-project

Author	SHA1	Message	Date
Momchil Velikov	4af96a9d83	[MLIR] Determine contiguousness of memrefs with dynamic dimensions (#142421 ) This patch enhances `MemRefType::areTrailingDimsContiguous` to also handle memrefs with dynamic dimensions. The implementation itself is based on a new member function `MemRefType::getMaxCollapsableTrailingDims` that return the maximum number of trailing dimensions that can be collapsed - trivially all dimensions for memrefs with identity layout, or by examining the memref strides stopping at discontiguous or statically unknown strides.	2025-06-23 09:28:33 +01:00
Chao Chen	99720bbb87	[MLIR][Utils] Fix the overflow issue in computeSuffixProductImpl for 32-bit system. (#140567 ) In `int64_t r = strides.size() - 2`, it may cause overflow on 32-bit system when strides.size() is 1, because `strides.size()` is defined as `unsigned int`	2025-05-19 13:27:37 -05:00
Ian Wood	fbbb33f400	[mlir] Fix crash when verifying linalg.transpose (#131733 ) Adds checks in `isPermutationVector` for indices that are outside of the bounds and removes the assert. Signed-off-by: Ian Wood <ianwood2024@u.northwestern.edu>	2025-03-18 12:33:27 -07:00
donald chen	9cc11b98a7	[mlir] [linalg] Add pattern to swap transpose with broadcast (#97063 ) Add a pattern that implement: transpose(broadcast(input)) -> broadcast(transpose(input))	2024-07-23 12:52:25 +08:00
Spenser Bauman	a9205c5c9d	[mlir][tensor] Implement constant folder for tensor.pad (#92691 ) Extend the folding ability of the RewriteAsConstant patterns to include tensor.pad operations on constants. The new pattern with constant fold tensor.pad operations which operate on tensor constants and have statically resolvable padding sizes/values. %init = arith.constant dense<[[6, 7], [8, 9]]> : tensor<2x2xi32> %pad_value = arith.constant 0 : i32 %0 = tensor.pad %init low[1, 1] high[1, 1] { ^bb0(%arg1: index, %arg2: index): tensor.yield %pad_value : i32 } : tensor<2x2xi32> to tensor<4x4xi32> becomes %cst = arith.constant dense<[[0, 0, 0, 0], [0, 6, 7, 0], [0, 8, 9, 0], [0, 0, 0, 0]]> : tensor<4x4xi32> Co-authored-by: Spenser Bauman <sabauma@fastmail>	2024-06-06 10:22:16 -04:00
Diego Caballero	847048f497	[mlir][Vector] Fix bug in vector xfer op flattening transformation (#81964 ) It looks like the affine map generated to compute the indices of the collapsed dimensions used the wrong dim size. For indices `[idx0][idx1]` we computed the collapsed index as `idx0size0 + idx1` instead of `idx0size1 + idx1`. This led to correctness issues in convolution tests when enabling this transformation internally.	2024-02-22 12:37:32 -08:00
Mehdi Amini	8383bf2307	Apply clang-tidy fixes for llvm-else-after-return in IndexingUtils.cpp (NFC)	2024-02-14 10:11:37 -08:00
Han-Chung Wang	2472c45ba3	[mlir][tensor] Enhance pack/unpack simplification for identity outer_dims_perm cases. (#77409 ) They can be simplified to reshape ops if outer_dims_perm is an identity permutation. The revision adds a `isIdentityPermutation` method to IndexingUtils.	2024-01-10 08:30:34 -08:00
Guray Ozen	c65d8c7187	[mlir][memref] extract_strided_metadata for zero-sized memref (#74835 )	2023-12-08 15:55:14 +01:00
Christopher Bate	831041be79	[mlir][vector] Cleanup VectorUnroll and create a generic tile iteration utility This change refactors some of the utilities used to unroll larger vector computations into smaller vector computations. In fact, the indexing computations used here are rather generic and are useful in other dialects or downstream projects. Therefore, a utility for iterating over all possible tile offsets for a particular pair of static (shape, tiled shape) is introduced in IndexingUtils and replaces the existing computations in the vector unrolling transformations. This builds off of the refactoring of IndexingUtils introduced in 203fad476b7e. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D150000	2023-09-14 20:34:44 -06:00
Ivan Butygin	793ee2bf08	[mlir][gpu] Add DecomposeMemrefsPass Some GPU backends (SPIR-V) lower memrefs to bare pointers, so for dynamically sized/strided memrefs it will fail. This pass extracts sizes and strides via `memref.extract_strrided_metadata` outside `gpu.launch` body and do index/offset calculation explicitly and then reconstructs memrefs via `memref.reinterpret_cast`. `memref.reinterpret_cast` then lowered via https://reviews.llvm.org/D155011 Differential Revision: https://reviews.llvm.org/D155247	2023-08-10 22:28:05 +02:00
Ivan Butygin	b13248f997	Revert "[mlir][gpu] Add DecomposeMemrefsPass" Broke some bots This reverts commit 2b5b2bfef102b1021d91f2b9485e2443bdea9df5.	2023-08-10 03:07:28 +02:00
Ivan Butygin	2b5b2bfef1	[mlir][gpu] Add DecomposeMemrefsPass Some GPU backends (SPIR-V) lower memrefs to bare pointers, so for dynamically sized/strided memrefs it will fail. This pass extracts sizes and strides via `memref.extract_strrided_metadata` outside `gpu.launch` body and do index/offset calculation explicitly and then reconstructs memrefs via `memref.reinterpret_cast`. `memref.reinterpret_cast` then lowered via https://reviews.llvm.org/D155011 Differential Revision: https://reviews.llvm.org/D155247	2023-08-10 02:28:03 +02:00
Nicolas Vasilache	a3cd2eeb2d	[mlir][nvgpu] Add a nvgpu.rewrite_copy_as_tma transform operation. This revision adds support for direct lowering of a linalg.copy on buffers between global and shared memory to a tma async load + synchronization operations. This uses the recently introduced Hopper NVVM and NVGPU abstraction to connect things end to end. Differential Revision: https://reviews.llvm.org/D157087	2023-08-08 12:07:59 +00:00
Nicolas Vasilache	9e54d5e778	[mlir] NFC - Basic improvements to IndexingUtils (product and sum)	2023-07-14 16:41:31 +02:00
Quentin Colombet	0bfbecf52e	[mlir][TransformDialect] Simplify the lowering of pack/unpack when these are just pad/unpad This patch recognizes when tensor.pack/unpack operations are simple tensor.pad/unpad (a.k.a. tensor.extract_slice) and lowers them in a simpler sequence of instruction. For pack, instead of doing: ``` pad expand_shape transpose ``` we do ``` pad insert_slice ``` For unpack, instead of doing: ``` transpose collapse_shape extract_slice ``` we do ``` extract_slice ``` Note: returning nullptr for the transform dialect is fine. The related handles are just ignored by the following transformation. Differential Revision: https://reviews.llvm.org/D148159	2023-04-13 10:46:36 +00:00
Nicolas Vasilache	203fad476b	[mlir][DialectUtils] Cleanup IndexingUtils and provide more affine variants while reusing implementations Differential Revision: https://reviews.llvm.org/D145784	2023-03-14 03:44:59 -07:00
Kazu Hirata	0a81ace004	[mlir] Use std::optional instead of llvm::Optional (NFC) This patch replaces (llvm::\|)Optional< with std::optional<. I'll post a separate patch to remove #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 01:25:58 -08:00
Kazu Hirata	a1fe1f5f77	[mlir] Add #include <optional> (NFC) This patch adds #include <optional> to those files containing llvm::Optional<...> or Optional<...>. I'll post a separate patch to actually replace llvm::Optional with std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-13 21:05:06 -08:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
Hanhan Wang	b1d3afc93e	[mlir] Factor more common utils to IndexingUtils Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D139159	2022-12-02 13:27:01 -08:00
Nicolas Vasilache	7a69a9d7ae	[NFC][mlir] VectorUtils / IndexingUtils simplifications and cleanups This revision refactors and cleans up a bunch of infra related to vector, shapes and indexing into more reusable APIs. Differential Revision: https://reviews.llvm.org/D138501	2022-11-22 23:42:29 -08:00
Arnab Dutta	1b002d2768	Fold memref.expand_shape and memref.collapse_shape ops Fold memref.expand_shape and memref.collapse_shape ops into their memref/affine load/store ops. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D128986	2022-08-28 06:56:06 +05:30
Matthias Springer	99ef9eebad	[mlir][vector][NFC] Split into IR, Transforms and Utils This reduces the dependencies of the MLIRVector target and makes the dialect consistent with other dialects. Differential Revision: https://reviews.llvm.org/D118533	2022-01-31 19:17:09 +09:00

24 Commits