llvm-project

Author	SHA1	Message	Date
Jack Frankland	5a221c39b6	[mlir][memref]: Fold ExpandShape into TransferRead (#176786 ) Add support for folding `memref.expand_shape` ops into `vector.transfer_read` ops when the permutation map is a non-minor-identity. In the case that the permutation map indexes into expanded dimensions that would be contiguous within the original source shape then it is safe to make this transformation. Signed-off-by: Jack Frankland <jack.frankland@arm.com>	2026-02-02 09:30:02 +00:00
Krzysztof Drewniak	003b28d031	[mlir] Move affine's FoldMemRefAliasOps into its own pass (#172548 ) I'm planning to introduce an interface that'll allow FoldMemRefAliasOps to not know about dialects like NVVM or GPU. To do this, however, I need to get the `affine` ops (which need special handling in order to handle their implicit affine maps) into a separate pass, analogously to how `amdgpu` ops have these patterns under their dialect and ton under `memref`. This commit also changes the expand/collapse_shape index resolvers to return `void`, since they never actually failed and to make it clearer that they modify IR. (Note: An LLM did the initial refactoring and test movement, I've reviewed the results and edited them some.)	2026-01-02 10:13:42 -08:00
Jack Frankland	e575539541	[milr][memref]: Fold expand_shape + transfer_read (#167679 ) Extend the load of a expand shape rewrite pattern to support folding a `memref.expand_shape` and `vector.transfer_read` when the permutation map on `vector.transfer_read` is a minor identity. --------- Signed-off-by: Jack Frankland <jack.frankland@arm.com>	2025-11-24 12:58:34 +00:00
Jakub Kuderski	8bab6c4e8c	[mlir] Simplify unreachable type switch cases. NFC. (#162032 ) Use `DefaultUnreachable` from https://github.com/llvm/llvm-project/pull/161970.	2025-10-06 09:23:25 -04:00
Alan Li	1c3e4e994b	Reapply "[AMDGPU] fold `memref.subview/expand_shape/collapse_shape` into `amdgpu.gather_to_lds`" (#150334 ) This is a reapply of patch #149851. The reapply also fixes a CMake/Bazel build issue, which was the reason of the revert. (Thanks @rupprecht ) Original patch (#149851) message: ----- This PR adds a new optimization pass to fold `memref.subview/expand_shape/collapse_shape` ops into consumer `amdgpu.gather_to_lds` operations. * Implements a new pass `AmdgpuFoldMemRefOpsPass` with pattern `FoldMemRefOpsIntoGatherToLDSOp` * Adds corresponding folding tests	2025-07-24 09:23:15 -04:00
Alan Li	9cb5c00bf7	Revert "[AMDGPU] fold `memref.subview/expand_shape/collapse_shape` in… (#150256 ) …to `amdgpu.gather_to_lds` (#149851)" This reverts commit dbc63f1e3724b6f2348c431dc1216537d9c042e8. Having build deps issue.	2025-07-23 12:50:26 -04:00
Alan Li	dbc63f1e37	[AMDGPU] fold `memref.subview/expand_shape/collapse_shape` into `amdgpu.gather_to_lds` (#149851 ) This PR adds a new optimization pass to fold `memref.subview/expand_shape/collapse_shape` ops into consumer `amdgpu.gather_to_lds` operations. * Implements a new pass `AmdgpuFoldMemRefOpsPass` with pattern `FoldMemRefOpsIntoGatherToLDSOp` * Adds corresponding folding tests --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-23 11:22:41 -04:00
Kazu Hirata	d5def016b6	[llvm] Remove unused includes (NFC) (#148342 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-07-12 11:28:55 -07:00
Kazu Hirata	25348394bb	[mlir] Fix a warning This patch fixes: mlir/lib/Dialect/MemRef/Transforms/FoldMemRefAliasOps.cpp:106:14: error: unused variable 'sourceType' [-Werror,-Wunused-variable]	2025-05-13 13:43:10 -07:00
Krzysztof Drewniak	a891163e50	[mlir][MemRef] Use specialized index ops to fold expand/collapse_shape (#138930 ) This PR updates the FoldMemRefAliasOps to use `affine.linearize_index` and `affine.delinearize_index` to perform the index computations needed to fold a `memref.expand_shape` or `memref.collapse_shape` into its consumers, respectively. This also loosens some limitations of the pass: 1. The existing `output_shape` argument to `memref.expand_shape` is now used, eliminating the need to re-infer this shape or call `memref.dim`. 2. Because we're using `affine.delinearize_index`, the restriction that each group in a `memref.collapse_shape` can only have one dynamic dimension is removed.	2025-05-13 13:28:53 -05:00
Andrzej Warzyński	c45cc3e420	[mlir][vector] Standardize `base` Naming Across Vector Ops (NFC) (#137859 ) [mlir][vector] Standardize base Naming Across Vector Ops (NFC) This change standardizes the naming convention for the argument representing the value to read from or write to in Vector ops that interface with Tensors or MemRefs. Specifically, it ensures that all such ops use the name `base` (i.e., the base address or location to which offsets are applied). Updated operations: * `vector.transfer_read`, * `vector.transfer_write`. For reference, these ops already use `base`: * `vector.load`, `vector.store`, `vector.scatter`, `vector.gather`, `vector.expandload`, `vector.compressstore`, `vector.maskedstore`, `vector.maskedload`. This is a non-functional change (NFC) and does not alter the semantics of these operations. However, it does require users of the XFer ops to switch from `op.getSource()` to `op.getBase()`. To ease the transition, this PR temporarily adds a `getSource()` interface method for compatibility. This is intended for downstream use only and should not be relied on upstream. The method will be removed prior to the LLVM 21 release. Implements #131602	2025-05-12 09:44:50 +01:00
Kazu Hirata	15f7c6ed70	[mlir] Remove unused local variables (NFC) (#138481 )	2025-05-05 10:08:00 -07:00
lorenzo chelini	8502ba1eb4	[MLIR][NFC] Retire let constructor for MemRef (#134788 ) let constructor is legacy (do not use in tree!) since the tableGen backend emits most of the glue logic to build a pass. Note: The following constructor has been retired: ```cpp std::unique_ptr<Pass> createExpandReallocPass(bool emitDeallocs = true); ``` To update your codebase, replace it with the new options-based API: ```cpp memref::ExpandReallocPassOptions expandAllocPassOptions{ /emitDeallocs=/false}; pm.addPass(memref::createExpandReallocPass(expandAllocPassOptions)); ```	2025-04-23 16:50:00 +02:00
Jacques Pienaar	09dfc5713d	[mlir] Enable decoupling two kinds of greedy behavior. (#104649 ) The greedy rewriter is used in many different flows and it has a lot of convenience (work list management, debugging actions, tracing, etc). But it combines two kinds of greedy behavior 1) how ops are matched, 2) folding wherever it can. These are independent forms of greedy and leads to inefficiency. E.g., cases where one need to create different phases in lowering and is required to applying patterns in specific order split across different passes. Using the driver one ends up needlessly retrying folding/having multiple rounds of folding attempts, where one final run would have sufficed. Of course folks can locally avoid this behavior by just building their own, but this is also a common requested feature that folks keep on working around locally in suboptimal ways. For downstream users, there should be no behavioral change. Updating from the deprecated should just be a find and replace (e.g., `find ./ -type f -exec sed -i 's\|applyPatternsAndFoldGreedily\|applyPatternsGreedily\|g' {} \;` variety) as the API arguments hasn't changed between the two.	2024-12-20 08:15:48 -08:00
Kunwar Grover	57e4360836	[mlir][memref] Add memref alias folders for expand/collapse_shape for vector load/store (#95223 ) This patch adds adds patterns to fold memref alias for expand_shape/collapse_shape feeding into vector.load/vector.store and vector.maskedload/vector.maskedstore	2024-06-12 15:36:16 +01:00
tyb0807	baa5beecc0	[NFC] Make NVGPU casing consistent (#91903 )	2024-05-13 09:08:04 +02:00
Prathamesh Tagore	6ed8434edc	[mlir][fold-memref-alias-ops] Add support for folding memref.expand_shape involving dynamic dims (#89093 ) `fold-memref-alias-ops` bails out in presence of dynamic shapes in `memref.expand_shape` op. Handle this case.	2024-05-08 07:24:43 -07:00
Max191	dae3c44ce6	[mlir] Add `vector.store/maskedstore` of `memref.subview` memref alias folding (#72184 ) Fixes https://github.com/openxla/iree/issues/15575	2023-11-14 14:24:54 -08:00
Quinn Dawkins	48f980c535	[mlir][memref] Add memref alias folding for masked transfers (#71476 ) The contents of a mask on a masked transfer are unaffected by the particular region of memory being read/stored to, so just forward the mask in subview folding patterns.	2023-11-07 08:56:54 -05:00
tyb0807	5aa2c65abd	[mlir][MemRef] Add subview folding pattern for vector.maskedload (#71380 ) This is required for fixing https://github.com/openxla/iree/issues/15031	2023-11-06 20:08:30 +01:00
Felix Schneider	f32b3e1caa	[mlir][memref] Fix index delinearization for CollapseShapeOp folding (#68833 ) The `resolveSourceIndicesCollapseShape` method is used to compute indices into the source `MemRef` of a `CollapseShapeOp` from the collapsed indices. This method didn't check for dynamic sizes of the source shape which led to a crash. Fix https://github.com/llvm/llvm-project/issues/68483	2023-10-12 07:12:43 +02:00
Hanhan Wang	f6897c37a2	[mlir][MemRef] Bail out for unsupported cases in FoldMemRefAliasOps pass The pass uses `computeSuffixProduct` method which only allows static shapes. This revision adds an early-exit for dynamic cases to avoid crash. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D157668	2023-08-11 14:52:53 -07:00
Guray Ozen	5ec360c589	[mlir] Enable folding memref alias for`vector.load` This work enables folding memref alias pass for`vector.load` Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D151447	2023-05-25 17:07:20 +02:00
Guray Ozen	46c32afbc5	[mlir] Enable folding memref alias for `ldmatrix` Folding mechanism does not recognize `ldmatrix` op. This work helps pass to recognize the op and fold the memref aliases. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D151412	2023-05-25 13:10:17 +02:00
Tres Popp	5550c82189	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Caveats include: - This clang-tidy script probably has more problems. - This only touches C++ code, so nothing that is being generated. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This first patch was created with the following steps. The intention is to only do automated changes at first, so I waste less time if it's reverted, and so the first mass change is more clear as an example to other teams that will need to follow similar steps. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: https://github.com/llvm/llvm-project/compare/main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. 4. Some changes have been deleted for the following reasons: - Some files had a variable also named cast - Some files had not included a header file that defines the cast functions - Some files are definitions of the classes that have the casting methods, so the code still refers to the method instead of the function without adding a prefix or removing the method declaration at the same time. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc git restore mlir/lib/IR mlir/lib/Dialect/DLTI/DLTI.cpp\ mlir/lib/Dialect/Complex/IR/ComplexDialect.cpp\ mlir/lib/**/IR/\ mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp\ mlir/lib/Dialect/Vector/Transforms/LowerVectorMultiReduction.cpp\ mlir/test/lib/Dialect/Test/TestTypes.cpp\ mlir/test/lib/Dialect/Transform/TestTransformDialectExtension.cpp\ mlir/test/lib/Dialect/Test/TestAttributes.cpp\ mlir/unittests/TableGen/EnumsGenTest.cpp\ mlir/test/python/lib/PythonTestCAPI.cpp\ mlir/include/mlir/IR/ ``` Differential Revision: https://reviews.llvm.org/D150123	2023-05-12 11:21:25 +02:00
Matthias Springer	4c48f016ef	[mlir][Affine][NFC] Wrap dialect in "affine" namespace This cleanup aligns the affine dialect with all the other dialects. Differential Revision: https://reviews.llvm.org/D148687	2023-04-20 11:19:21 +09:00
Manish Gupta	fc5c1a7676	[mlir][Memref] Fold nvgpu device cp.async on src memref to dst memref Differential Revision: https://reviews.llvm.org/D148161	2023-04-20 01:09:44 +00:00
Nicolas Vasilache	33468a51db	[mlir][Tensor] Add support for insert_slice in FoldTensorSubsetOps Differential Revision: https://reviews.llvm.org/D148334	2023-04-14 09:34:11 -07:00
Quentin Colombet	faafd26c4d	[mlir][MemRef] Move transform related functions in Transforms.h NFC	2023-03-28 15:20:19 +02:00
Nicolas Vasilache	4dc72d47ce	[mlir][Tensor] Add a FoldTensorSubsetOps pass and patterns These patterns follow FoldMemRefAliasOps which is further refactored for reuse. In the process, fix FoldMemRefAliasOps handling of strides for vector.transfer ops which was previously incorrect. These opt-in patterns generalize the existing canonicalizations on vector.transfer ops. In the future the blanket canonicalizations will be retired. They are kept for now to minimize porting disruptions. Differential Revision: https://reviews.llvm.org/D146624	2023-03-23 04:03:27 -07:00
Nicolas Vasilache	829446cb45	[mlir][memref] Use folded composed affine apply ops in FoldMemRefAliasOps Creating maximally folded and composd affine.apply operation during FoldMemRefAliasOps composes better with other transformations without having to interleave canonicalization passes. Differential Revision: https://reviews.llvm.org/D146515	2023-03-21 22:17:36 -07:00
Lei Zhang	59e4fbfcd0	[mlir][memref] Fold subview into GPU subgroup MMA load/store ops This commits adds support for folding subview into GPU subgroup MMA load/store ops. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D146150	2023-03-15 17:49:32 +00:00
Nicolas Vasilache	203fad476b	[mlir][DialectUtils] Cleanup IndexingUtils and provide more affine variants while reusing implementations Differential Revision: https://reviews.llvm.org/D145784	2023-03-14 03:44:59 -07:00
Guray Ozen	1cb91b421e	[mlir] Add nontemporal field to memref.load/store and convey to llvm.load/store `llvm.load` op has nonTemporal field which is missing for `memref.load` and `memref.store`. This revision first adds nonTemporal field to memref's load/store op, then it lowers the field to llvm.load/store ops. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D142616	2023-02-03 14:03:38 +01:00
Matthias Springer	a2b837ab04	[mlir] GreedyPatternRewriteDriver: Entry point takes single region The rewrite driver is typically applied to a single region or all regions of the same op. There is no longer an overload to apply the rewrite driver to a list of regions. This simplifies the rewrite driver implementation because the scope is now a single region as opposed to a list of regions. Note: This change is not NFC because `config.maxIterations` and `config.maxNumRewrites` is now counted for each region separately. Furthermore, worklist filtering (`scope`) is now applied to each region separately. Differential Revision: https://reviews.llvm.org/D142611	2023-01-27 11:23:04 +01:00
Matthias Springer	ccb8a4e3f3	[mlir][memref] Fold subview(subview(x)) Folding of rank-reduced subviews is also supported. Differential Revision: https://reviews.llvm.org/D140110	2022-12-15 17:50:12 +01:00
River Riddle	c692a11e69	[mlir] Flip Async/GPU/MemRef/OpenACC/OpenMP/PDL dialects to prefixed This flips all of the remaining dialects to prefixed except for linalg, which will be done in a followup. Differential Revision: https://reviews.llvm.org/D134995	2022-09-30 16:55:30 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Nicolas Vasilache	b7d47ed1da	[mlir][memref] Add support for 0-D transfer / subview fold. The 0-d case simply forwards the indexing from the source memref and works out of the box. Differential Revision: https://reviews.llvm.org/D133536	2022-09-08 15:25:05 -07:00
Mehdi Amini	2fe37d1c7e	Apply clang-tidy fixes for performance-unnecessary-value-param in FoldMemRefAliasOps.cpp (NFC)	2022-09-05 12:34:46 +00:00
Michele Scuttari	67d0d7ac0a	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-31 12:28:45 +02:00
Michele Scuttari	039b969b32	Revert "[MLIR] Update pass declarations to new autogenerated files" This reverts commit 2be8af8f0e0780901213b6fd3013a5268ddc3359.	2022-08-30 22:21:55 +02:00
Michele Scuttari	2be8af8f0e	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-30 21:56:31 +02:00
Arnab Dutta	1b002d2768	Fold memref.expand_shape and memref.collapse_shape ops Fold memref.expand_shape and memref.collapse_shape ops into their memref/affine load/store ops. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D128986	2022-08-28 06:56:06 +05:30

44 Commits