llvm-project

Author	SHA1	Message	Date
Govind Malasani	95e0ae9fa7	[MLIR][SparseTensor] Loop ordering strategy infrastructure (flag) (#154656 ) As discussed before, this PR adds the basic infrastructure/boiler plate for loop ordering strategies to be implemented. If this looks ok, I wanted to also mention some of the heuristics that I would implement next, if they sound reasonable to you guys: - Parallel first : prioritize parallel loops over reduction loops - Dense outer : prioritize the most dense loops first - Sparse outer : the opposite, potentially useful in some cases? There is another that I am considering, stride/memory aware, which would prioritize loops with better stride patterns (like sequential or linear). Not sure how well this carries over to Sparse Tensor though. Are there any ideas/heuristics that I should definitely try to implement? As we discussed, I will try to incrementally add heuristics. Sorry for the delay on my end, and thank you so much for the feedback! --------- Co-authored-by: Aart Bik <ajcbik@google.com>	2025-10-06 17:29:48 +00:00
Matthias Springer	2b5b3cf60d	[mlir][sparse_tensor] Migrate `SparseIterationToScf.cpp` to dialect conversion (#121054 ) Use the regular dialect conversion driver instead of the 1:N dialect conversion driver. The 1:N dialect conversion driver will be removed soon.	2024-12-27 09:13:15 +01:00
Jacques Pienaar	09dfc5713d	[mlir] Enable decoupling two kinds of greedy behavior. (#104649 ) The greedy rewriter is used in many different flows and it has a lot of convenience (work list management, debugging actions, tracing, etc). But it combines two kinds of greedy behavior 1) how ops are matched, 2) folding wherever it can. These are independent forms of greedy and leads to inefficiency. E.g., cases where one need to create different phases in lowering and is required to applying patterns in specific order split across different passes. Using the driver one ends up needlessly retrying folding/having multiple rounds of folding attempts, where one final run would have sufficed. Of course folks can locally avoid this behavior by just building their own, but this is also a common requested feature that folks keep on working around locally in suboptimal ways. For downstream users, there should be no behavioral change. Updating from the deprecated should just be a find and replace (e.g., `find ./ -type f -exec sed -i 's\|applyPatternsAndFoldGreedily\|applyPatternsGreedily\|g' {} \;` variety) as the API arguments hasn't changed between the two.	2024-12-20 08:15:48 -08:00
Peiming Liu	c42bbda425	[mlir][sparse] implement lowering rules for ExtractIterSpaceOp. (#89143 ) DO NOT MERGE until https://github.com/llvm/llvm-project/pull/89003	2024-06-12 10:49:12 -07:00
Peiming Liu	99835922ca	[mlir][sparse] remove sparse encoding propagation pass. (#93593 )	2024-05-28 11:23:15 -07:00
Peiming Liu	ad1083dce4	[mlir][sparse] introduce new pass to propagate sparse encodings. (#92052 )	2024-05-13 17:29:01 -07:00
Aart Bik	5122a2c232	[mlir][sparse] allow for direct-out passing of sparse tensor buffers (#88327 ) In order to support various external frameworks (JAX vs PyTorch) we need a bit more flexibility in [dis]assembling external buffers to and from sparse tensors in MLIR land. This PR adds a direct-out option that avoids the rigid pre-allocated for copy-out semantics. Note that over time, we expect the [dis]assemble operations to converge into something that supports all sorts of external frameworks. Until then, this option helps in experimenting with different options.	2024-04-11 10:07:24 -07:00
Matthias Springer	a4c470555b	[mlir][linalg] Fix builder API usage in `RegionBuilderHelper` (#87451 ) Operations must be created with the supplied builder. Otherwise, the dialect conversion / greedy pattern rewrite driver can break. This commit fixes a crash in the dialect conversion: ``` within split at llvm-project/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg-invalid.mlir:1 offset :8:8: error: failed to legalize operation 'tosa.add' %0 = tosa.add %1, %arg2 : (tensor<10x10xf32>, tensor<xf32>) -> tensor<xf32> ^ within split at llvm-project/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg-invalid.mlir:1 offset :8:8: note: see current operation: %9 = "tosa.add"(%8, %arg2) : (tensor<10x10xf32>, tensor<xf32>) -> tensor<xf32> mlir-opt: llvm-project/mlir/include/mlir/IR/UseDefLists.h:198: mlir::IRObjectWithUseList<mlir::OpOperand>::~IRObjectWithUseList() [OperandType = mlir::OpOperand]: Assertion `use_empty() && "Cannot destroy a value that still has uses!"' failed. ``` This commit is the proper fix for #87297 (which was reverted).	2024-04-04 11:17:59 +09:00
Peiming Liu	4a653b4df5	[mlir][sparse] Support pretty print to debug sparse iteration. (#80207 )	2024-02-01 15:28:36 -08:00
Aart Bik	33b463ad99	[mlir][sparse] external entry method wrapper for sparse tensors (#80326 ) Similar to the emit_c_interface, this pull request adds a pass that converts public entry methods that use sparse tensors as input parameters and/or output return values into wrapper functions that [dis]assemble the individual tensors that constitute the actual storage used externally into MLIR sparse tensors. This pass can be used to prepare the public entry methods of a program that is compiled by the MLIR sparsifier to interface with an external runtime, e.g., when passing sparse tensors as numpy arrays from and to Python. Note that eventual bufferization decisions (e.g. who [de]allocates the underlying memory) should be resolved in agreement with the external runtime (Python, PyTorch, JAX, etc.)	2024-02-01 13:32:52 -08:00
Aart Bik	5f32bcfbae	[mlir][sparse][gpu] re-enable all GPU libgen tests (#72185 ) Previous change no longer properly used the GPU libgen pass (even though most tests still passed falling back to CPU). This revision puts the proper pass order into place. Also bit of a cleanup of CPU codegen vs. libgen setup.	2023-11-14 09:06:15 -08:00
Peiming Liu	269685545e	[mlir][sparse] remove filter-loop based algorithm support to handle a… (#71840 ) …ffine subscript expressions.	2023-11-13 11:36:49 -08:00
Aart Bik	af8428c0d9	[mlir][sparse] unify support of (dis)assemble between direct IR/lib path (#71880 ) Note that the (dis)assemble operations still make some simplfying assumptions (e.g. trailing 2-D COO in AoS format) but now at least both the direct IR and support library path behave exactly the same. Generalizing the ops is still TBD.	2023-11-13 10:05:00 -08:00
Aart Bik	5ef446790f	[mlir][sparse][gpu] cleanup GPUDataTransferStrategy (#71615 ) The flag seems to be doing practically the same thing for zero cost and pinned dma. In addition, the register host is not truly the right zero cost mechanism according to Thomas. So we are simplifying the setup for now, until we have a better definition for what to implement and test. https://github.com/llvm/llvm-project/issues/64316	2023-11-08 09:45:11 -08:00
Peiming Liu	f82bee1367	[mlir][sparse] split post-sparsification-rewriting into two passes. (#70727 )	2023-10-30 15:22:21 -07:00
Peiming Liu	6a93da9900	[mlir][sparse] add ReinterpretMapScopeOption for the pass (#70486 )	2023-10-27 14:14:09 -07:00
Aart Bik	7cfac1bedd	[mlir][sparse] add boilterplate code for a new reintepret map pass (#70393 ) The interesting stuff is of course still coming ;-)	2023-10-26 17:57:46 -07:00
Peiming Liu	f248d0b28d	[mlir][sparse] implement sparse_tensor.reorder_coo (#68916 ) As a side effect of the change, it also unifies the convertOp implementation between lib/codegen path.	2023-10-12 13:22:45 -07:00
Peiming Liu	0637440002	[mlir][sparse] introduce a pass to stage complex sparse operations in… (#68436 ) …to simple steps	2023-10-06 14:23:18 -07:00
Peiming Liu	0083f8338c	[mlir][sparse] renaming sparse_tensor.sort_coo to sparse_tensor.sort (#68161 ) Rationale: the operation does not always sort COO tensors (also used for sparse_tensor.compress for example).	2023-10-03 16:28:25 -07:00
Peiming Liu	bfa3bc4378	[mlir][sparse] unifies sparse_tensor.sort_coo/sort into one operation. (#66722 ) The use cases of the two operations are largely overlapped, let's simplify it and only use one of them.	2023-09-19 17:02:32 -07:00
Peiming Liu	098f46dce3	[sparse] allow unpack op to return 0-ranked tensor type. (#66269 ) Many frontends canonicalize scalar into 0-ranked tensor, it change will hopefully make the operation easier to use for those cases.	2023-09-13 11:33:01 -07:00
K-Wu	cfa82f7783	[mlir][sparse][gpu] introduce flag that controls host to device copy strategies (regular dma default) Differential Revision: https://reviews.llvm.org/D155352	2023-08-01 22:30:40 +00:00
Alex Zinenko	4a6b31b8d8	[mlir] NFC: untangle SCF Patterns.h and Transforms.h These two headers both contained a strange mix of definitions related to both patterns and non-pattern transforms. Put patterns and "populate" functions into Patterns.h and standalone transforms into Transforms.h. Depends On: D155223 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155454	2023-07-18 11:27:36 +00:00
Aart Bik	ee42e23614	[mlir][sparse][gpu] first implementation of the GPU libgen approach The sparse compiler now has two prototype strategies for GPU acceleration: * CUDA codegen: this converts sparsified code to CUDA threads * CUDA libgen: this converts pre-sparsified code to cuSPARSE library calls This revision introduces the first steps required for the second approach. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D150170	2023-05-15 08:49:38 -07:00
Aart Bik	19466ebc7f	[mlir][sparse][gpu] a first prototype sparse GPU code generator This implements a proof-of-concept GPU code generator to the sparse compiler pipeline, currently only capable of generating CUDA threads for outermost parallel loops. The objective, obviously, is to grow this concept to a full blown GPU code generator, capable of the right combinaton of code generation as well as exploiting idiomatic kernels or vector specific libraries (think cuSparse). Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D147483	2023-04-05 11:32:06 -07:00
Peiming Liu	c44d307c55	[mlir][sparse] add create-sparse-deallocs options to match the create-deallocs in BufferizationOption. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D147010	2023-03-27 23:18:32 +00:00
Peiming Liu	ee928fcde2	[mlir][sparse] add new sparisification option for dependent index reduction-based codegen Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D142927	2023-03-16 20:10:58 +00:00
Aart Bik	14c0317fef	[mlir][sparse] clean vectorization bail-out for VL=0 Fixes https://github.com/llvm/llvm-project/issues/59970 Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D142290	2023-01-23 12:12:05 -08:00
Peiming Liu	083ddffe47	[mlir][sparse] introduce sparse_tensor::StorageSpecifierToLLVM pass Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140122	2022-12-22 22:45:15 +00:00
bixia1	ea4be70cea	[mlir][sparse] Fix problems in creating complex zero for initialization. Reviewed By: aartbik, wrengr Differential Revision: https://reviews.llvm.org/D139591	2022-12-08 07:49:27 -08:00
Aart Bik	99b3849d89	[mlir][sparse] introduce vectorization pass for sparse loops This brings back previous SIMD functionality, but in a separate pass. The idea is to improve this new pass incrementally, going beyond for-loops to while-loops for co-iteration as welll (masking), while introducing new abstractions to make the lowering more progressive. The separation of sparsification and vectorization is a very good first step on this journey. Also brings back ArmSVE support Still to be fine-tuned: + use of "index" in SIMD loop (viz. a[i] = i) + check that all ops really have SIMD support + check all forms of reductions + chain reduction SIMD values Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D138236	2022-11-21 16:12:12 -08:00
bixia1	f81f0cb75a	[mlir][sparse] Split SparseTensorRewrite into PreSparsificationRewrite and PostSparsificationRewrite. Reviewed By: aartbik, wrengr Differential Revision: https://reviews.llvm.org/D138153	2022-11-17 07:13:55 -08:00
Aart Bik	d1da6f23a6	[mlir][sparse] avoid single default parameters in pass constructors Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D138054	2022-11-15 13:07:54 -08:00
bixia1	4f729d5a70	[mlir][sparse] Add rewriting rules for sparse_tensor.sort_coo. Refactor the rewriting of sparse_tensor.sort to support the implementation of sparse_tensor.sort_coo. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D137522	2022-11-14 08:48:53 -08:00
bixia1	7276b643f8	[mlir][sparse] Add option enable-buffer-initialization to the sparse-tensor-codegen pass. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D137733	2022-11-10 07:23:33 -08:00
bixia1	5618d2bea9	[mlir][sparse] Add option enable-buffer-initialization to initialize the memory buffers for sparse tensors to support debugging. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D137592	2022-11-08 09:54:33 -08:00
Peiming Liu	26eb2c6b42	[mlir][sparse] remove vector support in sparsification Sparse compiler used to generate vectorized code for sparse tensors computation, but it should really be delegated to other vectorization passes for better progressive lowering. https://discourse.llvm.org/t/rfc-structured-codegen-beyond-rectangular-arrays/64707 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D136183	2022-10-19 18:11:29 +00:00
bixia1	c1864ab953	[mlir][sparse] Add options to sparse-tensor-rewrite to disable rewriting rules for operators foreach and convert. This is to help simplify FileCheck tests for sparse-tensor-rewrite. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D136093	2022-10-18 09:27:32 -07:00
Aart Bik	779dcd2ecc	[mlir][sparse] move sparse tensor rewriting into its own pass Makes individual testing and debugging easier. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D135319	2022-10-05 14:52:55 -07:00
bixia1	654bbbde55	[mlir][sparse] Move the implementation of sparse_tensor.push_back to the buffer rewriter. Reviewed By: aartbik, Peiming Differential Revision: https://reviews.llvm.org/D134777	2022-09-29 15:06:00 -07:00
bixia1	062e515b70	[mlir][sparse] Add rewrite rule for the sort operator. Add sparse-buffer-rewrite pass to rewrite sparse primitives on buffers to MLIR implementation. Add sparse rewrite rule for the sort operator. Add FileCheck test and integration test. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D134627	2022-09-29 11:38:19 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Aart Bik	4d06861950	[mlir][sparse] add "sort" to the compress op codegen This revision also adds convenience methods to test the dim level type/property (with the codegen being first client) Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D134776	2022-09-28 10:41:40 -07:00
Aart Bik	30b550f14c	[mlir][sparse] add loop simplification to sparsification pass Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D134090	2022-09-16 16:29:47 -07:00
Peiming Liu	eb65327fe9	[mlir][sparse] Add new option (enable-runtime-library) to sparse compiler pipeline Add new option (enable-runtime-library) to sparse compiler pipeline, it allows us to decide whether we need to rewrite operations (e.g., concatenate, reshape) within sparsification (when using codegen) or convert them after sparsification (when using runtime library). Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D133597	2022-09-09 20:54:47 +00:00
bixia1	8a583bd53d	[mlir][sparse] Add codegen for expand op. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D133454	2022-09-08 14:06:01 -07:00
Peiming Liu	edca72f5bc	[mlir][sparse] Refactoring: remove dependence on tuple type when lowering sparse tensors. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D133390	2022-09-07 17:53:48 +00:00
Peiming Liu	4c46a5d54d	[mlir][sparse] Refactoring: renaming StorageNewOp to StorageOp To address comment in https://reviews.llvm.org/D133241 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D133363	2022-09-06 17:02:25 +00:00
Aart Bik	0c7abd3924	[mlir][sparse] codegen for sparse alloc Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D133241	2022-09-06 09:37:54 -07:00

1 2 3

102 Commits