llvm-project

Author	SHA1	Message	Date
wren romano	72455b314f	[mlir][sparse] Fixing -Wunused-variable in Sparsification.cpp Reviewed By: aartbik, Peiming Differential Revision: https://reviews.llvm.org/D146474	2023-03-20 16:53:19 -07:00
wren romano	1f58ae8066	[mlir][sparse] Making `TensorExp::Kind` a nested enum-class This improves namespacing, and follows the pattern used for "Kind" enums elsewhere in MLIR. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D146086	2023-03-20 16:12:31 -07:00
Peiming Liu	1328bb6ef1	[mlir][sparse] extend loop emitter and optimize lattices with the awareness of slice based iteration Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D142929	2023-03-20 22:19:57 +00:00
Peiming Liu	d03805f2ee	[mlir][sparse] add merger/topo sort support for slice-based affine sparse index codegen Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D142928	2023-03-20 21:24:10 +00:00
Peiming Liu	ee928fcde2	[mlir][sparse] add new sparisification option for dependent index reduction-based codegen Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D142927	2023-03-16 20:10:58 +00:00
Jakub Kuderski	a0a76804c4	[ADT] Allow `llvm::enumerate` to enumerate over multiple ranges This does not work by a mere composition of `enumerate` and `zip_equal`, because C++17 does not allow for recursive expansion of structured bindings. This implementation uses `zippy` to manage the iteratees and adds the stream of indices as the first zipped range. Because we have an upfront assertion that all input ranges are of the same length, we only need to check if the second range has ended during iteration. As a consequence of using `zippy`, `enumerate` will now follow the reference and lifetime semantics of the `zip*` family of functions. The main difference is that `enumerate` exposes each tuple of references through a new tuple-like type `enumerate_result`, with the familiar `.index()` and `.value()` member functions. Because the `enumerate_result` returned on dereference is a temporary, enumeration result can no longer be used through an lvalue ref. Reviewed By: dblaikie, zero9178 Differential Revision: https://reviews.llvm.org/D144503	2023-03-15 19:34:22 -04:00
bixia1	abb05014f9	[mlir][sparse] Modify the pivot selection method for quick sort. Previously, we choose the median of three values. We now choose the median of five values when the number of values being sorted exceed a threshold (currently 100). This is similar to std::sort. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145534	2023-03-15 13:53:00 -07:00
wren romano	b60de1dfcc	[mlir][sparse] Updating `Merger::foreachTensorLoopId` to take `LatPointId` Since all callsites of `foreachTensorLoopId` would simply look up the `LatPointId` to extract its `BitVector`, it's cleaner to let the `Merger` handle that instead. This seems to better capture the intent of the `foreachTensorLoopId` method, and improves decoupling (since it removes a place that leaks the implementation detail that we use `BitVector`). Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D146082	2023-03-15 12:27:47 -07:00
Jakub Kuderski	8c258fda1f	[ADT][mlir][NFCI] Do not use non-const lvalue-refs with enumerate Replace references to enumerate results with either result_pairs (reference wrapper type) or structured bindings. I did not use structured bindings everywhere as it wasn't clear to me it would improve readability. This is in preparation to the switch to zip semantics which won't support non-const lvalue reference to elements: https://reviews.llvm.org/D144503. I chose to use values instead of const lvalue-refs because MLIR is biased towards avoiding `const` local variables. This won't degrade performance because currently `result_pair` is cheap to copy (size_t + iterator), and in the future, the enumerator iterator dereference will return temporaries anyway. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D146006	2023-03-15 10:43:56 -04:00
bixia1	2ef416273f	[mlir][sparse] Improve sort operation by generating inlined code to compare values. Previously, we generate function calls to compare values for sorting. It turns out that the compiler doesn't inline those function calls. We now directly generate inlined code. Also, modify the code for comparing values to use less number of branches. This improves all sort implementation in general. For arabic-2005.mtx CSR, the improvement is around 25%. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145442	2023-03-14 15:14:49 -07:00
wren romano	b8cf7af909	[mlir][sparse] Cleaning up names in {Merger,LoopEmitter,CodegenEnv}.{h,cpp} This change does a bunch of renaming to clear up confusions in these files. In particular, this change: * Renames variables and methods to clarify the "dim"/"lvl" distinction, and changes them to use the `Dimension`/`Level` types as appropriate. * Introduces new typedefs * `ExprId`, `LatPointId`, `LatSetId`: to clarify the interning design of the Merger. * `LoopId`, `LoopOrd`: to clarify the distinction between arbitrary names for loop-variables, vs numeric identifiers based on the actual order of loop generation. * `TensorId` * (Future CLs will change these from typedefs to structs/classes, so that the typechecker can help avoid mixups.) * Updates documentation to match the new terminology * Adds additional assertions * Adds `const` to local variables along the way Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145756	2023-03-14 11:50:56 -07:00
bixia1	f6424d11cb	[mlir][sparse] Improve quick sort by using a loop to sort the bigger partition. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145440	2023-03-10 20:43:08 -08:00
Peiming Liu	6db397a8d4	[mlir][sparse] support dynamic sparse tensor slices. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D141532	2023-03-10 23:12:41 +00:00
Peiming Liu	8237cac612	[mlir][sparse] extend storage specifier operations for slices. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D141641	2023-03-10 18:58:47 +00:00
Peiming Liu	ab99b5d1f6	[mlir][sparse] deduplicate non-unique coordinates unconditionally Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145621	2023-03-09 21:59:57 +00:00
Peiming Liu	41089f86e3	[mlir][sparse] fix bugs when convert coo to coo but with different dim ordering Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145723	2023-03-09 20:55:03 +00:00
Peiming Liu	4fa3cc6eb4	[mlir][sparse] deduplicate non-unique coordinates when coiterating collapsed COO tensors. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145532	2023-03-09 18:15:12 +00:00
wren romano	115c7beda7	[mlir][sparse] Making SortMask into an enum-class This helps to reduce the confusion from using `unsigned` everywhere. Depends On D145606 Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D145611	2023-03-08 15:25:42 -08:00
Peiming Liu	55270f56d2	[mlir][sparse] fix a bug in unpack op that used wrong compare predicate. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145603	2023-03-08 19:52:09 +00:00
wren romano	7a68225428	[mlir][sparse] Cleaning up code style for genCast Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145432	2023-03-07 14:43:40 -08:00
Peiming Liu	cc009334eb	[mlir][sparse] deduplicate non-unique coordinates when coiterating COO tensors Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145518	2023-03-07 21:52:38 +00:00
wren romano	84cd51bb97	[mlir][sparse] Renaming "pointer/index" to "position/coordinate" The old "pointer/index" names often cause confusion since these names clash with names of unrelated things in MLIR; so this change rectifies this by changing everything to use "position/coordinate" terminology instead. In addition to the basic terminology, there have also been various conventions for making certain distinctions like: (1) the overall storage for coordinates in the sparse-tensor, vs the particular collection of coordinates of a given element; and (2) particular coordinates given as a `Value` or `TypedValue<MemRefType>`, vs particular coordinates given as `ValueRange` or similar. I have striven to maintain these distinctions as follows: * "p/c" are used for individual position/coordinate values, when there is no risk of confusion. (Just like we use "d/l" to abbreviate "dim/lvl".) * "pos/crd" are used for individual position/coordinate values, when a longer name is helpful to avoid ambiguity or to form compound names (e.g., "parentPos"). (Just like we use "dim/lvl" when we need a longer form of "d/l".) I have also used these forms for a handful of compound names where the old name had been using a three-letter form previously, even though a longer form would be more appropriate. I've avoided renaming these to use a longer form purely for expediency sake, since changing them would require a cascade of other renamings. They should be updated to follow the new naming scheme, but that can be done in future patches. * "coords" is used for the complete collection of crd values associated with a single element. In the runtime library this includes both `std::vector` and raw pointer representations. In the compiler, this is used specifically for buffer variables with C++ type `Value`, `TypedValue<MemRefType>`, etc. The bare form "coords" is discouraged, since it fails to make the dim/lvl distinction; so the compound names "dimCoords/lvlCoords" should be used instead. (Though there may exist a rare few cases where is is appropriate to be intentionally ambiguous about what coordinate-space the coords live in; in which case the bare "coords" is appropriate.) There is seldom the need for the pos variant of this notion. In most circumstances we use the term "cursor", since the same buffer is reused for a 'moving' pos-collection. * "dcvs/lcvs" is used in the compiler as the `ValueRange` analogue of "dimCoords/lvlCoords". (The "vs" stands for "`Value`s".) I haven't found the need for it, but "pvs" would be the obvious name for a pos-`ValueRange`. The old "ind"-vs-"ivs" naming scheme does not seem to have been sustained in more recent code, which instead prefers other mnemonics (e.g., adding "Buf" to the end of the names for `TypeValue<MemRefType>`). I have cleaned up a lot of these to follow the "coords"-vs-"cvs" naming scheme, though haven't done an exhaustive cleanup. * "positions/coordinates" are used for larger collections of pos/crd values; in particular, these are used when referring to the complete sparse-tensor storage components. I also prefer to use these unabbreviated names in the documentation, unless there is some specific reason why using the abbreviated forms helps resolve ambiguity. In addition to making this terminology change, this change also does some cleanup along the way: * correcting the dim/lvl terminology in certain places. * adding `const` when it requires no other code changes. * miscellaneous cleanup that was entailed in order to make the proper distinctions. Most of these are in CodegenUtils.{h,cpp} Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144773	2023-03-06 12:23:33 -08:00
Matthias Springer	42c31d8302	[mlir][IR] Clean up mergeBlockBefore and mergeBlocks * `RewriterBase::mergeBlocks` is simplified: it is implemented in terms of `mergeBlockBefore`. * The signature of `mergeBlockBefore` is consistent with other API (such as `inlineRegionBefore`): an overload for a `Block::iterator` is added. * Additional safety checks are added to `mergeBlockBefore`: detect cases where the resulting IR could be invalid (no more `dropAllUses`) or partly unreachable (likely a case of incorrect API usage). * Rename `mergeBlockBefore` to `inlineBlockBefore`. Differential Revision: https://reviews.llvm.org/D144969	2023-03-06 13:46:08 +01:00
Matthias Springer	ae9e1d1df4	[mlir][SparseTensor] Fix incorrect API usage in RewritePatterns Incorrect API usage was detected by D144552. Differential Revision: https://reviews.llvm.org/D145166	2023-03-02 17:59:57 +01:00
Peiming Liu	b60cf8c972	[mlir][sparse] support coiteration with fused reshape tensor Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145091	2023-03-01 20:55:46 +00:00
Peiming Liu	fc126022e8	[mlir][sparse] fuse collapse_shape on sparse tensor with GenericOp. Instead of always materializing a new sparse tensor after reshape, this patch tries to fuses the reshape (currently only on COO) with GenericOp and coiterates with the reshaped tensors without allocating a new sparse tensor. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D145016	2023-03-01 19:05:48 +00:00
bixia1	2c81d43241	[mlir][sparse] Improve the implementation of sparse_tensor.new for the codegen path. Rewrite a NewOp into a NewOp of a sorted COO tensor and a ConvertOp for converting the sorted COO tensor to the destination tensor type. Codegen a NewOp of a sorted COO tensor to use the new bulk reader API and sort the elements only when the input is not sorted. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144504	2023-03-01 07:29:49 -08:00
Peiming Liu	849529ba8a	[mlir][sparse] fix performance bug in matmul with a sparse rhs due to suboptimal iteration graphs. While dense tensors support random accesses, it is critical to visit them in a row-major order for better cache locality. However, we previously consider dense inputs and outputs together when computing constraints for building iteration graph, it could lead us to less efficient iteration graphs. This patch adds a new `SortMask::kIncludeDenseInput` to treat dense inputs/outputs separately when building iteration graph, thus increasing the chance for use to construct a better iteration graph. A more fine-grained approach is to treat each input separately. Note, related to: https://github.com/llvm/llvm-project/issues/51651 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144932	2023-02-28 21:02:17 +00:00
Kohei Yamaguchi	9a29d87538	[mlir][sparse] Add checking parent op of SortOp Fix crash with segmentation fault caused by setting a parent operator that is not func::FuncOp with sparse_tensor SortOp. fixes https://github.com/llvm/llvm-project/issues/59988 Reviewed By: aartbik, wrengr Differential Revision: https://reviews.llvm.org/D143874	2023-02-27 16:37:02 +01:00
Peiming Liu	85dbb3fc4b	[mlir][sparse] support sparse tensor element type conversion in codegen path Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144578	2023-02-23 17:49:50 +00:00
Peiming Liu	44ff23d5e4	[mlir][sparse] unconditionally use IndexType for sparse_tensor.specifier Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144574	2023-02-22 20:21:34 +00:00
Peiming Liu	b7d86f3f1c	[mlir][sparse] revert optimization for dense->csc conversion. Eliminates the sort seems make the whole conversion slower (probably because loop rotation leads to bad locality). Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144517	2023-02-21 21:34:01 +00:00
Peiming Liu	9e8d9316ce	[mlir][sparse] allow foreach operation to generate out-of-order loop on non-annotated tensor. No need for a temp COO and sort even when converting dense -> CSC, we can instead rotate the loop to yield a ordered coordinates at beginning. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144213	2023-02-16 23:23:20 +00:00
bixia1	c2e248c6ae	[mlir][sparse] Remove the expansion of symmetric MTX in the sparse tensor storage. We will support symmetric MTX without expanding the data in the sparse tensor storage. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144059	2023-02-16 13:02:17 -08:00
Peiming Liu	e2e83f4c8f	[mlir][sparse] support coiteration over sparse tensor slices Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140736	2023-02-15 23:52:22 +00:00
wren romano	ae7942e296	[mlir][sparse] adding `SparseTensorType::get{Pointer,Index}Type` methods Depends On D143800 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D143946	2023-02-15 14:37:55 -08:00
wren romano	d950bdc73e	[mlir][sparse] misc code cleanup * Flattening/simplifying some nested conditionals * const-ifying some local variables Depends On D143800 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D143949	2023-02-15 13:29:00 -08:00
wren romano	bb4fc6b6d6	[mlir][sparse] Adding `SparseTensorType::{operator==, hasSameDimToLvlMap}` Depends On D143800 Reviewed By: aartbik, Peiming Differential Revision: https://reviews.llvm.org/D144052	2023-02-15 12:05:29 -08:00
Jie Fu	71251e8d4f	[mlir] Fix -Wsign-compare in SparseTensorRewriting.cpp and Sparsification.cpp (NFC) /home/jiefu/llvm-project/mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp:279:33: error: comparison of integers of different signs: 'int64_t' (aka 'long') and 'const mlir::sparse_tensor::Level' (aka 'const unsigned long') [-Werror,-Wsign-compare] assert(env.op().getRank(&t) == lvlRank); ~~~~~~~~~~~~~~~~~~~~ ^ ~~~~~~~ /usr/include/assert.h:93:27: note: expanded from macro 'assert' (static_cast <bool> (expr) \ ^~~~ 1 error generated. /home/jiefu/llvm-project/mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorRewriting.cpp:788:29: error: comparison of integers of different signs: 'int64_t' (aka 'long') and 'const mlir::sparse_tensor::Dimension' (aka 'const unsigned long') [-Werror,-Wsign-compare] assert(srcRTT.getRank() == dimRank); ~~~~~~~~~~~~~~~~ ^ ~~~~~~~ /usr/include/assert.h:93:27: note: expanded from macro 'assert' (static_cast <bool> (expr) \ ^~~~ /home/jiefu/llvm-project/mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorRewriting.cpp:810:31: error: comparison of integers of different signs: 'int64_t' (aka 'long') and 'const mlir::sparse_tensor::Dimension' (aka 'const unsigned long') [-Werror,-Wsign-compare] assert(srcRTT.getRank() == dimRank); ~~~~~~~~~~~~~~~~ ^ ~~~~~~~ /usr/include/assert.h:93:27: note: expanded from macro 'assert' (static_cast <bool> (expr) \ ^~~~ 2 errors generated.	2023-02-15 11:58:08 +08:00
wren romano	f708a549b8	[mlir][sparse] Factoring out SparseTensorType class This change adds a new `SparseTensorType` class for making the "dim" vs "lvl" distinction more overt, and for abstracting over the differences between sparse-tensors and dense-tensors. In addition, this change also adds new type aliases `Dimension`, `Level`, and `FieldIndex` to make code more self-documenting. Although the diff is very large, the majority of the changes are mechanical in nature (e.g., changing types to use the new aliases, updating variable names to match, etc). Along the way I also made many variables `const` when they could be; the majority of which required only adding the keyword. A few places had conditional definitions of these variables, requiring actual code changes; however, that was only done when the overall change was extremely local and easy to extract. All these changes are included in the current patch only because it would be too onerous to split them off into a separate patch. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D143800	2023-02-14 19:17:19 -08:00
Peiming Liu	81cb70e46e	[mlir][sparse] fix a bug in UnpackOp converter. UnpackOp Converter used to create reallocOp unconditionally, but it might cause issue when the requested memory size is smaller than the actually storage. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D144065	2023-02-15 02:36:00 +00:00
Peiming Liu	ce9ce66b8d	[mlir][sparse] fix a memory leakage when converting from a tensor slice Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D143929	2023-02-13 22:44:12 +00:00
Peiming Liu	dc6427d687	[mlir][sparse] implement lowering rules for sparse_tensor::unpack Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D143672	2023-02-11 01:05:46 +00:00
Jim Kitchen	81d0d2b2a0	[mlir][sparse] Sparse reduction in lex order no longer produces dense output Previously, when performing a reduction on a sparse tensor, the result would be different depending on iteration order. For expanded access pattern, an empty row would contribute no entry in the output. For lex ordering, the identity would end up in the output. This code changes that behavior and keeps track of whether any entries were actually reduced in lex ordering, making the output consistent between the two iteration styles. Differential Revision: https://reviews.llvm.org/D142050	2023-02-10 13:09:28 -06:00
Matthias Springer	9fa6b3504b	[mlir][bufferization] Improve aliasing OpOperand/OpResult property `getAliasingOpOperands`/`getAliasingOpResults` now encodes OpOperand/OpResult, buffer relation and a degree of certainty. E.g.: ``` // aliasingOpOperands(%r) = {(%t, EQUIV, DEFINITE)} // aliasingOpResults(%t) = {(%r, EQUIV, DEFINITE)} %r = tensor.insert %f into %t[%idx] : tensor<?xf32> // aliasingOpOperands(%r) = {(%t0, EQUIV, MAYBE), (%t1, EQUIV, MAYBE)} // aliasingOpResults(%t0) = {(%r, EQUIV, MAYBE)} // aliasingOpResults(%t1) = {(%r, EQUIV, MAYBE)} %r = arith.select %c, %t0, %t1 : tensor<?xf32> ``` `BufferizableOpInterface::bufferRelation` is removed, as it is now part of `getAliasingOpOperands`/`getAliasingOpResults`. This change allows for better analysis, in particular wrt. equivalence. This allows additional optimizations and better error checking (which is sometimes overly conservative). Examples: * EmptyTensorElimination can eliminate `tensor.empty` inside `scf.if` blocks. This requires a modeling of equivalence: It is not a per-OpResult property anymore. Instead, it can be specified for each OpOperand and OpResult. This is important because `tensor.empty` may be eliminated only if all values on the SSA use-def chain to the final consumer (`tensor.insert_slice`) are equivalent. * The detection of "returning allocs from a block" can be improved. (Addresses a TODO in `assertNoAllocsReturned`.) This allows us to bufferize IR such as "yielding a `tensor.extract_slice` result from an `scf.if` branch", which currently fails to bufferize because the alloc detection is too conservative. * Better bufferization of loops. Aliases of the iter_arg can be yielded (even if they are not equivalent) without having to realloc and copy the entire buffer on each iteration. The above-mentioned examples are not yet implemented with this change. This change just improves the BufferizableOpInterface, its implementations and related helper functions, so that better aliasing information is available for each op. Differential Revision: https://reviews.llvm.org/D142129	2023-02-09 11:35:03 +01:00
bixia1	a150766880	[mlir][sparse] Implement hybrid quick sort for sparse_tensor.sort. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D143227	2023-02-08 14:06:31 -08:00
Peiming Liu	0352690421	[mlir][sparse] make foreach operation support sparse tensor slices. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140713	2023-02-08 18:58:35 +00:00
Peiming Liu	7a8edea69d	[mlir][sparse] fix bug when packing tensor with 32 bit pointer width. Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D143450	2023-02-07 02:00:40 +00:00
Aart Bik	3bd82f30dc	[mlir][sparse] compute allocation size_hint This adds the hint to a number of tensor allocations in codegens, shaving off quite some time from e.g. reading in sparse matrices due to zero-reallocation scheme. Note that we can probably provide hints on all allocations, and refine the heuristics that use them for general tensors. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D143309	2023-02-06 14:08:53 -08:00
Peiming Liu	7fef8d69cc	[mlir][sparse] implement bufferizableOpInterface for sparse_tensor.pack operation Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D143281	2023-02-03 23:55:59 +00:00

1 2 3 4 5 ...

481 Commits