llvm-project

Author	SHA1	Message	Date
Matthias Springer	3ace685105	[mlir][Transforms] Support 1:N mappings in `ConversionValueMapping` (#116524 ) This commit updates the internal `ConversionValueMapping` data structure in the dialect conversion driver to support 1:N replacements. This is the last major commit for adding 1:N support to the dialect conversion driver. Since #116470, the infrastructure already supports 1:N replacements. But the `ConversionValueMapping` still stored 1:1 value mappings. To that end, the driver inserted temporary argument materializations (converting N SSA values into 1 value). This is no longer the case. Argument materializations are now entirely gone. (They will be deleted from the type converter after some time, when we delete the old 1:N dialect conversion driver.) Note for LLVM integration: Replace all occurrences of `addArgumentMaterialization` (except for 1:N dialect conversion passes) with `addSourceMaterialization`. --------- Co-authored-by: Markus Böck <markus.boeck02@gmail.com>	2025-01-03 16:11:56 +01:00
Matthias Springer	2b5b3cf60d	[mlir][sparse_tensor] Migrate `SparseIterationToScf.cpp` to dialect conversion (#121054 ) Use the regular dialect conversion driver instead of the 1:N dialect conversion driver. The 1:N dialect conversion driver will be removed soon.	2024-12-27 09:13:15 +01:00
Kazu Hirata	129f1001c3	[Dialect] Migrate away from PointerUnion::{is,get} (NFC) (#120818 ) Note that PointerUnion::{is,get} have been soft deprecated in PointerUnion.h: // FIXME: Replace the uses of is(), get() and dyn_cast() with // isa<T>, cast<T> and the llvm::dyn_cast<T> I'm not touching PointerUnion::dyn_cast for now because it's a bit complicated; we could blindly migrate it to dyn_cast_if_present, but we should probably use dyn_cast when the operand is known to be non-null.	2024-12-21 08:17:51 -08:00
Jacques Pienaar	09dfc5713d	[mlir] Enable decoupling two kinds of greedy behavior. (#104649 ) The greedy rewriter is used in many different flows and it has a lot of convenience (work list management, debugging actions, tracing, etc). But it combines two kinds of greedy behavior 1) how ops are matched, 2) folding wherever it can. These are independent forms of greedy and leads to inefficiency. E.g., cases where one need to create different phases in lowering and is required to applying patterns in specific order split across different passes. Using the driver one ends up needlessly retrying folding/having multiple rounds of folding attempts, where one final run would have sufficed. Of course folks can locally avoid this behavior by just building their own, but this is also a common requested feature that folks keep on working around locally in suboptimal ways. For downstream users, there should be no behavioral change. Updating from the deprecated should just be a find and replace (e.g., `find ./ -type f -exec sed -i 's\|applyPatternsAndFoldGreedily\|applyPatternsGreedily\|g' {} \;` variety) as the API arguments hasn't changed between the two.	2024-12-20 08:15:48 -08:00
Matthias Springer	eb6c4197d5	[mlir][CF] Split `cf-to-llvm` from `func-to-llvm` (#120580 ) Do not run `cf-to-llvm` as part of `func-to-llvm`. This commit fixes https://github.com/llvm/llvm-project/issues/70982. This commit changes the way how `func.func` ops are lowered to LLVM. Previously, the signature of the entire region (i.e., entry block and all other blocks in the `func.func` op) was converted as part of the `func.func` lowering pattern. Now, only the entry block is converted. The remaining block signatures are converted together with `cf.br` and `cf.cond_br` as part of `cf-to-llvm`. All unstructured control flow is not converted as part of a single pass (`cf-to-llvm`). `func-to-llvm` no longer deals with unstructured control flow. Also add more test cases for control flow dialect ops. Note: This PR is in preparation of #120431, which adds an additional GPU-specific lowering for `cf.assert`. This was a problem because `cf.assert` used to be converted as part of `func-to-llvm`. Note for LLVM integration: If you see failures, add `-convert-cf-to-llvm` to your pass pipeline.	2024-12-20 13:46:45 +01:00
Matthias Springer	b03a09e74f	[mlir] Fix integration tests after #120548 (#120706 ) This should have been part of #120548.	2024-12-20 11:03:33 +01:00
Matthias Springer	9df63b2651	[mlir][Transforms] Add 1:N `matchAndRewrite` overload (#116470 ) This commit adds a new `matchAndRewrite` overload to `ConversionPattern` to support 1:N replacements. This is the first of two main PRs that merge the 1:1 and 1:N dialect conversion drivers. The existing `matchAndRewrite` function supports only 1:1 replacements, as can be seen from the `ArrayRef<Value>` parameter. ```c++ LogicalResult ConversionPattern::matchAndRewrite( Operation op, ArrayRef<Value> operands /adaptor values/, ConversionPatternRewriter &rewriter) const; ``` This commit adds a `matchAndRewrite` overload that is called by the dialect conversion driver. By default, this new overload dispatches to the original 1:1 `matchAndRewrite` implementation. Existing `ConversionPattern`s do not need to be changed as long as there are no 1:N type conversions or value replacements. ```c++ LogicalResult ConversionPattern::matchAndRewrite( Operation op, ArrayRef<ValueRange> operands /adaptor values/, ConversionPatternRewriter &rewriter) const { // Note: getOneToOneAdaptorOperands produces a fatal error if at least one // ValueRange has 0 or more than 1 value. return matchAndRewrite(op, getOneToOneAdaptorOperands(operands), rewriter); } ``` The `ConversionValueMapping`, which keeps track of value replacements and materializations, still does not support 1:N replacements. We still rely on argument materializations to convert N replacement values back into a single value. The `ConversionValueMapping` will be generalized to 1:N mappings in the second main PR. Before handing the adaptor values to a `ConversionPattern`, all argument materializations are "unpacked". The `ConversionPattern` receives N replacement values and does not see any argument materializations. This implementation strategy allows us to use the 1:N infrastructure/API in `ConversionPattern`s even though some functionality is still missing in the driver. This strategy was chosen to keep the sizes of the PRs smaller and to make it easier for downstream users to adapt to API changes. This commit also updates the the "decompose call graphs" transformation and the "sparse tensor codegen" transformation to use the new 1:N `ConversionPattern` API. Note for LLVM conversion: If you are using a type converter with 1:N type conversion rules or if your patterns are performing 1:N replacements (via `replaceOpWithMultiple` or `applySignatureConversion`), conversion pattern applications will start failing (fatal LLVM error) with this error message: `pattern 'name' does not support 1:N conversion`. The name of the failing pattern is shown in the error message. These patterns must be updated to the new 1:N `matchAndRewrite` API.	2024-11-30 09:27:47 +09:00
Matthias Springer	cbc7802233	[mlir][bufferization] Remove `finalizing-bufferize` pass (#114154 ) The dialect conversion-based bufferization passes have been migrated to One-Shot Bufferize about two years ago. To clean up the code base, this commit removes the `finalizing-bufferize` pass, one of the few remaining parts of the old infrastructure. Most bufferization passes have already been removed. Note for LLVM integration: If you depend on this pass, migrate to One-Shot Bufferize or copy the pass to your codebase. Depends on #114152.	2024-11-21 10:51:23 +09:00
Matthias Springer	204234a69c	[mlir][SparseTensor][NFC] Pass tensor type to descriptor helper (#116468 ) `getDescriptorFromTensorTuple` and `getMutDescriptorFromTensorTuple` extract the tensor type from an `unrealized_conversion_cast` op that serves as a workaround for missing 1:N dialect conversion support. This commit changes these functions so that they explicitly receive the tensor type as a function argument. This is in preparation of merging the 1:1 and 1:N conversion drivers. The conversion patterns in this file will soon start receiving multiple SSA values (`ValueRange`) from their adaptors (instead of a single value that is the result of `unrealized_conversion_cast`). It will no longer be possible to take the tensor type from the `unrealized_conversion_cast` op. The `unrealized_conversion_cast` workaround will disappear entirely.	2024-11-19 09:27:51 +09:00
Matthias Springer	aed4356252	[mlir][Transforms] Dialect Conversion: Add `replaceOpWithMultiple` (#115816 ) This commit adds a new function `ConversionPatternRewriter::replaceOpWithMultiple`. This function is similar to `replaceOp`, but it accepts multiple `ValueRange` replacements, one per op result. Note: This function is not an overload of `replaceOp` because of ambiguous overload resolution that would make the API difficult to use. This commit aligns "block signature conversions" with "op replacements": both support 1:N replacements now. Due to incomplete 1:N support in the dialect conversion driver, an argument materialization is inserted when an SSA value is replaced with multiple values; same as block signature conversions already work around the problem. These argument materializations are going to be removed in a subsequent commit that adds full 1:N support. The purpose of this PR is to add missing features gradually in small increments. This commit also updates two MLIR transformations that have their custom workarounds around missing 1:N support. These can already start using `replaceOpWithMultiple`. Co-authored-by: Markus Böck <markus.boeck02@gmail.com>	2024-11-14 10:27:58 +09:00
Manupa Karunaratne	a6e72f9392	[MLIR][Vector] Add Lowering for vector.step (#113655 ) Currently, the lowering for vector.step lives under a folder. This is not ideal if we want to do transformation on it and defer the materizaliztion of the constants much later. This commits adds a rewrite pattern that could be used by using `transform.structured.vectorize_children_and_apply_patterns` transform dialect operation. Moreover, the rewriter of vector.step is also now used in -convert-vector-to-llvm pass where it handles scalable and non-scalable types as LLVM expects it. As a consequence of removing the vector.step lowering as its folder, linalg vectorization will keep vector.step intact.	2024-11-01 16:38:36 +00:00
Matthias Springer	f18c3e4e73	[mlir][Transforms] Dialect Conversion: Simplify materialization fn result type (#113031 ) This commit simplifies the result type of materialization functions. Previously: `std::optional<Value>` Now: `Value` The previous implementation allowed 3 possible return values: - Non-null value: The materialization function produced a valid materialization. - `std::nullopt`: The materialization function failed, but another materialization can be attempted. - `Value()`: The materialization failed and so should the dialect conversion. (Previously: Dialect conversion can roll back.) This commit removes the last variant. It is not particularly useful because the dialect conversion will fail anyway if all other materialization functions produced `std::nullopt`. Furthermore, in contrast to type conversions, at least one materialization callback is expected to succeed. In case of a failing type conversion, the current dialect conversion can roll back and try a different pattern. This also used to be the case for materializations, but that functionality was removed with #107109: failed materializations can no longer trigger a rollback. (They can just make the entire dialect conversion fail without rollback.) With this in mind, it is even less useful to have an additional error state for materialization functions. This commit is in preparation of merging the 1:1 and 1:N type converters. Target materializations will have to return multiple values instead of a single one. With this commit, we can keep the API simple: `SmallVector<Value>` instead of `std::optional<SmallVector<Value>>`. Note for LLVM integration: All 1:1 materializations should return `Value` instead of `std::optional<Value>`. Instead of `std::nullopt` return `Value()`.	2024-10-23 07:29:17 -07:00
Kazu Hirata	af6e1881e0	[mlir] Avoid repeated map lookups (NFC) (#113122 )	2024-10-21 06:52:24 -07:00
Kazu Hirata	2077fb80ff	[mlir] Avoid repeated map lookups (NFC) (#113074 )	2024-10-20 10:42:28 -07:00
Matthias Springer	77f8297c6f	[mlir][sparse] Improve sparse tensor type constraints (#112133 ) Sparse tensors are always ranked tensors. Encodings cannot be attached to unranked tensors. Change the type constraint to `RankedTensorOf`, so that we generate `TypedValue<RankedTensorType>` instead of `TypedValue<TensorType>`. This removes the need for type casting in some cases. Also improve the verifiers (missing `return` statements) and switch a few other `AnyTensor` to `AnyRankedTensor`. This commit is in preparation of a dialect conversion commit that required fixes in the sparse dialect.	2024-10-13 21:12:38 +02:00
Matthias Springer	206fad0e21	[mlir][NFC] Mark type converter in `populate...` functions as `const` (#111250 ) This commit marks the type converter in `populate...` functions as `const`. This is useful for debugging. Patterns already take a `const` type converter. However, some `populate...` functions do not only add new patterns, but also add additional type conversion rules. That makes it difficult to find the place where a type conversion was added in the code base. With this change, all `populate...` functions that only populate pattern now have a `const` type converter. Programmers can then conclude from the function signature that these functions do not register any new type conversion rules. Also some minor cleanups around the 1:N dialect conversion infrastructure, which did not always pass the type converter as a `const` object internally.	2024-10-05 21:32:40 +02:00
Matthias Springer	49df12c01e	[mlir][NFC] Minor cleanup around `ModuleOp` usage (#110498 ) Use `moduleOp.getBody()` instead of `moduleOp.getBodyRegion().front()`.	2024-09-30 21:20:48 +02:00
Longsheng Mou	129ade21bd	[mlir][sparse] Replace `getSparseTensorType` with `tryGetSparseTensorType` (#109435 ) This PR fixes a bug in `SparseTensorDimOpRewriter` when `tensor.dim` has an unranked tensor type. To prevent crashes, we now use `tryGetSparseTensorType` instead of `getSparseTensorType`. Fixes #107807.	2024-09-30 09:16:55 +08:00
JOE1994	884221eddb	[mlir] Tidy uses of llvm::raw_stream_ostream (NFC) As specified in the docs, 1) raw_string_ostream is always unbuffered and 2) the underlying buffer may be used directly ( 65b13610a5226b84889b923bae884ba395ad084d for further reference ) * Don't call raw_string_ostream::flush(), which is essentially a no-op. * Avoid unneeded calls to raw_string_ostream::str(), to avoid excess indirection.	2024-09-16 23:23:25 -04:00
Aart Bik	0e34dbb4f4	[mlir][sparse] fix bug with all-dense assembler (#108615 ) When only all-dense "sparse" tensors occur in a function prototype, the assembler would skip the method conversion purely based on input/output counts. It should rewrite based on the presence of any annotation, however.	2024-09-13 17:24:48 -07:00
Yinying Li	8f0c014b12	[mlir][sparse] add parallelization options to mini pipeline (#104233 )	2024-08-30 17:47:37 -04:00
Christopher Bate	8bf69ceb00	Reapply "[mlir] NFC: fix dependence of (Tensor\|Linalg\|MemRef\|Complex) dialects on LLVM Dialect and LLVM Core in CMake build (#104832 )" (#105703 ) Reapply the commit 43b508566799751aa180f1eaaafc5be693f2f1ae with additional fixes for building with BUILD_SHARED_LIBS=ON.	2024-08-28 22:34:14 -06:00
Hugo Trachino	cb9267f055	[mlir][vector] Rename LowerVectorToLLVM to ConvertVectorToLLVM (NFC) (#104785 ) There was some inconsistency with ConvertVectorToLLVM Pass builder, files and option names. This patch aims to move all occurences to ConvertVectorToLLVM.	2024-08-27 09:13:45 +01:00
Peiming Liu	b48ef8d8d4	[mlir][sparse] unify block arguments order between iterate/coiterate operations. (#105567 )	2024-08-23 14:49:00 -07:00
Peiming Liu	7186704204	[mlir][sparse] refactoring sparse_tensor.iterate lowering pattern implementation. (#105566 )	2024-08-23 11:21:44 -07:00
Peiming Liu	f607102a0d	[mlir][sparse] partially support lowering sparse coiteration loops to scf.while/for. (#105565 )	2024-08-23 10:47:44 -07:00
Peiming Liu	c44202574f	[mlir][sparse] support sparsification to coiterate operations. (#102546 )	2024-08-20 11:13:38 -07:00
Matthias Springer	7359a6b799	[mlir][ODS] Verify type constraints in Types and Attributes (#102326 ) When a type/attribute is defined in TableGen, a type constraint can be used for parameters, but the type constraint verification was missing. Example: ``` def TestTypeVerification : Test_Type<"TestTypeVerification"> { let parameters = (ins AnyTypeOf<[I16, I32]>:$param); // ... } ``` No verification code was generated to ensure that `$param` is I16 or I32. When type constraints a present, a new method will generated for types and attributes: `verifyInvariantsImpl`. (The naming is similar to op verifiers.) The user-provided verifier is called `verify` (no change). There is now a new entry point to type/attribute verification: `verifyInvariants`. This function calls both `verifyInvariantsImpl` and `verify`. If neither of those two verifications are present, the `verifyInvariants` function is not generated. When a type/attribute is not defined in TableGen, but a verifier is needed, users can implement the `verifyInvariants` function. (This function was previously called `verify`.) Note for LLVM integration: If you have an attribute/type that is not defined in TableGen (i.e., just C++), you have to rename the verification function from `verify` to `verifyInvariants`. (Most attributes/types have no verification, in which case there is nothing to do.) Depends on #102657.	2024-08-09 22:04:40 +02:00
Nikhil Kalra	84cc1865ef	[mlir] Support DialectRegistry extension comparison (#101119 ) `PassManager::run` loads the dependent dialects for each pass into the current context prior to invoking the individual passes. If the dependent dialect is already loaded into the context, this should be a no-op. However, if there are extensions registered in the `DialectRegistry`, the dependent dialects are unconditionally registered into the context. This poses a problem for dynamic pass pipelines, however, because they will likely be executing while the context is in an immutable state (because of the parent pass pipeline being run). To solve this, we'll update the extension registration API on `DialectRegistry` to require a type ID for each extension that is registered. Then, instead of unconditionally registered dialects into a context if extensions are present, we'll check against the extension type IDs already present in the context's internal `DialectRegistry`. The context will only be marked as dirty if there are net-new extension types present in the `DialectRegistry` populated by `PassManager::getDependentDialects`. Note: this PR removes the `addExtension` overload that utilizes `std::function` as the parameter. This is because `std::function` is copyable and potentially allocates memory for the contained function so we can't use the function pointer as the unique type ID for the extension. Downstream changes required: - Existing `DialectExtension` subclasses will need a type ID to be registered for each subclass. More details on how to register a type ID can be found here: `8b68e06731/mlir/include/mlir/Support/TypeID.h (L30)` - Existing uses of the `std::function` overload of `addExtension` will need to be refactored into dedicated `DialectExtension` classes with associated type IDs. The attached `std::function` can either be inlined into or called directly from `DialectExtension::apply`. --------- Co-authored-by: Mehdi Amini <joker.eph@gmail.com>	2024-08-06 01:32:36 +02:00
Kazu Hirata	5262865aac	[mlir] Construct SmallVector with ArrayRef (NFC) (#101896 )	2024-08-04 11:43:05 -07:00
Peiming Liu	785a24f156	[mlir][sparse] introduce `sparse_tensor.coiterate` operation. (#101100 ) This PR introduces `sparse_tensor.coiterate` operation, which represents a loop that traverses multiple sparse iteration space.	2024-07-31 15:14:27 -07:00
Peiming Liu	951a363097	[mlir][sparse] implement `sparse_tensor.extract_value` operation. (#101220 )	2024-07-31 14:47:29 -07:00
Peiming Liu	12189f8005	[mlir][sparse] introduce `sparse_tensor.extract_value` operation. (#101219 )	2024-07-30 16:26:18 -07:00
Cullen Rhodes	074414fdbd	[mlir][sparse] Use vector.step for index vector generation (#97692 )	2024-07-08 08:29:49 +01:00
Peiming Liu	1ba2768c63	[mlir][sparse] expose emit strategy option to mini pipeline (#96238 )	2024-06-20 14:23:36 -07:00
Peiming Liu	a02010b3e9	[mlir][sparse] support sparsifying sparse kernels to sparse-iterator-based loop (#95858 )	2024-06-17 16:50:12 -07:00
Peiming Liu	d6cc35f7f6	Reapply "[mlir][sparse] implement lowering rules for IterateOp." (#95836 )	2024-06-17 13:29:53 -07:00
Shivam Gupta	bba5951b6f	[MLIR] Fix an assert that contains a mistake in conditional operator (#95668 ) This is described in (N2) https://pvs-studio.com/en/blog/posts/cpp/1126/ so caught by the PVS Studio analyzer. Warning message - V502 Perhaps the '?:' operator works in a different way than it was expected. The '?:' operator has a lower priority than the '+' operator. LoopEmitter.cpp 983 V502 Perhaps the '?:' operator works in a different way than it was expected. The '?:' operator has a lower priority than the '+' operator. LoopEmitter.cpp 1039 The assert should be assert(bArgs.size() == reduc.size() + (needsUniv ? 1 : 0)); since + has higher precedence and ? has lower. This further can be reduce to assert(aArgs.size() == reduc.size() + needsUniv); because needUniv is a bool value which is implicitly converted to 0 or	2024-06-18 01:09:30 +05:30
Peiming Liu	996905d815	Revert "[mlir][sparse] implement lowering rules for IterateOp." (#95826 ) Reverts llvm/llvm-project#95286	2024-06-17 11:35:23 -07:00
Peiming Liu	3a2e442893	[mlir][sparse] implement lowering rules for IterateOp. (#95286 )	2024-06-17 10:45:50 -07:00
Peiming Liu	a43d79af78	[mlir][sparse] add canonicalization patterns for IterateOp. (#95569 )	2024-06-14 10:31:29 -07:00
Peiming Liu	c42bbda425	[mlir][sparse] implement lowering rules for ExtractIterSpaceOp. (#89143 ) DO NOT MERGE until https://github.com/llvm/llvm-project/pull/89003	2024-06-12 10:49:12 -07:00
Aart Bik	438a7d4c98	[mlir][sparse] expose optimization flags to mini pipeline (#95158 ) Some of the options only fed into the full sparse pipeline. However, some backends prefer to use the sparse minipipeline. This change exposes some important optimization flags to the pass as well. This prepares some SIMDization of PyTorch sparsified code.	2024-06-11 14:20:58 -07:00
Peiming Liu	c6d85baf9f	[mlir][sparse] implement sparse space collapse pass. (#89003 )	2024-06-11 12:10:54 -07:00
Peiming Liu	d4749765fc	[mlir][sparse] fix missing cmake dependencies. (#95034 ) Fix https://github.com/llvm/llvm-project/pull/88955	2024-06-10 13:44:07 -07:00
Peiming Liu	e276cf0831	[mlir][sparse] introduce `sparse_tensor.iterate` operation (#88955 ) A `sparse_tensor.iterate` iterates over a sparse iteration space extracted from `sparse_tensor.extract_iteration_space` operation introduced in https://github.com/llvm/llvm-project/pull/88554.	2024-06-10 10:20:24 -07:00
Kazu Hirata	197c3a3efc	Use llvm::less_first (NFC) (#94136 )	2024-06-02 07:45:50 -07:00
Peiming Liu	99835922ca	[mlir][sparse] remove sparse encoding propagation pass. (#93593 )	2024-05-28 11:23:15 -07:00
Kazu Hirata	2f52bbeb6f	[mlir] Use operator==(StringRef, StringRef) (NFC) (#92706 )	2024-05-19 15:20:46 -07:00
Peiming Liu	fb8f492a1c	[mlir][sparse] clone a empty sparse tensor when fuse convert into pro… (#92158 ) …ducer.	2024-05-14 13:26:49 -07:00

1 2 3 4 5 ...

1023 Commits