llvm-project

Author	SHA1	Message	Date
Nicolas Vasilache	973e133b76	[mlir][Linalg] Improve region support in Linalg ops. This revision takes advantage of the newly extended `ref` directive in assembly format to allow better region handling for LinalgOps. Specifically, FillOp and CopyOp now build their regions explicitly which allows retiring older behavior that relied on specific op knowledge in both lowering to loops and vectorization. Differential Revision: https://reviews.llvm.org/D96598	2021-02-12 14:51:03 +00:00
Hanhan Wang	9325b8da17	[mlir][Linalg] Add conv ops with TF definition. The dimension order of a filter in tensorflow is [filter_height, filter_width, in_channels, out_channels], which is different from current definition. The current definition follows TOSA spec. Add TF version conv ops to .tc, so we do not have to insert a transpose op around a conv op. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96038	2021-02-10 22:59:38 -08:00
Nicolas Vasilache	bb69de3f41	[mlir][Linalg] Add a vectorization pattern for linalg::PadTensorOp The new pattern is exercised from the TestLinalgTransforms pass. Differential Revision: https://reviews.llvm.org/D96410	2021-02-10 14:13:49 +00:00
Nicolas Vasilache	0fcbbde2c7	[mlir][Linalg] NFC - Refactor vectorization to be more composable Differential Revision: https://reviews.llvm.org/D96116	2021-02-05 12:03:14 +00:00
Mehdi Amini	215441fcb7	Remove dead code from Linalg vectorization to fix GCC warning (NFC)	2021-02-04 17:37:25 +00:00
Nicolas Vasilache	e4a503a26d	[mlir][Linalg] Introduce a ContractionOpInterface This revision takes advantage of recent extensions to vectorization to refactor contraction detection into a bona fide Linalg interface. The mlit-linalg-ods-gen parser is extended to support adding such interfaces. The detection that was originally enabling vectorization is refactored to serve as both a test on a generic LinalgOp as well as to verify ops that declare to conform to that interface. This is plugged through Linalg transforms and strategies but it quickly becomes evident that the complexity and rigidity of the C++ class based templating does not pay for itself. Therefore, this revision changes the API for vectorization patterns to get rid of templates as much as possible. Variadic templates are relegated to the internals of LinalgTransformationFilter as much as possible and away from the user-facing APIs. It is expected other patterns / transformations will follow the same path and drop as much C++ templating as possible from the class definition. Differential revision: https://reviews.llvm.org/D95973	2021-02-04 16:53:24 +00:00
Nicolas Vasilache	f245b7ad36	[mlir][Linalg] Generalize the definition of a Linalg contraction. This revision defines a Linalg contraction in general terms: 1. Has 2 input and 1 output shapes. 2. Has at least one reduction dimension. 3. Has only projected permutation indexing maps. 4. its body computes `u5(u1(c) + u2(u3(a) * u4(b)))` on some field (AddOpType, MulOpType), where u1, u2, u3, u4 and u5 represent scalar unary operations that may change the type (e.g. for mixed-precision). As a consequence, when vectorization of such an op occurs, the only special behavior is that the (unique) MulOpType is vectorized into a `vector.contract`. All other ops are handled in a generic fashion. In the future, we may wish to allow more input arguments and elementwise and constant operations that do not involve the reduction dimension(s). A test is added to demonstrate the proper vectorization of matmul_i8_i8_i32. Differential revision: https://reviews.llvm.org/D95939	2021-02-04 07:50:44 +00:00
Benjamin Kramer	94f540cc7c	[mlir][Linalg] Fix unused variable warning in Release builds. NFC.	2021-02-02 12:59:41 +01:00
Nicolas Vasilache	0a2a260aab	[mlir][Linalg] Refactor Linalg vectorization for better reuse and extensibility. This revision unifies Linalg vectorization and paves the way for vectorization of Linalg ops with mixed-precision operations. The new algorithm traverses the ops in the linalg block in order and avoids recursion. It uses a BlockAndValueMapping to keep track of vectorized operations. The revision makes the following modifications but is otherwise NFC: 1. vector.transfer_read are created eagerly and may appear in a different order than the original order. 2. a more progressive vectorization to vector.contract results in only the multiply operation being converted to `vector.contract %a, %b, %zero`, where `%zero` is a constant of the proper type. Later vector canonicalizations are assumed to rewrite vector.contract %a, %b, %zero + add to a proper accumulate form. Differential revision: https://reviews.llvm.org/D95797	2021-02-02 11:31:09 +00:00
Nicolas Vasilache	299cc5da6d	[mlir][Linalg] Further improve codegen strategy and add a linalg.matmul_i8_i8_i32 This revision adds a layer of SFINAE to the composable codegen strategy so it does not have to require statically defined ops but instead can also be used with OpInterfaces, Operation* and an op name string. A linalg.matmul_i8_i8_i32 is added to the .tc spec to demonstrate how all this works end to end. Differential Revision: https://reviews.llvm.org/D95600	2021-01-28 13:02:42 +00:00
Nicolas Vasilache	d0c9fb1b8e	[mlir][Linalg] Improve codegen strategy This revision improves the usage of the codegen strategy by adding a few flags that make it easier to control for the CLI. Usage of ModuleOp is replaced by FuncOp as this created issues in multi-threaded mode. A simple benchmarking capability is added for linalg.matmul as well as linalg.matmul_column_major. This latter op is also added to linalg. Now obsolete linalg integration tests that also take too long are deleted. Correctness checks are still missing at this point. Differential revision: https://reviews.llvm.org/D95531	2021-01-28 10:59:16 +00:00
Thomas Raoux	cf216670a0	[mlir][linalg] Add vectorization for linalg on tensor ops Support vectorization of linalg ops using tensor inputs/outputs. Differential Revision: https://reviews.llvm.org/D93890	2020-12-29 09:02:23 -08:00
nicolasvasilache	b7ae1d3d2b	[mlir][Linalg] Revisit the Linalg on tensors abstraction This revision drops init_tensor arguments from Linalg on tensors and instead uniformizes the output buffers and output tensors to be consistent. This significantly simplifies the usage of Linalg on tensors and is a stepping stone for its evolution towards a mixed tensor and shape abstraction discussed in https://llvm.discourse.group/t/linalg-and-shapes/2421/19. Differential Revision: https://reviews.llvm.org/D93469	2020-12-21 12:29:10 -08:00
Thomas Raoux	26c8f9081b	[mlir[[vector] Extend Transfer read/write ops to support tensor types. Transfer_ops can now work on both buffers and tensor. Right now, lowering of the tensor case is not supported yet. Differential Revision: https://reviews.llvm.org/D93500	2020-12-21 08:55:04 -08:00
Thomas Raoux	8955e9f6b7	[mlir][linalg] Fix bug in elementwise vectorization Fix a bug causing to pick the wrong vector size to broadcast to when the source vectors have different ranks. Differential Revision: https://reviews.llvm.org/D93118	2020-12-14 10:44:36 -08:00
Thomas Raoux	c503dc1b8a	[mlir][linalg] Add vectorization for element-wise linalg ops Add support for vectorization for linalg.generic representing element-wise ops. Those are converted to transfer_read + vector ops + transfer_write. Also re-organize the vectorization tests to be together. Implementation derived from the work of @burmako, @agrue and @fedelebron. Differential Revision: https://reviews.llvm.org/D92540	2020-12-03 15:31:13 -08:00
Thomas Raoux	29d1fba7b5	[mlir][vector] Make linalg FillOp vectorization use Transfer op Differential Revision: https://reviews.llvm.org/D90474	2020-11-03 14:35:26 -08:00
Jakub Lichman	0b17d4754a	[mlir][Linalg] Tile sizes for Conv ops vectorization added as pass arguments Current setup for conv op vectorization does not enable user to specify tile sizes as well as dimensions for vectorization. In this commit we change that by adding tile sizes as pass arguments. Every dimension with corresponding tile size > 1 is automatically vectorized. Differential Revision: https://reviews.llvm.org/D88533	2020-09-30 11:31:28 +00:00
Jakub Lichman	347d59b16c	[mlir][Linalg] Convolution tiling added to ConvOp vectorization pass ConvOp vectorization supports now only convolutions of static shapes with dimensions of size either 3(vectorized) or 1(not) as underlying vectors have to be of static shape as well. In this commit we add support for convolutions of any size as well as dynamic shapes by leveraging existing matmul infrastructure for tiling of both input and kernel to sizes accepted by the previous version of ConvOp vectorization. In the future this pass can be extended to take "tiling mask" as a user input which will enable vectorization of user specified dimensions. Differential Revision: https://reviews.llvm.org/D87676	2020-09-17 09:39:41 +00:00
Eugene Burmako	5638df1950	Introduce linalg.vecmat This patch adds a new named structured op to accompany linalg.matmul and linalg.matvec. We needed it for our codegen, so I figured it would be useful to add it to Linalg. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D87292	2020-09-10 18:48:14 +02:00
Jakub Lichman	fea175b59f	[mlir][Linalg] Small refactoring of ConvOpVectorization This commit addresses comments that were requested on D86619 after it was landed. Differential Revision: https://reviews.llvm.org/D87354	2020-09-10 07:05:30 +00:00
Jakub Lichman	83d82d1fb1	[mlir] Fix of broken build on windows caused by using uint	2020-09-08 09:42:25 +00:00
Jakub Lichman	67b37f571c	[mlir] Conv ops vectorization pass In this commit a new way of convolution ops lowering is introduced. The conv op vectorization pass lowers linalg convolution ops into vector contractions. This lowering is possible when conv op is first tiled by 1 along specific dimensions which transforms it into dot product between input and kernel subview memory buffers. This pass converts such conv op into vector contraction and does all necessary vector transfers that make it work. Differential Revision: https://reviews.llvm.org/D86619	2020-09-08 08:47:42 +00:00
Frederik Gossen	136eb79a88	[MLIR][Standard] Add `dynamic_tensor_from_elements` operation With `dynamic_tensor_from_elements` tensor values of dynamic size can be created. The body of the operation essentially maps the index space to tensor elements. Declare SCF operations in the `scf` namespace to avoid name clash with the new `std.yield` operation. Resolve ambiguities between `linalg/shape/std/scf.yield` operations. Differential Revision: https://reviews.llvm.org/D86276	2020-09-07 11:44:43 +00:00
River Riddle	d289a97f91	[mlir][PDL] Add a PDL Interpreter Dialect The PDL Interpreter dialect provides a lower level abstraction compared to the PDL dialect, and is targeted towards low level optimization and interpreter code generation. The dialect operations encapsulates low-level pattern match and rewrite "primitives", such as navigating the IR (Operation::getOperand), creating new operations (OpBuilder::create), etc. Many of the operations within this dialect also fuse branching control flow with some form of a predicate comparison operation. This type of fusion reduces the amount of work that an interpreter must do when executing. An example of this representation is shown below: ```mlir // The following high level PDL pattern: pdl.pattern : benefit(1) { %resultType = pdl.type %inputOperand = pdl.input %root, %results = pdl.operation "foo.op"(%inputOperand) -> %resultType pdl.rewrite %root { pdl.replace %root with (%inputOperand) } } // May be represented in the interpreter dialect as follows: module { func @matcher(%arg0: !pdl.operation) { pdl_interp.check_operation_name of %arg0 is "foo.op" -> ^bb2, ^bb1 ^bb1: pdl_interp.return ^bb2: pdl_interp.check_operand_count of %arg0 is 1 -> ^bb3, ^bb1 ^bb3: pdl_interp.check_result_count of %arg0 is 1 -> ^bb4, ^bb1 ^bb4: %0 = pdl_interp.get_operand 0 of %arg0 pdl_interp.is_not_null %0 : !pdl.value -> ^bb5, ^bb1 ^bb5: %1 = pdl_interp.get_result 0 of %arg0 pdl_interp.is_not_null %1 : !pdl.value -> ^bb6, ^bb1 ^bb6: pdl_interp.record_match @rewriters::@rewriter(%0, %arg0 : !pdl.value, !pdl.operation) : benefit(1), loc([%arg0]), root("foo.op") -> ^bb1 } module @rewriters { func @rewriter(%arg0: !pdl.value, %arg1: !pdl.operation) { pdl_interp.replace %arg1 with(%arg0) pdl_interp.return } } } ``` Differential Revision: https://reviews.llvm.org/D84579	2020-08-26 05:22:27 -07:00
Rahul Joshi	706d992ced	[NFC] Add getArgumentTypes() to Region - Add getArgumentTypes() to Region (missed from before) - Adopt Region argument API in `hasMultiplyAddBody` - Fix 2 typos in comments Differential Revision: https://reviews.llvm.org/D84807	2020-07-28 18:27:42 -07:00
Thomas Raoux	a1b9fb220f	[mlir][linalg] Add vectorization transform for CopyOp CopyOp get vectorized to vector.transfer_read followed by vector.transfer_write Differential Revision: https://reviews.llvm.org/D83739	2020-07-22 12:40:42 -07:00
Benjamin Kramer	bf561dd2eb	[mlir][Vector] Vectorize integer matmuls The underlying infrastructure supports this already, just add the pattern matching for linalg.generic. Differential Revision: https://reviews.llvm.org/D84335	2020-07-22 19:39:56 +02:00
Nicolas Vasilache	512da70be7	[mlir][Vector] Degrade masking information when forwarding linalg.copy to vector.transfer Summary: linalg.copy + linalg.fill can be used to create a padded local buffer. The `masked` attribute is only valid on this padded buffer. When forwarding to vector.transfer ops, the attribute must be reset conservatively. Differential Revision: https://reviews.llvm.org/D83782	2020-07-15 02:32:45 -04:00
Nicolas Vasilache	56c638b5c1	[mlir][Linalg] Generalize Vectorization of Linalg contractions This revision adds support for vectorizing named and generic contraction ops to vector.contract. Cases in which the memref is 0-D are special cased to emit std.load/std.store instead of vector.transfer. Relevant tests are added. Differential revision: https://reviews.llvm.org/D83307	2020-07-10 10:28:34 -04:00
River Riddle	9db53a1827	[mlir][NFC] Remove usernames and google bug numbers from TODO comments. These were largely leftover from when MLIR was a google project, and don't really follow LLVM guidelines.	2020-07-07 01:40:52 -07:00
Rahul Joshi	d891d738d9	[MLIR][NFC] Adopt variadic isa<> Differential Revision: https://reviews.llvm.org/D82489	2020-06-24 17:02:44 -07:00
Nicolas Vasilache	9534192c3b	[mlir][Linalg] Make contraction vectorization use vector transfers This revision replaces the load + vector.type_cast by appropriate vector transfer operations. These play more nicely with other vector abstractions and canonicalization patterns and lower to load/store with or without masks when appropriate. Differential Revision: https://reviews.llvm.org/D80809	2020-05-29 15:03:46 -04:00
Nicolas Vasilache	1ee114322c	[mlir][Linalg][Vector] Add forwarding patterns between linalg.copy and vector.transfer This revision adds custom rewrites for patterns that arise during linalg structured ops vectorization. These patterns allow the composition of linalg promotion, vectorization and removal of redundant copies. The patterns are voluntarily limited and restrictive atm. More robust behavior will be implemented once more powerful side effect modeling and analyses are available on view/subview. On the transfer_read side, the following pattern is rewritten: ``` %alloc = ... [optional] %view = std.view %alloc ... %subView = subview %allocOrView ... [optional] linalg.fill(%allocOrView, %cst) ... ... linalg.copy(%in, %subView) ... vector.transfer_read %allocOrView[...], %cst ... ``` into ``` [unchanged] %alloc = ... [unchanged] [optional] %view = std.view %alloc ... [unchanged] [unchanged] %subView = subview %allocOrView ... ... vector.transfer_read %in[...], %cst ... ``` On the transfer_write side, the following pattern is rewriten: ``` %alloc = ... [optional] %view = std.view %alloc ... %subView = subview %allocOrView... ... vector.transfer_write %..., %allocOrView[...] linalg.copy(%subView, %out) ``` Differential Revision: https://reviews.llvm.org/D80728	2020-05-29 08:08:34 -04:00
Nicolas Vasilache	307cfdf533	[mlir][Linalg] Mostly NFC - Refactor Linalg patterns and transformations. Linalg transformations are currently exposed as DRRs. Unfortunately RewriterGen does not play well with the line of work on named linalg ops which require variadic operands and results. Additionally, DRR is arguably not the right abstraction to expose compositions of such patterns that don't rely on SSA use-def semantics. This revision abandons DRRs and exposes manually written C++ patterns. Refactorings and cleanups are performed to uniformize APIs. This refactoring will allow replacing the currently manually specified Linalg named ops. A collateral victim of this refactoring is the `tileAndFuse` DRR, and the one associated test, which will be revived at a later time. Lastly, the following 2 tests do not add value and are altered: - a dot_perm tile + interchange test does not test anything new and is removed - a dot tile + lower to loops does not need 2-D tiling and is trimmed.	2020-05-04 11:17:37 -04:00

35 Commits