llvm-project

Author	SHA1	Message	Date
Matthias Springer	5eee80ce5e	[mlir][memref] Add runtime verification for memref::CastOp Verify unranked -> ranked casts and casts of dynamic sizes/offset/strides to static ones. Differential Revision: https://reviews.llvm.org/D138671	2023-01-06 14:38:56 +01:00
Matthias Springer	e7790fbed3	[mlir] Add `test-convergence` option to Canonicalizer tests This new option is set to `false` by default. It should be set only in Canonicalizer tests to detect faulty canonicalization patterns. I.e., patterns that prevent the canonicalizer from converging. The canonicalizer should always convergence on such small unit tests that we have in `canonicalize.mlir`. Two faulty canonicalization patterns were detected and fixed with this change. Differential Revision: https://reviews.llvm.org/D140873	2023-01-04 12:02:21 +01:00
Matthias Springer	108b08f2a9	[mlir] Add RuntimeVerifiableOpInterface and transform Static op verification cannot detect cases where an op is valid at compile time but may be invalid at runtime. An example of such an op is `memref::ExpandShapeOp`. Invalid at compile time: `memref.expand_shape %m [[0, 1]] : memref<11xf32> into memref<2x5xf32>` Valid at compile time (because we do not know any better): `memref.expand_shape %m [[0, 1]] : memref<?xf32> into memref<?x5xf32>`. This op may or may not be valid at runtime depending on the runtime shape of `%m`. Invalid runtime ops such as the one above are hard to debug because they can crash the program execution at a seemingly unrelated position or (even worse) compute an invalid result without crashing. This revision adds a new op interface `RuntimeVerifiableOpInterface` that can be implemented by ops that provide additional runtime verification. Such runtime verification can be computationally expensive, so it is only generated on an opt-in basis by running `-generate-runtime-verification`. A simple runtime verifier for `memref::ExpandShapeOp` is provided as an example. Differential Revision: https://reviews.llvm.org/D138576	2022-12-21 10:57:14 +01:00
Matthias Springer	ccb8a4e3f3	[mlir][memref] Fold subview(subview(x)) Folding of rank-reduced subviews is also supported. Differential Revision: https://reviews.llvm.org/D140110	2022-12-15 17:50:12 +01:00
Matthias Springer	17f36648e6	[mlir][memref] Fold no-op subview(subview(x)) ops Differential Revision: https://reviews.llvm.org/D140008	2022-12-14 12:47:00 +01:00
Quentin Colombet	64f99842a6	[mlir][ExpandStridedMetadata] Handle collapse_shape of dim of size 1 gracefully Collapsing dimensions of size 1 with random strides (a.k.a. non-contiguous w.r.t. collapsed dimensions) is a grey area that we'd like to clean-up. (See https://reviews.llvm.org/D136483#3909856) That said, the implementation in `memref-to-llvm` currently skips dimensions of size 1 when computing the stride of a group. While longer term we may want to clean that up, for now matches this behavior, at least in the static case. For the dynamic case, for this patch we stick to `min(group strides)`. However, if we want to handle the dynamic cases correctly while allowing non-truly-contiguous dynamic size of 1, we would need to `if-then-else` every dynamic size. In other words `min(stride_i, for all i in group and dim_i != 1)`. I didn't implement that in this patch at the moment since `memref-to-llvm` is technically broken in the general case for this. (It currently would only produce something sensible for row major tensors.) Differential Revision: https://reviews.llvm.org/D139329	2022-12-08 07:32:01 +00:00
Hanhan Wang	0a1569a400	[mlir][NFC] Remove trailing whitespaces from `.td` and `.mlir` files. This is generated by running ``` sed --in-place 's/[[:space:]]\+$//' mlir/*/.td sed --in-place 's/[[:space:]]\+$//' mlir/*/.mlir ``` Reviewed By: rriddle, dcaballe Differential Revision: https://reviews.llvm.org/D138866	2022-11-28 15:26:30 -08:00
Matthias Springer	f2d91a7ae1	[mlir][utils] Fix invalid reshapes in ComposeCollapseOfExpandOp Do not generate CollapseShapeOps/ExpandShapeOps that have the same source and result shape. Generate casts instead. Such reshapes became invalid with D138498. Differential Revision: https://reviews.llvm.org/D138557	2022-11-23 13:52:00 +01:00
Matthias Springer	b9745ad812	[mlir][tensor/memref] Disallow Collapse/ExpandShapeOps that do not reduce/increase the rank CollapseShapeOp/ExpandShapeOp that do not change the rank (or increase/reduce it) are invalid. Differential Revision: https://reviews.llvm.org/D138498	2022-11-23 09:19:35 +01:00
Quentin Colombet	8b97b4e7ee	[mlir][MemRef] NFC rename simplify-extract-strided-metadata This pass has outgrown its original goal and is now going to be used to expand certain memref operations before lowering. Reflect that in the name. The pass is now called expand-strided-metadata. NFC Differential Revision: https://reviews.llvm.org/D138448	2022-11-21 22:43:15 +00:00
Quentin Colombet	d665448a7f	[mlir][MemRef] Change the anchor point of a reshapeLikeOp pattern Essentially, this patches changes the anchor point of the `extract_strided_metadata(reshapeLikeOp)` pattern from `extract_strided_metadata` to `reshapeLikeOp`. In details, this means that instead of replacing: ``` base, offset, sizes, strides = extract_strided_metadata(reshapeLikeOp(src)) ``` With ``` base, offset = extract_strided_metadata(src) sizes = <some math> strides = <some math> ``` We replace only the reshapeLikeOp part and connect it back with a reinterpret_cast: ``` val = reshapeLikeOp(src) ``` => ``` base, offset, ... = extract_strided_metadata(src) sizes = <some math> strides = <some math> val = reinterpret_cast base, offset, sizes, strides Differential Revision: https://reviews.llvm.org/D136386	2022-11-14 18:56:35 +00:00
Quentin Colombet	41783666e4	[mlir][MemRef] Change the anchor point of a subview pattern Essentially, this patches changes the anchor point of the `extract_strided_metadata(subview)` pattern from `extract_strided_metadata` to `subview`. In details, this means that instead of replacing: ``` base, offset, sizes, strides = extract_strided_metadata(subview(src)) ``` With ``` base, ... = extract_strided_metadata(src) offset = <some math> sizes = subSizes strides = <some math> ``` We replace only the subview part and connect it back with a reinterpret_cast: ``` val = subview(src) ``` => ``` base, ... = extract_strided_metadata(src) offset = <some math> sizes = subSizes strides = <some math> val = reinterpret_cast base, offset, sizes, strides ``` Differential Revision: https://reviews.llvm.org/D135839	2022-11-14 18:43:34 +00:00
Quentin Colombet	244af24faf	[mlir][MemRef] Simplify extract_strided_metadata(reinterpret_cast) This patch adds a pattern to simplify ``` base, offset, sizes, strides = extract_strided_metadata( reinterpret_cast(src, srcOffset, srcSizes, srcStrides)) ``` Into ``` base, baseOffset, ... = extract_strided_metadata(src) offset = srcOffset sizes = srcSizes strides = srcStrides ``` Note: Reinterpret_cast with unranked sources are not simplified since they cannot feed extract_strided_metadata operations. Differential Revision: https://reviews.llvm.org/D135837	2022-11-14 18:36:31 +00:00
Quentin Colombet	42263fb52d	[mlir][MemRef] Make reinterpret_cast(extract_strided_metadata) more robust Prior to this patch the canonicalization pattern that turns `reinterpret_cast(extract_strided_metadata)` into cast was only applied when all the input operands of the `reinterpret_cast` are exactly all the output results of the `extract_strided_metadata`. This missed simplification opportunities when the values would have hold the same constant values, but yet, come from different actual values. E.g., prior to this patch, a pattern of the form: ``` %base, %offset = extract_strided_metadata %source : memref<i16> reinterpret_cast %base to offset:[0] ``` Wouldn't have been simplified into a simple cast, because %offset is not directly the same value object as 0. This patch teaches this pattern how to check if the constant values match what the results of the `extract_strided_metadata` operation would have hold. Differential Revision: https://reviews.llvm.org/D135736	2022-11-14 18:02:15 +00:00
River Riddle	38c219b4a8	[mlir] Infer SubElementInterface implementations using the storage KeyTy The KeyTy of attribute/type storage classes provide enough information for automatically implementing the necessary sub element interface methods. This removes the need for derived classes to do it themselves, which is both much nicer and easier to handle certain invariants (e.g. null handling). In cases where explicitly handling for parameter types is necessary, they can provide an implementation of `AttrTypeSubElementHandler` to opt-in to support. This tickles a few things alias wise, which annoyingly messes with tests that hard code specific affine map numbers. Differential Revision: https://reviews.llvm.org/D137374	2022-11-04 18:15:03 -07:00
Zequan Wu	a7fa5febaa	[Test] Fix CHECK typo. Differential Revision: https://reviews.llvm.org/D137287	2022-11-04 10:18:04 -07:00
River Riddle	c8496d292e	[mlir] Refactor alias generation to support nested aliases We currently only support one level of aliases, which isn't great in situations where an attribute/type can have multiple duplicated components nested within it(e.g. debuginfo metadata). This commit refactors alias generation to support nested aliases, which requires changing alias grouping to take into account the depth of child aliases, to ensure that attributes/types aren't printed before the aliases they use. The only real user facing change here was that we no longer print 0 as an alias suffix, which would be unnecessarily expensive to keep in the new alias generation method (and isn't that valuable of a behavior to preserve). Differential Revision: https://reviews.llvm.org/D136541	2022-10-23 23:59:55 -07:00
Quentin Colombet	98c529652a	[mlir][MemRef] Move the forwarding patterns for `extract_strided_metadata` The `SimplifyExtractStridedMetadata` pass features a pattern that forward statically known information (offset, sizes, strides) to their respective users. This patch moves this pattern from this pass to the `extract_strided_metadata` folding patterns. Differential Revision: https://reviews.llvm.org/D135797	2022-10-18 22:34:50 +00:00
Quentin Colombet	df455beedf	[mlir][MemRef] Fix the simplification of extract_strided_metadata(subview) Prior to this patch we were wrongly applying the sub-strides to the computation of the final offset of the subview. Put differently, we were computing the offset as: ``` offset = baseOffset + sum(subOffset#i * baseStrides#i * subSizes#i) ``` Whereas we should be doing: ``` offset = baseOffset + sum(subOffset#i * baseStrides#i) ``` I.e., drop the subSizes#i term from the sum. Differential Revision: https://reviews.llvm.org/D136107	2022-10-18 19:29:49 +00:00
Quentin Colombet	3a33c146ed	[mlir][MemRef] Add a extract_strided_metadata(extract_strided_metadata) pattern This pattern will be useful to get cleaner code when lowering view like operations. Differential Revision: https://reviews.llvm.org/D135836	2022-10-14 19:02:10 +00:00
Jakub Kuderski	fae258e6c6	[mlir][memref] Add initial Wide Int Emulation pass and patterns Add a new pass and conversions to emulate wide integer operations over memrefs. The emulation is implemented on top of the existing pass to emulate wide integer arith ops. Improve naming in the arith pass to avoid potential name clashes. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D135722	2022-10-14 11:37:52 -04:00
Alex Zinenko	59bb8af4c3	[mlir] switch the transform loop extension to use types Add types to the Loop (SCF) extension of the transform dialect. See https://discourse.llvm.org/t/rfc-type-system-for-the-transform-dialect/65702 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D135587	2022-10-11 09:55:23 +00:00
Kirsten Lee	a8aeb651cd	[mlir][memref] Extend multi-buffering transform Extend multi-buffering to simplify the affine map created if any of its operands are constants. This avoids downstream problems where more complex affine.apply operations cannot be expanded. Transfer attributes from the old allocation to the new allocation. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D134894	2022-10-03 18:45:38 +00:00
Quentin Colombet	d831568171	[mlir][MemRef] Simplify extract_strided_metadata(collapse_shape) The new pattern gets rid of the collapse_shape operation while materializing its effects on the sizes, and the strides of the base object. In other words, this simplification replaces: ``` baseBuffer, offset, sizes, strides = extract_strided_metadata(collapse_shape(memref)) ``` With ``` baseBuffer, offset, baseSizes, baseStrides = extract_strided_metadata(memref) for reassDim in {0 .. collapseRank - 1} sizes#reassDim = product(baseSizes#i for i in group[reassDim]) strides#reassDim = baseStrides[group[reassDim].back()] ``` Note: baseBuffer and offset are unaffected by the collapse_shape operation. Differential Revision: https://reviews.llvm.org/D134826	2022-09-30 16:54:56 +00:00
Nicolas Vasilache	435debea69	[mlir][test] NFC - Fix some worst offenders "capture by SSA name" tests Many tests still depend on specific names of SSA values (!!). This commit is a best effort cleanup that will set the stage for adding some pretty SSA result names.	2022-09-30 08:24:13 -07:00
Nicolas Vasilache	df6387079e	[mlir][memref]Add pattern to forward memref.extract_aligned_pointer_as_index(view_like_op) to its source Differential Revision: https://reviews.llvm.org/D134835	2022-09-29 02:27:01 -07:00
Kirsten Lee	3f050f6ac4	[mlir][transform] Add multi-buffering to the transform dialect Add the plumbing necessary to call the memref dialect's multiBuffer function. This will allow separation between choosing which buffers to multi-buffer and the actual transform. Alter the multibuffer function to return the newly created allocation if multi-buffering succeeds. This is necessary to communicate with the transform dialect hooks what allocation multi-buffering created. Reviewed By: ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D133985	2022-09-28 14:30:02 -07:00
Quentin Colombet	9d259916e1	[mlir][MemRef] Simplify extract_strided_metadata(allocLikeOp) Teach the pass that simplifies extract_strided_metadata(other_op(memref)) how to get rid of extract_strided_metadata when they are fed by allocLikeOp. For the simplification to happen the allocLikeOp needs to have been normalized. I.e., no weird offset and strides. When this is the case, we replace: ``` base, offset, sizes, strides = extract_strided_metadata(allocLikeOp(allocSizes)) ``` With ``` base = reinterpret_cast allocLikeOp(allocSizes) to a flat memref<eltTy> offset = 0 sizes = allocSizes strides#i = prod(allocSizes#j, for j in {i+1..rank-1}) ``` The computation involving dynamic sizes are expanded in affine.apply. Differential Revision: https://reviews.llvm.org/D134577	2022-09-26 16:14:29 +00:00
Nicolas Vasilache	b3d48a60ff	[mlir][Memref] Introduce a memref::ExtractAlignedPointerAsIndexOp As experience with memref::ExtractStridedMetadataOp grows we are still missing a simple way to extract the pointer held by a memref and lower to different backednds (LLVM, SPIRV, library calls). This revision introduces a memref.extract_aligned_pointer_as_index that returns an index containing the aligned pointer of the strided memref. This operation is intended to be used solely as step during lowering, it has no side effects. A reverse operation that creates a memref from an index interpreted as a pointer is explicitly discouraged. Differential Revision: https://reviews.llvm.org/D134651	2022-09-26 08:55:05 -07:00
Nicolas Vasilache	f7e1ce0f30	[mlir][MemRef] Add pattern that forwards constant strided metadata. `memref.extract_strided_metadata` can forward constants independently of the exsistence of other operations such as subview or reshape. Differential Revision: https://reviews.llvm.org/D134603	2022-09-26 08:34:31 -07:00
Quentin Colombet	d0aeb74e88	[mlir][MemRef] Simplify extract_strided_metadata(expand_shape) Add a pattern to the pass that simplifies extract_strided_metadata(other_op(memref)). The new pattern gets rid of the expand_shape operation while materializing its effects on the sizes, and the strides of the base object. In other words, this simplification replaces: ``` baseBuffer, offset, sizes, strides = extract_strided_metadata(expand_shape(memref)) ``` With ``` baseBuffer, offset, baseSizes, baseStrides = extract_strided_metadata(memref) sizes#reassIdx = baseSizes#reassDim / product(expandShapeSizes#j, for j in group excluding reassIdx) strides#reassIdx = baseStrides#reassDim * product(expandShapeSizes#j, for j in reassIdx+1.. reassIdx+group.size-1) ``` Where `reassIdx` is a reassociation index for the group at `reassDim` and `expandShapeSizes#j` is either: - The constant size at dimension j, derived directly from the result type of the expand_shape op, or - An affine expression: baseSizes#reassDim / product of all constant sizes in expandShapeSizes. Note: baseBuffer and offset are unaffected by the expand_shape operation. Differential Revision: https://reviews.llvm.org/D133625	2022-09-22 19:07:09 +00:00
bixia1	9f13b9346b	[mlir][memref] Add realloc op. Add memref.realloc and canonicalization of the op. Add conversion patterns for lowering the op to LLVM using unaligned alloc or aligned alloc based on the conversion option. Add filecheck tests for parsing and converting the op. Add an integration test. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D133424	2022-09-21 08:04:00 -07:00
Ivan Butygin	54d81e49e3	[mlir] Allow negative strides and offset in StridedLayoutAttr Negative strides are useful for creating reverse-view of array. We don't have specific example for negative offset yet but will add it for consistency. Differential Revision: https://reviews.llvm.org/D134147	2022-09-21 13:21:53 +02:00
Alex Zinenko	f3fae035c7	[mlir] use strided layout in structured codegen-related tests All relevant operations have been switched to primarily use the strided layout, but still support the affine map layout. Update the relevant tests to use the strided format instead for compatibility with how ops now print by default. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D134045	2022-09-17 08:11:28 +02:00
Alex Zinenko	46b90a7b5d	[mlir] make remaining memref dialect ops produce strided layouts The three following ops in the memref dialect: transpose, expand_shape, collapse_shape, have been originally designed to operate on memrefs with strided layouts but had to go through the affine map representation as the type did not support anything else. Make these ops produce memref values with StridedLayoutAttr instead now that it is available. Depends On D133938 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133947	2022-09-16 10:56:48 +02:00
Alex Zinenko	2791162b01	[mlir] make memref.subview produce strided layout Memref subview operation has been initially designed to work on memrefs with strided layouts only and has never supported anything else. Port it to use the recently added StridedLayoutAttr instead of extracting the strided from implicitly from affine maps. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133938	2022-09-16 10:56:46 +02:00
Nicolas Vasilache	b7d47ed1da	[mlir][memref] Add support for 0-D transfer / subview fold. The 0-d case simply forwards the indexing from the source memref and works out of the box. Differential Revision: https://reviews.llvm.org/D133536	2022-09-08 15:25:05 -07:00
Quentin Colombet	63a2536f77	[mlir][MemRef] Simplify extract_strided_metadata(subview) Add a dedicated pass to simplify extract_strided_metadata(other_op(memref)). Currently the pass features only one pattern: extract_strided_metadata(subview). The goal is to get rid of the subview while materializing its effects on the offset, sizes, and strides with respect to the base object. In other words, this simplification replaces: ``` baseBuffer, offset, sizes, strides = extract_strided_metadata( subview(memref, subOffset, subSizes, subStrides)) ``` With ``` baseBuffer, baseOffset, baseSizes, baseStrides = extract_strided_metadata(memref) strides#i = baseStrides#i * subSizes#i offset = baseOffset + sum(subOffset#i * strides#i) sizes = subSizes ``` Differential Revision: https://reviews.llvm.org/D133166	2022-09-08 17:10:02 +00:00
Alex Zinenko	519847fefc	[mlir] materialize strided memref layout as attribute Introduce a new attribute to represent the strided memref layout. Strided layouts are omnipresent in code generation flows and are the only kind of layouts produced and supported by a half of operation in the memref dialect (view-related, shape-related). However, they are internally represented as affine maps that require a somewhat fragile extraction of the strides from the linear form that also comes with an overhead. Furthermore, textual representation of strided layouts as affine maps is difficult to read: compare `affine_map<(d0, d1, d2)[s0, s1] -> (d032 + d1s0 + s1 + d2)>` with `strides: [32, ?, 1], offset: ?`. While a rudimentary support for parsing a syntactically sugared version of the strided layout has existed in the codebase for a long time, it does not go as far as this commit to make the strided layout a first-class attribute in the IR. This introduces the attribute and updates the tests that using the pre-existing sugared form to use the new attribute instead. Most memref created programmatically, e.g., in passes, still use the affine form with further extraction of strides and will be updated separately. Update and clean-up the memref type documentation that has gotten stale and has been referring to the details of affine map composition that are long gone. See https://discourse.llvm.org/t/rfc-materialize-strided-memref-layout-as-an-attribute/64211. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D132864	2022-08-30 17:19:58 +02:00
Quentin Colombet	ba916c0cf6	[mlir][MemRef] Canonicalize reinterpret_cast(extract_strided_metadata) Add a canonicalizetion step for reinterpret_cast(extract_strided_metadata). This step replaces this sequence of operations by either: - A noop, i.e., the original memref is directly used, or - A plain cast of the original memref The choice is ultimately made based on whether the original memref type is equal to what the reinterpret_cast iss producing. For instance, the reinterpret_cast could be changing some dimensions from static to dynamic and in such case, we need to keep a cast. The transformation is currently only performed when the reinterpret_cast uses exactly the same arguments as what the extract_strided_metadata produces. It may be possible to be more aggressive here but I wanted to start with a relatively simple MLIR patch for my first one! Differential Revision: https://reviews.llvm.org/D132776	2022-08-29 17:00:50 +00:00
Arnab Dutta	1b002d2768	Fold memref.expand_shape and memref.collapse_shape ops Fold memref.expand_shape and memref.collapse_shape ops into their memref/affine load/store ops. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D128986	2022-08-28 06:56:06 +05:30
Nicolas Vasilache	325426d72c	[mlir][MemRef] Introduce a memref.extract_metadata op. This is the counterpart of `memref.reinterpret_cast` and is useful to lift strided memref manipulation out of the LLVM dialect. Discussion: https://discourse.llvm.org/t/extracting-dynamic-offsets-strides-from-memref/64170 Differential Revision: https://reviews.llvm.org/D132243	2022-08-26 09:09:15 -07:00
Ivan Kosarev	ad1d60c3be	[FileCheck] Catch missspelled directives. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D125604	2022-05-26 11:37:19 +01:00
Yi Zhang	1cddcfdc3c	Fix CollapsedLayoutMap for dim size 1 case This change fixes `CollapsedLayoutMap` for cases where the collapsed dims are size 1. The cases where inner most dims are size 1 and noncontiguous can be represented by the strided form and therefore can be allowed. For such cases, the new stride should be of the next entry in an association whose dimension is not size 1. If the next entry is dynamic, it's not possible to decide which stride to use at compilation time and the stride is set to dynamic. Differential Revision: https://reviews.llvm.org/D124137	2022-04-22 17:48:24 -04:00
River Riddle	0fd3a1ce60	[mlir][NFC] Update remaining textual references of un-namespaced `func` operations The special case parsing of operations in the `func` dialect is being removed, and operations will require the dialect namespace prefix.	2022-04-20 22:17:31 -07:00
River Riddle	0254b0bcf0	[mlir][NFC] Update textual references of `func` to `func.func` in LLVM/Math/MemRef/NVGPU/OpenACC/OpenMP/Quant/SCF/Shape tests The special case parsing of `func` operations is being removed.	2022-04-20 22:17:28 -07:00
Chia-hung Duan	5232c5c5d4	[mlir] Fix verification order of nested ops. In order to increase parallism, certain ops with regions and have the IsIsolatedFromAbove trait will have their verification delayed. That means the region verifier may access the invalid ops and may lead to a crash. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D122771	2022-04-15 04:41:10 +00:00
Alexander Belyaev	747b10be95	Revert "Revert "[mlir] Rewrite canonicalization of collapse(expand) and expand(collapse)."" This reverts commit 96e9b6c9dc60946f08399def879a19395bc98107.	2022-04-06 12:18:30 +02:00
Nicolas Vasilache	fc8f465a00	[mlir][MemRef] Allow transposed layouts in ExpandShapeOp. https://reviews.llvm.org/D122641 introduced fixes to the ExpandShapeOp verifier but also introduced an artificial layout limitation that prevents the consideration of transposed layouts. This revision fixes the omissions and reimplements the logic using saturated arithmetic which is more idiomatic and avoids leaking internal implementation details. Tests cases are added for transposed layouts. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D122845	2022-04-06 04:19:30 -04:00
Hanhan Wang	96e9b6c9dc	Revert "[mlir] Rewrite canonicalization of collapse(expand) and expand(collapse)." This reverts commit 64f659bee67b5a024defeb3cd2ecf65e1ad8c0a7. An invalid tensor.expand_shape op is generated with the commit. To repro: $ mlir-opt -canonicalize a.mlir ``` func @foo(%0: tensor<1x1xf32>, %1: tensor<1x1xf32>, %2: tensor<1x1xf32>) -> tensor<1x1xf32> { %cst = arith.constant 0.000000e+00 : f32 %3 = linalg.init_tensor [8, 1] : tensor<8x1xf32> %4 = linalg.fill ins(%cst : f32) outs(%3 : tensor<8x1xf32>) -> tensor<8x1xf32> %5 = tensor.collapse_shape %0 [] : tensor<1x1xf32> into tensor<f32> %6 = tensor.insert_slice %5 into %4[0, 0] [1, 1] [1, 1] : tensor<f32> into tensor<8x1xf32> %7 = linalg.init_tensor [8, 1] : tensor<8x1xf32> %8 = linalg.fill ins(%cst : f32) outs(%7 : tensor<8x1xf32>) -> tensor<8x1xf32> %9 = tensor.collapse_shape %2 [] : tensor<1x1xf32> into tensor<f32> %10 = tensor.insert_slice %9 into %8[0, 0] [1, 1] [1, 1] : tensor<f32> into tensor<8x1xf32> %11 = tensor.collapse_shape %6 [[0, 1]] : tensor<8x1xf32> into tensor<8xf32> %12 = linalg.init_tensor [8] : tensor<8xf32> %13 = linalg.generic {indexing_maps = [affine_map<(d0) -> (d0)>, affine_map<(d0) -> (d0)>], iterator_types = ["parallel"]} ins(%11 : tensor<8xf32>) outs(%12 : tensor<8xf32>) { ^bb0(%arg3: f32, %arg4: f32): linalg.yield %arg3 : f32 } -> tensor<8xf32> %14 = tensor.expand_shape %13 [[0, 1, 2, 3]] : tensor<8xf32> into tensor<1x1x8x1xf32> %15 = tensor.collapse_shape %1 [] : tensor<1x1xf32> into tensor<f32> %16 = linalg.init_tensor [] : tensor<f32> %17 = linalg.generic {indexing_maps = [affine_map<() -> ()>, affine_map<() -> ()>], iterator_types = []} ins(%15 : tensor<f32>) outs(%16 : tensor<f32>) { ^bb0(%arg3: f32, %arg4: f32): linalg.yield %arg3 : f32 } -> tensor<f32> %18 = tensor.expand_shape %17 [] : tensor<f32> into tensor<1x1x1x1xf32> %19 = tensor.collapse_shape %10 [[0, 1]] : tensor<8x1xf32> into tensor<8xf32> %20 = linalg.init_tensor [8] : tensor<8xf32> %21 = linalg.generic {indexing_maps = [affine_map<(d0) -> (d0)>, affine_map<(d0) -> (d0)>], iterator_types = ["parallel"]} ins(%19 : tensor<8xf32>) outs(%20 : tensor<8xf32>) { ^bb0(%arg3: f32, %arg4: f32): linalg.yield %arg3 : f32 } -> tensor<8xf32> %22 = tensor.expand_shape %21 [[0, 1, 2, 3]] : tensor<8xf32> into tensor<1x1x8x1xf32> %23 = linalg.mmt4d {comment = "f32f32->f32, aarch64, matrixvector"} ins(%14, %18 : tensor<1x1x8x1xf32>, tensor<1x1x1x1xf32>) outs(%22 : tensor<1x1x8x1xf32>) -> tensor<1x1x8x1xf32> %24 = tensor.collapse_shape %23 [[0, 1, 2, 3]] : tensor<1x1x8x1xf32> into tensor<8xf32> %25 = linalg.init_tensor [8] : tensor<8xf32> %26 = linalg.generic {indexing_maps = [affine_map<(d0) -> (d0)>, affine_map<(d0) -> (d0)>], iterator_types = ["parallel"]} ins(%24 : tensor<8xf32>) outs(%25 : tensor<8xf32>) { ^bb0(%arg3: f32, %arg4: f32): linalg.yield %arg3 : f32 } -> tensor<8xf32> %27 = tensor.expand_shape %26 [[0, 1]] : tensor<8xf32> into tensor<8x1xf32> %28 = tensor.extract_slice %27[0, 0] [1, 1] [1, 1] : tensor<8x1xf32> to tensor<f32> %29 = tensor.expand_shape %28 [] : tensor<f32> into tensor<1x1xf32> return %29 : tensor<1x1xf32> } ``` Differential Revision: https://reviews.llvm.org/D123161	2022-04-05 15:05:41 -07:00

1 2 3

113 Commits