llvm-project

Author	SHA1	Message	Date
Kazu Hirata	3041fa6c7a	[mlir] Use *Set::insert_range (NFC) (#132326 ) DenseSet, SmallPtrSet, SmallSet, SetVector, and StringSet recently gained C++23-style insert_range. This patch replaces: Dest.insert(Src.begin(), Src.end()); with: Dest.insert_range(Src); This patch does not touch custom begin like succ_begin for now.	2025-03-20 22:24:17 -07:00
Jacques Pienaar	09dfc5713d	[mlir] Enable decoupling two kinds of greedy behavior. (#104649 ) The greedy rewriter is used in many different flows and it has a lot of convenience (work list management, debugging actions, tracing, etc). But it combines two kinds of greedy behavior 1) how ops are matched, 2) folding wherever it can. These are independent forms of greedy and leads to inefficiency. E.g., cases where one need to create different phases in lowering and is required to applying patterns in specific order split across different passes. Using the driver one ends up needlessly retrying folding/having multiple rounds of folding attempts, where one final run would have sufficed. Of course folks can locally avoid this behavior by just building their own, but this is also a common requested feature that folks keep on working around locally in suboptimal ways. For downstream users, there should be no behavioral change. Updating from the deprecated should just be a find and replace (e.g., `find ./ -type f -exec sed -i 's\|applyPatternsAndFoldGreedily\|applyPatternsGreedily\|g' {} \;` variety) as the API arguments hasn't changed between the two.	2024-12-20 08:15:48 -08:00
Kai Sasaki	017c75bfac	[mlir] Fix typo in test vector transform pass descriptions (#118194 ) Fix some typos in the description of vector transform passes.	2024-12-01 15:48:02 +09:00
Petr Kurapov	ecaf2c335c	[MLIR] Move warp_execute_on_lane_0 from vector to gpu (#116994 ) Please see the related RFC here: https://discourse.llvm.org/t/rfc-move-execute-on-lane-0-from-vector-to-gpu-dialect/82989. This patch does exactly one thing - moves the op to gpu.	2024-11-22 15:30:47 +01:00
Benoit Jacob	a9ebdbb5ac	[MLIR] Vector: turn the ExtractStridedSlice rewrite pattern from #111541 into a canonicalization (#111614 ) This is a reasonable canonicalization because `extract` is more constrained than `extract_strided_slices`, so there is no loss of semantics here, just lifting an op to a special-case higher/constrained op. And the additional `shape_cast` is merely adding leading unit dims to match the original result type. Context: discussion on #111541. I wasn't sure how this would turn out, but in the process of writing this PR, I discovered at least 2 bugs in the pattern introduced in #111541, which shows the value of shared canonicalization patterns which are exercised on a high number of testcases. --------- Signed-off-by: Benoit Jacob <jacob.benoit.1@gmail.com>	2024-10-09 09:24:23 -04:00
Benoit Jacob	10054ba4ac	[mlir][vector] Add pattern to rewrite contiguous ExtractStridedSlice into Extract (#111541 ) Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>	2024-10-08 11:51:01 -04:00
Andrzej Warzyński	42944da5ba	[mlir][vector] Group re-order patterns together (#102856 ) Group all patterns that re-order vector.transpose and vector.broadcast Ops () under `populateSinkVectorOpsPatterns`. These patterns are normally used to "sink" redundant Vector Ops, hence grouping together. Example: ```mlir %at = vector.transpose %a, [1, 0]: vector<4x2xf32> to vector<2x4xf32> %bt = vector.transpose %b, [1, 0]: vector<4x2xf32> to vector<2x4xf32> %r = arith.addf %at, %bt : vector<2x4xf32> ``` would get converted to: ```mlir %0 = arith.addf %a, %b : vector<4x2xf32> %r = vector.transpose %0, [1, 0] : vector<2x4xf32> ``` This patch also moves all tests for these patterns so that all of them are: run under one test-flag: `test-vector-sink-patterns`, * located in one file: "vector-sink.mlir". To facilitate this change: * `-test-sink-vector-broadcast` is renamed as `test-vector-sink-patterns`, * "sink-vector-broadcast.mlir" is renamed as "vector-sink.mlir", * tests for `ReorderCastOpsOnBroadcast` and `ReorderElementwiseOpsOnTranspose` patterns are moved from "vector-reduce-to-contract.mlir" to "vector-sink.mlir", * `ReorderElementwiseOpsOnTranspose` patterns are removed from `populateVectorReductionToContractPatterns` and added to (newly created) `populateSinkVectorOpsPatterns`, * `ReorderCastOpsOnBroadcast` patterns are removed from `populateVectorReductionToContractPatterns` - these are already present in `populateSinkVectorOpsPatterns`. This should allow us better layering and more straightforward testing. For the latter, the goal is to be able to easily identify which pattern a particular test is exercising (especially when it's a specific pattern). NOTES FOR DOWNSTREAM USERS In order to preserve the current functionality, please make sure to add * `populateSinkVectorOpsPatterns`, wherever you are using `populateVectorReductionToContractPatterns`. Also, rename `populateSinkVectorBroadcastPatterns` as `populateSinkVectorOpsPatterns`. (*) I didn't notice any other re-order patterns.	2024-08-16 16:53:53 +01:00
Benjamin Maxwell	9b06e25e73	[mlir][vector] Add mask elimination transform (#99314 ) This adds a new transform `eliminateVectorMasks()` which aims at removing scalable `vector.create_masks` that will be all-true at runtime. It attempts to do this by simply pattern-matching the mask operands (similar to some canonicalizations), if that does not lead to an answer (is all-true? yes/no), then value bounds analysis will be used to find the lower bound of the unknown operands. If the lower bound is >= to the corresponding mask vector type dim, then that dimension of the mask is all true. Note that the pattern matching prevents expensive value-bounds analysis in cases where the mask won't be all true. For example: ```mlir %mask = vector.create_mask %dynamicValue, %c2 : vector<8x4xi1> ``` From looking at `%c2` we can tell this is not going to be an all-true mask, so we don't need to run the value-bounds analysis for `%dynamicValue` (and can exit the transform early). Note: Eliminating create_masks here means replacing them with all-true constants (which will then lead to the masks folding away).	2024-08-09 10:51:49 +01:00
Kazu Hirata	5262865aac	[mlir] Construct SmallVector with ArrayRef (NFC) (#101896 )	2024-08-04 11:43:05 -07:00
Charitha Saumya	c577f91d26	[mlir][vector] Add support for linearizing Extract, ExtractStridedSlice, Shuffle VectorOps in VectorLinearize (#88204 ) This PR adds support for converting `vector.extract_strided_slice` and `vector.extract` operations to equivalent `vector.shuffle` operations that operates on linearized (1-D) vectors. `vector.shuffle` operations operating on n-D (n > 1) are also converted to equivalent shuffle operations working on linearized vectors.	2024-04-18 21:13:49 +03:00
Andrzej Warzyński	d3aa92ed14	[mlir][vector] Add support for scalable vectors to VectorLinearize (#86786 ) Adds support for scalable vectors to patterns defined in VectorLineralize.cpp. Linearization is disable in 2 notable cases: * vectors with more than 1 scalable dimension (we cannot represent vscale^2), * vectors initialised with arith.constant that's not a vector splat (such arith.constant Ops cannot be flattened).	2024-03-28 14:53:21 +00:00
Balaji V. Iyer	6f5c4f2eac	[mlir][vector]Add Vector bitwidth target to Linearize Vectorizable and Constant Ops (#83314 ) Added a new flag `targetVectorBitwidth` to capture bit-width input.	2024-03-04 19:17:51 -06:00
Quinn Dawkins	c2b952926f	[mlir][vector] Fix n-d transfer write distribution (#83215 ) Currently n-d transfer write distribution can be inconsistent with distribution of reductions if a value has multiple users, one of which is a transfer_write with a non-standard distribution map, and the other of which is a vector.reduction. We may want to consider removing the distribution map functionality in the future for this reason.	2024-02-28 00:11:28 -05:00
Diego Caballero	71441ed171	[mlir][Vector] Add vector bitwidth target to xfer op flattening (#81966 ) This PR adds an optional bitwidth parameter to the vector xfer op flattening transformation so that the flattening doesn't happen if the trailing dimension of the read/writen vector is larger than this bitwidth (i.e., we are already able to fill at least one vector register with that size).	2024-02-21 09:22:48 -08:00
Ivan Butygin	35ef3994bf	[mlir][vector] ND vectors linearization pass (#81159 ) Common backends (LLVM, SPIR-V) only supports 1D vectors, LLVM conversion handles ND vectors (N >= 2) as `array<array<... vector>>` and SPIR-V conversion doesn't handle them at all at the moment. Sometimes it's preferable to treat multidim vectors as linearized 1D. Add pass to do this. Only constants and simple elementwise ops are supported for now. @krzysz00 I've extracted yours result type conversion code from LegalizeToF32 and moved it to common place. Also, add ConversionPattern class operating on traits.	2024-02-13 15:30:58 +03:00
Jakub Kuderski	07677113ff	[mlir][vector] Add pattern to break down reductions into arith ops (#75727 ) The number of vector elements considered 'small' enough to extract is parameterized. This is to avoid going into specialized reduction lowering when a single/couple of arith ops can do. Targets without dedicated reduction intrinsics can use that as an emulation path too. Depends on https://github.com/llvm/llvm-project/pull/75846.	2023-12-18 17:54:54 -05:00
Hsiangkai Wang	f643eec892	[mlir][vector] Add emulation patterns for vector masked load/store (#74834 ) In this patch, it will convert ``` vector.maskedload %base[%idx_0, %idx_1], %mask, %pass_thru ``` to ``` %ivalue = %pass_thru %m = vector.extract %mask[0] %result0 = scf.if %m { %v = memref.load %base[%idx_0, %idx_1] %combined = vector.insert %v, %ivalue[0] scf.yield %combined } else { scf.yield %ivalue } %m = vector.extract %mask[1] %result1 = scf.if %m { %v = memref.load %base[%idx_0, %idx_1 + 1] %combined = vector.insert %v, %result0[1] scf.yield %combined } else { scf.yield %result0 } ... ``` It will convert ``` vector.maskedstore %base[%idx_0, %idx_1], %mask, %value ``` to ``` %m = vector.extract %mask[0] scf.if %m { %extracted = vector.extract %value[0] memref.store %extracted, %base[%idx_0, %idx_1] } %m = vector.extract %mask[1] scf.if %m { %extracted = vector.extract %value[1] memref.store %extracted, %base[%idx_0, %idx_1 + 1] } ... ```	2023-12-15 11:35:48 +00:00
Andrzej Warzyński	c02d07fdf0	[mlir][vector] Add pattern to drop unit dim from elementwise(a, b)) (#74817 ) For vectors with either leading or trailing unit dim, replaces: elementwise(a, b) with: sc_a = shape_cast(a) sc_b = shape_cast(b) res = elementwise(sc_a, sc_b) return shape_cast(res) The newly inserted shape_cast Ops fold (before elementwise Op) and then restore (after elementwise Op) the unit dim. Vectors `a` and `b` are required to be rank > 1. Example: ```mlir %mul = arith.mulf %B_row, %A_row : vector<1x[4]xf32> %cast = vector.shape_cast %mul : vector<1x[4]xf32> to vector<[4]xf32> ``` gets converted to: ```mlir %B_row_sc = vector.shape_cast %B_row : vector<1x[4]xf32> to vector<[4]xf32> %A_row_sc = vector.shape_cast %A_row : vector<1x[4]xf32> to vector<[4]xf32> %mul = arith.mulf %B_row_sc, %A_row_sc : vector<[4]xf32> %mul_sc = vector.shape_cast %mul : vector<[4]xf32> to vector<1x[4]xf32> %cast = vector.shape_cast %mul_sc : vector<1x[4]xf32> to vector<[4]xf32> ``` In practice, the bottom 2 shape_cast(s) will be folded away.	2023-12-13 20:29:12 +00:00
Jakub Kuderski	8063622721	[mlir][vector] Allow vector distribution with multiple written elements (#75122 ) Add a configuration option to allow vector distribution with multiple elements written by a single lane. This is so that we can perform vector multi-reduction with multiple results per workgroup.	2023-12-12 13:15:17 -05:00
Andrzej Warzyński	2eb9e33cc5	[mlir][Vector] Update patterns for flattening vector.xfer Ops (2/N) (#73523 ) Updates patterns for flattening `vector.transfer_read` by relaxing the requirement that the "collapsed" indices are all zero. This enables collapsing cases like this one: ```mlir %2 = vector.transfer_read %arg4[%c0, %arg0, %arg1, %c0] ... : memref<1x43x4x6xi32>, vector<1x2x6xi32> ``` Previously only the following case would be consider for collapsing (all indices are 0): ```mlir %2 = vector.transfer_read %arg4[%c0, %c0, %c0, %c0] ... : memref<1x43x4x6xi32>, vector<1x2x6xi32> ``` Also adds some new comments and renames the `firstContiguousInnerDim` parameter as `firstDimToCollapse` (the latter better matches the actual meaning). Similar updates for `vector.transfer_write` will be implemented in a follow-up patch.	2023-12-05 08:35:58 +00:00
Jakub Kuderski	d33bad66d8	[mlir][vector] Add patterns to simplify chained reductions (#73048 ) Chained reductions get created during vector unrolling. These patterns simplify them into a series of adds followed by a final reductions. This is preferred on GPU targets like SPIR-V/Vulkan where vector reduction gets lowered into subgroup operations that are generally more expensive than simple vector additions. For now, only the `add` combining kind is handled.	2023-11-22 10:30:04 -05:00
Quinn Dawkins	df49a97ab2	[mlir][vector] Root the transfer write distribution pattern on the warp op (#71868 ) Currently when there is a mix of transfer read ops and transfer write ops that need to be distributed, because the pattern for write distribution is rooted on the transfer write, it is hard to guarantee that the write gets distributed after the read when the two aren't directly connected by SSA. This is likely still relatively unsafe when there are undistributable ops, but structurally these patterns are a bit difficult to work with. For now pattern benefits give fairly good guarantees for happy paths.	2023-11-10 08:49:33 -05:00
Mehdi Amini	830b9b072d	Update some uses of `getAttr()` to be explicit about Inherent vs Discardable (NFC)	2023-09-12 01:33:47 -07:00
Andrzej Warzynski	576b184d6e	[mlir][vector] Add support for scalable vectors in `trimLeadingOneDims` This patch updates one specific hook in "VectorDropLeadUnitDim.cpp" to make sure that "scalable dims" are handled correctly. While this change affects multiple patterns, I am only adding one regression tests that captures one specific case that affects me right now. I am also adding Vector dialect to the list of dependencies of `-test-vector-to-vector-lowering`. Otherwise my test case won't work as a standalone test. Differential Revision: https://reviews.llvm.org/D157993	2023-08-22 08:45:59 +00:00
Andrzej Warzynski	4d339ec91e	[mlir][Vector] Add pattern to reorder elementwise and broadcast ops The new pattern will replace elementwise(broadcast) with broadcast(elementwise) when safe. This change affects tests for vectorising nD-extract. In one case ("vectorize_nd_tensor_extract_with_tensor_extract") I just trimmed the test and only preserved the key parts (scalar and contiguous load from the original Op). We could do the same with some other tests if that helps maintainability. Differential Revision: https://reviews.llvm.org/D152812	2023-06-15 10:13:41 +01:00
Matthias Springer	faae4d5d81	[mlir][vector][transform] Expose tensor slice -> transfer folding patterns Add a new transform op to populate patterns: ApplyFoldTensorSliceIntoTransferPatternsOp. Differential Revision: https://reviews.llvm.org/D152531	2023-06-09 16:23:25 +02:00
Manish Gupta	9a795f0c59	[mlir][Vector] Adds a pattern to fold `arith.extf` into `vector.contract` Consider mixed precision data type, i.e., F16 input lhs, F16 input rhs, F32 accumulation, and F32 output. This is typically written as F32 <= F16F16 + F32. During vectorization from linalg to vector for mixed precision data type (F32 <= F16F16 + F32), linalg.matmul introduces arith.extf on input lhs and rhs operands. "linalg.matmul"(%lhs, %rhs, %acc) ({ ^bb0(%arg1: f16, %arg2: f16, %arg3: f32): %lhs_f32 = "arith.extf"(%arg1) : (f16) -> f32 %rhs_f32 = "arith.extf"(%arg2) : (f16) -> f32 %mul = "arith.mulf"(%lhs_f32, %rhs_f32) : (f32, f32) -> f32 %acc = "arith.addf"(%arg3, %mul) : (f32, f32) -> f32 "linalg.yield"(%acc) : (f32) -> () }) There are backend that natively supports mixed-precision data type and does not need the arith.extf. For example, NVIDIA A100 GPU has mma.sync.aligned.*.f32.f16.f16.f32 that can support mixed-precision data type. However, the presence of arith.extf in the IR, introduces the unnecessary casting targeting F32 Tensor Cores instead of F16 Tensor Cores for NVIDIA backend. This patch adds a folding pattern to fold arith.extf into vector.contract Differential Revision: https://reviews.llvm.org/D151918	2023-06-05 23:22:20 +00:00
Diego Caballero	14726cd691	[mlir][Vector] Extend xfer_read(extract)->scalar load to support multiple uses This patch extends the vector.extract(vector.transfer_read) -> scalar load patterns to support vector.transfer_read with multiple uses. For now, we check that all the uses are vector.extract operations. Supporting multiple uses is predicated under a flag. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D150812	2023-05-19 21:03:18 +00:00
Lei Zhang	e000b62a34	[mlir][vector] Separate out vector transfer + tensor slice patterns These patterns touches the structure generated from tiling so it affects later steps like bufferization and vector hoisting. Instead of putting them in canonicalization, this commit creates separate entry points for them to be called explicitly. This is NFC regarding the functionality and tests of those patterns. It also addresses two TODO items in the codebase. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D150702	2023-05-17 09:01:19 -07:00
Tres Popp	5550c82189	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Caveats include: - This clang-tidy script probably has more problems. - This only touches C++ code, so nothing that is being generated. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This first patch was created with the following steps. The intention is to only do automated changes at first, so I waste less time if it's reverted, and so the first mass change is more clear as an example to other teams that will need to follow similar steps. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: https://github.com/llvm/llvm-project/compare/main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. 4. Some changes have been deleted for the following reasons: - Some files had a variable also named cast - Some files had not included a header file that defines the cast functions - Some files are definitions of the classes that have the casting methods, so the code still refers to the method instead of the function without adding a prefix or removing the method declaration at the same time. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc git restore mlir/lib/IR mlir/lib/Dialect/DLTI/DLTI.cpp\ mlir/lib/Dialect/Complex/IR/ComplexDialect.cpp\ mlir/lib/**/IR/\ mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp\ mlir/lib/Dialect/Vector/Transforms/LowerVectorMultiReduction.cpp\ mlir/test/lib/Dialect/Test/TestTypes.cpp\ mlir/test/lib/Dialect/Transform/TestTransformDialectExtension.cpp\ mlir/test/lib/Dialect/Test/TestAttributes.cpp\ mlir/unittests/TableGen/EnumsGenTest.cpp\ mlir/test/python/lib/PythonTestCAPI.cpp\ mlir/include/mlir/IR/ ``` Differential Revision: https://reviews.llvm.org/D150123	2023-05-12 11:21:25 +02:00
Quinn Dawkins	650f04feda	[mlir][vector] Add pattern to break down vector.bitcast The pattern added here is intended as a last resort for targets like SPIR-V where there are vector size restrictions and we need to be able to break down large vector types. Vectorizing loads/stores for small bitwidths (e.g. i8) relies on bitcasting to a larger element type and patterns to bubble bitcast ops to where they can cancel. This fails for cases such as ``` %1 = arith.trunci %0 : vector<2x32xi32> to vector<2x32xi8> vector.transfer_write %1, %destination[%c0, %c0] {in_bounds = [true, true]} : vector<2x32xi8>, memref<2x32xi8> ``` where the `arith.trunci` op essentially does the job of one of the bitcasts, leading to a bitcast that need to be further broken down ``` vector.bitcast %0 : vector<16xi8> to vector<4xi32> ``` Differential Revision: https://reviews.llvm.org/D149065	2023-04-25 20:18:02 -04:00
Quinn Dawkins	435f7d4c2e	[mlir][vector] Add unroll pattern for vector.gather This pattern is useful for SPIR-V to unroll to a supported vector size before later lowerings. The unrolling pattern is closer to an elementwise op than the transfer ops because the index values from which to extract elements are captured by the index vector and thus there is no need to update the base offsets when unrolling gather. Differential Revision: https://reviews.llvm.org/D149066	2023-04-24 14:02:59 -04:00
Matthias Springer	4c48f016ef	[mlir][Affine][NFC] Wrap dialect in "affine" namespace This cleanup aligns the affine dialect with all the other dialects. Differential Revision: https://reviews.llvm.org/D148687	2023-04-20 11:19:21 +09:00
Nicolas Vasilache	553cebde06	[mlir][Vector] Use a RewriterBase for IR rewrites in VectorTransferOpTransforms	2023-03-25 01:48:50 -07:00
Nicolas Vasilache	8b51340740	[mlir][Vector][Transforms] Improve the control over individual vector lowerings and transforms This revision adds vector transform operations that allow us to better inspect the composition of various lowerings that were previously very opaque. This commit is NFC in that it does not change patterns beyond adding `rewriter.notifyFailure` messages and it does not change the tests beyond breaking them into pieces and using transforms instead of throwaway opaque test passes. Reviewed By: ftynse, springerm Co-authored-by: Alex Zinenko <zinenko@google.com> Differential Revision: https://reviews.llvm.org/D146755	2023-03-24 14:01:39 +00:00
Nicolas Vasilache	2bc4c3e920	[mlir][Vector] NFC - Reorganize vector patterns Vector dialect patterns have grown enormously in the past year to a point where they are now impenetrable. Start reorganizing them towards finer-grained control. Differential Revision: https://reviews.llvm.org/D146736	2023-03-23 11:30:25 -07:00
Nicolas Vasilache	73bec2b2c3	[mlir][Vector] Retire one old filter-based test Differential Revision: https://reviews.llvm.org/D146742	2023-03-23 11:00:35 -07:00
Jakub Kuderski	f80a976acd	[mlir][vector] Add gather lowering patterns This is for targets that do not support gather-like ops, e.g., SPIR-V. Gather is expanded into lower-level vector ops with memory accesses guarded with `scf.if`. I also considered generating `vector.maskedload`s, but decided against it to keep the `memref` and `tensor` codepath closer together. There's a good chance that if a target doesn't support gather it does not support masked loads either. Issue: https://github.com/llvm/llvm-project/issues/60905 Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D145942	2023-03-14 10:59:30 -04:00
Jakub Kuderski	fb7ef637a8	[mlir][vector][nvgpu] Move MMA contraction preparation to VectorUtils This pattern is not specific to nvgpu; I intend to use in SPIR-V codegen. `VectorTransforms` seems like a more generally useful place. In addition: - Fix a bug in the second condition (the dimensions were swapped for RHS). - Add tests. - Add support for externally provided filter functions, similar to other vector transforms. - Prefer to transpose before zero/sign-extending inputs. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D145638	2023-03-09 14:56:21 -05:00
Kazu Hirata	0a81ace004	[mlir] Use std::optional instead of llvm::Optional (NFC) This patch replaces (llvm::\|)Optional< with std::optional<. I'll post a separate patch to remove #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 01:25:58 -08:00
Kazu Hirata	a1fe1f5f77	[mlir] Add #include <optional> (NFC) This patch adds #include <optional> to those files containing llvm::Optional<...> or Optional<...>. I'll post a separate patch to actually replace llvm::Optional with std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-13 21:05:06 -08:00
Matthias Springer	2ec98ffbf1	[mlir][vector] Add scalar vector xfer to memref patterns These patterns devectorize scalar transfers such as vector<f32> or vector<1xf32>. Differential Revision: https://reviews.llvm.org/D140215	2022-12-19 10:27:49 +01:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
Nicolas Vasilache	de13eeda11	[mlir][Vector] Add a Broadcast::createBroadcastOp helper This helper handles non trivial cases of broadcast + optional transpose creation that should not leak to the outside world. Differential Revision: https://reviews.llvm.org/D139003	2022-11-30 05:32:14 -08:00
Nicolas Vasilache	7a69a9d7ae	[NFC][mlir] VectorUtils / IndexingUtils simplifications and cleanups This revision refactors and cleans up a bunch of infra related to vector, shapes and indexing into more reusable APIs. Differential Revision: https://reviews.llvm.org/D138501	2022-11-22 23:42:29 -08:00
Matthias Springer	9d51b4e4e7	[mlir][vector] Support vector.extractelement distribution of 1D vectors Ops such as `%1 = vector.extractelement %0[%pos : index] : vector<96xf32>`. In case of an extract from a 1D vector, the source vector is distributed. The lane into which the requested position falls, extracts the element and shuffles it to all other lanes. Differential Revision: https://reviews.llvm.org/D137336	2022-11-10 15:07:56 +01:00
Lei Zhang	39c80656fe	[mlir][vector] Convert extract_strided_slice to extract & insert chain This is useful for breaking down extract_strided_slice and potentially cancel with other extract / insert ops before or after. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D137471	2022-11-09 19:42:07 -05:00
stanley-nod	d2061530dc	[mlir][vector] Modify constraint and interface for warp reduce on f16 and i8 Quantization method is crucial and ubiqutous in accelerating machine learning workloads. Most of these methods uses f16 and i8 types. This patch relaxes the type contraints on warp reduce distribution to allow these types. Furthermore, this patch also changed the interface and moved the initial reduction of data to a single thread into the distributedReductionFn, this gives flexibility for developers to control how they are obtaining the initial lane value, which might differ based on the input types. (i.e to shuffle 32-width type, we need to reduce f16 to 2xf16 types rather than a single element). Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D137691	2022-11-09 11:52:17 -08:00
Thomas Raoux	91f62f0e35	[mlir][vector] Fix distribution of scf.for with value coming from above When a value used in the forOp is defined outside the region but within the parent warpOp we need to return and distribute the value to pass it to new operations created within the loop. Also simplify the lambda interface. Differential Revision: https://reviews.llvm.org/D137146	2022-11-02 04:15:18 +00:00
Nicolas Vasilache	05fa8e88f4	[mlir][Linalg] Retire LinalgStrategyLowerVectorsPass and filter-based patterns Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 Depends on D135200 Differential Revision: https://reviews.llvm.org/D135222	2022-10-05 00:55:27 -07:00

1 2 3

109 Commits