llvm-project

Author	SHA1	Message	Date
Vivek	817d343fb0	[MLIR] Fix tilePerfectlyNested utility for handling non-unit step size The current implementation of tilePerfectlyNested utility doesn't handle the non-unit step size. We have added support to perform tiling correctly even if the step size of the loop to be tiled is non-unit. Fixes https://bugs.llvm.org/show_bug.cgi?id=49188. Differential Revision: https://reviews.llvm.org/D97037	2021-02-23 00:50:04 +05:30
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit 8aa6c3765b924d86f623d452777eb76b83bf2787.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Nicolas Vasilache	d01ea0edaa	[mlir] Drop reliance of SliceAnalysis on specific ops. SliceAnalysis originally was developed in the context of affine.for within mlfunc. It predates the notion of region. This revision updates it to not hardcode specific ops like scf::ForOp. When rooted at an op, the behavior of the slice computation changes as it recurses into the regions of the op. This does not support gathering all values transitively depending on a loop induction variable anymore. Additional variants rooted at a Value are added to also support the existing behavior. Differential revision: https://reviews.llvm.org/D96702	2021-02-16 06:34:32 +00:00
Uday Bondhugula	5400f602cd	[MLIR] Update affine.for unroll utility for iter_args support Update affine.for loop unroll utility for iteration arguments support. Fix promoteIfSingleIteration as well. Fixes PR49084: https://bugs.llvm.org/show_bug.cgi?id=49084 Differential Revision: https://reviews.llvm.org/D96383	2021-02-10 10:38:47 +05:30
River Riddle	e21adfa32d	[mlir] Mark LogicalResult as LLVM_NODISCARD This makes ignoring a result explicit by the user, and helps to prevent accidental errors with dropped results. Marking LogicalResult as no discard was always the intention from the beginning, but got lost along the way. Differential Revision: https://reviews.llvm.org/D95841	2021-02-04 15:10:10 -08:00
Alexander Belyaev	8d7cbcf582	[mlir] Preserve lexicographic order after loop collapsing. Currently, for a scf.parallel (i,j,k) after the loop collapsing to 1D is done, the IVs would be traversed as for an scf.parallel(k,j,i). Differential Revision: https://reviews.llvm.org/D95693	2021-01-29 21:32:36 +01:00
Kazuaki Ishizaki	f88fab5006	[mlir] NFC: fix trivial typos fix typo under include and lib directories Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D94220	2021-01-08 02:10:12 +09:00
Christian Sigg	0bf4a82a5a	[mlir] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove the corresponding methods from OpState. Reviewed By: silvas, rriddle Differential Revision: https://reviews.llvm.org/D92878	2020-12-09 12:11:32 +01:00
Christian Sigg	c4a0405902	Add `Operation* OpState::operator->()` to provide more convenient access to members of Operation. Given that OpState already implicit converts to Operator*, this seems reasonable. The alternative would be to add more functions to OpState which forward to Operation. Reviewed By: rriddle, ftynse Differential Revision: https://reviews.llvm.org/D92266	2020-12-02 15:46:20 +01:00
River Riddle	65fcddff24	[mlir][BuiltinDialect] Resolve comments from D91571 * Move ops to a BuiltinOps.h * Add file comments	2020-11-19 11:12:49 -08:00
River Riddle	73ca690df8	[mlir][NFC] Remove references to Module.h and Function.h These includes have been deprecated in favor of BuiltinDialect.h, which contains the definitions of ModuleOp and FuncOp. Differential Revision: https://reviews.llvm.org/D91572	2020-11-17 00:55:47 -08:00
Kazuaki Ishizaki	41b09f4eff	[mlir] NFC: fix trivial typos fix typos in comments and documents Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D90089	2020-10-29 04:05:22 +09:00
River Riddle	b6eb26fd0e	[mlir][NFC] Move around the code related to PatternRewriting to improve layering There are several pieces of pattern rewriting infra in IR/ that really shouldn't be there. This revision moves those pieces to a better location such that they are easier to evolve in the future(e.g. with PDL). More concretely this revision does the following: * Create a Transforms/GreedyPatternRewriteDriver.h and move the applyandFold methods there. The definitions for these methods are already in Transforms/ so it doesn't make sense for the declarations to be in IR. Create a new lib/Rewrite library and move PatternApplicator there. This new library will be focused on applying rewrites, and will also include compiling rewrites with PDL. Differential Revision: https://reviews.llvm.org/D89103	2020-10-26 18:01:06 -07:00
Tatiana Shpeisman	9909ef292d	[mlir][scf] Fix a bug in scf::ForOp loop unroll with an epilogue Fixes a bug in formation and simplification of an epilogue loop generated during loop unroll of scf::ForOp (https://bugs.llvm.org/show_bug.cgi?id=46689) Differential Revision: https://reviews.llvm.org/D87583	2020-10-10 14:18:25 +05:30
Navdeep Kumar	0602e8f77f	[MLIR][Affine] Add parametric tile size support for affine.for tiling Add support to tile affine.for ops with parametric sizes (i.e., SSA values). Currently supports hyper-rectangular loop nests with constant lower bounds only. Move methods - moveLoopBody() - getTileableBands() - checkTilingLegality() - tilePerfectlyNested() - constructTiledIndexSetHyperRect(*) to allow reuse with constant tile size API. Add a test pass -test-affine -parametric-tile to test parametric tiling. Differential Revision: https://reviews.llvm.org/D87353	2020-09-17 23:39:14 +05:30
Lubomir Litchev	e2394245eb	Add an option for unrolling loops up to a factor. Currently, there is no option to allow for unrolling a loop up to a specific factor (specified by the user). The code for doing that is there and there are benefits when unrolling is done to smaller loops (smaller than the factor specified). Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D87111	2020-09-08 09:23:38 -07:00
Vincent Zhao	654e8aadfd	[MLIR] Consider AffineIfOp when getting the index set of an Op wrapped in nested loops This diff attempts to resolve the TODO in `getOpIndexSet` (formerly known as `getInstIndexSet`), which states "Add support to handle IfInsts surronding `op`". Major changes in this diff: 1. Overload `getIndexSet`. The overloaded version considers both `AffineForOp` and `AffineIfOp`. 2. The `getInstIndexSet` is updated accordingly: its name is changed to `getOpIndexSet` and its implementation is based on a new API `getIVs` instead of `getLoopIVs`. 3. Add `addAffineIfOpDomain` to `FlatAffineConstraints`, which extracts new constraints from the integer set of `AffineIfOp` and merges it to the current constraint system. 4. Update how a `Value` is determined as dim or symbol for `ValuePositionMap` in `buildDimAndSymbolPositionMaps`. Differential Revision: https://reviews.llvm.org/D84698	2020-08-09 03:16:03 +05:30
Uday Bondhugula	ec85d7c8f3	[MLIR][NFC] Fix clang tidy warnings in misc utilities Fix clang tidy warnings in misc utilities - missing const or a star in declaration. Differential Revision: https://reviews.llvm.org/D83861	2020-07-16 00:27:30 +05:30
River Riddle	9db53a1827	[mlir][NFC] Remove usernames and google bug numbers from TODO comments. These were largely leftover from when MLIR was a google project, and don't really follow LLVM guidelines.	2020-07-07 01:40:52 -07:00
Tobias Gysi	652a79659a	[mlir] fix off-by-one error in collapseParallelLoops Summary: The patch fixes an off by one error in the method collapseParallelLoops. It ensures the same normalized bound is used for the computation of the division and the remainder. Reviewers: herhut Reviewed By: herhut Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, Kayjukh, jurahul, msifontes Tags: #mlir Differential Revision: https://reviews.llvm.org/D82634	2020-06-26 15:39:46 +02:00
Alex Zinenko	3adced3494	[mlir] Introduce callback-based builders to SCF Parallel and Reduce ops Similarly to `scf::ForOp`, introduce additional `function_ref` arguments to `::build` functions of SCF `ParallelOp` and `ReduceOp`. The provided functions will be called to construct the body of the respective operations while constructing the operation itself. Exercise them in LoopUtils. Differential Revision: https://reviews.llvm.org/D81872	2020-06-16 20:51:32 +02:00
Nicolas Vasilache	38c407bf00	[mlir][SCF] Add single iteration scf.for promotion to the FuncOp level helper. Previously only the Affine version would be folded. Differential Revision: https://reviews.llvm.org/D81261	2020-06-05 11:28:21 -04:00
Sean Silva	98eead8186	[mlir][Value] Add v.getDefiningOp<OpTy>() Summary: This makes a common pattern of `dyn_cast_or_null<OpTy>(v.getDefiningOp())` more concise. Differential Revision: https://reviews.llvm.org/D79681	2020-05-11 12:55:27 -07:00
Alex Zinenko	c25b20c0f6	[mlir] NFC: Rename LoopOps dialect to SCF (Structured Control Flow) This dialect contains various structured control flow operaitons, not only loops, reflect this in the name. Drop the Ops suffix for consistency with other dialects. Note that this only moves the files and changes the C++ namespace from 'loop' to 'scf'. The visible IR prefix remains the same and will be updated separately. The conversions will also be updated separately. Differential Revision: https://reviews.llvm.org/D79578	2020-05-11 15:04:27 +02:00
Andy Davis	93d1108801	[MLIR][LoopOps] Adds the loop unroll transformation for loop::ForOp. Summary: Adds the loop unroll transformation for loop::ForOp. Adds support for promoting the body of single-iteration loop::ForOps into its containing block. Adds check tests for loop::ForOps with dynamic and static lower/upper bounds and step. Care was taken to share code (where possible) with the AffineForOp unroll transformation to ease maintenance and potential future transition to a LoopLike construct on which loop transformations for different loop types can implemented. Reviewers: ftynse, nicolasvasilache Reviewed By: ftynse Subscribers: bondhugula, mgorny, zzheng, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79184	2020-05-05 10:42:36 -07:00
Alexander Belyaev	ed5363a674	[MLIR] Add getBody() method to SingleImplicitBlockTerminator op trait. Many ops with this trait have `getBody()` and `getBodyBuilder()` methods defined in `extraClassDeclaration` in tablegen. `getBody()` implementation is the same accross all these ops, but `getBodyBuilder()` can return builders with varying insertion points set. In this PR, `getBody()` is moved into `SingleImplicitBlockTerminator` struct and `getBodyBuilder()` is replaced with `OpBuilder::atBlock(End\|Terminator)(op.getBody);`. Differential Revision: https://reviews.llvm.org/D78864	2020-04-27 21:48:52 +02:00
Alexander Belyaev	def3e10eac	[MLIR] Add #include "llvm/ADT/SmallPtrSet.h" back to LoopUtils.h.	2020-04-20 10:21:18 +02:00
Alexander Belyaev	ad9988f4da	[MLIR] Move `replaceAllUsesExcept` from LoopUtil.h to Value.h. Differential Revision: https://reviews.llvm.org/D78426	2020-04-20 09:21:06 +02:00
Uday Bondhugula	ecddafd84a	[MLIR] NFC affine for op tiling cleanup / utility rename Rename mlir::tileCodeGen -> mlir::tilePerfectlyNested to be consistent. NFC clean up tiling utility code, drop dead code, better comments. Expose isPerfectlyNested and reuse. Differential Revision: https://reviews.llvm.org/D78423	2020-04-19 00:53:34 +05:30
Uday Bondhugula	f043677f6d	[MLIR] Make isPerfectlyNested check more efficient Make mlir::isPerfectlyNested more efficient; use O(1) check instead of O(N) size() method. Differential Revision: https://reviews.llvm.org/D78428	2020-04-18 23:34:49 +05:30
Alexander Belyaev	be9c3bdc44	[MLIR] Fix fusion of linalg.indexed_generic producer into tiled (Indexed)GenericOp. Differential Revision: https://reviews.llvm.org/D78209	2020-04-16 10:45:17 +02:00
Jeremy Bruestle	9f3ab92ec8	[MLIR] Improve support for 0-dimensional Affine Maps. Summary: Modified AffineMap::get to remove support for the overload which allowed an ArrayRef of AffineExpr but no context (and gathered the context from a presumed first entry, resulting in bugs when there were 0 results). Instead, we support only a ArrayRef and a context, and a version which takes a single AffineExpr. Additionally, removed some now needless case logic which previously special cased which call to AffineMap::get to use. Reviewers: flaub, bondhugula, rriddle!, nicolasvasilache, ftynse, ulysseB, mravishankar, antiagainst, aartbik Subscribers: mehdi_amini, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, bader, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78226	2020-04-15 14:15:02 -07:00
Uday Bondhugula	04b5274ede	[MLIR] Introduce applyOpPatternsAndFold for op local rewrites Introduce mlir::applyOpPatternsAndFold which applies patterns as well as any folding only on a specified op (in contrast to applyPatternsAndFoldGreedily which applies patterns only on the regions of an op isolated from above). The caller is made aware of the op being folded away or erased. Depends on D77485. Differential Revision: https://reviews.llvm.org/D77487	2020-04-15 14:10:01 +05:30
River Riddle	204c3b5516	[llvm][STLExtras] Move various iterator/range utilities from MLIR to LLVM This revision moves the various range utilities present in MLIR to LLVM to enable greater reuse. This revision moves the following utilities: * indexed_accessor_* This is set of utility iterator/range base classes that allow for building a range class where the iterators are represented by an object+index pair. * make_second_range Given a range of pairs, returns a range iterating over the `second` elements. * hasSingleElement Returns if the given range has 1 element. size() == 1 checks end up being very common, but size() is not always O(1) (e.g., ilist). This method provides O(1) checks for those cases. Differential Revision: https://reviews.llvm.org/D78064	2020-04-14 15:14:40 -07:00
Uday Bondhugula	ac047d9fce	[MLIR] Remove dead affine.applys while generating pointwise copies This makes no impact on the test cases because affine-data-copy-generate runs whole function canonicalization at its end; however, the latter will be removed in a pending revision. It is thus useful to clean up these affine.applys right here, and eventually, not even generate these (when the right API to compose by construction is in place). Differential Revision: https://reviews.llvm.org/D78055	2020-04-14 09:47:14 +05:30
Uday Bondhugula	42ada5fee9	[MLIR] NFC cleanup/modernize memref-dataflow-opt / getNestingDepth Bring code to date with recent changes to the core infrastructure / coding style. Differential Revision: https://reviews.llvm.org/D77998	2020-04-14 00:03:06 +05:30
Uday Bondhugula	70da33bf30	[MLIR] fix/update affine data copy utility for max/min bounds Fix point-wise copy generation to work with bounds that have max/min. Change structure of copy loop nest to use absolute loop indices and subtracting base from the indexes of the fast buffers. Update supporting utilities: Fix FlatAffineConstraints::getLowerAndUpperBound to look at equalities as well and for a missing division. Update unionBoundingBox to not discard common constraints (leads to a tighter system). Update MemRefRegion::getConstantBoundingSizeAndShape to add memref dimension constraints. Run removeTrivialRedundancy at the end of MemRefRegion::compute. Run single iteration loop promotion and load/store canonicalization after affine data copy (in its test pass as well). Differential Revision: https://reviews.llvm.org/D77320	2020-04-07 13:55:42 +05:30
Kazuaki Ishizaki	5aacce3db2	[mlir] NFC: Fix trivial typo Differential Revision: https://reviews.llvm.org/D77473	2020-04-05 11:30:30 +09:00
Uday Bondhugula	5e8093134a	[MLIR] Add method to drop duplicate result exprs from AffineMap Add a method that given an affine map returns another with just its unique results. Use this to drop redundant bounds in max/min for affine.for. Update affine.for's canonicalization pattern and createCanonicalizedForOp to use this. Differential Revision: https://reviews.llvm.org/D77237	2020-04-02 03:00:19 +05:30
Mehdi Amini	0dd21130ef	Add LLVM_ATTRIBUTE_UNUSED to function used only in assert (NFC)	2020-04-01 17:21:07 +00:00
Uday Bondhugula	68316afb29	[MLIR][NFC] loop transforms/analyis utils cleanup / modernize Modernize/cleanup code in loop transforms utils - a lot of this code was written prior to the currently available IR support / code style. This patch also does some variable renames including inst -> op, comment updates, turns getCleanupLoopLowerBound into a local function. Differential Revision: https://reviews.llvm.org/D77175	2020-04-01 22:36:25 +05:30
Tres Popp	90b7bbffdd	[MLIR] Rename collapsePLoops -> collapseParallelLoops Summary: Additionally, NFC code cleanups were done. This is to address additional comments on https://reviews.llvm.org/D76363 Differential Revision: https://reviews.llvm.org/D77052	2020-04-01 10:15:13 +02:00
Uday Bondhugula	f273e5c507	[MLIR] Fix permuteLoops utility Rewrite mlir::permuteLoops (affine loop permutation utility) to fix incorrect approach. Avoiding using sinkLoops entirely - use single move approach. Add test pass. This fixes https://bugs.llvm.org/show_bug.cgi?id=45328 Depends on D77003. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D77004	2020-03-30 23:38:23 +05:30
scentini	3b20970de8	Fix unused-variable error when assertions are disabled	2020-03-30 13:55:43 +02:00
Uday Bondhugula	4e4ea2cde4	[MLIR] Add missing asserts in interchangeLoops util, doc comment update Add missing assert checks for input to mlir::interchangeLoops utility. Rename interchangeLoops -> permuteLoops; update doc comments to clarify inputs / return val. Other than the assert checks, this is NFC. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D77003	2020-03-30 00:03:12 +05:30
Uday Bondhugula	43a95a543f	[MLIR] Introduce full/partial tile separation using if/else This patch introduces a utility to separate full tiles from partial tiles when tiling affine loop nests where trip counts are unknown or where tile sizes don't divide trip counts. A conditional guard is generated to separate out the full tile (with constant trip count loops) into the then block of an 'affine.if' and the partial tile to the else block. The separation allows the 'then' block (which has constant trip count loops) to be optimized better subsequently: for eg. for unroll-and-jam, register tiling, vectorization without leading to cleanup code, or to offload to accelerators. Among techniques from the literature, the if/else based separation leads to the most compact cleanup code for multi-dimensional cases (because a single version is used to model all partial tiles). INPUT affine.for %i0 = 0 to %M { affine.for %i1 = 0 to %N { "foo"() : () -> () } } OUTPUT AFTER TILING W/O SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0)[s0] -> (d0 + 32, s0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.for %arg4 = #map0(%arg2) to min #map1(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map1(%arg3)[%N] { "foo"() : () -> () } } } } OUTPUT AFTER TILING WITH SEPARATION map0 = affine_map<(d0) -> (d0)> map1 = affine_map<(d0) -> (d0 + 32)> map2 = affine_map<(d0)[s0] -> (d0 + 32, s0)> #set0 = affine_set<(d0, d1)[s0, s1] : (-d0 + s0 - 32 >= 0, -d1 + s1 - 32 >= 0)> affine.for %arg2 = 0 to %M step 32 { affine.for %arg3 = 0 to %N step 32 { affine.if #set0(%arg2, %arg3)[%M, %N] { // Full tile. affine.for %arg4 = #map0(%arg2) to #map1(%arg2) { affine.for %arg5 = #map0(%arg3) to #map1(%arg3) { "foo"() : () -> () } } } else { // Partial tile. affine.for %arg4 = #map0(%arg2) to min #map2(%arg2)[%M] { affine.for %arg5 = #map0(%arg3) to min #map2(%arg3)[%N] { "foo"() : () -> () } } } } } The separation is tested via a cmd line flag on the loop tiling pass. The utility itself allows one to pass in any band of contiguously nested loops, and can be used by other transforms/utilities. The current implementation works for hyperrectangular loop nests. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76700	2020-03-28 06:58:35 +05:30
Tres Popp	27c201aa1d	[MLIR] Add parallel loop collapsing. This allows conversion of a ParallelLoop from N induction variables to some nuber of induction variables less than N. The first intended use of this is for the GPUDialect to convert ParallelLoops to iterate over 3 dimensions so they can be launched as GPU Kernels. To implement this: - Normalize each iteration space of the ParallelLoop - Use the same induction variable in a new ParallelLoop for multiple original iterations. - Split the new induction variable back into the original set of values inside the body of the ParallelLoop. Subscribers: mgorny, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76363	2020-03-26 09:32:52 +01:00
Uday Bondhugula	98fa615002	[MLIR] move loopUnrollJamBy*Factor to loop transforms utils The declarations for these were already part of transforms utils, but the definitions were left in affine transforms. Move definitions to loop transforms utils. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76633	2020-03-24 08:08:57 +05:30
Rob Suderman	e708471395	[mlir][NFC] Cleanup AffineOps directory structure Summary: Change AffineOps Dialect structure to better group both IR and Tranforms. This included extracting transforms directly related to AffineOps. Also move AffineOps to Affine. Differential Revision: https://reviews.llvm.org/D76161	2020-03-20 14:23:43 -07:00

1 2 3 4

159 Commits