llvm-project

Author	SHA1	Message	Date
Lei Zhang	50000abe3c	[mlir] Use affine.apply when distributing to processors This makes it easy to compose the distribution computation with other affine computations. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D98171	2021-03-09 08:37:20 -05:00
River Riddle	3dfa86149e	[mlir][IR] Refactor the internal implementation of Value The current implementation of Value involves a pointer int pair with several different kinds of owners, i.e. BlockArgumentImpl, Operation , TrailingOpResult. This design arose from the desire to save memory overhead for operations that have a very small number of results (generally 0-2). There are, unfortunately, many problematic aspects of the current implementation that make Values difficult to work with or just inefficient. Operation result types are stored as a separate array on the Operation. This is very inefficient for many reasons: we use TupleType for multiple results, which can lead to huge amounts of memory usage if multi-result operations change types frequently(they do). It also means that simple methods like Value::getType/Value::setType now require complex logic to get to the desired type. Value only has one pointer bit free, severely limiting the ability to use it in things like PointerUnion/PointerIntPair. Given that we store the kind of a Value along with the "owner" pointer, we only leave one bit free for users of Value. This creates situations where we end up nesting PointerUnions to be able to use Value in one. As noted above, most of the methods in Value need to branch on at least 3 different cases which is both inefficient, possibly error prone, and verbose. The current storage of results also creates problems for utilities like ValueRange/TypeRange, which want to efficiently store base pointers to ranges (of which Operation isn't really useful as one). This revision greatly simplifies the implementation of Value by the introduction of a new ValueImpl class. This class contains all of the state shared between all of the various derived value classes; i.e. the use list, the type, and the kind. This shared implementation class provides several large benefits: * Most of the methods on value are now branchless, and often one-liners. * The "kind" of the value is now stored in ValueImpl instead of Value This frees up all of Value's pointer bits, allowing for users to take full advantage of PointerUnion/PointerIntPair/etc. It also allows for storing more operation results as "inline", 6 now instead of 2, freeing up 1 word per new inline result. * Operation result types are now stored in the result, instead of a side array This drops the size of zero-result operations by 1 word. It also removes the memory crushing use of TupleType for operations results (which could lead up to hundreds of megabytes of "dead" TupleTypes in the context). This also allowed restructured ValueRange, making it simpler and one word smaller. This revision does come with two conceptual downsides: * Operation::getResultTypes no longer returns an ArrayRef<Type> This conceptually makes some usages slower, as the iterator increment is slightly more complex. * OpResult::getOwner is slightly more expensive, as it now requires a little bit of arithmetic From profiling, neither of the conceptual downsides have resulted in any perceivable hit to performance. Given the advantages of the new design, most compiles are slightly faster. Differential Revision: https://reviews.llvm.org/D97804	2021-03-03 14:33:37 -08:00
KareemErgawy-TomTom	3b021fbdc0	[MLIR][LinAlg] Detensorize interal function control flow. This patch continues detensorizing implementation by detensoring internal control flow in functions. In order to detensorize functions, all the non-entry block's arguments are detensored and branches between such blocks are properly updated to reflect the detensored types as well. Function entry block (signature) is left intact. This continues work towards handling github/google/iree#1159. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D97148	2021-03-02 11:46:20 +01:00
Vladislav Vinogradov	37eca08e5b	[mlir][NFC] Rename `MemRefType::getMemorySpace` to `getMemorySpaceAsInt` Just a pure method renaming. It is a preparation step for replacing "memory space as raw integer" with more generic "memory space as attribute", which will be done in separate commit. The `MemRefType::getMemorySpace` method will return `Attribute` and become the main API, while `getMemorySpaceAsInt` will be declared as deprecated and will be replaced in all in-tree dialects (also in separate commits). Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D97476	2021-03-02 11:08:54 +03:00
River Riddle	e6260ad043	[mlir] Simplify various pieces of code now that Identifier has access to the Context/Dialect This also exposed a bug in Dialect loading where it was not correctly identifying identifiers that had the dialect namespace as a prefix. Differential Revision: https://reviews.llvm.org/D97431	2021-02-26 18:00:05 -08:00
Vinayaka Bandishti	ce0f10a1d1	[MLIR][affine] Certain Call Ops to prevent fusion Fixes a bug in affine fusion pipeline where an incorrect fusion is performed despite a Call Op that potentially modifies memrefs under consideration exists between source and target. Fixes part of https://bugs.llvm.org/show_bug.cgi?id=49220 Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D97252	2021-02-26 15:27:41 +05:30
Tung D. Le	203d5eeec5	[MLIR][affine-loop-fusion] Handle defining ops between the source and dest loops This patch handles defining ops between the source and dest loop nests, and prevents loop nests with `iter_args` from being fused. If there is any SSA value in the dest loop nest whose defining op has dependence from the source loop nest, we cannot fuse the loop nests. If there is a `affine.for` with `iter_args`, prevent it from being fused. Reviewed By: dcaballe, bondhugula Differential Revision: https://reviews.llvm.org/D97030	2021-02-25 18:12:34 +02:00
River Riddle	59f0e4627a	[mlir][Inliner] Don't optimize callees in async mode if there is only one to optimize This avoids unnecessary async overhead in situations that won't benefit from it.	2021-02-23 18:44:09 -08:00
River Riddle	16a50c9e64	[mlir][Inliner] Keep the number of async pass managers constant This prevents a bug in the pass instrumentation implementation where the main thread would end up with a different pass manager in different runs of the pass.	2021-02-23 16:40:24 -08:00
River Riddle	abd3c6f24c	[mlir][Inliner] Use llvm::parallelForEach instead of llvm::parallelTransformReduce llvm::parallelTransformReduce does not schedule work on the caller thread, which becomes very costly for the inliner where a majority of SCCs are small, often ~1 element. The switch to llvm::parallelForEach solves this, and also aligns the implementation with the PassManager (which realistically should share the same implementation). This change dropped compile time on an internal benchmark by ~1(25%) second. Differential Revision: https://reviews.llvm.org/D96086	2021-02-23 14:36:45 -08:00
Adam Straw	af8adea155	make Affine parallel and yield ops MemRefsNormalizable Affine parallel ops may contain and yield results from MemRefsNormalizable ops in the loop body. Thus, both affine.parallel and affine.yield should have the MemRefsNormalizable trait. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D96821	2021-02-23 10:16:47 -08:00
Vivek	817d343fb0	[MLIR] Fix tilePerfectlyNested utility for handling non-unit step size The current implementation of tilePerfectlyNested utility doesn't handle the non-unit step size. We have added support to perform tiling correctly even if the step size of the loop to be tiled is non-unit. Fixes https://bugs.llvm.org/show_bug.cgi?id=49188. Differential Revision: https://reviews.llvm.org/D97037	2021-02-23 00:50:04 +05:30
Vinayaka Bandishti	15332982c3	[MLIR][affine] Prevent fusion when ops with memory effect free are present between producer and consumer This commit fixes a bug in affine fusion pipeline where an incorrect fusion is performed despite a dealloc op is present between a producer and a consumer. This is done by creating a node for dealloc op in the MDG. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D97032	2021-02-22 23:21:02 +05:30
Jacques Pienaar	02d7b260c6	[mlir] Register the print-op-graph pass using ODS Move over to ODS & use pass options.	2021-02-20 15:42:02 -08:00
Alexander Belyaev	a89035d750	Revert "[MLIR] Create memref dialect and move several dialect-specific ops from std." This commit introduced a cyclic dependency: Memref dialect depends on Standard because it used ConstantIndexOp. Std depends on the MemRef dialect in its EDSC/Intrinsics.h Working on a fix. This reverts commit 8aa6c3765b924d86f623d452777eb76b83bf2787.	2021-02-18 12:49:52 +01:00
Julian Gross	8aa6c3765b	[MLIR] Create memref dialect and move several dialect-specific ops from std. Create the memref dialect and move several dialect-specific ops without dependencies to other ops from std dialect to this dialect. Moved ops: AllocOp -> MemRef_AllocOp AllocaOp -> MemRef_AllocaOp DeallocOp -> MemRef_DeallocOp MemRefCastOp -> MemRef_CastOp GetGlobalMemRefOp -> MemRef_GetGlobalOp GlobalMemRefOp -> MemRef_GlobalOp PrefetchOp -> MemRef_PrefetchOp ReshapeOp -> MemRef_ReshapeOp StoreOp -> MemRef_StoreOp TransposeOp -> MemRef_TransposeOp ViewOp -> MemRef_ViewOp The roadmap to split the memref dialect from std is discussed here: https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667 Differential Revision: https://reviews.llvm.org/D96425	2021-02-18 11:29:39 +01:00
Adam Straw	99c0458f2f	separate AffineMapAccessInterface from AffineRead/WriteOpInterface Separating the AffineMapAccessInterface from AffineRead/WriteOp interface so that dialects which extend Affine capabilities (e.g. PlaidML PXA = parallel extensions for Affine) can utilize relevant passes (e.g. MemRef normalization). Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D96284	2021-02-16 13:05:27 -08:00
Nicolas Vasilache	d01ea0edaa	[mlir] Drop reliance of SliceAnalysis on specific ops. SliceAnalysis originally was developed in the context of affine.for within mlfunc. It predates the notion of region. This revision updates it to not hardcode specific ops like scf::ForOp. When rooted at an op, the behavior of the slice computation changes as it recurses into the regions of the op. This does not support gathering all values transitively depending on a loop induction variable anymore. Additional variants rooted at a Value are added to also support the existing behavior. Differential revision: https://reviews.llvm.org/D96702	2021-02-16 06:34:32 +00:00
Alexander Belyaev	891e769ab6	[mlir] Initialize `isSmallAlloc` in `initialize` method.	2021-02-12 10:28:58 +01:00
Alexander Belyaev	16213e1f50	[mlir] Allow users of `PromoteBuffersToStackPass` to customize `small buffer` func. Differential Revision: https://reviews.llvm.org/D96579	2021-02-12 10:11:58 +01:00
Mehdi Amini	b1aaed023e	Enable `Pass::initialize()` to fail by returning a LogicalResult Differential Revision: https://reviews.llvm.org/D96474	2021-02-11 01:51:53 +00:00
Uday Bondhugula	5400f602cd	[MLIR] Update affine.for unroll utility for iter_args support Update affine.for loop unroll utility for iteration arguments support. Fix promoteIfSingleIteration as well. Fixes PR49084: https://bugs.llvm.org/show_bug.cgi?id=49084 Differential Revision: https://reviews.llvm.org/D96383	2021-02-10 10:38:47 +05:30
River Riddle	fe7c0d90b2	[mlir][IR] Remove the concept of `OperationProperties` These properties were useful for a few things before traits had a better integration story, but don't really carry their weight well these days. Most of these properties are already checked via traits in most of the code. It is better to align the system around traits, and improve the performance/cost of traits in general. Differential Revision: https://reviews.llvm.org/D96088	2021-02-09 12:00:15 -08:00
Tung D. Le	05c6c648ec	[MLIR] [affine-loop-fusion] Fix a bug about non-result ops in affine-loop-fusion This patch fixes the following bug when calling --affine-loop-fusion Input program: ```mlir func @should_not_fuse_since_top_level_non_affine_non_result_users( %in0 : memref<32xf32>, %in1 : memref<32xf32>) { %c0 = constant 0 : index %cst_0 = constant 0.000000e+00 : f32 affine.for %d = 0 to 32 { %lhs = affine.load %in0[%d] : memref<32xf32> %rhs = affine.load %in1[%d] : memref<32xf32> %add = addf %lhs, %rhs : f32 affine.store %add, %in0[%d] : memref<32xf32> } store %cst_0, %in0[%c0] : memref<32xf32> affine.for %d = 0 to 32 { %lhs = affine.load %in0[%d] : memref<32xf32> %rhs = affine.load %in1[%d] : memref<32xf32> %add = addf %lhs, %rhs: f32 affine.store %add, %in0[%d] : memref<32xf32> } return } ``` call --affine-loop-fusion, we got an incorrect output: ```mlir func @should_not_fuse_since_top_level_non_affine_non_result_users(%arg0: memref<32xf32>, %arg1: memref<32xf32>) { %c0 = constant 0 : index %cst = constant 0.000000e+00 : f32 store %cst, %arg0[%c0] : memref<32xf32> affine.for %arg2 = 0 to 32 { %0 = affine.load %arg0[%arg2] : memref<32xf32> %1 = affine.load %arg1[%arg2] : memref<32xf32> %2 = addf %0, %1 : f32 affine.store %2, %arg0[%arg2] : memref<32xf32> %3 = affine.load %arg0[%arg2] : memref<32xf32> %4 = affine.load %arg1[%arg2] : memref<32xf32> %5 = addf %3, %4 : f32 affine.store %5, %arg0[%arg2] : memref<32xf32> } return } ``` This happened because when analyzing the source and destination nodes, affine loop fusion ignored non-result ops sandwitched between them. In other words, the MemRefDependencyGraph in the affine loop fusion ignored these non-result ops. This patch solves the issue by adding these non-result ops to the MemRefDependencyGraph. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D95668	2021-02-06 13:30:16 +05:30
River Riddle	e21adfa32d	[mlir] Mark LogicalResult as LLVM_NODISCARD This makes ignoring a result explicit by the user, and helps to prevent accidental errors with dropped results. Marking LogicalResult as no discard was always the intention from the beginning, but got lost along the way. Differential Revision: https://reviews.llvm.org/D95841	2021-02-04 15:10:10 -08:00
Alex Zinenko	5b91060dcc	[mlir] Apply source materialization in case of transitive conversion In dialect conversion infrastructure, source materialization applies as part of the finalization procedure to results of the newly produced operations that replace previously existing values with values having a different type. However, such operations may be created to replace operations created in other patterns. At this point, it is possible that the results of the _original_ operation are still in use and have mismatching types, but the results of the _intermediate_ operation that performed the type change are not in use leading to the absence of source materialization. For example, %0 = dialect.produce : !dialect.A dialect.use %0 : !dialect.A can be replaced with %0 = dialect.other : !dialect.A %1 = dialect.produce : !dialect.A // replaced, scheduled for removal dialect.use %1 : !dialect.A and then with %0 = dialect.final : !dialect.B %1 = dialect.other : !dialect.A // replaced, scheduled for removal %2 = dialect.produce : !dialect.A // replaced, scheduled for removal dialect.use %2 : !dialect.A in the same rewriting, but only the %1->%0 replacement is currently considered. Change the logic in dialect conversion to look up all values that were replaced by the given value and performing source materialization if any of those values is still in use with mismatching types. This is performed by computing the inverse value replacement mapping. This arguably expensive manipulation is performed only if there were some type-changing replacements. An alternative could be to consider all replaced operations and not only those that resulted in type changes, but it would harm pattern-level composability: the pattern that performed the non-type-changing replacement would have to be made aware of the type converter in order to call the materialization hook. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95626	2021-02-04 11:15:11 +01:00
Mehdi Amini	a1d5bdf819	Make the folder more robust against op fold() methods that generate a type mismatch We could extend this with an interface to allow dialect to perform a type conversion, but that would make the folder creating operation which isn't the case at the moment, and isn't necessarily always desirable. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95991	2021-02-04 01:58:56 +00:00
Alex Zinenko	0409eb2874	[mlir] Keep track of region signature conversions as argument replacements In dialect conversion, signature conversions essentially perform block argument replacement and are added to the general value remapping. However, the replaced values were not tracked, so if a signature conversion was rolled back, the construction of operand lists for the following patterns could have obtained block arguments from the mapping and give them to the pattern leading to use-after-free. Keep track of signature conversions similarly to normal block argument replacement, and erase such replacements from the general mapping when the conversion is rolled back. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95688	2021-02-02 10:38:31 +01:00
Alexander Belyaev	8d7cbcf582	[mlir] Preserve lexicographic order after loop collapsing. Currently, for a scf.parallel (i,j,k) after the loop collapsing to 1D is done, the IVs would be traversed as for an scf.parallel(k,j,i). Differential Revision: https://reviews.llvm.org/D95693	2021-01-29 21:32:36 +01:00
Diego Caballero	c8fc5c0385	[mlir][Affine] Add support for multi-store producer fusion This patch adds support for producer-consumer fusion scenarios with multiple producer stores to the AffineLoopFusion pass. The patch introduces some changes to the producer-consumer algorithm, including: * For a given consumer loop, producer-consumer fusion iterates over its producer candidates until a fixed point is reached. * Producer candidates are gathered beforehand for each iteration of the consumer loop and visited in reverse program order (not strictly guaranteed) to maximize the number of loops fused per iteration. In general, these changes were needed to simplify the multi-store producer support and remove some of the workarounds that were introduced in the past to support more fusion cases under the single-store producer limitation. This patch also preserves the existing functionality of AffineLoopFusion with one minor change in behavior. Producer-consumer fusion didn't fuse scenarios with escaping memrefs and multiple outgoing edges (from a single store). Multi-store producer scenarios will usually (always?) have multiple outgoing edges so we couldn't fuse any with escaping memrefs, which would greatly limit the applicability of this new feature. Therefore, the patch enables fusion for these scenarios. Please, see modified tests for specific details. Reviewed By: andydavis1, bondhugula Differential Revision: https://reviews.llvm.org/D92876	2021-01-25 20:31:17 +02:00
mikeurbach	0a7a1ac73d	[mlir] Support FuncOpSignatureConversion for more FunctionLike ops. This extracts the implementation of getType, setType, and getBody from FunctionSupport.h into the mlir::impl namespace and defines them generically in FunctionSupport.cpp. This allows them to be used elsewhere for any FunctionLike ops that use FunctionType for their type signature. Using the new helpers, FuncOpSignatureConversion is generalized to work with all such FunctionLike ops. Convenience helpers are added to configure the pattern for a given concrete FunctionLike op type. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D95021	2021-01-21 18:35:09 -07:00
Diego Caballero	735a07f047	Revert "[mlir][Affine] Add support for multi-store producer fusion" This reverts commit 7dd198852b4db52ae22242dfeda4eccda83aa8b2. ASAN issue.	2021-01-21 00:37:23 +02:00
Jacques Pienaar	cad16e4a92	Avoid unused variable warning in opt mode	2021-01-20 09:45:22 -08:00
Diego Caballero	7dd198852b	[mlir][Affine] Add support for multi-store producer fusion This patch adds support for producer-consumer fusion scenarios with multiple producer stores to the AffineLoopFusion pass. The patch introduces some changes to the producer-consumer algorithm, including: * For a given consumer loop, producer-consumer fusion iterates over its producer candidates until a fixed point is reached. * Producer candidates are gathered beforehand for each iteration of the consumer loop and visited in reverse program order (not strictly guaranteed) to maximize the number of loops fused per iteration. In general, these changes were needed to simplify the multi-store producer support and remove some of the workarounds that were introduced in the past to support more fusion cases under the single-store producer limitation. This patch also preserves the existing functionality of AffineLoopFusion with one minor change in behavior. Producer-consumer fusion didn't fuse scenarios with escaping memrefs and multiple outgoing edges (from a single store). Multi-store producer scenarios will usually (always?) have multiple outgoing edges so we couldn't fuse any with escaping memrefs, which would greatly limit the applicability of this new feature. Therefore, the patch enables fusion for these scenarios. Please, see modified tests for specific details. Reviewed By: andydavis1, bondhugula Differential Revision: https://reviews.llvm.org/D92876	2021-01-20 19:03:07 +02:00
Julian Gross	43f34f5834	Added check if there are regions that do not implement the RegionBranchOpInterface. Add a check if regions do not implement the RegionBranchOpInterface. This is not allowed in the current deallocation steps. Furthermore, we handle edge-cases, where a single region is attached and the parent operation has no results. This fixes: https://bugs.llvm.org/show_bug.cgi?id=48575 Differential Revision: https://reviews.llvm.org/D94586	2021-01-20 12:15:28 +01:00
River Riddle	c8fb6ee341	[mlir][PatternRewriter] Add a new hook to selectively replace uses of an operation This revision adds a new `replaceOpWithIf` hook that replaces uses of an operation that satisfy a given functor. If all uses are replaced, the operation gets erased in a similar manner to `replaceOp`. DialectConversion support will be added in a followup as this requires adjusting how replacements are tracked there. Differential Revision: https://reviews.llvm.org/D94632	2021-01-14 11:58:21 -08:00
Mehdi Amini	d8113cda78	Add newline to terminate debug message (NFC)	2021-01-14 19:29:18 +00:00
Tres Popp	3bd620d450	[mlir] Correct 2 places that result in corrupted conversion rollbacks This corrects the last 2 issues caught by tests when causing dialect conversion rollbacks to occur. Differential Revision: https://reviews.llvm.org/D94623	2021-01-13 20:31:15 +01:00
Rob Suderman	f75f391fc6	[MLIR][Linalg] Refactor transforms to use linalg::getDynOperands helper getDynOperands behavior is commonly used in a number of passes. Refactored to use a helper function and avoid code reuse. Differential Revision: https://reviews.llvm.org/D94340	2021-01-11 16:24:59 -08:00
River Riddle	1ba5ea67a3	[mlir] Add a hook for initializing passes before execution and use it in the Canonicalizer This revision adds a new `initialize(MLIRContext *)` hook to passes that allows for them to initialize any heavy state before the first execution of the pass. A concrete use case of this is with patterns that rely on PDL, given that PDL is compiled at run time it is imperative that compilation results are cached as much as possible. The first use of this hook is in the Canonicalizer, which has the added benefit of reducing the number of expensive accesses to the context when collecting patterns. Differential Revision: https://reviews.llvm.org/D93147	2021-01-08 13:36:12 -08:00
Kazuaki Ishizaki	f88fab5006	[mlir] NFC: fix trivial typos fix typo under include and lib directories Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D94220	2021-01-08 02:10:12 +09:00
Jacques Pienaar	e03266994a	[mlir] Skip empty op-pipelines in inliner textual opt parsing Avoids failing on cases like inline{default-pipeline=canonicalize max-iterations=4 op-pipelines=}, as produced by crash reproducer.	2020-12-29 13:59:53 -08:00
River Riddle	fc5cf50e89	[mlir] Remove the MutableDictionaryAttr class This class used to serve a few useful purposes: * Allowed containing a null DictionaryAttr * Provided some simple mutable API around a DictionaryAttr The first of which is no longer an issue now that there is much better caching support for attributes in general, and a cache in the context for empty dictionaries. The second results in more trouble than it's worth because it mutates the internal dictionary on every action, leading to a potentially large number of dictionary copies. NamedAttrList is a much better alternative for the second use case, and should be modified as needed to better fit it's usage as a DictionaryAttrBuilder. Differential Revision: https://reviews.llvm.org/D93442	2020-12-17 17:18:42 -08:00
River Riddle	1b97cdf885	[mlir][IR][NFC] Move context/location parameters of builtin Type::get methods to the start of the parameter list This better matches the rest of the infrastructure, is much simpler, and makes it easier to move these types to being declaratively specified. Differential Revision: https://reviews.llvm.org/D93432	2020-12-17 13:01:36 -08:00
Alexander Belyaev	9ca67d7f44	Revert "[mlir] Lookup the latest value with a legal type when remapping values." This reverts commit f8184d4c44dff1fab13122221f0c23ab50936647.	2020-12-16 23:09:04 +01:00
Alexander Belyaev	f8184d4c44	[mlir] Lookup the latest value with a legal type when remapping values. The current condition implies that the target materialization will be called even if the type is the new operand type is legal, but slightly different. For example, if there is a bufferization pattern that changes memref layout, then target materialization for an illegal type (TensorType) would be called. Differential Revision: https://reviews.llvm.org/D93126	2020-12-16 09:53:19 +01:00
River Riddle	d7eba20052	[mlir][Inliner] Refactor the inliner to use nested pass pipelines instead of just canonicalization Now that passes have support for running nested pipelines, the inliner can now allow for users to provide proper nested pipelines to use for optimization during inlining. This revision also changes the behavior of optimization during inlining to optimize before attempting to inline, which should lead to a more accurate cost model and prevents the need for users to schedule additional duplicate cleanup passes before/after the inliner that would already be run during inlining. Differential Revision: https://reviews.llvm.org/D91211	2020-12-14 18:09:47 -08:00
Sean Silva	444822d77a	Revert "Revert "[mlir] Start splitting the `tensor` dialect out of `std`."" This reverts commit 0d48d265db6633e4e575f81f9d3a52139b1dc5ca. This reapplies the following commit, with a fix for CAPI/ir.c: [mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 14:30:50 -08:00
Sean Silva	0d48d265db	Revert "[mlir] Start splitting the `tensor` dialect out of `std`." This reverts commit cab8dda90f48e15ee94b0d55ceac5b6a812e4743. I mistakenly thought that CAPI/ir.c failure was unrelated to this change. Need to debug it.	2020-12-11 14:15:41 -08:00
Sean Silva	cab8dda90f	[mlir] Start splitting the `tensor` dialect out of `std`. This starts by moving `std.extract_element` to `tensor.extract` (this mirrors the naming of `vector.extract`). Curiously, `std.extract_element` supposedly works on vectors as well, and this patch removes that functionality. I would tend to do that in separate patch, but I couldn't find any downstream users relying on this, and the fact that we have `vector.extract` made it seem safe enough to lump in here. This also sets up the `tensor` dialect as a dependency of the `std` dialect, as some ops that currently live in `std` depend on `tensor.extract` via their canonicalization patterns. Part of RFC: https://llvm.discourse.group/t/rfc-split-the-tensor-dialect-from-std/2347/2 Differential Revision: https://reviews.llvm.org/D92991	2020-12-11 13:50:55 -08:00

1 2 3 4 5 ...

986 Commits