llvm-project

Author	SHA1	Message	Date
Nirvedh Meshram	c441070665	[mlir][spirv] Add conversion from GPU WMMA ops to SPIRV Cooperative matrix Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D136521	2022-10-22 18:29:40 -07:00
Hanhan Wang	00767cb452	[mlir] Delete dup code and use unified methods. The foldMemRefCast method is defined in memref namespace; the foldTensorCast method is defined in tensor namespace. This revision deletes the dup code and use the unified methods. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D136379	2022-10-21 16:51:44 -07:00
Alex Zinenko	b0bf7ffffc	[mlir] add utilites for DiagnosedSilenceableFailure This class adds helper functions similar to `emitError` for the DiagnosedSilenceableFailure class in both the silenceable and definite failure cases. These helpers simplify the use of said class and make tranfsorm op application code idiomatic. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D136072	2022-10-17 15:31:28 +00:00
Mehdi Amini	5fbec2dfbb	Apply clang-tidy fixes for readability-identifier-naming in GPUTransformOps.cpp (NFC)	2022-10-11 16:54:33 +00:00
Ivan Butygin	b845addae8	[mlir][gpu] Add `subgroup_reduce` operation Introduce `subgroup_reduce` operation, similar to `all_reduce`, but operating on subgroup scope instead of workgroup. It is intended as low-level building block for more high level abstractions (e.g for workgroup-wide `all_reduce` ops). Only introduce version taking reduce operation enum for simplicity sake. Differential Revision: https://reviews.llvm.org/D135323	2022-10-11 11:47:15 +02:00
Guray Ozen	e68a7bed59	[mlir][transform] Add failing test for GPU transform dialect The GPU transform dialect currently has restrictions and several situations where we can't use transform dialect. This update includes a method to test a failing cases in GPU transform dialect. Differential Revision: https://reviews.llvm.org/D135063	2022-10-05 13:10:13 +02:00
Guray Ozen	78305720f3	[mlir][transform][nfc] typo fix fix typo Reviewed By: nicolasvasilache, ftynse Differential Revision: https://reviews.llvm.org/D135242	2022-10-05 13:05:46 +02:00
Guray Ozen	89bb0cae46	[mlir][transform] Create GPU transform dialect This revision adds GPU transform dialect. It also introduce a prefix such as "transform.gpu" for all ops related to this dialect. MLIR already had two GPU transform op in linalg. This revision moves these ops into GPUTransformOps. The Ops are as follows: `transform.structured.map_nested_foreach_thread_to_gpu_blocks` -> `transform.gpu.map_foreach_to_blocks` This op selects the outermost (toplevel) foreach_thread and parallelize across GPU blocks. It can also generate `gpu_launch`. `transform.structured.map_nested_foreach_thread_to_gpu_threads` -> `transform.gpu.map_nested_foreach_to_threads` This op parallelizes nested foreach_thread that are inside `gpu_launch` across GPU threads. It doesn't add new functionality, but there are some minor refactoring of the code. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D134800	2022-10-04 13:09:08 +02:00
River Riddle	10c04f4641	[mlir:GPU][NFC] Update GPU API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
River Riddle	a5aa783685	[mlir:Async][NFC] Update Async API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Ivan Radanov Ivanov	e01c7f092f	[MLIR] Revert default NVIDIA GPU version Due to integration tests failing revert mlir::SerializeToCubinPass defaults to old ones (changed in https://reviews.llvm.org/D134153) Reviewed By: akuegel Differential Revision: https://reviews.llvm.org/D134414	2022-09-22 10:19:38 +02:00
River Riddle	986b5c56ea	[mlir] Flip Async/GPU/OpenACC/OpenMP to use Both accessors This allows for incrementally updating the old API usages without needing to update everything at once. These will be left on Both for a little bit and then flipped to prefixed when all APIs have been updated. Differential Revision: https://reviews.llvm.org/D134386	2022-09-21 17:36:13 -07:00
Ivan Radanov Ivanov	f9211330f6	[MLIR] Set default NVIDIA GPU version	2022-09-21 18:10:59 -04:00
Ivan Radanov Ivanov	2f7a774ed7	[MLIR] Add a create function for mlir::SerializeToCubinPass Differential Revision: https://reviews.llvm.org/D134153	2022-09-21 18:02:59 -04:00
Mehdi Amini	28c17a4b06	Apply clang-tidy fixes for performance-unnecessary-value-param in InferIntRangeInterfaceImpls.cpp (NFC)	2022-09-01 14:50:14 +00:00
Michele Scuttari	67d0d7ac0a	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-31 12:28:45 +02:00
Michele Scuttari	039b969b32	Revert "[MLIR] Update pass declarations to new autogenerated files" This reverts commit 2be8af8f0e0780901213b6fd3013a5268ddc3359.	2022-08-30 22:21:55 +02:00
Michele Scuttari	2be8af8f0e	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-30 21:56:31 +02:00
Christian Sigg	50c33a3a9c	[MLIR] Harden gpu.func verification GPUFuncOpLowering moves the body out of gpu.func op and erases it. An empty gpu.func may fail verification but should not crash it. Verification of an erased op is triggered e.g. with debug printing on. Reviewed By: akuegel Differential Revision: https://reviews.llvm.org/D132446	2022-08-23 14:58:46 +02:00
Jeff Niu	58a47508f0	(Reland) [mlir] Switch segment size attributes to DenseI32ArrayAttr This reland includes changes to the Python bindings. Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131801 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D131803	2022-08-12 19:44:52 -04:00
Alex Zinenko	e8e718fa4b	Revert "[mlir] Switch segment size attributes to DenseI32ArrayAttr" This reverts commit 30171e76f0e5ea8037bc4d1450dd3e12af4d9938. Breaks Python tests in MLIR, missing C API and Python changes.	2022-08-12 10:22:47 +02:00
Jeff Niu	30171e76f0	[mlir] Switch segment size attributes to DenseI32ArrayAttr Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131738 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D131702	2022-08-11 20:56:45 -04:00
Benjamin Kramer	9fa59e7643	[mlir] Use C++17 structured bindings instead of std::tie where applicable. NFCI	2022-08-09 13:34:17 +02:00
River Riddle	c60b897d22	[mlir] Refactor the Parser library in preparation for an MLIR binary format The current Parser library is solely focused on providing API for the textual MLIR format, but MLIR will soon also provide a binary format. This commit renames the current Parser library to AsmParser to better correspond to what the library is actually intended for. A new Parser library is added which will act as a unified parser interface between both text and binary formats. Most parser clients are unaffected, given that the unified interface is essentially the same as the current interface. Only clients that rely on utilizing the AsmParserState, or those that want to parse Attributes/Types need to be updated to point to the AsmParser library. Differential Revision: https://reviews.llvm.org/D129605	2022-07-25 16:33:01 -07:00
Jeff Niu	b7f93c2809	[mlir] (NFC) run clang-format on all files	2022-07-14 13:32:13 -07:00
Kazu Hirata	c27d815249	[mlir] Use value instead of getValue (NFC)	2022-07-14 00:19:59 -07:00
Kazu Hirata	491d27013d	[mlir] Use has_value instead of hasValue (NFC)	2022-07-13 00:57:02 -07:00
Jacques Pienaar	136d746ec7	[mlir] Flip accessors to prefixed form (NFC) Another mechanical sweep to keep diff small for flip to _Prefixed.	2022-07-10 21:19:11 -07:00
Christian Sigg	3e01af093f	[mlir] Add InferIntRangeInterface to gpu.launch Infers block/grid dimensions/indices or ranges of such dimensions/indices. Reviewed By: krzysz00 Differential Revision: https://reviews.llvm.org/D129036	2022-07-05 07:14:54 +02:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit aa8feeefd3ac6c78ee8f67bf033976fc7d68bc6d.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Kazu Hirata	6d5fc1e3d5	[mlir] Don't use Optional::getValue (NFC)	2022-06-20 23:20:25 -07:00
Mogball	d883a02a7c	[mlir][ods] Remove StructAttr Depends on D127373 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D127375	2022-06-21 01:10:05 +00:00
Kazu Hirata	037f09959a	[mlir] Don't use Optional::hasValue (NFC)	2022-06-20 11:22:37 -07:00
Alex Zinenko	8b68da2c7d	[mlir] move SCF headers to SCF/{IR,Transforms} respectively This aligns the SCF dialect file layout with the majority of the dialects. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D128049	2022-06-20 10:18:01 +02:00
Mogball	e16d13322b	[mlir] (NFC) Clean up bazel and CMake target names All dialect targets in bazel have been named Dialect and all dialect targets in CMake have been named MLIRDialect.	2022-06-13 16:24:15 +00:00
Krzysztof Drewniak	a2cdb9791b	[mlir][AMDGPU] Set ABI version constant when linking device libs Currently, linking the device libraries requires setting a constant that indicates the code object ABI version the compilation is targeting. This fixes the MLIR linking process by setting this constant to 400, which is the value corresponding to the current code object ABI default, version 4. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D126913	2022-06-10 18:40:52 +00:00
Mogball	d7ef488bb6	[mlir][gpu] Move GPU headers into IR/ and Transforms/ Depends on D127350 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D127352	2022-06-09 22:49:03 +00:00
Mogball	7bdd3722f2	[mlir][gpu] Change ParalellLoopMappingAttr to AttrDef It was a StructAttr. Also adds a FieldParser for AffineMap. Depends on D127348 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D127350	2022-06-09 22:23:21 +00:00
Christian Sigg	bcf3d52486	[MLIR][GPU] Expose GpuParallelLoopMapping as non-test pass. Reviewed By: bondhugula, herhut Differential Revision: https://reviews.llvm.org/D126199	2022-05-30 09:20:48 +02:00
Mehdi Amini	59c3be748f	Apply clang-tidy fixes for performance-move-const-arg in SerializeToHsaco.cpp (NFC)	2022-05-16 13:58:49 +00:00
Arnab Dutta	16219f8c94	[MLIR][GPU] Add canonicalizer for gpu.memcpy Erase gpu.memcpy op when only uses of dest are the memcpy op in question, its allocation and deallocation ops. Reviewed By: bondhugula, csigg Differential Revision: https://reviews.llvm.org/D124257	2022-05-14 19:01:04 +05:30
Christian Sigg	0e3d1ca54a	[MLIR][GPU] NFC: simplify kernel operand accessor implementations. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D125112	2022-05-14 14:14:42 +02:00
Daniil Dudkin	70c463efc8	[mlir][NFC] Fix `GpuKernelOutliningPass` copy constructor warnings 1. Call copy constructor of the base class 2. Assign value of the option directly Reviewed By: dcaballe, rriddle Differential Revision: https://reviews.llvm.org/D125101	2022-05-12 11:41:18 +03:00
Thomas Raoux	15bcc36eed	[mlir][gpu] Move async copy ops to NVGPU and add caching hints Move async copy operations to NVGPU as they only exist on NV target and are designed to match ptx semantic. This allows us to also add more fine grain caching hint attribute to the op. Add hint to bypass L1 and hook it up to NVVM op. Differential Revision: https://reviews.llvm.org/D125244	2022-05-10 22:30:24 +00:00
Nikita Popov	03ab30686d	[MLIR] Split off MLIRExecutionEngineUtils to fix libMLIR.so build (PR54242) Building libMLIR.so currently fails with: > /usr/bin/ld: /tmp/ccNzulEA.ltrans39.ltrans.o: in function `(anonymous namespace)::SerializeToHsacoPass::optimizeLlvm(llvm::Module&, llvm::TargetMachine&)': > /builddir/build/BUILD/llvm-project-15.0.0.src/mlir/lib/Dialect/GPU/Transforms/SerializeToHsaco.cpp:328: undefined reference to `mlir::makeOptimizingTransformer(unsigned int, unsigned int, llvm::TargetMachine*)' This is because MLIRGPUTransforms depends on MLIRExecutionEngine in `61bb2e4ea8/mlir/lib/Dialect/GPU/Transforms/SerializeToHsaco.cpp (L328)`, but MLIRExecutionEngine is marked as excluded from libMLIR.so. However, this code doesn't require the full execution engine: It only performs middle-end optimization, and does not need any of the JIT/codegen infrastructure. As such, split off a separate library MLIRExecutionEngineUtils, which only contains that part and is not excluded from libMLIR.so. Fixes https://github.com/llvm/llvm-project/issues/54242. Differential Revision: https://reviews.llvm.org/D125214	2022-05-10 10:17:52 +02:00
Chris Lattner	d85eb4e2d6	[AsmParser] Introduce a new "Argument" abstraction + supporting logic MLIR has a common pattern for "arguments" that uses syntax like `%x : i32 {attrs} loc("sourceloc")` which is implemented in adhoc ways throughout the codebase. The approach this uses is verbose (because it is implemented with parallel arrays) and inconsistent (e.g. lots of things drop source location info). Solve this by introducing OpAsmParser::Argument and make addRegion (which sets up BlockArguments for the region) take it. Convert the world to propagating this down. This means that we correctly capture and propagate source location information in a lot more cases (e.g. see the affine.for testcase example), and it also simplifies much code. Differential Revision: https://reviews.llvm.org/D124649	2022-04-29 12:19:34 -07:00
Chris Lattner	5dedf911de	[AsmParser] Rework logic around "region argument parsing" The asm parser had a notional distinction between parsing an operand (like "%foo" or "%4#3") and parsing a region argument (which isn't supposed to allow a result number like #3). Unfortunately the implementation has two problems: 1) It didn't actually check for the result number and reject it. parseRegionArgument and parseOperand were identical. 2) It had a lot of machinery built up around it that paralleled operand parsing. This also was functionally identical, but also had some subtle differences (e.g. the parseOptional stuff had a different result type). I thought about just removing all of this, but decided that the missing error checking was important, so I reimplemented it with a `allowResultNumber` flag on parseOperand. This keeps the codepaths unified and adds the missing error checks. Differential Revision: https://reviews.llvm.org/D124470	2022-04-28 11:12:44 -07:00
Vitaly Buka	6e1ac68a0c	[mlir] Don't iterate mutable user list executeOp.operandsMutable().append(asyncTokens) in addAsyncDependencyAfter can resize and invalidate iterators. Fixes reports like https://reviews.llvm.org/P8286 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D124577	2022-04-28 08:59:55 -07:00

1 2 3 4 5 ...

331 Commits