llvm-project

Author	SHA1	Message	Date
Christian Ulmann	51e36f217f	[mlir][llvm] Add branch weights to call and invoke This commit introduces branch weight attributes to the LLVM::CallOp and LLVM::InvokeOp and adds both import and export of them. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D141122	2023-01-09 10:25:07 +01:00
Slava Zakharin	2f66c89130	[mlir] Support TBAA metadata in LLVMIR dialect. This change introduces new LLVMIR dialect operations to represent TBAA root, type descriptor and access tag metadata nodes. For the purpose of importing TBAA metadata from LLVM IR it only supports the current version of TBAA format described in https://llvm.org/docs/LangRef.html#tbaa-metadata (i.e. size-aware representation introduced in D41501 is not supported). TBAA attribute support is only added for LLVM::LoadOp and LLVM::StoreOp. Support for intrinsics operations (e.g. LLVM::MemcpyOp) may be added later. The TBAA attribute is represented as an array of access tags, though, LLVM IR supports only single access tag per memory accessing instruction. I implemented it as an array anticipating similar support in LLVM IR to combine TBAA graphs with different roots for Flang - one of the options described in https://docs.google.com/document/d/16kKZVmI585wth01VSaJAqZMZpoX68rcdBmgfj0kNAt0/edit#heading=h.jzzheaz9vqac It should be easy to restrict MLIR operation to a single access tag, if we end up using a different approach for Flang. Differential Revision: https://reviews.llvm.org/D140768	2023-01-06 11:16:31 -08:00
Christian Ulmann	b72dd6f775	[mlir] Add function_entry_count to LLVMFuncOp This commit introduces the function_entry_count metadata field to the LLVMFuncOp and adds both the corresponding import and export funtionalities. The import of the function metadata uses the same infrastructure as the instruction metadata, i.e., it dispatches through a dialect interface. Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D141001	2023-01-05 13:40:56 +01:00
Tobias Gysi	0cf066392f	[mlir][llvm] Make the import of LLVM IR metadata extensible. This revision extends the LLVMImportDialectInterface to make the import of LLVM IR instruction-level metadata extensible. It extends the signature of the existing dialect interface to provide a method to import specific metadata kinds and attach them to the imported operation. The conversion function can rely on the ModuleImport class to perform support tasks. The revision implements the second part of the "extensible llvm ir import" rfc: https://discourse.llvm.org/t/rfc-extensible-llvm-ir-import/67256/6 The interface method names changed a bit compared to the suggested design. The hook to set the instruction level metadata is now called setMetadataAttrs and takes the metadata kind as an additional parameter. We do not hand in the original LLVM IR instruction since it is not used at this point. Importing named module-level meta data can be added in a later stage after gaining some experience with this extension mechanism. Depends on D140374 Reviewed By: ftynse, Dinistro Differential Revision: https://reviews.llvm.org/D140556	2023-01-03 14:47:25 +01:00
Krzysztof Drewniak	f6076bd81f	[mlir][ROCDL] Translate known block size attributes to ROCDL 1. When converting from the GPU dialect to the ROCDL dialect, if the function that contains a gpu.thread_id or gpu.block_id op is annotated with gpu.known_{block,grid}_size, use that size to set a "range" attribute on the corresponding rocdl intrinsic so that the LLVM frontend can optimize based on that range information. 1b. When translating from the rocdl dialect to LLVM IR, use the "range" attribute, if present, to set !range metadata on the relevant function call. 2. Deprecate the old rocdl.max_flat_work_group_size attribute, which was used in a tensorflow backend. Instead, use rocdl.flat_work_group_size going forward to allow kernel generators to specify the minimum and maximum work group sizes a kernel may be launched with in one attribute, thus more closely matching the backend. 3. When translating from gpu.func to llvm.func within gpu-to-rocdl, copy the known_block_size attribute as rocdl.reqd_work_group_size to enable further translations to set the corresponding metadata on the LLVM IR function. Also, set the rocdl.flat_work_group_size attribute to ensure that the reqd_work_group_size metadata and the amdgpu-flat-work-group-size metadata are consistent. 3b. Extend the ROCDL to LLVM IR translation to set the !reqd_work_group_size metadata on LLVM functions Also update tests and add functions to the ROCDL dialect to ensure attribute names are used consistently. Depends on D139865 Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D139866	2023-01-02 21:04:13 +00:00
Tobias Gysi	cf487cce6f	[mlir][llvm] Make the import of LLVM IR intrinsics extensible. The revision introduces the LLVMImportDialectInterface to make the import of LLVM IR intrinsics extensible. It uses a dialect interface that enables external projects to provide their own conversion functions for custom intrinsics. These conversion functions can rely on the ModuleImport class to perform support tasks such as mapping LLVM values to MLIR values or for converting types between the two worlds. The implementation largely mirrors the export implementation. One major difference is the dispatch to the appropriate dialect interface, since LLVM IR intrinsics have no direct association to an MLIR dialect. The dialect interfaces thus have to publish the supported intrinsics to ensure incoming conversion calls are dispatched to the right dialect interface. The revision implements the extensible intrinsic import discussed as part of the "extensible llvm ir import" rfc: https://discourse.llvm.org/t/rfc-extensible-llvm-ir-import/67256/6 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D140374	2023-01-02 11:35:44 +01:00
Fangrui Song	cbb0981388	[mlir] llvm::Optional::value => operator*/operator-> std::optional::value() has undesired exception checking semantics and is unavailable in older Xcode (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS). The call sites block std::optional migration.	2022-12-17 19:07:38 +00:00
Ramkumar Ramachandra	22426110c5	mlir/tblgen: use std::optional in generation This is part of an effort to migrate from llvm::Optional to std::optional. This patch changes the way mlir-tblgen generates .inc files, and modifies tests and documentation appropriately. It is a "no compromises" patch, and doesn't leave the user with an unpleasant mix of llvm::Optional and std::optional. A non-trivial change has been made to ControlFlowInterfaces to split one constructor into two, relating to a build failure on Windows. See also: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Signed-off-by: Ramkumar Ramachandra <r@artagnon.com> Differential Revision: https://reviews.llvm.org/D138934	2022-12-17 11:13:26 +01:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
electriclilies	0efff7cdcb	[mlir] Add call_intrinsic op to LLVMIIR The call_intrinsic op allows us to call LLVM intrinsics from the LLVMDialect without implementing a new op every time. Reviewed By: lattner, rriddle Differential Revision: https://reviews.llvm.org/D137187	2022-11-02 15:56:46 -07:00
Slava Zakharin	b925509938	Fix MLIR build after D136931 ld.lld: error: undefined symbol: mlir::extractFromI64ArrayAttr(mlir::Attribute) >>> referenced by NVVMToLLVMIRTranslation.cpp:142 (/llvm-project/mlir/lib/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.cpp:142) >>> referenced by NVVMToLLVMIRTranslation.cpp:152 (/llvm-project/mlir/lib/Target/LLVMIR/Dialect/NVVM/NVVMToLLVMIRTranslation.cpp:152)	2022-10-28 17:50:11 -07:00
Guray Ozen	3ac17449cf	[mlir][nvvm] Introduce performance tuning directives PTX programming models provides some performance tuning directives; see https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#performance-tuning-directives The downstream compiler namely `ptxas` leverages these information for better register allocation or to handle other resource management that improves the performance. This revision introduce all the kernel based directives to MLIR's NVVM dialect. The list is below ``` maxnreg -> max register per thread in CTA maxntid -> max threads per CTA reqntid -> exact number of threads per CTA minnctapersm -> min CTA per SM ``` Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D136931	2022-10-28 14:02:40 +02:00
Jeremy Furtek	b56e65d318	[mlir][arith] Initial support for fastmath flag attributes in the Arithmetic dialect (v2) This diff adds initial (partial) support for "fastmath" attributes for floating point operations in the arithmetic dialect. The "fastmath" attributes are implemented using a default-valued bit enum. The defined flags currently mirror the fastmath flags in the LLVM dialect (and in LLVM itself). Extending the set of flags (if necessary) is left as a future task. In this diff: - Definition of FastMathAttr as a custom attribute in the Arithmetic dialect that inherits from the EnumAttr class. - Definition of ArithFastMathInterface, which is an interface that is implemented by operations that have an arith::fastmath attribute. - Declaration of a default-valued fastmath attribute for unary and (some) binary floating point operations in the Arithmetic dialect. - Conversion code to lower arithmetic fastmath flags to LLVM fastmath flags NOT in this diff (but planned or currently in progress): - Documentation of flag meanings - Addition of FastMathAttr attributes to other dialects that might lower to the Arithmetic dialect (e.g. Math and Complex) - Folding/rewrite implementations that are enabled by fastmath flags - Specification of fastmath values from Python bindings (pending other in- progress diffs) Reviewed By: mehdi_amini, vzakhari Differential Revision: https://reviews.llvm.org/D126305	2022-10-26 11:56:16 -07:00
Dominik Adamski	ccd314d320	[OpenMP][OMPIRBuilder] Add generation of SIMD align assumptions to OMPIRBuilder Currently generation of align assumptions for OpenMP simd construct is done outside OMPIRBuilder for C code and it is not supported for Fortran. According to OpenMP 5.0 standard (2.9.3) only pointers and arrays can be aligned for C code. If given aligned variable is pointer, then Clang generates the following set of the LLVM IR isntructions to support simd align clause: ; memory allocation for pointer address: %A.addr = alloca ptr, align 8 ; some LLVM IR code ; Alignment instructions (alignment is equal to 32): %0 = load ptr, ptr %A.addr, align 8 call void @llvm.assume(i1 true) [ "align"(ptr %0, i64 32) ] If given aligned variable is array, then Clang generates the following set of the LLVM IR isntructions to support simd align clause: ; memory allocation for array: %B = alloca [10 x i32], align 16 ; some LLVM IR code ; Alignment instructions (alignment is equal to 32): %arraydecay = getelementptr inbounds [10 x i32], ptr %B, i64 0, i64 0 call void @llvm.assume(i1 true) [ "align"(ptr %arraydecay, i64 32) ] OMPIRBuilder was modified to generate aligned assumptions. It generates only llvm.assume calls. Frontend is responsible for generation of aligned pointer and getting the default alignment value if user does not specify it in aligned clause. Unit and regression tests were added to check if aligned clause was handled correctly. Differential Revision: https://reviews.llvm.org/D133578 Reviewed By: jdoerfert	2022-10-18 02:04:18 -05:00
Dominik Adamski	6842d35012	[OpenMP][OMPIRBuilder] Add support for order(concurrent) to OMPIRBuilder for SIMD directive If 'order(concurrent)' clause is specified, then the iterations of SIMD loop can be executed concurrently. This patch adds support for LLVM IR codegen via OMPIRBuilder for SIMD loop with 'order(concurrent)' clause. The functionality added to OMPIRBuilder is similar to the functionality implemented in 'CodeGenFunction::EmitOMPSimdInit'. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D134046 Signed-off-by: Dominik Adamski <dominik.adamski@amd.com>	2022-10-04 08:30:00 -05:00
River Riddle	4fb4e12bab	[mlir:OpenMP][NFC] Update OpenMP API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:11 -07:00
River Riddle	f9806b3e3e	[mlir:OpenACC][NFC] Update OpenACC API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
Hanhan Wang	aac844a4b1	[mlir] Improve bitEnumContains methods. `839b436c93` changes the behavior. Based on the discussion, we also want to support "and" behavior. The revision changes it into two functions, bitEnumContainsAny and bitEnumContainsAll. Reviewed By: krzysz00, antiagainst Differential Revision: https://reviews.llvm.org/D133507	2022-09-09 11:56:36 -07:00
Shraiysh Vaishay	80c1be8d14	[mlir][OpenMP] Translation to LLVM IR for omp.taskgroup This patch adds translation from OpenMP Dialect to LLVM IR for omp.taskgroup. This patch also adds missing tests for the clauses in omp.taskgroup operation. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D130157	2022-08-31 04:55:01 +00:00
Prabhdeep Singh Soni	b8055c5115	[MLIR][OpenMP] Add support for safelen clause This supports translation from MLIR to LLVM IR using OMPIRBuilder for OpenMP safelen clause in SIMD construct. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D132245	2022-08-24 12:30:54 -04:00
Prabhdeep Singh Soni	4fce38cde2	Revert "[MLIR][OpenMP] Add support for safelen clause" This reverts commit 172fe1706d83832a330170f43fe52aab1b75e7de.	2022-08-24 12:30:54 -04:00
Prabhdeep Singh Soni	172fe1706d	[MLIR][OpenMP] Add support for safelen clause This supports translation from MLIR to LLVM IR using OMPIRBuilder for OpenMP safelen clause in SIMD construct.	2022-08-24 12:23:31 -04:00
Prabhdeep Singh Soni	cfef6561a7	Revert "Add support for safelen clause" This reverts commit 3dd4d6a0cec85d96af0340a48aaacf638215fe76.	2022-08-24 12:15:41 -04:00
Prabhdeep Singh Soni	3dd4d6a0ce	Add support for safelen clause This supports translation from MLIR to LLVM IR using OMPIRBuilder for OpenMP safelen clause in SIMD construct.	2022-08-24 12:06:00 -04:00
Prabhdeep Singh Soni	bce94ea551	[OMPIRBuilder] Add support for safelen clause This patch adds OMPIRBuilder support for the safelen clause for the simd directive. Reviewed By: shraiysh, Meinersbur Differential Revision: https://reviews.llvm.org/D131526	2022-08-18 15:43:08 -04:00
Jeff Niu	5c5af910fe	[mlir][LLVMIR] "Modernize" Insert/ExtractValueOp This patch "modernizes" the LLVM `insertvalue` and `extractvalue` operations to use DenseI64ArrayAttr, since they only require an array of indices and previously there was confusion about whether to use i32 or i64 arrays, and to use assembly format. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D131537	2022-08-10 12:51:11 -04:00
Eugene Zhulenev	5f1c7e2cc5	[mlir] Use SymbolTableCollection to lookup referenced symbol in AddressOfOp Depends On D131285 Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D131291	2022-08-05 14:05:03 -07:00
Dominik Adamski	d90b7bf2c5	Add support for lowering simd if clause to LLVM IR Scope of changes: 1) Added new function to generate loop versioning 2) Added support for if clause to applySimd function 2) Added tests which confirm that lowering is successful If ifCond is specified, then collapsed loop is duplicated and if branch is added. Duplicated loop is executed if simd ifCond is evaluated to false. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D129368 Signed-off-by: Dominik Adamski <dominik.adamski@amd.com>	2022-08-01 04:43:32 -05:00
Kazu Hirata	e5a1ccbf25	Use value instead of getValue (NFC)	2022-07-29 21:18:41 -07:00
Prabhdeep Singh Soni	f5efa1892e	[Flang][MLIR][OpenMP] Add support for simdlen clause This supports lowering from parse-tree to MLIR and translation from MLIR to LLVM IR using OMPIRBuilder for OpenMP simdlen clause in SIMD construct. Reviewed By: shraiysh, peixin, arnamoy10 Differential Revision: https://reviews.llvm.org/D130195	2022-07-28 23:49:17 +08:00
Jeff Niu	b7f93c2809	[mlir] (NFC) run clang-format on all files	2022-07-14 13:32:13 -07:00
Kazu Hirata	c27d815249	[mlir] Use value instead of getValue (NFC)	2022-07-14 00:19:59 -07:00
Kazu Hirata	491d27013d	[mlir] Use has_value instead of hasValue (NFC)	2022-07-13 00:57:02 -07:00
Prabhdeep Singh Soni	ac892c70a4	[OMPIRBuilder] Add support for simdlen clause This patch adds OMPIRBuilder support for the simdlen clause for the simd directive. It uses the simdlen support in OpenMPIRBuilder when it is enabled in Clang. Simdlen is lowered by OpenMPIRBuilder by generating the loop.vectorize.width metadata. Reviewed By: jdoerfert, Meinersbur Differential Revision: https://reviews.llvm.org/D129149	2022-07-11 13:29:06 -04:00
Dominik Adamski	2c915e3b26	[mlir][OpenMP] Add if clause to OpenMP simd construct This patch adds if clause to OpenMP TableGen for simd construct. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D128940 Signed-off-by: Dominik Adamski <dominik.adamski@amd.com>	2022-07-06 07:24:48 -05:00
Shraiysh Vaishay	fdf505f3f2	[mlir][OpenMP] omp.task translation to LLVM IR This patch adds translation for omp.task from OpenMPDialect to LLVM IR Dialect and adds tests for the same. Depends on D71989 Reviewed By: ftynse, kiranchandramohan, peixin, Meinersbur Differential Revision: https://reviews.llvm.org/D123919	2022-07-04 21:03:02 +05:30
Peixin-Qiao	1795f8cd2e	[NFC][OpenMP] Fix worksharing-loop 1. Remove the redundant collapse clause in MLIR OpenMP worksharing-loop operation. 2. Fix several typos. 3. Refactor the chunk size type conversion since CreateSExtOrTrunc has both type check and type conversion. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D128338	2022-06-29 12:20:03 +08:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit aa8feeefd3ac6c78ee8f67bf033976fc7d68bc6d.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Kazu Hirata	6d5fc1e3d5	[mlir] Don't use Optional::getValue (NFC)	2022-06-20 23:20:25 -07:00
Kazu Hirata	0916d96d12	Don't use Optional::hasValue (NFC)	2022-06-20 20:17:57 -07:00
Kazu Hirata	037f09959a	[mlir] Don't use Optional::hasValue (NFC)	2022-06-20 11:22:37 -07:00
Kazu Hirata	30c675878c	Use value_or instead of getValueOr (NFC)	2022-06-19 10:34:41 -07:00
Mogball	e16d13322b	[mlir] (NFC) Clean up bazel and CMake target names All dialect targets in bazel have been named Dialect and all dialect targets in CMake have been named MLIRDialect.	2022-06-13 16:24:15 +00:00
jfurtek	5c3b20520b	[mlir] Update LLVMIR Fastmath flags use of MLIR BitEnum functionality This diff updates the LLVMIR dialect Fastmath flags attribute to use recently added features of `BitEnum` attributes. Specifically, this diff uses the bit enum "group" case to represent the `fast` value as an alias for a combination of other values (`ninf`, `nnan`, ...), instead of using a separate integer value. (This is in line with LLVM's fastmath flags representation.) This diff also leverages the `printBitEnumPrimaryGroups` `tblgen` field for concise enum printing. The `BitEnum` features were developed for an upcoming diff that adds `fastmath` support to the arithmetic dialect. This diff simply applies some of the relevant new features to the LLVM dialect attribute. Reviewed By: ftynse, Mogball Differential Revision: https://reviews.llvm.org/D124720	2022-05-17 18:19:14 +00:00
Michael Kruse	ff289feeba	[OpenMPIRBuilder] Remove ContinuationBB argument from Body callback. The callback is expected to create a branch to the ContinuationBB (sometimes called FiniBB in some lambdas) argument when finishing. This creates problems: 1. The InsertPoint used for CodeGenIP does not need to be the end of a block. If it is not, a naive callback will insert a branch instruction into the middle of the block. 2. The BasicBlock the CodeGenIP is pointing to may or may not have a terminator. There is an conflict where to branch to if the block already has a terminator. 3. Some API functions work only with block having a terminator. Some workarounds have been used to insert a temporary terminator that is removed again. 4. Some callbacks are sensitive to whether the BasicBlock has a terminator or not. This creates a callback ordering problem where different callback may have different behaviour depending on whether a previous callback created a terminator or not. The problem also exists for FinalizeCallbackTy where some callbacks do create branch to another "continue" block, but unlike BodyGenCallbackTy does not receive the target as argument. This is not addressed in this patch. With this patch, the callback receives an CodeGenIP into a BasicBlock where to insert instructions. If it has to insert control flow, it can split the block at that position as needed but otherwise no separate ContinuationBB is needed. In particular, a callback can be empty without breaking the emitted IR. If the caller needs the control flow to branch to a specific target, it can insert the branch instruction itself and pass an InsertPoint before the terminator to the callback. Certain frontends such as Clang may expect the current IRBuilder position to be at the end of a basic block. In this case its callbacks must split the block at CodeGenIP before setting the IRBuilder position such that the instructions after CodeGenIP are moved to another basic block and before returning create a new branch instruction to the split block. Some utility functions such as `splitBB` are supporting correct splitting of BasicBlocks, independent of whether they have a terminator or not, returning/setting the InsertPoint of an IRBuilder to the end of split predecessor block, and optionally omitting creating a branch to the split successor block to be added later. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D118409	2022-04-26 16:35:01 -05:00
Mehdi Amini	8608ed1441	Apply clang-tidy fixes for llvm-twine-local in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-20 00:39:10 +00:00
Mehdi Amini	2d6335421f	Apply clang-tidy fixes for llvm-qualified-auto in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-19 17:20:57 +00:00
Michael Kruse	2d92ee97f1	Reapply "[OpenMP] Refactor OMPScheduleType enum." This reverts commit af0285122f306573d9bcc4c4ad7f904cfdd4d869. The test "libomp::loop_dispatch.c" on builder openmp-gcc-x86_64-linux-debian fails from time-to-time. See #54969. This patch is unrelated.	2022-04-18 21:56:47 -05:00
Michael Kruse	af0285122f	Revert "[OpenMP] Refactor OMPScheduleType enum." This reverts commit 9ec501da76fc1559cadd6d6dac32766bf4376a3d. It may have caused the openmp-gcc-x86_64-linux-debian buildbot to fail. https://lab.llvm.org/buildbot/#/builders/4/builds/20377	2022-04-18 14:38:31 -05:00

1 2 3

146 Commits