llvm-project

Author	SHA1	Message	Date
Sergio Afonso	18dd299fb1	[Flang][MLIR][OpenMP] Host-evaluation of omp.loop bounds (#133908 ) This patch updates Flang lowering and kernel flags identification in MLIR so that loop bounds on `target teams loop` constructs are evaluated on the host, making the trip count available to the corresponding `__tgt_target_kernel` call emitted for the target region. This is necessary in order to properly execute these constructs as `target teams distribute parallel do`. Co-authored-by: Kareem Ergawy <kareem.ergawy@amd.com>	2025-04-03 15:06:19 +01:00
Tom Eccles	e17d864f55	[flang][OpenMP][Lower] lower array subscripts for task depend (#132994 ) The OpenMP standard says that all dependencies in the same set of inter-dependent tasks must be non-overlapping. This simplification means that the OpenMP only needs to keep track of the base addresses of dependency variables. This can be seen in kmp_taskdeps.cpp, which stores task dependency information in a hash table, using the base address as a key. This patch generates a rebox operation to slice boxed arrays, but only the box data address is used for the task dependency. The extra box is optimized away by LLVM at O3. Vector subscripts are TODO (I will address in my next patch). This also fixes a bug for ordinary subscripts when the symbol was mapped to a box: Fixes #132647	2025-04-01 10:26:14 +01:00
swatheesh-mcw	fe30cf18ab	Revert "Revert "[flang][openmp] Adds Parser and Semantic Support for Interop Construct, and Init and Use Clauses."" (#132343 ) Reverts llvm/llvm-project#132005	2025-03-28 15:21:52 +00:00
Krzysztof Parzyszek	68180d8d16	[flang][OpenMP] Use OmpDirectiveSpecification in standalone directives (#131163 ) This uses OmpDirectiveSpecification in the rest of the standalone directives.	2025-03-20 06:50:43 -05:00
Sergio Afonso	ac9e4e9b33	[Flang][OpenMP] Simplify entry block creation for BlockArgOpenMPOpInterface ops, NFC (#132036 ) This patch adds the `OpWithBodyGenInfo::blockArgs` field and updates `createBodyOfOp()` to prevent the need for `BlockArgOpenMPOpInterface` operations to pass the same callback, minimizing chances of introducing inconsistent behavior.	2025-03-19 17:29:40 +00:00
Krzysztof Parzyszek	cd26dd5595	[flang][OpenMP] Use OmpDirectiveSpecification in simple directives (#131162 ) The `OmpDirectiveSpecification` contains directive name, the list of arguments, and the list of clauses. It was introduced to store the directive specification in METADIRECTIVE, and could be reused everywhere a directive representation is needed. In the long term this would unify the handling of common directive properties, as well as creating actual constructs from METADIRECTIVE by linking the contained directive specification with any associated user code.	2025-03-19 11:34:40 -05:00
Kiran Chandramohan	96b112fb61	Revert "[flang][openmp] Adds Parser and Semantic Support for Interop Construct, and Init and Use Clauses." (#132005 ) Reverts llvm/llvm-project#120584 Reverting due to CI failure https://lab.llvm.org/buildbot/#/builders/157/builds/22946	2025-03-19 11:13:52 +00:00
swatheesh-mcw	ee8a759bfb	[flang][openmp] Adds Parser and Semantic Support for Interop Construct, and Init and Use Clauses. (#120584 ) Adds Parser and Semantic Support for the below construct and clauses: - Interop Construct - Init Clause - Use Clause Note: The other clauses supported by Interop Construct such as Destroy, Use, Depend and Device are added already.	2025-03-19 10:49:17 +00:00
Tom Eccles	e7c6e3557b	[flang][OpenMP] Fix threadprivate pointer variable in common block (#131888 ) Fixes #112538 The problem was that the host associated symbol for the threadprivate variable doesn't have all of the symbol attributes (e.g. POINTER). This caused the lowering code to generate the wrong type, eventually hitting an assertion.	2025-03-19 10:12:52 +00:00
Akash Banerjee	cbc5c11fec	[MLIR][OpenMP] Add Lowering support for implicitly linking to default declare mappers (#131006 )	2025-03-18 13:17:10 +00:00
Kareem Ergawy	83658ddb1b	[flang][OpenMP] Enable delayed privatization by default for `omp.distribute` (#131574 ) Switches delayed privatization for `omp.distribute` to be on by default: controlled by the `-openmp-enable-delayed-privatization` instead of by `-openmp-enable-delayed-privatization-staging`. ### GFortran & Fujitsu test suite results: #### gfotran test-suite (this PR): ``` Testing Time: 34.51s Passed: 6569 ``` #### Fujitsu without changes (commit: 0813c5cf5f52): ``` Testing Time: 155.39s Passed : 88325 Failed : 156 Executable Missing: 408 ``` #### Fujitsu with changes (this PR): ``` Testing Time: 158.54s Passed : 88325 Failed : 156 Executable Missing: 408 ```	2025-03-18 14:07:41 +01:00
Krzysztof Parzyszek	f4fc2d731c	[flang][OpenMP] Map ByRef if size/alignment exceed that of a pointer (#130832 ) Improve the check for whether a type can be passed by copy. Currently, passing by copy is done via the OMP_MAP_LITERAL mapping, which can only transfer as much data as can be contained in a pointer representation.	2025-03-12 19:41:11 -05:00
Krzysztof Parzyszek	d67947162f	[flang][OpenMP] Implement HAS_DEVICE_ADDR clause (#128568 ) The HAS_DEVICE_ADDR indicates that the object(s) listed exists at an address that is a valid device address. Specifically, `has_device_addr(x)` means that (in C/C++ terms) `&x` is a device address. When entering a target region, `x` does not need to be allocated on the device, or have its contents copied over (in the absence of additional mapping clauses). Passing its address verbatim to the region for use is sufficient, and is the intended goal of the clause. Some Fortran objects use descriptors in their in-memory representation. If `x` had a descriptor, both the descriptor and the contents of `x` would be located in the device memory. However, the descriptors are managed by the compiler, and can be regenerated at various points as needed. The address of the effective descriptor may change, hence it's not safe to pass the address of the descriptor to the target region. Instead, the descriptor itself is always copied, but for objects like `x`, no further mapping takes place (as this keeps the storage pointer in the descriptor unchanged). --------- Co-authored-by: Sergio Afonso <safonsof@amd.com>	2025-03-10 08:11:01 -05:00
Kareem Ergawy	9543e9e927	[flang][OpenMP] Handle pre-detemined `lastprivate` for `simd` (#129507 ) This PR tries to fix `lastprivate` update issues in composite constructs. In particular, pre-determined `lastprivate` symbols are attached to the wrong leaf of the composite construct (the outermost one). When using delayed privatization (should be the default mode in the future), this results in trying to update the `lastprivate` symbol in the wrong construct (outside the `omp.loop_nest` op). For example, given the following input: ```fortran !$omp target teams distribute parallel do simd collapse(2) private(y_max) do i=x_min,x_max do j=y_min,y_max enddo enddo ``` Without the fixes introduced in this PR, the `DataSharingProcessor` tries to generate the `lastprivate` update ops in the `parallel` op since this is the op for which the DSP instance is created. The fix consists of 2 main parts: 1. Instead of creating a single DSP instance, one instance is created for the leaf constructs that might need privatization (whether for explicit, implicit, or pre-determined symbols). 2. When generating the `lastprivate` comparison ops, we don't directly use the SSA values of the UBs and steps. Instead, we regenerated these SSA values from the original loop bounds' expressions. We have to do this to avoid using `host_eval` values in the `lastprivate` comparison logic which is illegal.	2025-03-07 05:44:39 +01:00
Kiran Chandramohan	e2911aa2c2	[Flang][OpenMP] Fix crash when loop index var is pointer or allocatable (#129717 ) Use hlfir dereferencing for pointers and allocatables and use hlfir assign. Also, change the code updating IV in lastprivate. Note: This is a small change. Modifications in existing tests are changes from fir.store to hlfir.assign. Fixes #121290	2025-03-06 12:19:34 +00:00
Krzysztof Parzyszek	9573c62114	[flang][OpenMP] Accept modern syntax of FLUSH construct (#128975 ) The syntax with the object list following the memory-order clause has been removed in OpenMP 5.2. Still, accept that syntax with versions >= 5.2, but treat it as deprecated (and emit a warning).	2025-03-03 07:59:19 -06:00
Mats Petersson	24b7759a9d	[FLANG][OpenMP]Add frontend support for ASSUME and ASSUMES (#120770 ) Enough suport to parse correctly formed directives of !$OMP ASSUME and !$OMP ASSUMES with teh related clauses that go with them: ABSENT, CONTAINS, NO_OPENPP, NO_OPENMP_ROUTINES, NO_PARALLELISM and HOLDS. Tests added for unparsing and dump parse-tree. Semantics support is very minimal and no specific tests added. The lowering will hit a TODO, and there are tests in Lower/OpenMP/Todo to make it clear that this is currently expected behaviour. --------- Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com> Co-authored-by: Krzysztof Parzyszek <Krzysztof.Parzyszek@amd.com>	2025-02-25 17:36:25 +00:00
Sergio Afonso	25c19eb117	[Flang][OpenMP] Allow host evaluation of loop bounds for distribute (#127822 ) This patch adds `target teams distribute [simd]` and equivalent construct nests to the list of cases where loop bounds can be evaluated in the host, as they represent kernels for which the trip count must also be evaluated in advance to the kernel call.	2025-02-25 10:35:21 +00:00
Akash Banerjee	9905728e2f	[MLIR][OpenMP] Add Lowering support for OpenMP Declare Mapper directive (#117046 ) This patch adds HLFIR/FIR lowering support for OpenMP Declare Mapper directive. Depends on #117045.	2025-02-18 16:36:01 +00:00
Kareem Ergawy	dcb124e820	[flang][OpenMP] Enable delayed privatization by default `omp.wsloop` (#125732 ) Reapplies #122471 This is based on https://github.com/llvm/llvm-project/pull/125699, only the latest commit is relevant. With changes in this PR and the parent one, the previously reported failures in the Fujitsu() test suite should hopefully be resolved (I verified all the 14 reported failures and they pass now). () https://linaro.atlassian.net/browse/LLVM-1521	2025-02-06 19:11:04 +01:00
Michael Kruse	b815a3942a	[Flang] Move non-common headers to FortranSupport (#124416 ) Move non-common files from FortranCommon to FortranSupport (analogous to LLVMSupport) such that * declarations and definitions that are only used by the Flang compiler, but not by the runtime, are moved to FortranSupport * declarations and definitions that are used by both ("common"), the compiler and the runtime, remain in FortranCommon * generic STL-like/ADT/utility classes and algorithms remain in FortranCommon This allows a for cleaner separation between compiler and runtime components, which are compiled differently. For instance, runtime sources must not use STL's `<optional>` which causes problems with CUDA support. Instead, the surrogate header `flang/Common/optional.h` must be used. This PR fixes this for `fast-int-sel.h`. Declarations in include/Runtime are also used by both, but are header-only. `ISO_Fortran_binding_wrapper.h`, a header used by compiler and runtime, is also moved into FortranCommon.	2025-02-06 15:29:10 +01:00
Anchu Rajendran S	ccd92ec4c6	[flang][openmp] Changes for invoking scan Op (#123254 )	2025-02-05 06:55:32 -08:00
Leandro Lupori	6fc66d322b	[flang][OpenMP] Fix sections lastprivate for common blocks (#125504 ) Common block handling was missing in sections' lastprivate lowering. Fixes #121719	2025-02-04 10:28:14 -03:00
Krzysztof Parzyszek	6dfe20dbbd	[flang][OpenMP] Parse METADIRECTIVE in specification part (#123397 ) Add METADIRECTIVE to the OpenMP declarative constructs as well. Emit a TODO error for both declarative and executable cases.	2025-02-03 11:13:44 -06:00
Krzysztof Parzyszek	15ab7be2e0	[flang][OpenMP] Parse WHEN, OTHERWISE, MATCH clauses plus METADIRECTIVE (#121817 ) Parse METADIRECTIVE as a standalone executable directive at the moment. This will allow testing the parser code. There is no lowering, not even clause conversion yet. There is also no verification of the allowed values for trait sets, trait properties.	2025-01-29 15:07:20 -06:00
Mats Petersson	8035d38daa	[Flang][OpenMP]Add parsing support for DISPATCH construct (#121982 ) This allows the Flang parser to accept the !$OMP DISPATCH and related clauses. Lowering is currently not implemented. Tests for unparse and parse-tree dump is provided, and one for checking that the lowering ends in a "not yet implemented" --------- Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com>	2025-01-26 09:44:04 +00:00
Kareem Ergawy	937cbce14c	Revert "[flang][OpenMP] Enable delayed privatization by default `omp.wsloop` (#122471 )" (#123324 ) This seems to have caused some regressions in Fujitsu's test-suite: https://linaro.atlassian.net/browse/LLVM-1521 This reverts commit 6f82408bb53f57a859953d8f1114f1634a5d3ee9.	2025-01-22 10:16:40 +01:00
jeanPerier	662133a278	[flang][OpenMP][OpenACC] remove libEvaluate dependency in passes (#123784 ) Move OpenACC/OpenMP helpers from Lower/DirectivesCommon.h that are also used in OpenACC and OpenMP mlir passes into a new Optimizer/Builder/DirectivesCommon.h so that parser and evaluate headers are not included in Optimizer libraries (this both introduce compile-time and link-time pointless overheads). This should fix https://github.com/llvm/llvm-project/issues/123377	2025-01-21 20:32:42 +01:00
Thirumalai Shaktivel	c2aa11d148	[Flang] Add LLVM lowering support for UNTIED clause in Task (#121052 ) Implementation details: The UNTIED clause is recognized by setting the flag=0 for the default case or performing logical OR to flag if other clauses are specified, and this flag is passed as an argument to the `__kmpc_omp_task_alloc` runtime call. Resubmitting the PR with fix for the failure, as it was reverted here: 927a70daf31b1610627f346b0dc140eda72144b9 and previously merged here: https://github.com/llvm/llvm-project/pull/115283	2025-01-21 09:10:25 +05:30
Kareem Ergawy	a0406ce823	[flang][OpenMP] Add `hostIsSource` paramemter to `copyHostAssociateVar` (#123162 ) This fixes a bug when the same variable is used in `firstprivate` and `lastprivate` clauses on the same construct. The issue boils down to the fact that `copyHostAssociateVar` was deciding the direction of the copy assignment (i.e. the `lhs` and `rhs`) based on whether the `copyAssignIP` parameter is set. This is not the best way to do it since it is not related to whether we doing a copy from host to localized copy or the other way around. When we set the insertion for `firstprivate` in delayed privatization, this resulted in switching the direction of the copy assignment. Instead, this PR adds a new paramter to explicitely tell the function the direction of the assignment. This is a follow up PR for https://github.com/llvm/llvm-project/pull/122471, only the latest commit is relevant.	2025-01-16 19:10:12 +01:00
Kareem Ergawy	6f82408bb5	[flang][OpenMP] Enable delayed privatization by default `omp.wsloop` (#122471 ) This enable delayed privatization by default for `omp.wsloop` ops, with one caveat! I had to workaround the "impure" alloc region issue that being resolved at the moment. The workaround detects whether the alloc region's argument is used in the region and at the same time defined in block that does not dominate the chosen alloca insertion point. If so, we move the alloca insertion point below the defining instruction of the alloc region argument. This basically reverts to the non-delayed-privatizaiton behavior.	2025-01-16 15:44:59 +01:00
Kazu Hirata	0d150817c3	[flang] Fix a warning This patch fixes: flang/lib/Lower/OpenMP/OpenMP.cpp:599:15: error: unused variable 'ompEval' [-Werror,-Wunused-variable]	2025-01-14 11:08:53 -08:00
Sergio Afonso	8fe11a26ae	[Flang][OpenMP] Lowering of host-evaluated clauses (#116219 ) This patch adds support for lowering OpenMP clauses and expressions attached to constructs nested inside of a target region that need to be evaluated in the host device. This is done through the use of the `OpenMP_HostEvalClause` `omp.target` set of operands and entry block arguments. When lowering clauses for a target construct, a more involved `processHostEvalClauses()` function is called, which looks at the current and potentially other nested constructs in order to find and lower clauses that need to be processed outside of the `omp.target` operation under construction. This populates an instance of a global structure with the resulting MLIR values. The resulting list of host-evaluated values is used to initialize the `host_eval` operands when constructing the `omp.target` operation, and then replaced with the corresponding block arguments after creating that operation's region. Afterwards, while lowering nested operations, those that might potentially be evaluated on the host (i.e. `num_teams`, `thread_limit`, `num_threads` and `collapse`) check first whether there is an active global host-evaluated information structure and whether it holds values referring to these clauses. If that is the case, the stored values (`omp.target` entry block arguments at that stage) are used instead of lowering these clauses again.	2025-01-14 13:55:17 +00:00
Sergio Afonso	82b9eb1086	[Flang][OpenMP] Support teams reductions lowering (#122683 ) This patch adds PFT to MLIR lowering of teams reductions. Since there is still no MLIR to LLVM IR translation implemented, compilation of programs including these constructs will still trigger not-yet-implemented errors.	2025-01-13 12:31:29 +00:00
Kareem Ergawy	42da12063f	[flang][OpenMP] Extend delayed privatization for `omp.simd` (#122156 ) Adds support for delayed privatization for `simd` directives. This PR includes PFT down to LLVM IR lowering.	2025-01-12 07:46:58 +01:00
agozillon	5137c209f0	[Flang][OpenMP] Fix allocating arrays with size intrinisic (#119226 ) Attempt to address the following example from causing an assert or ICE: ``` subroutine test(a) implicit none integer :: i real(kind=real64), dimension(:) :: a real(kind=real64), dimension(size(a, 1)) :: b !$omp target map(tofrom: b) do i = 1, 10 b(i) = i end do !$omp end target end subroutine ``` Where we utilise a Fortran intrinsic (size) to calculate the size of allocatable arrays and then map it to device.	2025-01-03 16:46:15 +01:00
Krzysztof Parzyszek	adeff9f63a	[flang][OpenMP] Allow utility constructs in specification part (#121509 ) Allow utility constructs (error and nothing) to appear in the specification part as well as the execution part. The exception is "ERROR AT(EXECUTION)" which should only be in the execution part. In case of ambiguity (the boundary between the specification and the execution part), utility constructs will be parsed as belonging to the specification part. In such cases move them to the execution part in the OpenMP canonicalization code.	2025-01-03 09:21:36 -06:00
Krzysztof Parzyszek	df859f90aa	[flang][OpenMP] Frontend support for NOTHING directive (#120606 ) Create OpenMPUtilityConstruct and put the two utility directives in it (error and nothing). Rename OpenMPErrorConstruct to OmpErrorDirective.	2025-01-03 08:36:34 -06:00
Muhammad Omair Javaid	927a70daf3	Revert "[Flang OpenMP] Add LLVM translation support for UNTIED in Task (#115283 )" This reverts commit 919aead1db64b2f1444842bc75a3af7836238671. It breaks following LLVM bots: https://lab.llvm.org/buildbot/#/builders/199 https://lab.llvm.org/buildbot/#/builders/143 https://lab.llvm.org/buildbot/#/builders/17	2024-12-24 01:47:24 +05:00
Thirumalai Shaktivel	919aead1db	[Flang OpenMP] Add LLVM translation support for UNTIED in Task (#115283 ) Implementation details: The UNTIED clause is recognized by setting the flag=0 for the default case or performing logical OR to flag if other clauses are specified, and this flag is passed as an argument to the `__kmpc_omp_task_alloc` runtime call.	2024-12-20 16:36:51 +05:30
Kareem Ergawy	e532241b02	Re-apply (#117867 ): [flang][OpenMP] Implicitly map allocatable record fields (#120374 ) This re-applies #117867 with a small fix that hopefully prevents build bot failures. The fix is avoiding `dyn_cast` for the result of `getOperation()`. Instead we can assign the result to `mlir::ModuleOp` directly since the type of the operation is known statically (`OpT` in `OperationPass`).	2024-12-18 09:19:45 +01:00
Kareem Ergawy	dc936f3c19	Revert "[flang][OpenMP] Implicitly map allocatable record fields (#117867 )" (#120360 )	2024-12-18 06:52:24 +01:00
Kareem Ergawy	db09014a07	[flang][OpenMP] Implicitly map allocatable record fields (#117867 ) This is a starting PR to implicitly map allocatable record fields. This PR contains the following changes: 1. Re-purposes some of the utils used in `Lower/OpenMP.cpp` so that these utils work on the `mlir::Value` level rather than the `semantics::Symbol` level. This takes one step towards to enabling MLIR passes to more easily do some lowering themselves (e.g. creating `omp.map.bounds` ops for implicitely caputured data like this PR does). 2. Adds support for implicitely capturing and mapping allocatable fields in record types. There is quite some distant to still cover to have full support for this. I added a number of todos to guide further development. Co-authored-by: Andrew Gozillon <andrew.gozillon@amd.com> Co-authored-by: Andrew Gozillon <andrew.gozillon@amd.com>	2024-12-18 05:37:58 +01:00
Mats Petersson	75e6d0eb4d	[flang][OpenMP]Add support for OpenMP ERROR directive (#119582 ) Lowering leads to a TODO, with a test to confirm. Also testing unparse. --------- Co-authored-by: Krzysztof Parzyszek <Krzysztof.Parzyszek@amd.com>	2024-12-13 14:05:48 +00:00
Ivan R. Ivanov	7c9404c279	[flang][OpenMP] Add frontend support for ompx_bare clause (#111106 )	2024-12-13 21:44:43 +09:00
Leandro Lupori	db9856b516	[flang][OpenMP][NFC] Turn symTable into a reference (#119435 ) Convert `DataSharingProcessor::symTable` from pointer to reference. This avoids accidental null pointer dereferences and makes it possible to use `symTable` when delayed privatization is disabled.	2024-12-11 16:26:19 -03:00
NimishMishra	edc50f3954	[flang][OpenMP] Add lowering support for task detach (#119128 ) This PR adds lowering task detach to MLIR.	2024-12-10 03:25:06 -08:00
Mats Petersson	03b5f8f0f0	[flang][OpenMP]Add parsing and semantics support for ATOMIC COMPARE (#117032 ) This adds a minimalistic implementation of parsing and semantics for the ATOMIC COMPARE feature from OpenMP 5.1. There is no lowering, just a TODO for that part. Some of the Semantics is also just a comment explaining that more is needed.	2024-12-02 15:05:21 +00:00
Kareem Ergawy	94488445cd	[flang][MLIR] Support delayed privatization for `wsloop` (PFT -> MLIR) (#118271 ) Adds PFT to MLIR lowering for delayed privatization of `omp.wsloop` ops. Lowering to LLVM IR will be added in a later PR.	2024-12-02 15:01:09 +01:00
Kareem Ergawy	81f544d465	[flang][OpenMP] Rewrite `omp.loop` to semantically equivalent ops (#115443 ) Introduces a new conversion pass that rewrites `omp.loop` ops to their semantically equivalent op nests bases on the surrounding/binding context of the `loop` op. Not all forms of `omp.loop` are supported yet. See `isLoopConversionSupported` for more info on which forms are supported.	2024-11-28 05:15:06 +01:00

1 2 3 4

172 Commits