llvm-project

Author	SHA1	Message	Date
Shilei Tian	52b4bec939	[Clang][OpenMP] Emit unroll directive w/o captured stmt (#65862 ) The front end doesn't create captured stmt for unroll directive. This leads to a crash when `-fopenmp-simd` is used, as reported in #63570. Fix #63570.	2023-09-09 18:51:58 -04:00
Sandeep Kosuri	08bbff4aad	[OpenMP] Codegen support for thread_limit on target directive for host offloading - This patch adds support for thread_limit clause on target directive according to OpenMP 51 [2.14.5] - The idea is to create an outer task for target region, when there is a thread_limit clause, and manipulate the thread_limit of task instead. This way, thread_limit will be applied to all the relevant constructs enclosed by the target region. Differential Revision: https://reviews.llvm.org/D152054	2023-08-26 22:18:49 -05:00
Podchishchaeva, Mariya	cc928c6830	[NFC][clang] Fix static analyzer concerns OMPTransformDirectiveScopeRAII doesn't have user-written copy constructor/assignment operator but it frees memory in the destructor. Delete these members since doesn't seem that OMPTransformDirectiveScopeRAII objects are intended for copy. Reviewed By: tahonermann, ABataev Differential Revision: https://reviews.llvm.org/D155849	2023-07-24 05:15:50 -07:00
Michael Halkenhaeuser	7d4e14c76b	[clang][OpenMP] Add interop support for multiple depend clauses This patch removes the constraint of the `interop` directive where only a single `depend` clause was allowed. Differential Revision: https://reviews.llvm.org/D155692	2023-07-20 06:26:19 -04:00
Akash Banerjee	227012cbd7	[OpenMP] Migrate device code privatisation from Clang CodeGen to OMPIRBuilder This patch migrates the UseDevicePtr and UseDeviceAddr clause related code for handling privatisation from Clang codegen to the OMPIRBuilder Depends on D150860 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D152554	2023-07-12 12:03:28 +01:00
Sergio Afonso	63ca93c7d1	[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags This patch renames the `OpenMPIRBuilderConfig` flags to reduce confusion over their meaning. `IsTargetCodegen` becomes `IsGPU`, whereas `IsEmbedded` becomes `IsTargetDevice`. The `-fopenmp-is-device` compiler option is also renamed to `-fopenmp-is-target-device` and the `omp.is_device` MLIR attribute is renamed to `omp.is_target_device`. Getters and setters of all these renamed properties are also updated accordingly. Many unit tests have been updated to use the new names, but an alias for the `-fopenmp-is-device` option is created so that external programs do not stop working after the name change. `IsGPU` is set when the target triple is AMDGCN or NVIDIA PTX, and it is only valid if `IsTargetDevice` is specified as well. `IsTargetDevice` is set by the `-fopenmp-is-target-device` compiler frontend option, which is only added to the OpenMP device invocation for offloading-enabled programs. Differential Revision: https://reviews.llvm.org/D154591	2023-07-10 14:14:16 +01:00
Dave Pagan	eb61bde829	[OpenMP][CodeGen] Add codegen for combined 'loop' directives. The loop directive is a descriptive construct which allows the compiler flexibility in how it generates code for the directive's associated loop(s). See OpenMP specification 5.2 [257:8-9]. Codegen added in this patch for the combined 'loop' directives are: 'target teams loop' -> 'target teams distribute parallel for' 'teams loop' -> 'teams distribute parallel for' 'target parallel loop' -> 'target parallel for' 'parallel loop' -> 'parallel for' NOTE: The implementation of the 'loop' directive itself is unchanged. Differential Revision: https://reviews.llvm.org/D145823	2023-07-05 12:31:59 -05:00
Jennifer Yu	35041a435d	[OPENMP52] Codegen support for doacross clause. Differential Revision: https://reviews.llvm.org/D154180	2023-07-03 15:24:05 -07:00
Youngsuk Kim	5f32baf17d	[clang] Replace uses of CreateElementBitCast (NFC) Partial progress towards replacing uses of CreateElementBitCast, as it no longer does what its name suggests. Reviewed By: barannikov88 Differential Revision: https://reviews.llvm.org/D154229	2023-06-30 17:35:36 -04:00
Zhiheng Xie	08513cbea4	[OpenMP] Fix lvalue reference type generation in untied task loop For variables with lvalue reference type in untied task loop, it now wrongly sets its actual type as ElementType. It should be converted to pointer type. It fixes https://github.com/llvm/llvm-project/issues/62965 Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D153321	2023-06-29 09:11:10 -07:00
Youngsuk Kim	44e63ffe2b	[clang] Replace uses of CGBuilderTy::CreateElementBitCast (NFC) * Add `Address::withElementType()` as a replacement for `CGBuilderTy::CreateElementBitCast`. * Partial progress towards replacing `CreateElementBitCast`, as it no longer does what its name suggests. Either replace its uses with `Address::withElementType()`, or remove them if no longer needed. * Remove unused parameter 'Name' of `CreateElementBitCast` Reviewed By: barannikov88, nikic Differential Revision: https://reviews.llvm.org/D153196	2023-06-18 04:13:15 +03:00
Itay Bookstein	782c59a4ee	[OpenMP] Prefix outlined and reduction func names with original func's name This patch prefixes omp outlined helpers and reduction funcs with the original function's name. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D140722	2023-04-19 23:00:26 +03:00
Itay Bookstein	6fdd13e0ec	Revert "[OpenMP] Prefix outlined and reduction func names with original func's name" This reverts commit 029bfc311d4d7d3cd90be81bb08c046848796d02.	2023-04-19 19:08:49 +03:00
Itay Bookstein	029bfc311d	[OpenMP] Prefix outlined and reduction func names with original func's name This patch attempts to prefix omp outlined helpers and reduction funcs with the original function's name. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D140722	2023-04-19 19:05:21 +03:00
Rafael A. Herrera Guaitero	64549f0903	[OpenMP][5.1] Fix parallel masked is ignored #59939 Code generation support for 'parallel masked' directive. The `EmitOMPParallelMaskedDirective` was implemented. In addition, the appropiate device functions were added. Fix #59939. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D143527	2023-04-03 20:33:55 +00:00
Alexey Bataev	acc30a169e	[OpenMP]Emit captured decls for target data if no devices were specified. If use_device_ptr/use_device_addr clauses are used on target data directive and no device was specified during the compilation, only host part should be emitted. But it still required to emit captured decls for partially mapped data fields. Differential Revision: https://reviews.llvm.org/D144993	2023-02-28 12:18:00 -08:00
Joseph Huber	853d405913	[OpenMP] Ignore implicit casts on assertion for `use_device_ptr` There was an assertion triggering when invoking a captured member whose initializer was in a blase class. This patch fixes it by allowing the assertion on implicit casts to the base class rather than only the base class itself. Fixes https://github.com/llvm/llvm-project/issues/61027 Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D144873	2023-02-27 10:48:20 -06:00
Johannes Doerfert	915602e096	[OpenMP][FIX] Add default clause to switch	2023-01-21 19:50:22 -08:00
Doru Bercea	1407dbeabc	Allow a target loop to be used inside a parallel.	2023-01-20 14:10:43 -06:00
Kazu Hirata	6ad0788c33	[clang] Use std::optional instead of llvm::Optional (NFC) This patch replaces (llvm::\|)Optional< with std::optional<. I'll post a separate patch to remove #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 12:31:01 -08:00
Kazu Hirata	a1580d7b59	[clang] Add #include <optional> (NFC) This patch adds #include <optional> to those files containing llvm::Optional<...> or Optional<...>. I'll post a separate patch to actually replace llvm::Optional with std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 11:07:21 -08:00
Matt Arsenault	8efb8f776a	OpenMP: Use inbounds in EmitOMPAggregateAssign This looked like a plausibly correct out of tree patch. The changed testcases with the pragmas stripped out only use inbounds GEPs so I assume this is correct.	2023-01-12 19:03:10 -05:00
serge-sans-paille	a3c248db87	Move from llvm::makeArrayRef to ArrayRef deduction guides - clang/ part This is a follow-up to https://reviews.llvm.org/D140896, split into several parts as it touches a lot of files. Differential Revision: https://reviews.llvm.org/D141139	2023-01-09 12:15:24 +01:00
Sunil Kuravinakop	e9babe7571	[OpenMP] Clang Support for taskwait nowait clause Support for taskwait nowait clause with placeholder for runtime changes. Reviewed By: cchen, ABataev Differential Revision: https://reviews.llvm.org/D131830	2022-12-20 12:13:56 -06:00
Chi Chun Chen	e0fd86db09	Revert "[OpenMP] Clang Support for taskwait nowait clause" This reverts commit 100dfe7a8ad3789a98df623482b88d9a3a02e176.	2022-12-09 11:06:45 -06:00
Jennifer Yu	af781f7042	[OPENMP51]Codegen for error directive. Added codegen for `omp error` directive. This is to generate IR to call: void __kmpc_error(ident_t loc, int severity, const char message); Differential Revision: https://reviews.llvm.org/D139166	2022-12-08 13:07:08 -08:00
Sunil K	100dfe7a8a	[OpenMP] Clang Support for taskwait nowait clause Support for taskwait nowait clause with placeholder for runtime changes. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D131830	2022-12-08 12:40:44 -08:00
Kazu Hirata	bb666c6930	[CodeGen] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 11:13:43 -08:00
Jennifer Yu	9d90cf2fca	[OPENMP5.1] Initial support for message clause.	2022-11-18 17:59:23 -08:00
Jennifer Yu	1e054e6b52	[OPENMP5.1] Initial support for severity clause Differential Revision:https://reviews.llvm.org/D138227	2022-11-17 16:05:02 -08:00
Jennifer Yu	628fdc3f57	[OPENMP]Initial support for at clause Error directive is allowed in both declared and executable contexts. The function ActOnOpenMPAtClause is called in both places during the parsers. Adding a param "bool InExContext" to identify context which is used to emit error massage. Differential Revision: https://reviews.llvm.org/D137851	2022-11-15 14:06:50 -08:00
Jennifer Yu	ea64e66f7b	[OPENMP]Initial support for error directive. Differential Revision: https://reviews.llvm.org/D137209	2022-11-02 14:25:28 -07:00
Dominik Adamski	ccd314d320	[OpenMP][OMPIRBuilder] Add generation of SIMD align assumptions to OMPIRBuilder Currently generation of align assumptions for OpenMP simd construct is done outside OMPIRBuilder for C code and it is not supported for Fortran. According to OpenMP 5.0 standard (2.9.3) only pointers and arrays can be aligned for C code. If given aligned variable is pointer, then Clang generates the following set of the LLVM IR isntructions to support simd align clause: ; memory allocation for pointer address: %A.addr = alloca ptr, align 8 ; some LLVM IR code ; Alignment instructions (alignment is equal to 32): %0 = load ptr, ptr %A.addr, align 8 call void @llvm.assume(i1 true) [ "align"(ptr %0, i64 32) ] If given aligned variable is array, then Clang generates the following set of the LLVM IR isntructions to support simd align clause: ; memory allocation for array: %B = alloca [10 x i32], align 16 ; some LLVM IR code ; Alignment instructions (alignment is equal to 32): %arraydecay = getelementptr inbounds [10 x i32], ptr %B, i64 0, i64 0 call void @llvm.assume(i1 true) [ "align"(ptr %arraydecay, i64 32) ] OMPIRBuilder was modified to generate aligned assumptions. It generates only llvm.assume calls. Frontend is responsible for generation of aligned pointer and getting the default alignment value if user does not specify it in aligned clause. Unit and regression tests were added to check if aligned clause was handled correctly. Differential Revision: https://reviews.llvm.org/D133578 Reviewed By: jdoerfert	2022-10-18 02:04:18 -05:00
Dominik Adamski	6842d35012	[OpenMP][OMPIRBuilder] Add support for order(concurrent) to OMPIRBuilder for SIMD directive If 'order(concurrent)' clause is specified, then the iterations of SIMD loop can be executed concurrently. This patch adds support for LLVM IR codegen via OMPIRBuilder for SIMD loop with 'order(concurrent)' clause. The functionality added to OMPIRBuilder is similar to the functionality implemented in 'CodeGenFunction::EmitOMPSimdInit'. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D134046 Signed-off-by: Dominik Adamski <dominik.adamski@amd.com>	2022-10-04 08:30:00 -05:00
Dhruva Chakrabarti	839ac62c50	Revert "[OpenMP] Codegen aggregate for outlined function captures" This reverts commit 7539e9cf811e590d9f12ae39673ca789e26386b4.	2022-09-15 03:08:46 +00:00
Giorgis Georgakoudis	7539e9cf81	[OpenMP] Codegen aggregate for outlined function captures Parallel regions are outlined as functions with capture variables explicitly generated as distinct parameters in the function's argument list. That complicates the fork_call interface in the OpenMP runtime: (1) the fork_call is variadic since there is a variable number of arguments to forward to the outlined function, (2) wrapping/unwrapping arguments happens in the OpenMP runtime, which is sub-optimal, has been a source of ABI bugs, and has a hardcoded limit (16) in the number of arguments, (3) forwarded arguments must cast to pointer types, which complicates debugging. This patch avoids those issues by aggregating captured arguments in a struct to pass to the fork_call. Reviewed By: jdoerfert, jhuber6, ABataev Differential Revision: https://reviews.llvm.org/D102107	2022-09-15 00:54:05 +00:00
utsumi	2e2caea37f	[Clang][OpenMP] Make copyin clause on combined and composite construct work (patch by Yuichiro Utsumi (utsumi.yuichiro@fujitsu.com)) Make copyin clause on the following constructs work. - parallel for - parallel for simd - parallel sections Fixes https://github.com/llvm/llvm-project/issues/55547 Patch by Yuichiro Utsumi (utsumi.yuichiro@fujitsu.com) Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D132209	2022-08-23 07:58:35 -07:00
Prabhdeep Singh Soni	bce94ea551	[OMPIRBuilder] Add support for safelen clause This patch adds OMPIRBuilder support for the safelen clause for the simd directive. Reviewed By: shraiysh, Meinersbur Differential Revision: https://reviews.llvm.org/D131526	2022-08-18 15:43:08 -04:00
Shilei Tian	e21202dac1	[Clang][OpenMP] Fix the issue that `llvm.lifetime.end` is emitted too early for variables captured in linear clause Currently if an OpenMP program uses `linear` clause, and is compiled with optimization, `llvm.lifetime.end` for variables listed in `linear` clause are emitted too early such that there could still be uses after that. Let's take the following code as example: ``` // loop.c int j; int u; void loop(int n) { int i; for (i = 0; i < n; ++i) { ++j; u = &j; } } ``` We compile using the command: ``` clang -cc1 -fopenmp-simd -O3 -x c -triple x86_64-apple-darwin10 -emit-llvm loop.c -o loop.ll ``` The following IR (simplified) will be generated: ``` @j = local_unnamed_addr global i32 0, align 4 @u = local_unnamed_addr global ptr null, align 8 define void @loop(i32 noundef %n) local_unnamed_addr { entry: %j = alloca i32, align 4 %cmp = icmp sgt i32 %n, 0 br i1 %cmp, label %simd.if.then, label %simd.if.end simd.if.then: ; preds = %entry call void @llvm.lifetime.start.p0(i64 4, ptr nonnull %j) store ptr %j, ptr @u, align 8 call void @llvm.lifetime.end.p0(i64 4, ptr nonnull %j) %0 = load i32, ptr %j, align 4 store i32 %0, ptr @j, align 4 br label %simd.if.end simd.if.end: ; preds = %simd.if.then, %entry ret void } ``` The most important part is: ``` call void @llvm.lifetime.end.p0(i64 4, ptr nonnull %j) %0 = load i32, ptr %j, align 4 store i32 %0, ptr @j, align 4 ``` `%j` is still loaded after `@llvm.lifetime.end.p0(i64 4, ptr nonnull %j)`. This could cause the backend incorrectly optimizes the code and further generates incorrect code. The root cause is, when we emit a construct that could have `linear` clause, it usually has the following pattern: ``` EmitOMPLinearClauseInit(S) { OMPPrivateScope LoopScope(this); ... EmitOMPLinearClause(S, LoopScope); ... (void)LoopScope.Privatize(); ... } EmitOMPLinearClauseFinal(S, [](CodeGenFunction &) { return nullptr; }); ``` Variables that need to be privatized are added into `LoopScope`, which also serves as a RAII object. When `LoopScope` is destructed and if optimization is enabled, a `@llvm.lifetime.end` is also emitted for each privatized variable. However, the writing back to original variables in `linear` clause happens after the scope in `EmitOMPLinearClauseFinal`, causing the issue we see above. A quick "fix" seems to be, moving `EmitOMPLinearClauseFinal` inside the scope. However, it doesn't work. That's because the local variable map has been updated by `LoopScope` such that a variable declaration is mapped to the privatized variable, instead of the actual one. In that way, the following code will be generated: ``` %0 = load i32, ptr %j, align 4 store i32 %0, ptr %j, align 4 call void @llvm.lifetime.end.p0(i64 4, ptr nonnull %j) ``` Well, now the life time is correct, but apparently the writing back is broken. In this patch, a new function `OMPPrivateScope::restoreMap` is added and called before calling `EmitOMPLinearClauseFinal`. This can make sure that `EmitOMPLinearClauseFinal` can find the orignal varaibls to write back. Fixes #56913. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D131272	2022-08-06 16:50:37 -04:00
Dominik Adamski	d90b7bf2c5	Add support for lowering simd if clause to LLVM IR Scope of changes: 1) Added new function to generate loop versioning 2) Added support for if clause to applySimd function 2) Added tests which confirm that lowering is successful If ifCond is specified, then collapsed loop is duplicated and if branch is added. Duplicated loop is executed if simd ifCond is evaluated to false. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D129368 Signed-off-by: Dominik Adamski <dominik.adamski@amd.com>	2022-08-01 04:43:32 -05:00
Shraiysh Vaishay	61fa7a88c7	[clang][OpenMP] Add IRBuilder support for taskgroup This patch makes use of OMPIRBuilder support for codegen of taskgroup construct in clang. Depends on D128203 Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D129992	2022-07-21 11:13:57 +05:30
Prabhdeep Singh Soni	ac892c70a4	[OMPIRBuilder] Add support for simdlen clause This patch adds OMPIRBuilder support for the simdlen clause for the simd directive. It uses the simdlen support in OpenMPIRBuilder when it is enabled in Clang. Simdlen is lowered by OpenMPIRBuilder by generating the loop.vectorize.width metadata. Reviewed By: jdoerfert, Meinersbur Differential Revision: https://reviews.llvm.org/D129149	2022-07-11 13:29:06 -04:00
Shilei Tian	83837a6198	[Clang][OpenMP] Enable floating-point operation for `atomic compare` series D127041 introduced the support for `fmax` and `fmin` such that we can also reprent `atomic compare` and `atomic compare capture` with `atomicrmw` instruction. This patch simply lifts the limitation we set before. Depend on D127041. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D127042	2022-07-06 13:05:11 -04:00
Ritanya B Bharadwaj	8322fe200d	Adding support for target in_reduction Implementing target in_reduction by wrapping target task with host task with in_reduction and if clause. This is in compliance with OpenMP 5.0 section: 2.19.5.6. So, this ``` for (int i=0; i<N; i++) { res = res+i } ``` will become ``` #pragma omp task in_reduction(+:res) if(0) #pragma omp target map(res) for (int i=0; i<N; i++) { res = res+i } ``` Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D125669	2022-06-27 10:36:46 -05:00
Kazu Hirata	452db157c9	[clang] Don't use Optional::hasValue (NFC)	2022-06-20 10:51:34 -07:00
Shilei Tian	c4a90db720	[Clang][OpenMP] Add the codegen support for `atomic compare capture` This patch adds the codegen support for `atomic compare capture` in clang. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120290	2022-06-02 21:38:21 -04:00
Shilei Tian	3a96256b7e	[Clang][OpenMP] Avoid using `IgnoreImpCasts` if possible This patch removes all `IgnoreImpCasts` in Sema, and only uses it if necessary. If the expression is not of the same type as the pointer value, a cast is inserted. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D126602	2022-06-02 17:45:02 -04:00
Shilei Tian	eb673be5ac	[OMPIRBuilder] Add the support for compare capture This patch adds the support for `compare capture` in `OMPIRBuilder`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D120007	2022-06-01 19:53:43 -04:00
Joel E. Denny	d2e3cb7374	[OpenMP][Clang] Fix atomic compare for signed vs. unsigned Without this patch, arguments to the `llvm::OpenMPIRBuilder::AtomicOpValue` initializer are reversed. Reviewed By: ABataev, tianshilei1992 Differential Revision: https://reviews.llvm.org/D126619	2022-05-30 11:02:20 -04:00
Aaron Ballman	9368bf9023	Removing this as part of the revert done in 69da3b6aead2e7a18a2578aad661d6d36b8d30cf This appears to have been added in a follow-up commit that I missed.	2022-05-25 13:45:17 -04:00

1 2 3 4 5 ...

721 Commits