llvm-project

Author	SHA1	Message	Date
Johannes Doerfert	95ea3e86bb	[OpenMP] Regenerate the check lines for 2 tests Somehow those check lines were mostly untested prefixes and the ones we were looking for have been removed. Simple cleanup.	2022-03-29 10:00:03 -05:00
Jake Egan	f5a9b5cc12	[NFC][tests][AIX] XFAIL test for lack of visibility support With the addition of `__attribute__((visibility("hidden")))` to the test, the test fails because AIX's current default behaviour is to ignore hidden visibility, so the expected error is not seen. This patch marks the test `XFAIL` on AIX for now. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D122519	2022-03-28 09:43:48 -04:00
Joseph Huber	392bb8cf1f	[OpenMP] Fix AMDGPU globals test	2022-03-25 23:05:41 -04:00
Joseph Huber	9d3550c517	[OpenMP] Add AMDGPU calling convention to ctor / dtor functions This patch adds the necessary AMDGPU calling convention to the ctor / dtor kernels. These are fundamentally device kenels called by the host on image load. Without this calling convention information the AMDGPU plugin is unable to identify them. Depends on D122504 Fixes #54091 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122515	2022-03-25 22:44:20 -04:00
Joseph Huber	3c6d32ec6c	[OpenMP] Make Ctor / Dtor functions have external visibility The default construction of constructor functions by LLVM tends to make them have internal linkage. When we call a ctor / dtor function in the target region we are actually creating a kernel that is called at registration. Because the ctor is a kernel we need to make sure it's externally visible so we can actually call it. This prevented AMDGPU from correctly using constructors while NVPTX could use them simply because it ignored internal visibility. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D122504	2022-03-25 22:44:17 -04:00
Joseph Huber	b9f67d44ba	[OpenMP] Replace device kernel linkage with weak_odr Currently the device kernels all have weak linkage to prevent linkage errors on multiple defintions. However, this prevents some optimizations from adequately analyzing them because of the nature of weak linkage. This patch replaces the weak linkage with weak_odr linkage so we can statically assert that multiple declarations of the same kernel will have the same definition. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122443	2022-03-25 11:29:15 -04:00
Joseph Huber	bfda79341b	[OpenMP] Add a semantic check for updating hidden or internal values A previous patch removed the compiler generating offloading entries for variables that were declared on the device but were internal or hidden. This allowed us to compile programs but turns any attempt to run '#pragma omp target update' on one of those variables a silent failure. This patch adds a check in the semantic analysis for if the user is attempting the update a variable on the device from the host that is not externally visible. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122403	2022-03-24 19:38:30 -04:00
Jennifer Yu	a6cdac48ff	Eliminate extra set of simd variant function attribute. Current clang generates extra set of simd variant function attribute with extra 'v' encoding. For example: _ZGVbN2v__Z5add_1Pf vs _ZGVbN2vv__Z5add_1Pf The problem is due to declaration of ParamAttrs following: llvm::SmallVector<ParamAttrTy, 8> ParamAttrs(ParamPositions.size()); where ParamPositions.size() is grown after following assignment: Pos = ParamPositions[PVD]; So the PVD is not find in ParamPositions. The problem is ParamPositions need to set for each FD decl. To fix this Move ParamPositions's init inside while loop for each FD. Differential Revision: https://reviews.llvm.org/D122338	2022-03-24 13:27:28 -07:00
Mike Rice	f82ec5532b	[OpenMP] Initial parsing/sema for the 'omp target parallel loop' construct Adds basic parsing/sema/serialization support for the #pragma omp target parallel loop directive. Differential Revision: https://reviews.llvm.org/D122359	2022-03-24 09:19:00 -07:00
Joseph Huber	0d16c23af1	[OpenMP] Do not create offloading entries for internal or hidden symbols Currently we create offloading entries to register device variables with the host. When we register a variable we will look up the symbol in the device image and map the device address to the host address. This is a problem when the symbol is declared with hidden visibility or internal linkage. This means the symbol is not accessible externally and we cannot get its address. We should still allow static variables to be declared on the device, but ew should not create an offloading entry for them so they exist independently on the host and device. Fixes #54309 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D122352	2022-03-23 18:27:16 -04:00
Nikita Popov	aaf2bccf1f	[CodeGen][OpenMP] Add alignment to test (NFC) Check which alignments are generated for loads/stores.	2022-03-23 12:01:00 +01:00
Nikita Popov	47eb4f7dcd	[CGOpenMPRuntime] Specify correct type in EmitLoadOfPointerLValue() Perform a bitcast first, so we can specify the correct pointer type inf EmitLoadOfPointerLValue(), rather than using a dummy void pointer.	2022-03-23 11:51:14 +01:00
Nikita Popov	a451a29127	[CodeGen][OpenMP] Add alignment to test (NFC) Check which alignments are generated for loads and stores.	2022-03-23 10:28:04 +01:00
Mike Rice	2cedaee6f7	[OpenMP] Initial parsing/sema for the 'omp parallel loop' construct Adds basic parsing/sema/serialization support for the #pragma omp parallel loop directive. Differential Revision: https://reviews.llvm.org/D122247	2022-03-22 13:55:47 -07:00
Nikita Popov	a9656bd1bc	[CodeGen][OpenMP] Make EmitLoadOfPointer() type consistent If necessary insert a bitcast beforehand, so the LLVM-level pointer type and the Clang-level pointer type line up.	2022-03-22 09:37:48 +01:00
Tom Honermann	059a953d88	[clang] [OpenMP] Diagnose use of 'target_clones' in OpenMP variant declarations. Previously, OpenMP variant declarations for a function declaration that included the 'cpu_dispatch', 'cpu_specific', or 'target' attributes was diagnosed, but one with the 'target_clones' attribute was not. Now fixed. Reviewed By: erichkeane, jdoerfert Differential Revision: https://reviews.llvm.org/D121963	2022-03-21 13:39:44 -04:00
Tom Honermann	8ff8c3ac0d	[clang] [OpenMP] Extend OpenMP variant declaration tests. This change extends the existing diagnostic tests for OpenMP variant declarations to cover diagnostics for declarations that include multiversion function attributes. The new tests demonstrate a missing check for the 'target_clones' attribute. Reviewed By: erichkeane, jdoerfert Differential Revision: https://reviews.llvm.org/D121962	2022-03-21 13:39:44 -04:00
Nikita Popov	7a2e12e0a7	[CodeGen][OpenMP] Use correct type in EmitLoadOfPointer() The EmitLoadOfPointer() call already specified the right pointer type, but it did not match the Address we're loading from, so we need to insert a bitcast first.	2022-03-21 15:22:37 +01:00
Nikita Popov	afb9cbb324	[OpenMP] Regenerate test checks (NFC)	2022-03-21 14:25:02 +01:00
Nikita Popov	b6f85d8539	[CodeGen][OpenMP] Use correct type in EmitLoadOfPointer() Rather than using a dummy void pointer type, we should specify the correct private type and perform the bitcast beforehand rather than afterwards. This way, the Address will have correct alignment information.	2022-03-21 12:08:05 +01:00
Mike Rice	6bd8dc91b8	[OpenMP] Initial parsing/sema for the 'omp target teams loop' construct Adds basic parsing/sema/serialization support for the #pragma omp target teams loop directive. Differential Revision: https://reviews.llvm.org/D122028	2022-03-18 13:48:32 -07:00
Johannes Doerfert	1df3a913ef	[OpenMP][FIX] Make test check lines less strict The ppc64be bot emits the dtor metadata first for some reason. We should investigate this or make the _cc_ update script able to use variables instead of fixed numbers (e.g., !1). The IR update script does that already.	2022-03-18 10:53:32 -05:00
Nikita Popov	52cc65d474	[OpenMPRuntime] Specify correct pointer type Rather than specifying a dummy type in EmitLoadOfPointer() and then casting it to the correct one, we should instead specify the correct type and cast beforehand. Otherwise the computed alignment will be incorrect.	2022-03-18 14:25:51 +01:00
Johannes Doerfert	b4cc3b1dd8	[OpenMP][FIX] Make metadata and attribute check lines less detailed The update_cc script should really do this automatically :(	2022-03-17 14:58:22 -05:00
Johannes Doerfert	052a6c744a	[OpenMP][FIX] Relax test check lines	2022-03-17 14:01:47 -05:00
Johannes Doerfert	f02550bdd9	Reapply "[OpenMP][FIX] Allow device constructors for AMD GPU" This reverts commit a597d6a780b184539f504392168b004bf392a135 and reapplies 07b176646134. In AMD GPU device code the globals are in AS(1). Before, we crashed if the global was a structure. Now we simply cast away the AS before we generate the code to initialize the global. Differential Revision: https://reviews.llvm.org/D121837 Fixes: https://github.com/llvm/llvm-project/issues/54421	2022-03-17 12:53:47 -05:00
Johannes Doerfert	a597d6a780	Revert "[OpenMP][FIX] Allow device constructors for AMD GPU" This reverts commit 07b176646134c3d88a4cecef5e0058e2de6b2409 as it broke the buildbots: https://lab.llvm.org/buildbot#builders/193/builds/8594	2022-03-16 17:35:54 -05:00
Johannes Doerfert	07b1766461	[OpenMP][FIX] Allow device constructors for AMD GPU In AMD GPU device code the globals are in AS(1). Before, we crashed if the global was a structure. Now we simply cast away the AS before we generate the code to initialize the global. Differential Revision: https://reviews.llvm.org/D121837	2022-03-16 17:04:28 -05:00
Mike Rice	79f661edc1	[OpenMP] Initial parsing/sema for the 'omp teams loop' construct Adds basic parsing/sema/serialization support for the #pragma omp teams loop directive. Differential Revision: https://reviews.llvm.org/D121713	2022-03-16 14:39:18 -07:00
Peixin-Qiao	4e159e4c7b	[clang] Fix OpenMP critical hint parameter check The paramemter of hint clause in OpenMP critical hint should be non-negative. The omp_lock_hint_none is 0 in omp.h. Reviewed By: Alexey Bataev Differential Revision: https://reviews.llvm.org/D121101	2022-03-08 09:04:31 +08:00
William S. Moses	87ec6f41bb	[OpenMPIRBuilder] Allocate temporary at the correct block in a nested parallel The OpenMPIRBuilder has a bug. Specifically, suppose you have two nested openmp parallel regions (writing with MLIR for ease) ``` omp.parallel { %a = ... omp.parallel { use(%a) } } ``` As OpenMP only permits pointer-like inputs, the builder will wrap all of the inputs into a stack allocation, and then pass this allocation to the inner parallel. For example, we would want to get something like the following: ``` omp.parallel { %a = ... %tmp = alloc store %tmp[] = %a kmpc_fork(outlined, %tmp) } ``` However, in practice, this is not what currently occurs in the context of nested parallel regions. Specifically to the OpenMPIRBuilder, the entirety of the function (at the LLVM level) is currently inlined with blocks marking the corresponding start and end of each region. ``` entry: ... parallel1: %a = ... ... parallel2: use(%a) ... endparallel2: ... endparallel1: ... ``` When the allocation is inserted, it presently inserted into the parent of the entire function (e.g. entry) rather than the parent allocation scope to the function being outlined. If we were outlining parallel2, the corresponding alloca location would be parallel1. This causes a variety of bugs, including https://github.com/llvm/llvm-project/issues/54165 as one example. This PR allows the stack allocation to be created at the correct allocation block, and thus remedies such issues. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D121061	2022-03-06 18:34:25 -05:00
Michael Kruse	a66f7769a3	[OpenMPIRBuilder] Implement static-chunked workshare-loop schedules. Add applyStaticChunkedWorkshareLoop method implementing static schedule when chunk-size is specified. Unlike a static schedule without chunk-size (where chunk-size is chosen by the runtime such that each thread receives one chunk), we need two nested loops: one for looping over the iterations of a chunk, and a second for looping over all chunks assigned to the threads. This patch includes the following related changes: * Adapt applyWorkshareLoop to triage between the schedule types, now possible since all schedules have been implemented. The default schedule is assumed to be non-chunked static, as without OpenMPIRBuilder. * Remove the chunk parameter from applyStaticWorkshareLoop, it is ignored by the runtime. Change the value for the value passed to the init function to 0, as without OpenMPIRBuilder. * Refactor CanonicalLoopInfo::setTripCount and CanonicalLoopInfo::mapIndVar as used by both, applyStaticWorkshareLoop and applyStaticChunkedWorkshareLoop. * Enable Clang to use the OpenMPIRBuilder in the presence of the schedule clause. Differential Revision: https://reviews.llvm.org/D114413	2022-02-28 18:18:33 -06:00
Alexey Bataev	d04d9220e1	[OPENMP]Fix PR50347: Mapping of global scope deep object fails. Changed the we handle llvm::Constants in sizes arrays. ConstExprs and GlobalValues cannot be used as initializers, need to put them at the runtime, otherwise there wight be the compilation errors. Differential Revision: https://reviews.llvm.org/D105297	2022-02-25 10:54:24 -08:00
Aaron Ballman	2ceee2f884	Add -Wno-strict-prototypes to C tests; NFC This patch adds -Wno-strict-prototypes to all of the test cases that use functions without prototypes, but not as the primary concern of the test. e.g., attributes testing whether they can/cannot be applied to a function without a prototype, etc. This is done in preparation for enabling -Wstrict-prototypes by default.	2022-02-24 15:30:30 -05:00
Alexey Bataev	ca6fa71b7e	Revert "[OPENMP]Fix PR50347: Mapping of global scope deep object fails." This reverts commit 638938117aeae5518d6cacd066ffd9830ef4fc9a. Need to fix reported fail https://lab.llvm.org/buildbot/#/builders/193/builds/7496	2022-02-24 12:04:39 -08:00
Alexey Bataev	638938117a	[OPENMP]Fix PR50347: Mapping of global scope deep object fails. Changed the we handle llvm::Constants in sizes arrays. ConstExprs and GlobalValues cannot be used as initializers, need to put them at the runtime, otherwise there wight be the compilation errors. Differential Revision: https://reviews.llvm.org/D105297	2022-02-24 11:49:14 -08:00
Aaron Ballman	dcc4feb9a4	Use function prototypes when appropriate; NFC	2022-02-23 17:12:25 -05:00
Joseph Huber	2b97b16f29	[OpenMP] Add option to make offloading mandatory Currently when we generate OpenMP offloading code we always make fallback code for the CPU. This is necessary for implementing features like conditional offloading and ensuring that unhandled pragmas don't result in missing symbols. However, this is problematic for a few cases. For offloading tests we can silently fail to the host without realizing that offloading failed. Additionally, this makes it impossible to provide interoperabiility to other offloading schemes like HIP or CUDA because those methods do not provide any such host fallback guaruntee. this patch adds the `-fopenmp-offload-mandatory` flag to prevent generating the fallback symbol on the CPU and instead replaces the function with a dummy global and the failed branch with 'unreachable'. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120353	2022-02-23 16:45:36 -05:00
Shilei Tian	104d9a6743	[Clang][OpenMP] Add the codegen support for `atomic compare` This patch adds the codegen support for `atomic compare` in clang. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D118632	2022-02-22 13:01:39 -05:00
Alexey Bataev	f9c3310d32	[OPENMP]Fix PR49366: crash on VLAs in task untied regions. We need to capture the local variables into a record in task untied regions but clang does not support record with VLA data members. Differential Revision: https://reviews.llvm.org/D99436	2022-02-21 12:28:47 -08:00
Shilei Tian	e2855e1760	[Clang][OpenMP] Add Sema support for atomic compare capture This patch adds Sema support for `atomic compare capture`. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120200	2022-02-21 14:21:02 -05:00
Shilei Tian	3a3d9ae545	[Clang][OpenMP] Fix wrong form of 'cond-update-stmt' in atomic_ast_print.cpp In `clang/test/OpenMP/atomic_ast_print.cpp` for `atomic compare capture`, it was using 'cond-expr-stmt' instead of 'cond-update-stmt'. The spec only supports 'cond-update-stmt'. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120252	2022-02-21 11:40:09 -05:00
Shilei Tian	6da60647cd	[Clang][Sema] Check unexpected else statement in cond-update-stmt In 'cond-update-stmt', `else` statement is not expected. This patch adds the check in Sema. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D120225	2022-02-21 08:20:34 -05:00
Shilei Tian	68b7b357fd	[Clang][OpenMP][Sema] Remove support for floating point values in atomic compare This is a follow-up patch of D119378. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D119392	2022-02-18 10:24:29 -05:00
Shilei Tian	ccebf8ac8c	[Clang][OpenMP] Add support for compare capture in parser This patch adds the support for `atomic compare capture` in parser and part of sema. We don't create an AST node for this because the spec doesn't say `compare` and `capture` clauses should be used tightly, so we cannot look one more token ahead in the parser. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D116261	2022-02-18 10:23:59 -05:00
Joseph Huber	0870a4f59a	[OpenMP] Add flag for disabling thread state in runtime The runtime uses thread state values to indicate when we use an ICV or are in nested parallelism. This is done for OpenMP correctness, but it not needed in the majority of cases. The new flag added is `-fopenmp-assume-no-thread-state`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D120106	2022-02-18 08:35:05 -05:00
hyeongyukim	b529744c29	[Clang] Rename `disable-noundef-analysis` flag to `-[no-]enable-noundef-analysis` This flag was previously renamed `enable_noundef_analysis` to `disable-noundef-analysis,` which is not a conventional name. (Driver and CC1's boolean options are using [no-] prefix) As discussed at https://reviews.llvm.org/D105169, this patch reverts its name to `[no-]enable_noundef_analysis` and enables noundef-analysis as default. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D119998	2022-02-18 17:02:41 +09:00
Aaron Ballman	5824d2bb0f	Fix the declaration printer to properly handle prototypes in C Previously, we would take a declaration like void f(void) and print it as void f(). That's correct in C++ as far as it goes, but is incorrect in C because that converts the function from having a prototype to one which does not. This turns out to matter for some of our tests that use the pretty printer where we'd like to get rid of the K&R prototypes from the test but can't because the test is checking the pretty printed function signature, as done with the ARCMT tests.	2022-02-17 13:54:09 -05:00
Mike Rice	383f3a467c	[OpenMP] Diagnose bad 'omp declare variant' that references itself. When an a variant is specified that is the same as the base function the compiler will end up crashing in CodeGen. Give an error instead. Differential Revision: https://reviews.llvm.org/D119979	2022-02-17 10:36:28 -08:00
Mike Rice	83a407d176	[OpenMP]Fix parsing of OpenMP directive nested in a metadirective Differential Revision: https://reviews.llvm.org/D119761	2022-02-14 16:15:20 -08:00

1 2 3 4 5 ...

1888 Commits