llvm-project

Author	SHA1	Message	Date
Matt Arsenault	81849497b4	clang/AMDGPU: Remove flat-address-space from feature map This was only used for checking if is_shared/is_private were legal, which we're not bothering to do anymore. This is apparently visible to more than the target attribute (which seems to silently ignore unrecognized features), so this has the potential to break something (i.e. see the OpenMP test change)	2023-01-05 16:35:04 -05:00
Matt Arsenault	ce6ae0b2a2	clang: Don't emit "frame-pointer"="none" This is the default behavior and cuts down on attribute spam. Probably should also do something to consolidate the option spellings; printing and parsing it is repeated in at least 3 different places. In the OpenMP tests, I had to manually delete some metadata check lines update_cc_test_checks was inserting that included the local build revision.	2023-01-03 19:42:46 -05:00
Johannes Doerfert	9ab0d4d66f	[OpenMP][2/2] Make device functions have hidden visibility Similar to https://reviews.llvm.org/D136111, this time for class methods. D136111 summary: In OpenMP target offloading an in other offloading languages, we maintain a difference between device functions and kernel functions. Kernel functions must be visible to the host and act as the entry point to the target device. Device functions however cannot be called directly by the host and must be called by a kernel function. Currently, we make all definitions on the device protected by default. Because device functions cannot be called or used by the host they should have hidden visibility. This allows for the definitions to be better optimized via LTO or other passes. This patch marks every device class methods in the AST as having hidden visibility. The kernel function is generated later at code-gen and we set its visibility explicitly so it should not be affected. This prevents the user from overriding the visibility, but since the user can't do anything with these symbols anyway there is no point exporting them right now.	2023-01-03 12:18:30 -08:00
Matt Arsenault	f4bcd7f598	AMDGPU/clang: Add builtins for llvm.amdgcn.ballot Use explicit _w32/_w64 suffixes for the wave size to be consistent with the existing other wave dependent intrinsics. Also start diagnosing trying to use both wave32 and wave64. I would have preferred to avoid the +wavefrontsize64 spam on targets where that's the only option, but avoiding this seems to be more work than I expected.	2022-12-29 17:58:55 -05:00
Joseph Huber	f74e3d2f81	[OpenMP] Fix test on 32-bit platforms Summary: This test didn't specify the triple so it defaulted to the user's, if this was 32-bit then it failed due to a diagnostic message.	2022-12-25 09:47:04 -06:00
Phoebe Wang	e746a9a600	[Clang] Emit "min-legal-vector-width" attribute for X86 only This is an alternative way of D139627 suggested by Craig. Creently only X86 backend uses this attribute. Let's just emit for X86 only. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139701	2022-12-21 11:54:05 +08:00
Sunil Kuravinakop	e9babe7571	[OpenMP] Clang Support for taskwait nowait clause Support for taskwait nowait clause with placeholder for runtime changes. Reviewed By: cchen, ABataev Differential Revision: https://reviews.llvm.org/D131830	2022-12-20 12:13:56 -06:00
Doru Bercea	b5c809acd3	Fix tests for commit 658ed9547cdd6657895339a6c390c31aa77a5698.	2022-12-19 07:46:34 -06:00
Doru Bercea	658ed9547c	Fix host call to nohost function with host variant.	2022-12-19 06:13:26 -06:00
Johannes Doerfert	90609fb68f	[OpenMP][NFCI] Remove effectively dead code in clang and the runtime Differential Revision: https://reviews.llvm.org/D136903	2022-12-13 18:44:19 -08:00
Johannes Doerfert	f9c29878b0	Revert "[OpenMP][NFCI] Remove effectively dead code in clang and the runtime" This reverts commit c1c8cbbf5f29257d084a23a2f6c4236c40b7afb9. One of the tests seems to be flaky/non-deterministic.	2022-12-12 22:08:28 -08:00
Johannes Doerfert	c1c8cbbf5f	[OpenMP][NFCI] Remove effectively dead code in clang and the runtime	2022-12-12 20:55:36 -08:00
Chi Chun Chen	7c34e74c25	[OpenMP] Basic parse and sema support for modifiers in order clause This patch gives basic parsing and semantic support for "modifiers" of order clause introduced in OpenMP 5.1 ( section 2.11.3 ) Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D127855	2022-12-12 15:51:38 -06:00
Nikita Popov	bb9ccb49d6	[Clang] Convert some OpenMP tests to opaque pointers (NFC)	2022-12-12 16:15:49 +01:00
Manuel Brito	2482dbff46	[Clang] Use poison instead of undef where its used as placeholder [NFC] Differential Revision: https://reviews.llvm.org/D139745	2022-12-11 16:18:06 +00:00
Chi Chun Chen	e0fd86db09	Revert "[OpenMP] Clang Support for taskwait nowait clause" This reverts commit 100dfe7a8ad3789a98df623482b88d9a3a02e176.	2022-12-09 11:06:45 -06:00
Jennifer Yu	af781f7042	[OPENMP51]Codegen for error directive. Added codegen for `omp error` directive. This is to generate IR to call: void __kmpc_error(ident_t loc, int severity, const char message); Differential Revision: https://reviews.llvm.org/D139166	2022-12-08 13:07:08 -08:00
Sunil K	100dfe7a8a	[OpenMP] Clang Support for taskwait nowait clause Support for taskwait nowait clause with placeholder for runtime changes. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D131830	2022-12-08 12:40:44 -08:00
Johannes Doerfert	6133942796	[OpenMP][FIX] Remove AssertingVHs that outlive their values The map with AssertingVHs has been moved into the OpenMPIRBuilder which extended their lifetime. On NVIDIA this will cause an assertion. This simply removes the AssertingVH wrapper.	2022-12-07 18:27:55 -08:00
John Brawn	6b8900f7f9	[clang][CodeGen] Add default attributes to __clang_call_terminate When generating __clang_call_terminate use SetLLVMFunctionAttributes to set the default function attributes, like we do for all the other functions generated by clang. This fixes a problem where target features from the command line weren't being applied to this function. Differential Revision: https://reviews.llvm.org/D138679	2022-11-29 13:09:52 +00:00
Jennifer Yu	9d90cf2fca	[OPENMP5.1] Initial support for message clause.	2022-11-18 17:59:23 -08:00
Fazlay Rabbi	56c1660170	[OpenMP] Initial parsing/sema for 'strict' modifier with 'num_tasks' clause This patch gives basic parsing and semantic analysis support for 'strict' modifier with 'num_tasks' clause of 'taskloop' construct introduced in OpenMP 5.1 (section 2.12.2) Differential Revision: https://reviews.llvm.org/D138328	2022-11-18 16:26:47 -08:00
Doru Bercea	9e595e911e	[Clang][OpenMP] Add support for default to/from map types on target enter/exit data	2022-11-18 16:12:35 -06:00
Fazlay Rabbi	ab9eac762c	[OpenMP] Initial parsing/sema for 'strict' modifier with 'grainsize' clause This patch gives basic parsing and semantic analysis support for 'strict' modifier with 'grainsize' clause of 'taskloop' construct introduced in OpenMP 5.1 (section 2.12.2) Differential Revision: https://reviews.llvm.org/D138217	2022-11-17 20:59:07 -08:00
Jennifer Yu	1e054e6b52	[OPENMP5.1] Initial support for severity clause Differential Revision:https://reviews.llvm.org/D138227	2022-11-17 16:05:02 -08:00
Doru Bercea	98bfd7f976	Fix declare target implementation to support enter.	2022-11-17 17:35:53 -06:00
Tom Honermann	3e25ae605e	[Clang] Correct when Itanium ABI guard variables are set for non-block variables with static or thread storage duration. Previously, Itanium ABI guard variables were set after initialization was complete for non-block declared variables with static and thread storage duration. That resulted in initialization of such variables being restarted in cases where the variable was referenced while it was still under construction. Per C++20 [class.cdtor]p2, such references are permitted (though the value obtained by such an access is unspecified). The late initialization resulted in recursive reinitialization loops for cases like this: template<typename T> struct ct { struct mc { mc() { ct<T>::smf(); } void mf() const {} }; thread_local static mc tlsdm; static void smf() { tlsdm.mf(); } }; template<typename T> thread_local typename ct<T>::mc ct<T>::tlsdm; int main() { ct<int>::smf(); } With this change, guard variables are set before initialization is started so as to avoid such reinitialization loops. Fixes https://github.com/llvm/llvm-project/issues/57828 Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D135919	2022-11-16 16:31:35 -05:00
Jennifer Yu	628fdc3f57	[OPENMP]Initial support for at clause Error directive is allowed in both declared and executable contexts. The function ActOnOpenMPAtClause is called in both places during the parsers. Adding a param "bool InExContext" to identify context which is used to emit error massage. Differential Revision: https://reviews.llvm.org/D137851	2022-11-15 14:06:50 -08:00
Animesh Kumar	0f8e7b4329	[OpenMP] Add map clause to the LIT test on use_device_addr clause As per the OpenMP Spec, "A list item in a use_device_addr clause must have a corresponding list item in the device data environment" . Therefore a `map` clause is added which will make sure that the respective list items are mapped to the device data environment before the `use_device_addr` clause is specified. The CHECK lines are also modified based on this change. Differential Revision: https://reviews.llvm.org/D134974	2022-11-09 12:23:39 +05:30
Jennifer Yu	de14befa77	Remove redundant loads. It is caused by regenerate captured var value when processing the has_device_addr, the captured var value has been generated in GenerateOpenMPCapturedVars and passed as Arg in generateInfoForCapture. The fix just use Arg instead regenerated just same as is_device_ptr	2022-11-04 15:22:25 -07:00
Mike Rice	c954cfeb57	Some uses of the preprocessor can result in multiple target regions on the same line. Cases such as those in the associated lit tests, can now be supported. This adds a 'Count' field to TargetRegionEntryInfo to differentiate regions with the same source position. The OffloadEntriesInfoManager routines are updated to maintain a count of regions seen at a location. The registration of regions proceeds that same as before, but now the next available count is always determined and used in the offload entry. Fixes: https://github.com/llvm/llvm-project/issues/52707 Differential Revision: https://reviews.llvm.org/D134816	2022-11-04 12:54:22 -07:00
Zequan Wu	a7fa5febaa	[Test] Fix CHECK typo. Differential Revision: https://reviews.llvm.org/D137287	2022-11-04 10:18:04 -07:00
Nikita Popov	304f1d59ca	[IR] Switch everything to use memory attribute This switches everything to use the memory attribute proposed in https://discourse.llvm.org/t/rfc-unify-memory-effect-attributes/65579. The old argmemonly, inaccessiblememonly and inaccessiblemem_or_argmemonly attributes are dropped. The readnone, readonly and writeonly attributes are restricted to parameters only. The old attributes are auto-upgraded both in bitcode and IR. The bitcode upgrade is a policy requirement that has to be retained indefinitely. The IR upgrade is mainly there so it's not necessary to update all tests using memory attributes in this patch, which is already large enough. We could drop that part after migrating tests, or retain it longer term, to make it easier to import IR from older LLVM versions. High-level Function/CallBase APIs like doesNotAccessMemory() or setDoesNotAccessMemory() are mapped transparently to the memory attribute. Code that directly manipulates attributes (e.g. via AttributeList) on the other hand needs to switch to working with the memory attribute instead. Differential Revision: https://reviews.llvm.org/D135780	2022-11-04 10:21:38 +01:00
Jennifer Yu	ea64e66f7b	[OPENMP]Initial support for error directive. Differential Revision: https://reviews.llvm.org/D137209	2022-11-02 14:25:28 -07:00
Jan Sjodin	67f8521cd4	[OpenMP] [OMPIRBuilder] Create a new datatype to hold the unique target region info Re-apply of: 3d0e9edd8e53fb72e85084f4170513159212839a Reverted in: 0cb65b0a585c8b3d4a8a2aefe994a8fc907934f8 A function parameter was using the wrong type 'llvm::TargetRegion' instead of 'const llvm:: TargetRegion&', which caused the error in the address sanitizer. The correct type is now used. This patch puts the individual target region information attributes into a struct so that the nested mappings are not needed and passing the information around is simplified. Reviewed By: jdoerfert, mikerice Differential Revision: https://reviews.llvm.org/D136601	2022-10-31 10:49:44 -04:00
Kevin Athey	0cb65b0a58	Revert "[OpenMP] [OMPIRBuilder] Create a new datatype to hold the unique target region info" This reverts commit 3d0e9edd8e53fb72e85084f4170513159212839a. Breaking HWASAN buildbot: https://lab.llvm.org/buildbot/#/builders/236/builds/786 Shown by targetted builds breaking at this patch: Built at this patch: https://lab.llvm.org/buildbot/#/builders/236/builds/803 Built at prior patch: https://lab.llvm.org/buildbot/#/builders/236/builds/804	2022-10-27 13:57:25 -07:00
Johannes Doerfert	126db4674a	[OpenMP][FIX] Adjust to clang tests after D136740	2022-10-26 12:33:55 -07:00
Jan Sjodin	3d0e9edd8e	[OpenMP] [OMPIRBuilder] Create a new datatype to hold the unique target region info This patch puts the individual target region information attributes into a struct so that the nested mappings are not needed and passing the information around is simplified. Reviewed By: jdoerfert, mikerice Differential Revision: https://reviews.llvm.org/D136601	2022-10-25 11:15:36 -04:00
Joseph Huber	8c1449a84d	[OpenMP] Make kernels have protected visibility This patch changes the kernels generated by OpenMP to have protected visibility. This is unlikely to change anything functionally. However, protected visibility better matches the behaviour of these GPU kernels. We do not expect any pending shared library load to preempt these kernels so we can specify a more restrictive visibility. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D136198	2022-10-18 16:37:28 -05:00
Joseph Huber	bb3c90d3ec	[OpenMP] Make device functions have hidden visibility In OpenMP target offloading an in other offloading languages, we maintain a difference between device functions and kernel functions. Kernel functions must be visible to the host and act as the entry point to the target device. Device functions however cannot be called directly by the host and must be called by a kernel function. Currently, we make all definitions on the device protected by default. Because device functions cannot be called or used by the host they should have hidden visibility. This allows for the definitions to be better optimized via LTO or other passes. This patch marks every device function in the AST as having `hidden` visibility. The kernel function is generated later at code-gen and we set its visibility explicitly so it should not be affected. This prevents the user from overriding the visibility, but since the user can't do anything with these symbols anyway there is no point exporting them right now. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D136111	2022-10-18 08:15:39 -05:00
Dominik Adamski	ccd314d320	[OpenMP][OMPIRBuilder] Add generation of SIMD align assumptions to OMPIRBuilder Currently generation of align assumptions for OpenMP simd construct is done outside OMPIRBuilder for C code and it is not supported for Fortran. According to OpenMP 5.0 standard (2.9.3) only pointers and arrays can be aligned for C code. If given aligned variable is pointer, then Clang generates the following set of the LLVM IR isntructions to support simd align clause: ; memory allocation for pointer address: %A.addr = alloca ptr, align 8 ; some LLVM IR code ; Alignment instructions (alignment is equal to 32): %0 = load ptr, ptr %A.addr, align 8 call void @llvm.assume(i1 true) [ "align"(ptr %0, i64 32) ] If given aligned variable is array, then Clang generates the following set of the LLVM IR isntructions to support simd align clause: ; memory allocation for array: %B = alloca [10 x i32], align 16 ; some LLVM IR code ; Alignment instructions (alignment is equal to 32): %arraydecay = getelementptr inbounds [10 x i32], ptr %B, i64 0, i64 0 call void @llvm.assume(i1 true) [ "align"(ptr %arraydecay, i64 32) ] OMPIRBuilder was modified to generate aligned assumptions. It generates only llvm.assume calls. Frontend is responsible for generation of aligned pointer and getting the default alignment value if user does not specify it in aligned clause. Unit and regression tests were added to check if aligned clause was handled correctly. Differential Revision: https://reviews.llvm.org/D133578 Reviewed By: jdoerfert	2022-10-18 02:04:18 -05:00
Animesh Kumar	06da9b94ae	[OpenMP] Extend the lit test for uses_allocators in target region This patch improves the LIT tests on the following : 1. The test on `uses_allocators` clause in the `target` region by adding the respective CHECK lines. Allocator `omp_thread_mem_alloc` is also added in the test. 2. The `defaultmap` clause wasn't being tested for the variable- category `scalar` and the implicit-behavior `tofrom` with respect to the OpenMP default version. These improvements are inspired from SOLLVE tests. SOLLVE repo: https://github.com/SOLLVE/sollve_vv Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D132855	2022-10-14 19:12:33 +05:30
Shilei Tian	4cdfab12bb	[Clang][OpenMP] Add one missing form of atomic compare capture Two another atomic compare capture forms, `{ v = x; expr-stmt }` and `{ expr-stmt; v = x; }` where `expr-stmt` could be `cond-expr-stmt` are missing. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D135236	2022-10-07 13:30:38 -04:00
Nikita Popov	40e353d0f9	[OpenMP] Convert more tests to opaque pointers (NFC) These were converted using the script at https://gist.github.com/nikic/98357b71fd67756b0f064c9517b62a34 followed by a re-run of update_cc_test_checks.py.	2022-10-07 15:36:44 +02:00
Nikita Popov	a290f3c8fc	[OpenMP] Convert tests to opaque pointers (NFC) Conversion performed using the script at: https://gist.github.com/nikic/98357b71fd67756b0f064c9517b62a34 These are only tests where no manual fixup was required.	2022-10-07 14:58:27 +02:00
Joseph Huber	4aa87a131f	[OpenMP][AMDGPU] Add 'uniform-work-group' attribute to OpenMP kernels The `cl-uniform-work-group` attribute asserts that the global work-size be a multiple of the work-group specified work group size. This should allow optimizations. It is already present by default in the AMD compiler and for HIP kernels so it should be safe to allow this for OpenMP kernels by default. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D135374	2022-10-06 18:22:09 -05:00
Joseph Huber	a8ec170e01	[OpenMP] Make the exec_mode global have protected visibility We use protected visibility for almost everything with offloading. This is because it provides us with the ability to read things from the host without the expectation that it will be preempted by a shared library load, bugs related to this have happened when offloading to the host. This patch just makes the `exec_mode` global generated for each plugin have protected visibility. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D135285	2022-10-05 14:39:22 -05:00
Shilei Tian	0c623ab1bf	[Clang][OpenMP] Only check value if the expression is not instantiation dependent Currently the following case fails: ``` template<typename Ty> Ty foo(Ty addr, Ty val) { Ty v; #pragma omp atomic compare capture { v = addr; if (addr > val) addr = val; } return v; } ``` The compiler complains `addr` is not a lvalue. That's because when an expression is instantiation dependent, we cannot tell if it is lvalue or not. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D135224	2022-10-05 08:44:56 -04:00
Dominik Adamski	6842d35012	[OpenMP][OMPIRBuilder] Add support for order(concurrent) to OMPIRBuilder for SIMD directive If 'order(concurrent)' clause is specified, then the iterations of SIMD loop can be executed concurrently. This patch adds support for LLVM IR codegen via OMPIRBuilder for SIMD loop with 'order(concurrent)' clause. The functionality added to OMPIRBuilder is similar to the functionality implemented in 'CodeGenFunction::EmitOMPSimdInit'. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D134046 Signed-off-by: Dominik Adamski <dominik.adamski@amd.com>	2022-10-04 08:30:00 -05:00
Jennifer Yu	30cc712eb6	[Clang][OpenMP] Fix run time crash when use_device_addr is used. It is data mapping ordering problem. According omp spec If one or more map clauses are present, the list item conversions that are performed for any use_device_ptr or use_device_addr clause occur after all variables are mapped on entry to the region according to those map clauses. The change is to put mapping data for use_device_addr at end of data mapping array. Differential Revision: https://reviews.llvm.org/D134556	2022-09-27 11:53:57 -07:00

1 2 3 4 5 ...

2023 Commits