llvm-project

Author	SHA1	Message	Date
Nick Sarnie	38a46a12c4	[offload][lit] Disable tests failing on Intel GPU (#189422 ) Fix some tests causing hangs, one fail, and a few XPASSing. We are seeing new passes/fails because of the named barrier changes being merged. Signed-off-by: Nick Sarnie <nick.sarnie@intel.com>	2026-03-30 18:02:34 +00:00
fineg74	1611a23a5b	[OFFLOAD] Add spirv implementation for named barrier (#180393 ) This change adds implementation for named barriers for SPIRV backend. Since there is no built in API/intrinsics for named barrier in SPIRV, the implementation loosely follows implementation for AMD	2026-03-27 20:14:09 +01:00
fineg74	2890f9883c	[OFFLOAD] Improve handling of synchronization errors in L0 plugin and reenable tests (#186927 ) This change improves handling of errors during synchronization in Level Zero plugin by ensuring cleanup of queues and events in case of an synchronization error. As a result multiple tests stopped hanging. --------- Co-authored-by: Duran, Alex <alejandro.duran@intel.com>	2026-03-18 05:50:06 +01:00
Kevin Sala Penades	ac71b185c2	[offload] Remove LIBOMPTARGET_SHARED_MEMORY_SIZE envar (#186231 ) This commit removes the `LIBOMPTARGET_SHARED_MEMORY_SIZE` envar and outputs a runtime warning if it is defined. Access to dynamic shared memory should be obtained through the `dyn_groupprivate` clause (OpenMP 6.1) or the launch arguments in liboffload kernel launch.	2026-03-12 21:21:29 -07:00
Jason Van Beusekom	2d4c8e0d0f	[OpenMP][clang] Indirect and Virtual function call mapping from host to device (#184412 ) This patch implements the CodeGen logic for calling __llvm_omp_indirect_call_lookup on the device when an indirect function call or a virtual function call is made within an OpenMP target region. --------- Co-authored-by: Youngsuk Kim	2026-03-03 13:20:24 -06:00
Jason Van Beusekom	f95662d159	Revert "[OpenMP][clang] Indirect and Virtual function call mapping from host to device" (#184378 ) Reverts llvm/llvm-project#159857	2026-03-03 17:11:14 +00:00
Jason Van Beusekom	b23438661c	[OpenMP][clang] Indirect and Virtual function call mapping from host to device (#159857 ) This patch implements the CodeGen logic for calling __llvm_omp_indirect_call_lookup on the device when an indirect function call or a virtual function call is made within an OpenMP target region. --------- Co-authored-by: Youngsuk Kim	2026-03-03 02:52:34 +00:00
Nick Sarnie	78ff5b55fd	[offload][lit] Enable/disable tests on Level Zero when using DeviceRTL (#182128 ) Since we can now build the DeviceRTL with SPIR-V, redo the `XFAIL/UNSUPPORTED` specifications for the tests we see passing/failing on the Level Zero backend with the DeviceRTL being used. The tests marked `UNSUPPORTED` hang or sporadically fail and those are tracked in https://github.com/llvm/llvm-project/issues/182119. This change will allow us to enable CI testing with the DeviceRTL. Here are the full test results with this change applied, running only the `spirv64-intel` `check-offload` tests: ``` Total Discovered Tests: 453 Unsupported : 206 (45.47%) Passed : 141 (31.13%) Expectedly Failed: 106 (23.40%) ``` 31% is not a bad start. --------- Signed-off-by: Nick Sarnie <nick.sarnie@intel.com>	2026-02-18 21:53:19 +00:00
Nick Sarnie	26b777444b	[offload][lit] XFAIL all failing tests on the Level Zero plugin (#174804 ) We finally got our buildbot added (to staging, at least) so we want to start running L0 tests in CI. We need `check-offload` to pass though, so XFAIL everything failing. There's a couple `UNSUPPORTED` as well, those are for sporadic fails. Also make set the `gpu` and `intelgpu` LIT variables when testing the `spirv64-intel` triple. We have no DeviceRTL yet so basically everything fails, but we manage to get ``` Total Discovered Tests: 432 Unsupported : 169 (39.12%) Passed : 67 (15.51%) Expectedly Failed: 196 (45.37%) ``` We still don't build the level zero plugin by default and these tests don't run unless the plugin was built, so this has no effect on most builds. --------- Signed-off-by: Nick Sarnie <nick.sarnie@intel.com>	2026-01-07 19:20:30 +00:00
Robert Imschweiler	8808beeb1a	Reland: [OpenMP] Implement omp_get_uid_from_device() / omp_get_device_from_uid() (#168554 ) Reland https://github.com/llvm/llvm-project/pull/164392 with Fortran support moved to follow-up PR	2025-12-01 14:18:31 +01:00
Jason-VanBeusekom	84d511df8d	[OpenMP][clang] Register vtables on device for indirect calls runtime (#167011 ) This is a branch off of https://github.com/llvm/llvm-project/pull/159856, in which consists of the runtime portion of the changes required to support indirect function and virtual function calls on an `omp target device` when the virtual class / indirect function is mapped to the device from the host. Key Changes - Introduced a new flag OMP_DECLARE_TARGET_INDIRECT_VTABLE to mark VTable registrations - Modified setupIndirectCallTable to support both VTable entries and indirect function pointers Details: The setupIndirectCallTable implementation was modified to support this registration type by retrieving the first address of the VTable and inferring the remaining data needed to build the indirect call table. Since the Vtables / Classes registered as indirect can be larger than 8 bytes, and the vtables may not be at the first address we either need to pass the size to __llvm_omp_indirect_call_lookup and have a check at each step of the binary search, or add multiple entries to the indirect table for each address registered. The latter was chosen. Commit: a00def3f20e166d4fb9328e6f0bc0742cd0afa31 is not a part of this PR and is handled / reviewed in: https://github.com/llvm/llvm-project/pull/159856, This is PR (2/3) Register Vtable PR (1/3): https://github.com/llvm/llvm-project/pull/159856, Codegen / _llvm_omp_indirect_call_lookup PR (3/3): https://github.com/llvm/llvm-project/pull/159857	2025-11-26 17:33:26 +00:00
Robert Imschweiler	9a0fd22da1	Revert "[OpenMP] Implement omp_get_uid_from_device() / omp_get_device_from_uid()" (#168547 ) Reverts llvm/llvm-project#164392 due to fortran issues	2025-11-18 15:10:42 +00:00
Robert Imschweiler	65c4a534bd	[OpenMP] Implement omp_get_uid_from_device() / omp_get_device_from_uid() (#164392 ) Use the implementation in libomptarget. If libomptarget is not available, always return the UID / device number of the host / the initial device.	2025-11-18 15:22:49 +01:00
Joseph Huber	b26baf1779	[Offload] Make AMDGPU plugin handle empty allocation properly (#142383 ) Summary: `malloc(0)` and `free(nullptr)` are both defined by the standard but we current trigger erros and assertions on them. Fix that so this works with empty arguments.	2025-06-02 08:12:20 -05:00
Joseph Huber	2f41fa387d	[AMDGPU] Fix code object version not being set to 'none' (#135036 ) Summary: Previously, we removed the special handling for the code object version global. I erroneously thought that this meant we cold get rid of this weird `-Xclang` option. However, this also emits an LLVM IR module flag, which will then cause linking issues.	2025-04-10 11:31:21 -05:00
Christian Clauss	1f56bb3137	[Offload][NFC] Fix typos discovered by codespell (#125119 ) https://github.com/codespell-project/codespell % `codespell --ignore-words-list=archtype,hsa,identty,inout,iself,nd,te,ths,vertexes --write-changes`	2025-01-31 09:35:29 -06:00
Shilei Tian	92376c3ff5	[Offload][OMPX] Add the runtime support for multi-dim grid and block (#118042 )	2024-12-06 09:07:50 -05:00
Joseph Huber	91f5f974cb	[OpenMP] Unconditionally provide an RPC client interface for OpenMP (#117933 ) Summary: This patch adds an RPC interface that lives directly in the OpenMP device runtime. This allows OpenMP to implement custom opcodes. Currently this is only providing the host call interface, which is the raw version of reverse offloading. Previously this lived in `libc/` as an extension which is not the correct place. The interface here uses a weak symbol for the RPC client by the same name that the `libc` interface uses. This means that it will defer to the libc one if both are present so we don't need to set up multiple instances. The presense of this symbol is what controls whether or not we set up the RPC server. Because this is an external symbol it normally won't be optimized out, so there's a special pass in OpenMPOpt that deletes this symbol if it is unused during linking. That means at `O0` the RPC server will always be present now, but will be removed trivially if it's not used at O1 and higher.	2024-12-02 14:31:51 -06:00
Jan Patrick Lehr	1a0cf245ac	[Offload] Change x86_64-pc-linux to x86_64-unknown-linux (#107023 ) It appears that the RUNTIMES build prefers the x86-64-unknown-linux-gnu triple notation for the host. This fixes runtime / test breakages when compiler-rt is used as the CLANG_DEFAULT_RTLIB.	2024-09-03 14:25:33 +02:00
Joseph Huber	e96146cd46	[OpenMP] Temporarily disable test to keep bots green Summary: This test mysteriously fails on the bots but not locally, disable until I can figure out why.	2024-08-20 15:16:05 -05:00
Joseph Huber	e0326b668e	[OpenMP] Map `omp_default_mem_alloc` to global memory (#104790 ) Summary: Currently, we assign this to private memory. This causes failures on some SOLLVE tests. The standard isn't clear on the semantics of this allocation type, but there seems to be a consensus that it's supposed to be shared memory.	2024-08-20 12:00:41 -05:00
Joseph Huber	161e250add	[OpenMP] Fix buildbot failing on allocator test	2024-08-14 13:56:12 -05:00
Joseph Huber	74d23f15b6	[OpenMP] Implement 'omp_alloc' on the device (#102526 ) Summary: The 'omp_alloc' function should be callable from a target region. This patch implemets it by simply calling `malloc` for every non-default trait value allocator. All the special access modifiers are unimplemented and return null. The null allocator returns null as the spec states it should not be usable from the target.	2024-08-14 13:38:55 -05:00
Joseph Huber	dcc27ea41e	[LinkerWrapper] Always pass `-flto` if the linker supports it (#102972 ) Summary; Now that we use the linker to do LTO / device linking, we need to inform the `clang` invocation to use `-flto` so it forwards arguments like `-On` correctly.	2024-08-13 11:23:55 -05:00
Joseph Huber	4854e25359	[Offload] Re-enable tests that are now passing Summary: Some recent patches made these stop failing so the XFAIL now makes the bots go red. Fixes https://github.com/llvm/llvm-project/issues/98903	2024-07-23 10:56:55 -05:00
Jan Patrick Lehr	4ed0f84d38	[Offload] XFAIL four tests while working on fix (#98899 ) omp_dynamic_shared_memory_mixed_amdgpu.c omp_dynamic_shared_memory_amdgpu.c amdgcn-amd-amdhsa::bug51982.c amdgcn-amd-amdhsa::bug51781.c	2024-07-15 15:45:59 +02:00
Ethan Luis McDonough	8823448807	[Offload] Refactor offload test requirements (#95196 ) Many tests in the `offload` project have requirements defined by which targets are not supported rather than which platforms are supported. This patch aims to streamline the requirement definitions by adding four new feature tags: `host`, `gpu`, `amdgpu`, and `nvidiagpu`.	2024-06-29 00:56:18 -05:00
Krzysztof Parzyszek	adc4e45f2e	[Offload] Update test to use `target parallel for reduction` Re-enable test disabled in 1bf1f93d with a fix.	2024-05-30 09:17:17 -05:00
Krzysztof Parzyszek	1bf1f93d94	[Offload] Temporarily disable failing test after eb88e7c1 The `target reduction` combination is no longer accepted. Disable the test to avoid build failures, until a better fix is ready.	2024-05-30 08:52:29 -05:00
Johannes Doerfert	330d8983d2	[Offload] Move `/openmp/libomptarget` to `/offload` (#75125 ) In a nutshell, this moves our libomptarget code to populate the offload subproject. With this commit, users need to enable the new LLVM/Offload subproject as a runtime in their cmake configuration. No further changes are expected for downstream code. Tests and other components still depend on OpenMP and have also not been renamed. The results below are for a build in which OpenMP and Offload are enabled runtimes. In addition to the pure `git mv`, we needed to adjust some CMake files. Nothing is intended to change semantics. ``` ninja check-offload ``` Works with the X86 and AMDGPU offload tests ``` ninja check-openmp ``` Still works but doesn't build offload tests anymore. ``` ls install/lib ``` Shows all expected libraries, incl. - `libomptarget.devicertl.a` - `libomptarget-nvptx-sm_90.bc` - `libomptarget.rtl.amdgpu.so` -> `libomptarget.rtl.amdgpu.so.18git` - `libomptarget.so` -> `libomptarget.so.18git` Fixes: https://github.com/llvm/llvm-project/issues/75124 --------- Co-authored-by: Saiyedul Islam <Saiyedul.Islam@amd.com>	2024-04-22 09:51:33 -07:00

30 Commits