llvm-project

Author	SHA1	Message	Date
Jan Patrick Lehr	ee797883e8	[Offload] Escape \; in command string (#186120 ) This adds a \ in front of the ; between the two cache files to stop the run function to interpret it as a shell statement separator (or so).	2026-03-12 15:02:40 +01:00
Jan Patrick Lehr	bb72ec480f	[Offload] AMD Flang bot to use CMake cache file (#186070 ) Converting the current bot config to use the CMake cache file that we use in other bots (offload/cmake/caches/AMDGPUBot.cmake). This PR removes all CMake settings that the cache file already sets and only leaves those that were either not set explicitly or which differ. Thus, first load the cache file and then adjust the settings to override existing values.	2026-03-12 14:10:04 +01:00
Kevin Sala Penades	1f583c6dee	[OpenMP][Offload] Add offload runtime support for dyn_groupprivate clause (#152831 ) Part 3 adding offload runtime support. See https://github.com/llvm/llvm-project/pull/152651. --------- Co-authored-by: Krzysztof Parzyszek <Krzysztof.Parzyszek@amd.com>	2026-03-12 01:13:06 -07:00
Łukasz Plewa	0e122bea82	[OFFLOAD] Enable Level Zero unittests (#185492 )	2026-03-11 14:09:59 +00:00
Amit Tiwari	a15dcd4117	[Clang][OpenMP] Handled `NonContig` Descriptor `DimCount` (#181987 ) ### Issue: Dimension override missing When variable count expressions were used with stride, the constant subsection path computed size first. This marked `ArgSizes` with byte size semantics. Variable expression logic later triggered, but reused `ArgSizes` assuming "bytes" semantics `OMPIRBuilder.cpp` didn't handle dimension count for `OMP_MAP_NON_CONTIG` flag Result: `ArgSizes` wasn't overwritten with dimension count, breaking non-contiguous mapping. Fixes: `llvm/lib/Frontend/OpenMP/OMPIRBuilder.cpp` - Expression semantics for non-contiguous. stride/count. Generate 3D descriptor structures with runtime dimensions. Fix dimension override to use dimension count instead of byte size. Added testcases to cover stack arrays, heap pointers, struct members, etc.	2026-03-11 19:39:55 +05:30
Alex Duran	789fea83bb	[offload][l0][nfc] remove duplicated entry (#185855 ) Remove left over function by mistake from #185404	2026-03-11 11:55:30 +01:00
Alex Duran	3ff332ad0f	[Offload][L0] Add support for OffloadBinary format in L0 plugin (#185404 ) - Accept OffloadBinaries as valid images by plugins that support them in the PluginInterface. - Add support in L0 plugin to extract SPIRV images and their associated metadata from an OffloadBinary image. Depends on: - #185663 Follow-up PRs: - #185413 (Changes SPIRV wrapper generation to use OffloadBinary) - #185425 (Adjusts llvm-objdump) - #184774 (Adjusts llvm-offload-binary)	2026-03-11 11:42:36 +01:00
Joseph Huber	fd069a46bf	[copmiler-rt] Initial support for building profile library on the GPU (#185552 ) Summary: As suggested in https://github.com/llvm/llvm-project/pull/177665, we should build a GPU version of the compiler-rt profile library instead of writing it in-line in the lowering. This PR does not define anything GPU specific, it simply re-uses the baremetal handling. Later PRs will prevent the GPU specific handling we would want to do to optimize counter handling on the GPU. Note that this will require using the cache file, or setting these options manually for existing users. Hopefully if people are using the cache file as they should it won't break anything.	2026-03-10 13:45:18 -05:00
Alex Duran	be021b8433	[OFFLOAD] Add interface to extend image validation (#185663 ) As discussed in #185404 we might want to provide a way for plugins to validate images not recognized by the common layer. This PR adds such extension and uses it to validate pure SPIRV images by the Level Zero plugin.	2026-03-10 18:41:23 +01:00
Amit Tiwari	14de1bb711	[Clang][OpenMP] Support expression semantics in target update fields with non-contiguous array sections (#176708 ) ### Issue: Variable stride not recognized as non-contiguous `CGOpenMPRuntime.cpp` failed to detect `DeclRefExpr`, `MemberExpr`, `ArraySubscriptExpr` as non-contiguous. Fixes: `clang/lib/CodeGen/CGOpenMPRuntime.cpp` - Variable stride detection + dimension count logic Detect variable stride expressions (`DeclRefExpr/MemberExpr/ArraySubscriptExpr`) as non-contiguous Added testcases to cover stack arrays, heap pointers, struct members, etc., for expression semantics in non-contiguous update.	2026-03-10 19:26:17 +05:30
Joseph Huber	a9e457a82f	[Offload][AMDGPU] Fix RPC server on mixed w32 w64 workloads (#185496 ) Summary: This was a regression from the original LLVM-gpu-loader. We used to handle `-mwavefrontsize64` correctly in the loader by over-allocating memory and just leaving the upper 32-bits masked off. In order to handle this in offload we need to scan loaded kernels to see how much memory we need to allocate. This should be safe, the protocol is designed to handle an arbitrary size and worst-case this just wastes space.	2026-03-09 17:13:59 -05:00
Kareem Ergawy	0bf9bb5c42	[Flang][OpenMP] Fix close map flag propagation for derived types in USM (#185330 ) This fixes a bug in USM mode where the `close` map type modifer was attached to some `map.info.op`'s corresponding to user-defined type members while the parent type instance itself is not marked as `close`. This fix ensures that if a parent record type map does not have the 'close' flag, it is cleared from its members as well, maintaining consistency. Gemini was used to create tests. AI generated test code was reviewed line-by-line by me. Which were derived from a reproducer I was working with to debug the issue. Assisted-by: Gemini <gemini@google.com>	2026-03-09 15:55:53 +01:00
Łukasz Plewa	57614e8810	[OFFLOAD] Replace C-style casts with C++ style casts in obtainInfoImpl (#185023 ) Replace C-style bool casts (bool)TmpInt with C++ functional casts bool(TmpInt)	2026-03-06 10:28:38 -06:00
Jason Van Beusekom	2d4c8e0d0f	[OpenMP][clang] Indirect and Virtual function call mapping from host to device (#184412 ) This patch implements the CodeGen logic for calling __llvm_omp_indirect_call_lookup on the device when an indirect function call or a virtual function call is made within an OpenMP target region. --------- Co-authored-by: Youngsuk Kim	2026-03-03 13:20:24 -06:00
Jason Van Beusekom	f95662d159	Revert "[OpenMP][clang] Indirect and Virtual function call mapping from host to device" (#184378 ) Reverts llvm/llvm-project#159857	2026-03-03 17:11:14 +00:00
Jason Van Beusekom	b23438661c	[OpenMP][clang] Indirect and Virtual function call mapping from host to device (#159857 ) This patch implements the CodeGen logic for calling __llvm_omp_indirect_call_lookup on the device when an indirect function call or a virtual function call is made within an OpenMP target region. --------- Co-authored-by: Youngsuk Kim	2026-03-03 02:52:34 +00:00
Abhinav Gaba	0ced81f7ea	[NFC][OpenMP] Remove redundant prints in `target` regions from tests added in #184260 . (#184266 ) Some buildbots don't like them, and the correctness of the values in the `target` region is ensured via prints after the region.	2026-03-03 00:02:28 +00:00
Abhinav Gaba	1d1c83ad73	Reland "[OpenMP][Offload] Handle `present/to/from` when a different entry did `alloc/delete`." (#184260 ) Some tests that were checking for prints inside/outside `target` regions needed to be updated to work on systems where the ordering wasn't deterministic. Reverts llvm/llvm-project#184240 Original description from #165494: ----- OpenMP allows cases like the following: ```c int p1, p2, x; p1 = p2 = &x; ... #pragma omp target_exit_data map(delete: p1[:]) from(p2[0]) ``` Which means, when the runtime encounters the `from` entry, the ref-count may not be zero, but it will go down to zero at the end of the current construct, which should cause the "from" transfer to happen. Similarly, a user may have: ```c struct S { int *p; }; #pragma omp declare_mapper (id1: S s) map(s.p) map(present, alloc: s.p[0:10]) #pragma omp declare_mapper (id2: S s) map(s.p, s.p[0:10]) S s1; // present-check should fail here #pragma omp target_enter_data map(alloc: s.p[0:10]) map(mapper(id1), to: s) // "to" should be honored here #pragma omp target_enter_data map(alloc: s.p[0:10]) map(mapper(id2), to: s) ``` Where the allocation happens before the "to" entry is encountered by the runtime. Or, an allocation happens before a "present" entry is encountered. To handle cases like this, we need to use the state information of previously seen new allocations, deletions, "from" entries, when honoring `to`/`from`/`present` map entries. -----	2026-03-02 15:32:30 -08:00
Abhinav Gaba	42a0fbc2c7	Revert "[OpenMP][Offload] Handle `present/to/from` when a different entry did `alloc/delete`." (#184240 ) Reverts llvm/llvm-project#165494 Some buildbots are not happy about CHECKs enforcing strict ordering of prints inside/after target regions.	2026-03-02 21:52:20 +00:00
Abhinav Gaba	1a7060a7b0	[OpenMP][Offload] Handle `present/to/from` when a different entry did `alloc/delete`. (#165494 ) OpenMP allows cases like the following: ```c int p1, p2, x; p1 = p2 = &x; ... #pragma omp target_exit_data map(delete: p1[:]) from(p2[0]) ``` Which means, when the runtime encounters the `from` entry, the ref-count may not be zero, but it will go down to zero at the end of the current construct, which should cause the "from" transfer to happen. Similarly, a user may have: ```c struct S { int *p; }; #pragma omp declare_mapper (id1: S s) map(s.p) map(present, alloc: s.p[0:10]) #pragma omp declare_mapper (id2: S s) map(s.p, s.p[0:10]) S s1; // present-check should fail here #pragma omp target_enter_data map(alloc: s.p[0:10]) map(mapper(id1), to: s) // "to" should be honored here #pragma omp target_enter_data map(alloc: s.p[0:10]) map(mapper(id2), to: s) ``` Where the allocation happens before the "to" entry is encountered by the runtime. Or, an allocation happens before a "present" entry is encountered. To handle cases like this, we need to use the state information of previously seen new allocations, deletions, "from" entries, when honoring `to`/`from`/`present` map entries.	2026-03-02 13:13:51 -08:00
Krish Gupta	148b10be8a	[flang][OpenMP] Support custom mappers in target update to/from clauses (#169673 ) Implement support for the OpenMP `mapper` modifier on `target update` `to` and `from` clauses in Flang. Semantic name resolution is extended to bind the mapper symbol for `OmpClause::To` and `OmpClause::From` via a shared `ResolveMapperModifier` helper. Lowering is extended in `ClauseProcessor` with a `getMapperIdentifier` template helper to extract the mapper name for both `map` and `target update` clauses and forward it to `omp.map_info`. Fixes #168701. Reviewed By: TIFitis, kparzysz Assited By: Copilot( For review and articulations of messages)	2026-03-02 21:59:56 +05:30
Hansang Bae	8f268e63e4	[Offload] Remove unused data type (#183840 )	2026-02-27 15:46:59 -06:00
Joseph Huber	c49460bae7	[flang-rt] Enable more runtime functions for the GPU target (#183649 ) Summary: This enables primarily `stop.cpp` and `descriptor.cpp`. Requires a little bit of wrangling to get it to compile. Unlike the CUDA build, this build uses an in-tree libc++ configured for the GPU. This is configured without thread support, environment, or filesystem, and it is not POSIX at all. So, no mutexes, pthreads, or get/setenv. I tested stop, but i don't know if it's actually legal to exit from OpenMP offloading.	2026-02-27 12:27:39 -06:00
Johannes Doerfert	9f4636210d	[Offload] Fix type mismatch by using `uint64_t` instead of `size_t` (#183375 ) The variant uses uint64_t, so should the get.	2026-02-25 13:31:03 -08:00
Nick Sarnie	600919ac32	[Offload][clang-linker-wrapper][SPIRV] Tell spirv-link to not optimize out exported symbols (#182930 ) `spirv-link` seems to internalize all symbols, which ends up causing the OpenMP Device Environment global generated by the OMP FE to get optimized out which causes `liboffload` to run in the wrong parallelization mode which breaks at least one liboffload lit test. Pass `--create-library` to tell it not to do that. ``` --create-library Link the binaries into a library, keeping all exported symbols. ``` This fixes the test. Closes: https://github.com/llvm/llvm-project/issues/182901 Signed-off-by: Nick Sarnie <nick.sarnie@intel.com>	2026-02-24 15:10:59 +00:00
Hansang Bae	a347e1298c	[Offload] Enable memory usage printing with `alloc` debug type (#182938 )	2026-02-23 17:19:41 -06:00
Jan Patrick Lehr	92447ed273	[Offload] Fix copy-elision warning (#182848 ) This fixes a warning about a prohibited copy-elision due to the move of a temporary object.	2026-02-23 13:58:07 +00:00
Alex Duran	7ed0aa2652	[OFFLOAD][L0] Remove leftover global constructor (#182611 ) (#182665 ) fixes #182611	2026-02-21 18:09:46 +01:00
Joseph Huber	70b5a1d050	[flang-rt] Add support for formatted I/O on the GPU (#182580 ) Summary: Expands on the previous support to enable formatted output, characters, and checking basic iostat. We intentionally do not handle cases where the descriptor is non-null as this is a non-trivial class that cannot easily be shepherded across the wire.	2026-02-20 14:43:06 -06:00
Joseph Huber	5f5d27d7d3	[libc] Support array tags in the RPC dispatch helpers (#181395 ) Summary: This PR adds support for tagging a pointer as an array when marshaling between the CPU and GPU.	2026-02-20 09:35:47 -06:00
Joseph Huber	21b3461440	[flang-rt] Implement basic support for I/O from OpenMP GPU Offloading (#181039 ) Summary: This PR provides the minimal support for Fortran I/O coming from a GPU in OpenMP offloading. We use the same support the `libc` uses for its printing through the RPC server. The helper functions `rpc::dispatch` and `rpc::invoke` help make this mostly automatic. Becaus Fortran I/O is not reentrant, the vast majority of complexity comes from needing to stitch together calls from the GPU until they can be executed all at once. This is needed not only because of the limitations of recursive I/O, but without this the output would all be interleaved because of the GPU's lock-step execution. As such, the return values from the intermediate functions are meaningless, all returning true. The final value is correct however. For cookies we create a context pointer on the server to chain these together. Works on both my AMD and NVIDIA GPUs. ```fortran program hello_gpu implicit none !$omp target teams num_teams(1) !$omp parallel num_threads(2) ! Print strings print *, "Hello from GPU" !$omp end parallel !$omp end target teams end program hello_gpu ``` ```console > flang hello.f90 -O2 -fopenmp --offload-arch=gfx1030 > ./a.out Hello from GPU Hello from GPU > flang hello.f90 -O2 -fopenmp --offload-arch=sm_89 > ./a.out Hello from GPU Hello from GPU ```	2026-02-20 07:56:59 -06:00
Nick Sarnie	78ff5b55fd	[offload][lit] Enable/disable tests on Level Zero when using DeviceRTL (#182128 ) Since we can now build the DeviceRTL with SPIR-V, redo the `XFAIL/UNSUPPORTED` specifications for the tests we see passing/failing on the Level Zero backend with the DeviceRTL being used. The tests marked `UNSUPPORTED` hang or sporadically fail and those are tracked in https://github.com/llvm/llvm-project/issues/182119. This change will allow us to enable CI testing with the DeviceRTL. Here are the full test results with this change applied, running only the `spirv64-intel` `check-offload` tests: ``` Total Discovered Tests: 453 Unsupported : 206 (45.47%) Passed : 141 (31.13%) Expectedly Failed: 106 (23.40%) ``` 31% is not a bad start. --------- Signed-off-by: Nick Sarnie <nick.sarnie@intel.com>	2026-02-18 21:53:19 +00:00
Jan Patrick Lehr	e1e0e86e60	[Offload] Always check/consume Error (#182008 ) This fixes an issue introduced in https://github.com/llvm/llvm-project/pull/172226 where an llvm::Error is not checked in the "good" code path.	2026-02-18 13:46:21 +01:00
Joseph Huber	6282a7b993	[Offload] Fix missing end to string in .td file	2026-02-17 15:32:17 -06:00
fineg74	1c6d774baa	[OFFLOAD] Extend olMemRegister API to handle cases when a memory block may have been mapped outside of liboffload. (#172226 ) This PR adds extends liboffload olMemRegister API to handle a case when a memory block may have been mapped before calling olMemRegister to support some use cases in libomptarget	2026-02-17 20:53:00 +00:00
Joseph Huber	d62cd1b89d	[Offload] Add argument to 'olInit' for global configuration options (#181872 ) Summary: This PR adds a pointer argument to the initialization routine to be used for global options. Right now this is used to allow the user to constrain which backends they wish to use. If a null argument is passed, the same behavior as before is observed. This is epxected to be extensible by forcing the user to encode the size of the struct. So, old executables will encode which fields they have access to. We use a macro helper to get this struct rather than a runtime call so that the current state of the size is baked into the executable rather than something looked up by the runtime. Otherwise it would just return the size that the (potentially newer) runtime would see	2026-02-17 14:04:00 -06:00
Nick Sarnie	5317575dd5	Reapply x2 "[Offload][lit] Link against SPIR-V DeviceRTL if present" (#181429 ) The change to `llvm-zorg` to start building the DeviceRTL for SPIR-V on our builder finally got taken by the infra, so we can merge this now. --------- Co-authored-by: Joseph Huber <huberjn@outlook.com>	2026-02-17 15:18:03 +00:00
Michael Kruse	0e1cc6138e	[Offload][test] Use just-compiled lld and llvm-symbolizer (#181793 ) In a bootstrapping build with LLVM_ENABLE_PROJECTS=lld, the lld executable will be found in LLVM's bin/ directory. But `check-offload` will currently ignore it because the JIT plugin will look for `lld` in `$PATH`. Similarly, the sanitizer tests require llvm-symbolizer during execution to FileCheck the expected stack trace. Add the llvm tools directory to `$PATH` so tests can use lld and llvm-symbolizer in it. It should be prefered over a system-installed lld/llvm-symbolizer. Fixes the JIT and sanitizer tests of [openmp-offload-amdgpu-clang-flang](https://lab.llvm.org/staging/#/builders/105).	2026-02-17 13:08:26 +00:00
Joseph Huber	d85576d368	[libc] Replace RPC 'close()' mechanism with RAII handler (#181690 ) Summary: Closing ports was previously done manually, This makes the protocol more error prone as unclosed ports will leak and eventually the locks will run out. I believe the original fear was that the RAII portion would negatively impact code generation but I have not noticed anything significant.	2026-02-16 15:14:30 -06:00
Alex Duran	e0182ebd40	[OFFLOAD] Fix issue where host plugin is added twice to the plugin list (#181346 )	2026-02-13 12:08:04 +01:00
fineg74	b58a31d3ce	[OFFLOAD] Add support for host offloading device (#177307 ) The purpose of this PR is to add support of host as an offloading device to liboffload. Both OpenMP and sycl support offloading to a host as their normal workflow and therefore would require such capability from liboffload library.	2026-02-13 10:27:52 +01:00
Joseph Huber	846e61c0fb	[libc] Small change to accept lambda types in rpc::dispatch Summary: This change allows lambdas to be used in the RPC dispatching functions. Just requires an extra function trait to convert a lambda with no captures into a function pointer. Also rearranged where the `Port` lives because it looks better no that we may use a lambda and it's more consistent with the dispatch usage (putting the client at the start).	2026-02-12 09:18:56 -06:00
Hansang Bae	0deb1b6e05	[Offload] Try to load Level Zero loader with version suffix (#180042 ) The default Level Zero loader `libze_loader.so` may not be available on systems that don't have Level Zero development package. Level Zero loaders with major version suffix are searched in that case.	2026-02-11 15:13:26 -06:00
Joseph Huber	6d6feb7655	[libc] Add RPC helpers for dispatching functions to the host (#179085 ) Summary: The RPC interface is useful for forwarding functions. This PR adds helper functions for doing a completely bare forwarding of a function from the client to the server. This is intended to facilitate heterogenous libraries that implement host functions on the GPU (like MPI or Fortran).	2026-02-11 08:13:52 -06:00
Alex Duran	8b9fd4803c	[OFFLOAD] Support host plugin on Windows (#180401 ) Changes to make host plugin compile on Windows: * Change IO code to be portable * Adjust Makefiles Allow plugin to work partially when libffi support is not found dynamically (compilation works fine even on Windows because of the wrapper support).	2026-02-11 08:54:47 +01:00
Nick Sarnie	6c0ff8d12f	Revert "Reapply [Offload][lit] Link against SPIR-V DeviceRTL if present" (#180743 ) Reverts llvm/llvm-project#180231	2026-02-10 14:22:40 +00:00
Alex Duran	e17226e20b	[OFFLOAD] Implement excluding filters for debugging (#180538 ) Allow a to define a set of Types that are not shown by default when doing default debug loggin (e.g., LIBOMPTARGET_DEBUG=All). Users can enable output of those types of messages by explicitly adding them to LIBOMPTARGET_DEBUG. Used to implement: #180545 --------- Co-authored-by: Michael Klemm <michael.klemm@amd.com>	2026-02-10 10:01:52 +01:00
Nick Sarnie	7fcc12c3be	Reapply [Offload][lit] Link against SPIR-V DeviceRTL if present (#180231 ) I'll merge this at the same time as some llvm-zorg changes that start building the DeviceRTL. We only see one new test passing because everything still fails because of the issue described in https://github.com/llvm/llvm-project/pull/178980 Once a fix for that issue is merged we will see many new passes.	2026-02-09 16:52:33 +00:00
Robert Imschweiler	33e3384aa8	[offload] Adapt tests to new PluginInterface quoting [NFC] (#180505 ) `4096cb6017` removed the quotes around PluginInterface	2026-02-09 07:47:04 -06:00
Nick Sarnie	6844af142e	Revert "[Offload][lit] Link against SPIR-V DeviceRTL if present" (#180211 ) Reverts https://github.com/llvm/llvm-project/pull/180030 Need to make changes to buildbot first	2026-02-06 15:05:16 +00:00

1 2 3 4 5 ...

645 Commits