History

Dominik Adamski b69fd34e76

[Offload] Add oneInterationPerThread param to loop device RTL (#151959 )

Currently, Flang can generate no-loop kernels for all OpenMP target
kernels in the program if the flags
-fopenmp-assume-teams-oversubscription or
-fopenmp-assume-threads-oversubscription are set.
If we add an additional parameter, we can choose
in the future which OpenMP kernels should be generated as no-loop
kernels.

This PR doesn't modify current behavior of oversubscription flags.

RFC for no-loop kernels:
https://discourse.llvm.org/t/rfc-no-loop-mode-for-openmp-gpu-kernels/87517

2025-08-21 09:03:56 +02:00

cmake

[NFC][CMake] quote ${CMAKE_SYSTEM_NAME} consistently (#154537 )

2025-08-20 12:45:41 -04:00

DeviceRTL

[Offload] Add oneInterationPerThread param to loop device RTL (#151959 )

2025-08-21 09:03:56 +02:00

docs

[Offload] Add Offload API Sphinx documentation (#147323 )

2025-07-10 11:50:51 +01:00

include

[Offload] Introduce ATTACH map-type support for pointer attachment. (#149036 )

2025-08-17 15:17:04 -07:00

liboffload

[Offload] Guard olMemAlloc/Free with a mutex (#153786 )

2025-08-20 13:23:57 +01:00

libomptarget

[Offload] Introduce ATTACH map-type support for pointer attachment. (#149036 )

2025-08-17 15:17:04 -07:00

plugins-nextgen

[Offload] Add olCalculateOptimalOccupancy (#142950 )

2025-08-19 15:16:47 +01:00

test

Fix test added in 1fd1d634630754cc9b9c4b5526961d5856f64ff9

2025-08-18 13:29:23 +01:00

tools

[Offload] Define additional device info properties (#152533 )

2025-08-19 13:02:01 +01:00

unittests

[Offload][Conformance] Add RandomGenerator for large input spaces (#154252 )

2025-08-20 13:37:01 -05:00

utils

…

CMakeLists.txt

[Offload] Add Offload API Sphinx documentation (#147323 )

2025-07-10 11:50:51 +01:00

Maintainers.md

[Offload] Add 'Maintainers.md' file for offload (#138177 )

2025-05-01 14:06:33 -05:00

README.md

[Offload][NFC] Update README.md

2024-11-17 07:32:29 -08:00

README.txt

…

README.md

The LLVM/Offload Subproject

The Offload subproject aims at providing tooling, runtimes, and APIs that allow users to execute code on accelerators or other "co-processors" that may or may not match the architecture of their "host". In the long run, all kinds of targets are in scope of this effort, including but not limited to: CPUs, GPUs, FPGAs, AI/ML accelerators, distributed resources, etc.

For OpenMP offload users, the project is ready and fully usable. The final API design is still under development. More content will show up here and on our webpage soon. In the meantime, people are encouraged to participate in our meetings (see below) and check our development board as well as the discussions on Discourse.

Meetings

Every second Wednesday, 7:00 - 8:00am PT, starting Jan 24, 2024. Alternates with the OpenMP in LLVM meeting. invite.ics Meeting Minutes and Agenda