History

[Offload] Make the RPC thread sleep briefly when idle (#168596 )

Summary:
We start this thread if the RPC client symbol is detected in the loaded
binary. We should make this sleep if there's no work to avoid the thread
running at high priority when the (scarecely used) RPC call is actually
required. So, right now after 25 microseconds we will assume the server
is inactive and begin sleeping. This resets once we do find work.

AMD supports a more intelligent way to do this. HSA signals can wake a
sleeping thread from the kernel, and signals can be sent from the GPU
side. This would be nice to have and I'm planning on working with it in
the future to make this infrastructure more usable with existing AMD
workloads.

2025-11-19 15:56:25 -06:00

cmake

[Runtimes] Default build must use its own output dirs (#168266 )

2025-11-19 13:51:14 +01:00

docs

[Offload] Add Offload API Sphinx documentation (#147323 )

2025-07-10 11:50:51 +01:00

include

Revert "[OpenMP] Implement omp_get_uid_from_device() / omp_get_device_from_uid()" (#168547 )

2025-11-18 15:10:42 +00:00

liboffload

[Offload] Add device info for shared memory (#167817 )

2025-11-13 11:00:12 -08:00

libomptarget

Revert "[OpenMP] Implement omp_get_uid_from_device() / omp_get_device_from_uid()" (#168547 )

2025-11-18 15:10:42 +00:00

plugins-nextgen

[Offload] Make the RPC thread sleep briefly when idle (#168596 )

2025-11-19 15:56:25 -06:00

test

[Runtimes] Default build must use its own output dirs (#168266 )

2025-11-19 13:51:14 +01:00

tools

[Offload] Add device info for shared memory (#167817 )

2025-11-13 11:00:12 -08:00

unittests

[Offload] Add device info for shared memory (#167817 )

2025-11-13 11:00:12 -08:00

utils

…

CMakeLists.txt

[Runtimes] Default build must use its own output dirs (#168266 )

2025-11-19 13:51:14 +01:00

Maintainers.md

[Offload] Add 'Maintainers.md' file for offload (#138177 )

2025-05-01 14:06:33 -05:00

README.md

[Offload][NFC] Update README.md

2024-11-17 07:32:29 -08:00

README.txt

…

README.md

The LLVM/Offload Subproject

The Offload subproject aims at providing tooling, runtimes, and APIs that allow users to execute code on accelerators or other "co-processors" that may or may not match the architecture of their "host". In the long run, all kinds of targets are in scope of this effort, including but not limited to: CPUs, GPUs, FPGAs, AI/ML accelerators, distributed resources, etc.

For OpenMP offload users, the project is ready and fully usable. The final API design is still under development. More content will show up here and on our webpage soon. In the meantime, people are encouraged to participate in our meetings (see below) and check our development board as well as the discussions on Discourse.

Meetings

Every second Wednesday, 7:00 - 8:00am PT, starting Jan 24, 2024. Alternates with the OpenMP in LLVM meeting. invite.ics Meeting Minutes and Agenda