llvm-project/surface_indirect_functions.h at 2abcdd8cf08b9a170e6e5ad1b9facbf71135522f - llvm-project - shylie's gitea

shylie/llvm-project

Austin Schuh 2abcdd8cf0

[CUDA] Add support for CUDA surfaces (#132883 )

This adds support for all the surface read and write calls to clang. It
extends the pattern used for textures to surfaces too.

I tested this by generating all the various permutations of the calls
and argument types in a python script, compiling them with both clang
and nvcc, and comparing the generated ptx for equivilence. They all
agree, ignoring register allocation, and some places where Clang picks
different memory write instructions. An example kernel is:

```
__global__ void testKernel(cudaSurfaceObject_t surfObj, int x, float2* result) {
    *result = surf1Dread<float2>(surfObj, x, cudaBoundaryModeZero);
}
```

---------

Signed-off-by: Austin Schuh <austin.linux@gmail.com>

2025-04-03 10:08:02 -07:00

3 lines

66 B

C

Raw Blame History

	`// required for __clang_cuda_runtime_wrapper.h tests`
	`#pragma once`