Youngsuk Kim
|
0f0a96b862
|
[llvm][NVPTX] Strip unneeded '+0' in PTX load/store (#113017)
Remove the extraneous '+0' immediate offset part in PTX load/stores, to
improve readability of output PTX code.
|
2024-10-19 10:05:36 -04:00 |
|
Youngsuk Kim
|
5a0942cd74
|
[llvm][NVPTX] Don't emit unused var 'temp_param_reg' (NFC) (#89004)
Don't emit unused variable 'temp_param_reg' which has been around since
ae556d3ef72dfe5f40a337b7071f42b7bf5b66a4 .
|
2024-04-17 14:45:33 -04:00 |
|
Adrian Kuegel
|
f0a5e50550
|
[llvm][NVPTX] Add missing feature guard.
|
2024-03-19 06:53:14 +00:00 |
|
Alex MacLean
|
89b7b3b995
|
[NVPTX] support dynamic allocas with PTX alloca instruction (#84585)
Add support for dynamically sized alloca instructions with the PTX
alloca instruction introduced in PTX 7.3
([9.7.15.3. Stack Manipulation Instructions: alloca]
(https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#stack-manipulation-instructions-alloca))
|
2024-03-15 11:51:46 -07:00 |
|
Youngsuk Kim
|
f9304974cc
|
[llvm][NVPTX] Inform that 'DYNAMIC_STACKALLOC' is unsupported (#74684)
Catch unsupported path early up, and emit error with information.
Motivated by the following threads:
* https://discourse.llvm.org/t/nvptx-problems-with-dynamic-alloca/70745
* #64017
|
2023-12-14 22:06:22 -05:00 |
|