Go to file

AMDGPU: Move enqueued block handling into clang (#128519 )

The previous implementation wasn't maintaining a faithful IR
representation of how this really works. The value returned by
createEnqueuedBlockKernel wasn't actually used as a function, and
hacked up later to be a pointer to the runtime handle global
variable. In reality, the enqueued block is a struct where the first
field is a pointer to the kernel descriptor, not the kernel itself. We
were also relying on passing around a reference to a global using a
string attribute containing its name. It's better to base this on a
proper IR symbol reference during final emission.

This now avoids using a function attribute on kernels and avoids using
the additional "runtime-handle" attribute to populate the final
metadata. Instead, associate the runtime handle reference to the
kernel with the !associated global metadata. We can then get a final,
correctly mangled name at the end.

I couldn't figure out how to get rename-with-external-symbol behavior
using a combination of comdats and aliases, so leaves an IR pass to
externalize the runtime handles for codegen. If anything breaks, it's
most likely this, so leave avoiding this for a later step. Use a
special section name to enable this behavior. This also means it's
possible to declare enqueuable kernels in source without going through
the dedicated block syntax or other dedicated compiler support.

We could move towards initializing the runtime handle in the
compiler/linker. I have a working patch where the linker sets up the
first field of the handle, avoiding the need to export the block
kernel symbol for the runtime. We would need new relocations to get
the private and group sizes, but that would avoid the runtime's
special case handling that requires the device_enqueue_symbol metadata
field.

https://reviews.llvm.org/D141700

2025-03-10 19:54:04 +07:00

.ci

[CI] Add Logging for Workflow Jobs

2025-03-01 03:06:57 +00:00

.github

[libclang/python] Update maximum Python version for CI to 3.13 (#130385 )

2025-03-08 10:09:04 +01:00

bolt

[BOLT][AArch64] Keep relocations for linker-relaxed instructions. NFCI (#129980 )

2025-03-05 23:06:01 -08:00

clang

AMDGPU: Move enqueued block handling into clang (#128519 )

2025-03-10 19:54:04 +07:00

clang-tools-extra

[clang-tidy] Fix invalid fixit from modernize-use-ranges for nullptr used with std::unique_ptr (#127162 )

2025-03-09 20:09:59 +08:00

cmake

Bump version to 21.0.0git (#124870 )

2025-01-28 19:48:43 -08:00

compiler-rt

Spelling in lit.cfg.py

2025-03-10 11:27:23 +00:00

cross-project-tests

[Clang][AMDGPU] Use 32-bit index for SWMMAC builtins (#129101 )

2025-02-27 23:28:48 -05:00

flang

[flang] Move parser invocations into ParserActions (#130309 )

2025-03-10 11:33:47 +00:00

flang-rt

[Flang] explicitly link the pthread library when building shared flang-rt. (#129956 )

2025-03-07 14:13:38 -05:00

libc

[libc] Added type-generic macros for fixed-point functions (#129371 )

2025-03-09 00:11:34 -05:00

libclc

[libclc] Stop installing CLC headers (#126908 )

2025-03-06 08:52:23 +00:00

libcxx

[libc++][CI] Update action runner base image. (#130433 )

2025-03-09 17:36:10 +01:00

libcxxabi

[libc++abi] Add a missing include for abort() (#126865 )

2025-02-12 14:18:02 +01:00

libunwind

[libunwind][RISCV] Make asm statement volatile (#130286 )

2025-03-10 10:13:33 +01:00

lld

Reland [lld][LoongArch] Relax call36/tail36: R_LARCH_CALL36

2025-03-10 11:02:23 +08:00

lldb

[lldb] Remove an extraneous printf statement. (#130453 )

2025-03-10 11:38:03 +00:00

llvm

AMDGPU: Move enqueued block handling into clang (#128519 )

2025-03-10 19:54:04 +07:00

llvm-libgcc

[runtimes] Correctly apply libdir subdir for multilib (#93354 )

2024-05-31 11:48:45 -07:00

mlir

[mlir] Refactor ConvertVectorToLLVMPass options (#128219 )

2025-03-10 10:32:03 +00:00

offload

[Flang][OpenMP][MLIR] Implement close, present and ompx_hold modifiers for Flang maps (#129586 )

2025-03-07 22:22:30 +01:00

openmp

[OMPD] Remove deprecated/unused module that is causing error (#127434 ) (#129999 )

2025-03-06 06:13:18 -06:00

polly

[IR] Store Triple in Module (NFC) (#129868 )

2025-03-06 10:27:47 +01:00

pstl

Bump version to 21.0.0git (#124870 )

2025-01-28 19:48:43 -08:00

runtimes

[libc++] Diagnose when nullptrs are passed to string APIs (#122790 )

2025-02-27 22:57:19 +01:00

third-party

[benchmark] Sync a few commits from upstream to help with CPU count (#126410 )

2025-02-10 00:06:25 -05:00

utils/bazel

[libc][bazel] Introduce libc_test_library macros. (#130355 )

2025-03-07 15:09:23 -08:00

.clang-format

…

.clang-tidy

Disable clang-tidy misc-include-cleaner (#83945 )

2024-03-05 12:14:09 -08:00

.git-blame-ignore-revs

[lldb] Add lldb/source/Host/posix/MainLoopPosix.cpp to git blame ignores

2024-12-18 09:46:06 +00:00

.gitattributes

Revert "Finally formalise our defacto line-ending policy"

2024-10-18 21:16:24 +01:00

.gitignore

[JITLink] Switch to SymbolStringPtr for Symbol names (#115796 )

2024-12-06 10:22:09 +11:00

.mailmap

add me to mailmap (#126226 )

2025-02-13 17:49:48 +00:00

CODE_OF_CONDUCT.md

…

CONTRIBUTING.md

Update CONTRIBUTING.md to remove the not about not accepting PR

2023-09-10 15:21:06 -07:00

LICENSE.TXT

…

pyproject.toml

[Py Reformat] Exclude third-party from reformat (#83491 )

2024-03-02 14:51:06 -08:00

README.md

[docs] README: Switch link to clang.llvm.org to use HTTPS.

2024-02-17 12:28:31 +01:00

SECURITY.md

…

README.md

The LLVM Compiler Infrastructure

Welcome to the LLVM project!

This repository contains the source code for LLVM, a toolkit for the construction of highly optimized compilers, optimizers, and run-time environments.

The LLVM project has multiple components. The core of the project is itself called "LLVM". This contains all of the tools, libraries, and header files needed to process intermediate representations and convert them into object files. Tools include an assembler, disassembler, bitcode analyzer, and bitcode optimizer.

C-like languages use the Clang frontend. This component compiles C, C++, Objective-C, and Objective-C++ code into LLVM bitcode -- and from there into object files, using LLVM.

Other components include: the libc++ C++ standard library, the LLD linker, and more.

Getting the Source Code and Building LLVM

Consult the Getting Started with LLVM page for information on building and running LLVM.

For information on how to contribute to the LLVM project, please take a look at the Contributing to LLVM guide.

Getting in touch

Join the LLVM Discourse forums, Discord chat, LLVM Office Hours or Regular sync-ups.

The LLVM project has adopted a code of conduct for participants to all modes of communication within the project.

Languages

LLVM 42.4%

C++ 30.1%

C 12.8%

Assembly 9.8%

MLIR 1.6%

Other 2.9%