llvm-project

History

[AMDGPU][SIInsertWaitCnt] Optimize loadcnt insertion at function boundaries (#169647 )

On GFX12+, GLOBAL_INV increments the loadcnt counter but does not write
results to any VGPRs. Previously, we unconditionally inserted
s_wait_loadcnt 0 at function returns even when the only pending loadcnt
was from GLOBAL_INV instructions.

This patch optimizes waitcnt insertion by skipping the loadcnt wait at
function boundaries when no VGPRs have pending loads. This is determined
by checking if any VGPR has a score greater than the lower bound for
LOAD_CNT - if not, the pending loadcnt must be from non-VGPR-writing
instructions like GLOBAL_INV.

The optimization is limited to GFX12+ targets where GLOBAL_INV exists
and uses the extended wait count instructions.

This is a follow-up optimization to PR #135340 which added tracking for
GLOBAL_INV in the waitcnt pass.

2025-12-17 17:53:00 +05:30

benchmarks

…

bindings

[OCaml] Fix build

2025-12-09 17:00:45 +01:00

cmake

llvm/cmake/config.guess: add support for e2k (Elbrus-2000) (#162460 )

2025-12-13 18:35:41 +00:00

docs

[llvm-objdump] Fix memory leak in mcpuHelp() (#172594 )

2025-12-17 10:10:54 +00:00

examples

[LLVM] Add plugin hook for back-ends

2025-12-16 16:33:39 +01:00

include

Revert "[mlir][amdgpu] Expose waitcnt bitpacking infra (#172313 )" (#172636 )

2025-12-17 12:13:44 +00:00

lib

[AMDGPU][SIInsertWaitCnt] Optimize loadcnt insertion at function boundaries (#169647 )

2025-12-17 17:53:00 +05:30

projects

…

resources

…

runtimes

Revert: check-builtins target for LLVM_ENABLE_RUNTIMES (#171940 )

2025-12-11 16:22:27 -08:00

test

[AMDGPU][SIInsertWaitCnt] Optimize loadcnt insertion at function boundaries (#169647 )

2025-12-17 17:53:00 +05:30

tools

[AArch64][llvm-objdump] Fix arm64_32 symbolization (#171164 )

2025-12-17 13:17:39 +01:00

unittests

[IR] Optimize PHINode::removeIncomingValue() by swapping removed incoming value with the last incoming value. (#171963 )

2025-12-17 19:44:01 +08:00

utils

[TableGen][SchedModel] Add logical combiners for SchedPredicates (#172106 )

2025-12-16 13:43:45 -08:00

.clang-format

…

.clang-tidy

…

.gitattributes

…

.gitignore

…

CMakeLists.txt

[llvm][clang] Enable IO sandbox for assert builds (#171935 )

2025-12-16 08:01:02 -08:00

CMakePresets.json

Add CMake configure preset building blocks (#170019 )

2025-12-09 00:07:11 -07:00

configure

…

CREDITS.TXT

…

LICENSE.TXT

…

Maintainers.md

…

README.txt

…

RELEASE_TESTERS.TXT

…

README.txt

The LLVM Compiler Infrastructure
================================

This directory and its subdirectories contain source code for LLVM,
a toolkit for the construction of highly optimized compilers,
optimizers, and runtime environments.

LLVM is open source software. You may freely distribute it under the terms of
the license agreement found in LICENSE.txt.

Please see the documentation provided in docs/ for further
assistance with LLVM, and in particular docs/GettingStarted.rst for getting
started with LLVM and docs/README.txt for an overview of LLVM's
documentation setup.

If you are writing a package for LLVM, see docs/Packaging.rst for our
suggestions.