llvm-project

History

Leandro Lacerda 08ff017fb0

[libc] Improve GPU benchmarking (#153512 )

This patch improves the GPU benchmarking in this way:

* Replace `rand`/`srand` with a deterministic per-thread RNG seeded by
`call_index`: reproducible, apples-to-apples libc vs vendor comparisons.
* Fix input generation: sample the unbiased exponent uniformly in
`[min_exp, max_exp]`, clamp bounds, and skip `Inf`, `NaN`, `-0.0`, and
`+0.0`.
* Fix standard deviation: use an explicit estimator from sums and
sums-of-squares (`sqrt(E[x^2] − E[x]^2)`) across samples.
* Fix throughput overhead: subtract a loop-only baseline inside
NVPTX/AMDGPU timing backends so `benchmark()` gets cycles-per-call
already corrected (no `overhead()` call).
* Adapt existing math benchmarks to the new RNG/timing plumbing (plumb
`call_index`, drop `rand/srand`, clean includes).
* Correct inter-thread aggregation: use iteration-weighted pooling to
compute the global mean/variance, ensuring statistically sound `Cycles
(Mean)` and `Stddev`.
* Remove `Time / Iteration` column from the results table: it reported
per-thread convergence time (not per-call latency) and was
redundant/misleading next to `Cycles (Mean)`.
* Remove unused `BenchmarkLogger` files: dead code that added
maintenance and cognitive overhead without providing functionality.

---

## TODO (before merge)

* [ ] Investigate compiler warnings and address their root causes.
* [x] Review how per-thread results are aggregated into the overall
result.

## Follow-ups (future PRs)

* Add support to run throughput benchmarks with uniform (linear) input
distributions, alongside the current log2-uniform scheme.
* Review/adjust the configuration and coverage of existing math
benchmarks.
* Add more math benchmarks (e.g., `exp`/`expf`, others).

2025-08-15 11:00:17 -05:00

AOR_v20.02

[libc][NFC] Remove all trailing spaces from libc (#82831 )

2024-02-23 16:34:00 -06:00

benchmarks

[libc] Improve GPU benchmarking (#153512 )

2025-08-15 11:00:17 -05:00

cmake

Revert "[libc] Add -Wextra for libc tests" (#153169 )

2025-08-12 11:40:14 +00:00

config

[libc][math][c++23] Add bf16fma{,f,l,f128} math functions (#153231 )

2025-08-13 23:26:15 +05:30

docs

[libc][math][docs] Add documentation for BFloat16 type (#153475 )

2025-08-15 20:07:33 +05:30

examples

[libc] Fix broken links in libc (#145199 )

2025-06-23 15:51:43 -07:00

fuzzing

[libc] Fuzz tests for fsqrt, f16sqrt, and hypot (#150489 )

2025-07-25 17:15:26 +00:00

hdr

[libc] Add struct_sched_param proxy header (#151722 )

2025-08-01 11:34:06 -07:00

include

[libc] Fix typo and amend restrict qualifier (#152410 )

2025-08-07 16:45:14 -07:00

lib

[libc] Fix building bitcode library for GPU (#100491 )

2024-07-26 13:17:17 -05:00

shared

[libc][math] Refactor coshf implementation to header-only in src/__support/math folder. (#153427 )

2025-08-14 17:19:47 +03:00

src

[libc][math] Refactor coshf implementation to header-only in src/__support/math folder. (#153427 )

2025-08-14 17:19:47 +03:00

startup

[libc] Add startup code for ARM v7-A, ARM v7-R variants (#153576 )

2025-08-15 09:17:50 +00:00

test

[libc] Fix mbrtowc test (#153721 )

2025-08-15 11:44:33 -03:00

utils

[libc][math][c++23] Add bf16fma{,f,l,f128} math functions (#153231 )

2025-08-13 23:26:15 +05:30

.clang-tidy

[libc] fix readability-identifier-naming.ConstexprFunctionCase (#83345 )

2024-02-28 14:52:02 -08:00

.gitignore

[libc][Obvious] Add build folder to .gitignore.

2022-03-04 13:16:55 -05:00

CMakeLists.txt

[libc] Add hooks for extra options in running hermetic tests (#147931 )

2025-07-15 11:43:51 +01:00

LICENSE.TXT

…

Maintainers.rst

[libc] Add myself as maintainer for Public Headers / hdrgen (#135209 )

2025-04-11 11:33:52 -07:00

README.txt

…

README.txt

LLVM libc
=========

This directory and its subdirectories contain source code for llvm-libc,
a retargetable implementation of the C standard library.

LLVM is open source software. You may freely distribute it under the terms of
the license agreement found in LICENSE.txt.