llvm-project

Author	SHA1	Message	Date
jameshu15869	deb6b45c32	[libc][gpu] Add Atan2 Benchmarks (#104708 ) This PR adds benchmarking for `atan2()`, `__nv_atan2()`, and `__ocml_atan2_f64()` using the same setup as `sin()`. This PR also adds support for throughout bencmarking for functions with 2 inputs.	2024-08-18 12:50:30 -05:00
jameshu15869	1248698e9b	[libc] [gpu] Fix Minor Benchmark UI Issues (#102529 ) Previously, `AmdgpuSinTwoPow_128` and others were too large for their table cells. This PR shortens the name to `AmdSin...` There were also some `-` missing in the separator. This PR instead creates the separator string using the length of the headers.	2024-08-08 15:32:20 -05:00
jameshu15869	39826b1030	[libc] [gpu] Change Time To Be Per Iteration (#101919 ) Previously, the time field was the total time take to run all iterations of the benchmark. This PR changes the value displayed to be the average time take by each iteration.	2024-08-05 08:27:31 -05:00
jameshu15869	677796cab3	[libc] Add Generic and NVPTX Sin Benchmark (#99795 ) This PR adds sin benchmarking for a range of values and on a pregenerated random distribution.	2024-07-29 22:09:11 -05:00
jameshu15869	a09c0f676d	[libc] Add Minimum Time and Iterations, Reduce Epsilon (#100838 ) This PR adds minimums (50 iterations, 500 us, and epsilon of 0.0001) to ensure that all benchmarks run at least a set number of times before outputting a final measurement.	2024-07-26 20:30:19 -05:00
Joseph Huber	6911f823ad	[libc] Fix invalid format specifier in benchmark Summary: This value is a uint32_t but is printed as a uint64_t, leading to invalid offsets when done on AMDGPU due to its packed format extending past the buffer.	2024-07-22 11:21:22 -05:00
jameshu15869	197b142232	[libc] Add N Threads Benchmark Helper (#99834 ) This PR adds a `BENCHMARK_N_THREADS()` helper to register benchmarks with a specific number of threads. This PR replaces the flags used originally to allow any amount of threads.	2024-07-21 21:56:40 -05:00
jameshu15869	a964f2e8a1	[libc] Improve Benchmark UI (#99796 ) This PR changes the output to resemble Google Benchmark. e.g. ``` Running Suite: LlvmLibcIsAlNumGpuBenchmark Benchmark \| Cycles \| Min \| Max \| Iterations \| Time (ns) \| Stddev \| Threads \| ----------------------------------------------------------------------------------------------------- IsAlnum \| 92 \| 76 \| 482 \| 23 \| 86500 \| 76 \| 64 \| IsAlnumSingleThread \| 87 \| 76 \| 302 \| 20 \| 72000 \| 49 \| 1 \| IsAlnumSingleWave \| 87 \| 76 \| 302 \| 20 \| 72000 \| 49 \| 32 \| IsAlnumCapital \| 89 \| 76 \| 299 \| 17 \| 78500 \| 52 \| 64 \| IsAlnumNotAlnum \| 87 \| 76 \| 303 \| 20 \| 76000 \| 49 \| 64 \| ```	2024-07-21 16:40:01 -05:00
jameshu15869	8badfccefe	[libc] Add Multithreaded GPU Benchmarks (#98964 ) This PR runs benchmarks on a 32 threads (A single warp on NVPTX) by default, adding the option for single threaded benchmarks. We can specify that a benchmark should be run on a single thread using the `SINGLE_THREADED_BENCHMARK()` macro. I chose to use a flag here so that other options could be added in the future.	2024-07-18 07:18:23 -05:00
jameshu15869	b42c332d73	[libc] Use Atomics in GPU Benchmarks (#98842 ) This PR replaces our old method of reducing the benchmark results by using an array to using atomics instead. This should help us implement single threaded benchmarks.	2024-07-15 07:08:23 -05:00
Petr Hosek	5ff3ff33ff	[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration (#98597 ) This is a part of #97655.	2024-07-12 09:28:41 -07:00
Mehdi Amini	ce9035f5bd	Revert "[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration" (#98593 ) Reverts llvm/llvm-project#98075 bots are broken	2024-07-12 09:12:13 +02:00
Petr Hosek	3f30effe1b	[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration (#98075 ) This is a part of #97655.	2024-07-11 12:35:22 -07:00
jameshu15869	eeed5896de	[libc] Correctly Run Multiple Benchmarks in the Same File (#98467 ) There was previously an issue where registering multiple benchmarks in the same file would only give the results for the last benchmark to run. This PR fixes the issue. @jhuber6	2024-07-11 06:58:10 -05:00
jameshu15869	f4e6ddbc2e	[libc] Fix Cppcheck Issues (#96999 ) This PR fixes linting issues discovered by `cppcheck`. Fixes: https://github.com/llvm/llvm-project/issues/96863	2024-07-06 17:53:36 -05:00
jameshu15869	02b57dedb7	[libc] NVPTX Profiling (#92009 ) PR for adding microbenchmarking infrastructure for NVPTX. `nvlink` cannot perform LTO, so we cannot inline `libc` functions and this function call overhead is not adjusted for during microbenchmarking.	2024-06-26 16:38:39 -05:00

16 Commits