llvm-project

Author	SHA1	Message	Date
Luke Drummond	b55c52c047	Revert "Renormalize line endings whitespace only after dccebddb3b80" This reverts commit 9d98acb196a40fee5229afeb08f95fd36d41c10a.	2024-10-18 21:16:50 +01:00
Luke Drummond	9d98acb196	Renormalize line endings whitespace only after dccebddb3b80 Line ending policies were changed in the parent, dccebddb3b80. To make it easier to resolve downstream merge conflicts after line-ending policies are adjusted this is a separate whitespace-only commit. If you have merge conflicts as a result, you can simply `git add --renormalize -u && git merge --continue` or `git add --renormalize -u && git rebase --continue` - depending on your workflow.	2024-10-17 14:49:26 +01:00
vporpo	e22b07e766	[SandboxIR][NFC] Move Function class to a separate file (#110526 )	2024-09-30 10:12:47 -07:00
vporpo	eba106d461	[SandboxIR][NFC] Move Instruction classes into a separate file (#110294 )	2024-09-27 10:54:11 -07:00
vporpo	31d4837273	[SandboxIR][Bench] SandboxIR creation (#108278 ) Adds a benchmark for the overhead of SandboxIR creation.	2024-09-11 15:28:37 -07:00
vporpo	0cfa5abd9d	[SandboxIR][Bench] Add tests with tracking enabled (#108273 ) Benchmarks RAUW and RUOW when tracking is enabled.	2024-09-11 12:14:50 -07:00
vporpo	bd4e0dfa94	[SandboxIR][Bench] Benchmark RUOW (#107456 ) This patch adds a benchmark for ReplaceUsesOfWith().	2024-09-11 11:40:16 -07:00
Justin Bogner	80c47ad3ae	[SandboxIR][Bench] Fix missing include In 362da640dd18 "[SandboxIR][Bench] Test RAUW (#107440)" we started using std::stringstream, but didn't include `<sstream>` for the definition. This is resulting in a build failure on windows.	2024-09-08 12:34:10 -07:00
vporpo	362da640dd	[SandboxIR][Bench] Test RAUW (#107440 )	2024-09-05 12:32:07 -07:00
vporpo	5e25291b3c	[SandboxIR][Bench] Initial patch for performance tracking (#107296 ) This patch adds a new benchmark suite for SandboxIR. It measures the performance of some of the most commonly used API functions and compares it against LLVM IR.	2024-09-05 10:35:02 -07:00
Rahul Joshi	dcf0160bd6	[TableGen] Optimize intrinsic info type signature encoding (#106809 ) Change the "fixed encoding" table used for encoding intrinsic type signature to use 16-bit encoding as opposed to 32-bit. This results in both space and time improvements. For space, the total static storage size (in bytes) of this info reduces by 50%: - Current = 141934 (Fixed table) + 16058 + 3 (Long Table) = 72833 - New size = 141932 (Fixed table) + 19879 + 3 (Long Table) = 48268. - Reduction = 50.9% For time, with the added benchmark, we see a 7.3% speedup in `GetIntrinsicInfoTableEntries` benchmark. Actual output of the benchmark in included in the GitHub MR.	2024-09-04 14:46:48 -07:00
Rahul Joshi	9ce4af5cad	Revert "Revert "[Support] Validate number of arguments passed to formatv()"" (#106592 ) Reverts llvm/llvm-project#106589 The fix for bot failures caused by the reverted commit was committed already, so this revert is not needed.	2024-08-29 10:39:40 -07:00
Mehdi Amini	ed37b5f6c3	Revert "[Support] Validate number of arguments passed to formatv()" (#106589 ) Reverts llvm/llvm-project#105745 Some bots are broken apparently.	2024-08-29 10:30:11 -07:00
Rahul Joshi	fc110202df	[Support] Validate number of arguments passed to formatv() (#105745 ) Change formatv() to validate that the number of arguments passed matches number of replacement fields in the format string, and that the replacement indices do not contain holes. To support cases where this cannot be guaranteed, introduce a formatv() overload that allows disabling validation with a bool flag as its first argument.	2024-08-29 08:00:25 -07:00
Rahul Joshi	389f339c11	[TableGen] Rework `EmitIntrinsicToBuiltinMap` (#104681 ) Rework `IntrinsicEmitter::EmitIntrinsicToBuiltinMap` for improved peformance as well as refactor the code. Performance: - Current generated code does a linear search on the TargetPrefix, followed by a binary search on the builtin names for that target's builtins. - Improve the performance of this code in 2 ways: (a) Use binary search on the target prefix to lookup the builtin table for the target. (b) Improve the (common) case of when all builtins for a target share a common prefix. Check this common prefix first, and then do the binary search in the builtin table using the builtin name with the common prefix removed. This should help both data size (by creating a smaller static string table) and runtime (by reducing the cost of binary search on smaller strings). Refactor: - Use range based for loops for iterating over maps. - Use formatv() and C++ raw string literals to simplify the emission code. - Change the generated `getIntrinsicForClangBuiltin` and `getIntrinsicForMSBuiltin` to take a `StringRef` instead of `const char *` for the prefix.	2024-08-20 14:22:48 -07:00
Daniel Bertalan	90569e02e6	[Support] Add Arm NEON implementation for `llvm::xxh3_64bits` (#99634 ) Compared to the generic scalar code, using Arm NEON instructions yields a ~11x speedup: 31 vs 339.5 ms to hash 1 GiB of random data on the Apple M1. This follows the upstream implementation closely, with some simplifications made: - Removed workarounds for suboptimal codegen on older GCC - Removed instruction reordering barriers which seem to have a negligible impact according to my measurements - We do not support WebAssembly's mostly NEON-compatible API - There is no configurable mixing of SIMD and scalar code; according to the upstream comments, this is only relevant for smaller Cortex cores which can dispatch relatively few NEON micro-ops per cycle. This commit intends to use only standard ACLE intrinsics and datatypes, so it should build with all supported versions of GCC, Clang and MSVC. This feature is enabled by default when targeting AArch64, but the `LLVM_XXH_USE_NEON=0` macro can be set to explicitly disable it. XXH3 is used for ICF, string deduplication and computing the UUID in ld64.lld; this commit results in a -1.77% +/- 0.59% speed improvement for a `--threads=8` link of Chromium.framework.	2024-07-22 19:06:43 +02:00
Kirill Bobyrev	0addd170ab	Pull google/benchmark library to the LLVM tree This patch pulls google/benchmark v1.4.1 into the LLVM tree so that any project could use it for benchmark generation. A dummy benchmark is added to `llvm/benchmarks/DummyYAML.cpp` to validate the correctness of the build process. The current version does not utilize LLVM LNT and LLVM CMake infrastructure, but that might be sufficient for most users. Two introduced CMake variables: * `LLVM_INCLUDE_BENCHMARKS` (`ON` by default) generates benchmark targets * `LLVM_BUILD_BENCHMARKS` (`OFF` by default) adds generated benchmark targets to the list of default LLVM targets (i.e. if `ON` benchmarks will be built upon standard build invocation, e.g. `ninja` or `make` with no specific targets) List of modifications: * `BENCHMARK_ENABLE_TESTING` is disabled * `BENCHMARK_ENABLE_EXCEPTIONS` is disabled * `BENCHMARK_ENABLE_INSTALL` is disabled * `BENCHMARK_ENABLE_GTEST_TESTS` is disabled * `BENCHMARK_DOWNLOAD_DEPENDENCIES` is disabled Original discussion can be found here: http://lists.llvm.org/pipermail/llvm-dev/2018-August/125023.html Reviewed by: dberris, lebedev.ri Subscribers: ilya-biryukov, ioeric, EricWF, lebedev.ri, srhines, dschuff, mgorny, krytarowski, fedor.sergeev, mgrang, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D50894 llvm-svn: 340809	2018-08-28 09:42:41 +00:00

17 Commits