llvm-project

Author	SHA1	Message	Date
Daniel Bertalan	90569e02e6	[Support] Add Arm NEON implementation for `llvm::xxh3_64bits` (#99634 ) Compared to the generic scalar code, using Arm NEON instructions yields a ~11x speedup: 31 vs 339.5 ms to hash 1 GiB of random data on the Apple M1. This follows the upstream implementation closely, with some simplifications made: - Removed workarounds for suboptimal codegen on older GCC - Removed instruction reordering barriers which seem to have a negligible impact according to my measurements - We do not support WebAssembly's mostly NEON-compatible API - There is no configurable mixing of SIMD and scalar code; according to the upstream comments, this is only relevant for smaller Cortex cores which can dispatch relatively few NEON micro-ops per cycle. This commit intends to use only standard ACLE intrinsics and datatypes, so it should build with all supported versions of GCC, Clang and MSVC. This feature is enabled by default when targeting AArch64, but the `LLVM_XXH_USE_NEON=0` macro can be set to explicitly disable it. XXH3 is used for ICF, string deduplication and computing the UUID in ld64.lld; this commit results in a -1.77% +/- 0.59% speed improvement for a `--threads=8` link of Chromium.framework.	2024-07-22 19:06:43 +02:00
Fangrui Song	72055622e9	[Support] Fix xxh3_128bits for Win32 builds after #95863 `__emulu` is used without including `intrin.h`. Actually, it's better to rely on compiler optimizations. In this LLVM copy, we try to eliminate unneceeded workarounds for old compilers. Pull Request: https://github.com/llvm/llvm-project/pull/96931	2024-06-28 00:19:46 -07:00
Brendan Duke	f991ebbb46	[Support] Add llvm::xxh3_128bits (#95863 ) Add a 128-bit xxhash function, following the existing `llvm::xxh3_64bits` and `llvm::xxHash` implementations. Previously, 48e93f57f1ee914ca29aa31bf2ccd916565a3610 added support for `llvm::xxh3_64bits`, which closely follows the upstream implementation at https://github.com/Cyan4973/xxHash, with simplifications from Devin Hussey's xxhash-clean. However, it is desirable to have a larger 128-bit hash key for use cases such as filesystem checksums where chance of collision needs to be negligible. So to that end this also ports over the 128-bit xxh3_128bits as `llvm::xxh3_128bits`. Testing: - Add a test based on xsum_sanity_check.c in upstream xxhash.	2024-06-19 15:24:54 -04:00
Fangrui Song	48e93f57f1	[Support] Add llvm::xxh3_64bits ld.lld SHF_MERGE\|SHF_STRINGS duplicate elimination is computation heavy and utilitizes llvm::xxHash64, a simplified version of XXH64. Externally many sources confirm that a new variant XXH3 is much faster. I have picked a few hash implementations and computed the proportion of time spent on hashing in the overall link time (a debug build of clang 16 on a machine using AMD Zen 2 architecture): * llvm::xxHash64: 3.63% * official XXH64 (`#define XXH_VECTOR XXH_SCALAR`): 3.53% * official XXH3_64bits (`#define XXH_VECTOR XXH_SCALAR`): 1.21% * official XXH3_64bits (default, essentially `XXH_SSE2`): 1.22% * this patch llvm::xxh3_64bits: 1.19% The remaining part of lld remains unchanged. Consequently, a lower ratio indicates that hashing is faster. Therefore, it is evident that XXH3 from xxhash is significantly faster than both the official version and our llvm::xxHash64. ( string length: count 1-3: 393434 4-8: 2084056 9-16: 2846249 17-128: 5598928 129-240: 1317989 241-: 328058 ) This patch adds heavily simplified https://github.com/Cyan4973/xxHash, taking account of many simplification ideas from Devin Hussey's xxhash-clean. Important x86-64 optimization ideas: * Make XXH3_len_129to240_64b and XXH3_hashLong_64b noinline * Unroll XXH3_len_17to128_64b * __restrict does not affect Clang code generation Beside SHF_MERGE\|SHF_STRINGS duplicate elimination, llvm/ADT/StringMap.h StringMapImpl::LookupBucketFor and a few places in lld can potentially be accelerated by switching to llvm::xxh3_64bits. Link: https://github.com/llvm/llvm-project/issues/63750 Reviewed By: serge-sans-paille Differential Revision: https://reviews.llvm.org/D154812	2023-07-18 13:36:11 -07:00
Benjamin Kramer	72eac42f21	[xxHash] Don't trigger UB on empty StringRef This is quite silly, but casting to uintptr_t seems like the easiest option to quiet ubsan. llvm/lib/Support/xxhash.cpp:107:12: runtime error: applying non-zero offset 8 to null pointer #0 0x7fe3660404c0 in llvm::xxHash64(llvm::StringRef) llvm/lib/Support/xxhash.cpp:107:12	2023-02-08 12:53:54 +01:00
serge-sans-paille	fbbc41f8dd	Cleanup include: TableGen This also includes a few cleanup from Support. Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121331	2022-03-11 11:41:32 +01:00
Rui Ueyama	7f97570e79	Make ICF log output order deterministic. This patch does the same thing as r338153 for COFF. Note that this patch affects only the order of log messages. The output file is already deterministic. Differential Revision: https://reviews.llvm.org/D50023 llvm-svn: 338406	2018-07-31 18:04:58 +00:00
Fangrui Song	9c85d7acbe	[Support] Use unsigned char for xxHash 64-bit Before, the last 3 bytes were char-signedness dependent. llvm-svn: 338128	2018-07-27 16:01:09 +00:00
Rui Ueyama	0fcbb2893e	Revert r301487: Replace HashString algorithm with xxHash64 This reverts commit r301487 to make buildbots green. llvm-svn: 301491	2017-04-26 23:15:10 +00:00
Rui Ueyama	87b30ac9d3	Replace HashString algorithm with xxHash64 The previous algorithm processed one character at a time, which is very painful on a modern CPU. Replace it with xxHash64, which both already exists in the codebase and is fairly fast. Patch from Scott Smith! Differential Revision: https://reviews.llvm.org/D32509 llvm-svn: 301487	2017-04-26 22:45:04 +00:00
Rafael Espindola	eaeb6d91a1	Add xxhash to llvm. It will be used for fast fingerprinting in lld at least. llvm-svn: 282493	2016-09-27 15:45:57 +00:00

11 Commits