Previously, the flow was: 1. Parallel scan adds relative relocs to per-thread `relocsVec` 2. `mergeRels()` copies all into `relocs` 3. `partitionRels()` uses `stable_partition` to separate Now, relative relocs are routed at `addReloc` time by checking `reloc.type == relativeRel`. In `mergeRels`, sharded entries are classified through the same `addReloc` path rather than blindly appended. `relocsVec` may contain non-relative entries like `R_AARCH64_AUTH_RELATIVE`. This eliminates the `stable_partition` on the full relocation vector (543K entries for clang) and avoids copying relative relocations into `relocs` only to move them out again. Linking an x86_64 release+assertions build of clang is 1.04x as fast. `numRelativeRelocs` caches `relativeRelocs.size()` at `finalizeContents` time for `DT_RELACOUNT`. Using a live `relativeRelocs.size()` would cause `DynamicSection::writeTo` to emit an extra entry when thunks add relocs after `.dynamic` is sized, overflowing into adjacent sections. Tested by ppc64-long-branch-rel14.s.
LLVM Linker (lld)
This directory and its subdirectories contain source code for the LLVM Linker, a modular cross platform linker which is built as part of the LLVM compiler infrastructure project.
lld is open source software. You may freely distribute it under the terms of the license agreement found in LICENSE.txt.
Benchmarking
In order to make sure various developers can evaluate patches over the same tests, we create a collection of self contained programs.
It is hosted at https://s3-us-west-2.amazonaws.com/linker-tests/lld-speed-test.tar.xz
The current sha256 is 10eec685463d5a8bbf08d77f4ca96282161d396c65bd97dc99dbde644a31610f.