llvm-project

Author	SHA1	Message	Date
Yanzuo Liu	3b27d50cc7	[LLVM][utils] Add script which clears release notes (#153593 ) The script copies `ReleaseNotesTemplate.txt` to corresponding `ReleaseNotes.rst`/`.md` to clear release notes. The suffix of `ReleaseNotesTemplate.txt` must be `.txt`. If it is `.rst`/`.md`, it will be treated as a documentation source file when building documentation.	2025-08-15 19:00:08 +08:00
Tobias Hieta	3d8db8ef50	Clear release notes on main for LLVM 22	2025-07-24 11:36:11 +02:00
SivanShani-Arm	8bb97d2d1e	[LLD][Docs] Document -z gcs= option in the man page (#146522 ) Add documentation for the -z gcs= option to the LLD man page. This flag controls how the GCS bit is set in the output: - implicit (default): inferred from input objects - never: GCS bit is never set - always: GCS bit is always set Clarifies behavior for users and aligns the man page with existing functionality.	2025-07-21 15:23:18 +01:00
bd1976bris	bbbbc093fe	[DTLTO][LLD][COFF] Add support for Integrated Distributed ThinLTO (#148594 ) This patch introduces support for Integrated Distributed ThinLTO (DTLTO) in COFF LLD. DTLTO enables the distribution of ThinLTO backend compilations via external distribution systems, such as Incredibuild, during the traditional link step: https://llvm.org/docs/DTLTO.html. Note: Bitcode members of non-thin archives are not currently supported. This will be addressed in a future change. This patch is sufficient to allow for self-hosting an LLVM build with DTLTO if thin archives are used. Testing: - LLD `lit` test coverage has been added, using a mock distributor to avoid requiring Clang. - Cross-project `lit` tests cover integration with Clang. For the design discussion of the DTLTO feature, see: https://github.com/llvm/llvm-project/pull/126654	2025-07-20 14:47:00 +01:00
Ami-zhang	9ef293ea24	[LoongArch] Add supplemental release notes for LLVM 21 (#148771 )	2025-07-15 15:39:00 +08:00
Fangrui Song	8983b22ca1	ReleaseNotes: add lld/ELF notes Move linker script changes to the middle and target-specific options/behavior changes to the end.	2025-07-13 23:24:14 -07:00
bd1976bris	3b4e79398d	[DTLTO][LLD][ELF] Add support for Integrated Distributed ThinLTO (#142757 ) This patch introduces support for Integrated Distributed ThinLTO (DTLTO) in ELF LLD. DTLTO enables the distribution of ThinLTO backend compilations via external distribution systems, such as Incredibuild, during the traditional link step: https://llvm.org/docs/DTLTO.html. It is expected that users will invoke DTLTO through the compiler driver (e.g., Clang) rather than calling LLD directly. A Clang-side interface for DTLTO will be added in a follow-up patch. Note: Bitcode members of archives (thin or non-thin) are not currently supported. This will be addressed in a future change. As a consequence of this lack of support, this patch is not sufficient to allow for self-hosting an LLVM build with DTLTO. Theoretically, --start-lib/--end-lib could be used instead of archives in a self-host build. However, it's unclear how --start-lib/--end-lib can be easily used with the LLVM build system. Testing: - ELF LLD `lit` test coverage has been added, using a mock distributor to avoid requiring Clang. - Cross-project `lit` tests cover integration with Clang. For the design discussion of the DTLTO feature, see: #126654.	2025-07-02 16:12:27 +01:00
Peter Collingbourne	494a74882b	Reapply "ELF: Add branch-to-branch optimization." Fixed assertion failure when reading .eh_frame sections, and added .eh_frame sections to tests. This reverts commit 1e95349dbe329938d2962a78baa0ec421e9cd7d1. Original commit message follows: When code calls a function which then immediately tail calls another function there is no need to go via the intermediate function. By branching directly to the target function we reduce the program's working set for a slight increase in runtime performance. Normally it is relatively uncommon to have functions that just tail call another function, but with LLVM control flow integrity we have jump tables that replace the function itself as the canonical address. As a result, when a function address is taken and called directly, for example after a compiler optimization resolves the indirect call, or if code built without control flow integrity calls the function, the call will go via the jump table. The impact of this optimization was measured using a large internal Google benchmark. The results were as follows: CFI enabled: +0.1% ± 0.05% queries per second CFI disabled: +0.01% queries per second [not statistically significant] The optimization is enabled by default at -O2 but may also be enabled or disabled individually with --{,no-}branch-to-branch. This optimization is implemented for AArch64 and X86_64 only. lld's runtime performance (real execution time) after adding this optimization was measured using firefox-x64 from lld-speed-test [1] with ldflags "-O2 -S" on an Apple M2 Ultra. The results are as follows: ``` N Min Max Median Avg Stddev x 512 1.2264546 1.3481076 1.2970261 1.2965788 0.018620888 + 512 1.2561196 1.3839965 1.3214632 1.3209327 0.019443971 Difference at 95.0% confidence 0.0243538 +/- 0.00233202 1.87831% +/- 0.179859% (Student's t, pooled s = 0.0190369) ``` [1] https://discourse.llvm.org/t/improving-the-reproducibility-of-linker-benchmarking/86057 Reviewers: zmodem, MaskRay Reviewed By: MaskRay Pull Request: https://github.com/llvm/llvm-project/pull/145579	2025-06-24 22:16:18 -07:00
Hans Wennborg	1e95349dbe	Revert "ELF: Add branch-to-branch optimization." This caused assertion failures in applyBranchToBranchOpt(): llvm/include/llvm/Support/Casting.h:578: decltype(auto) llvm::cast(From*) [with To = lld:🧝:InputSection; From = lld:🧝:InputSectionBase]: Assertion `isa<To>(Val) && "cast<Ty>() argument of incompatible type!"' failed. See comment on the PR (https://github.com/llvm/llvm-project/pull/138366) This reverts commit 491b82a5ec1add78d2c93370580a2f1897b6a364. This also reverts the follow-up "[lld] Use llvm::partition_point (NFC) (#145209)" This reverts commit 2ac293f5ac4cf65c0c038bf75a88f1d6715e467d.	2025-06-23 13:26:02 +02:00
Peter Collingbourne	491b82a5ec	ELF: Add branch-to-branch optimization. When code calls a function which then immediately tail calls another function there is no need to go via the intermediate function. By branching directly to the target function we reduce the program's working set for a slight increase in runtime performance. Normally it is relatively uncommon to have functions that just tail call another function, but with LLVM control flow integrity we have jump tables that replace the function itself as the canonical address. As a result, when a function address is taken and called directly, for example after a compiler optimization resolves the indirect call, or if code built without control flow integrity calls the function, the call will go via the jump table. The impact of this optimization was measured using a large internal Google benchmark. The results were as follows: CFI enabled: +0.1% ± 0.05% queries per second CFI disabled: +0.01% queries per second [not statistically significant] The optimization is enabled by default at -O2 but may also be enabled or disabled individually with --{,no-}branch-to-branch. This optimization is implemented for AArch64 and X86_64 only. lld's runtime performance (real execution time) after adding this optimization was measured using firefox-x64 from lld-speed-test [1] with ldflags "-O2 -S" on an Apple M2 Ultra. The results are as follows: ``` N Min Max Median Avg Stddev x 512 1.2264546 1.3481076 1.2970261 1.2965788 0.018620888 + 512 1.2561196 1.3839965 1.3214632 1.3209327 0.019443971 Difference at 95.0% confidence 0.0243538 +/- 0.00233202 1.87831% +/- 0.179859% (Student's t, pooled s = 0.0190369) ``` [1] https://discourse.llvm.org/t/improving-the-reproducibility-of-linker-benchmarking/86057 Pull Request: https://github.com/llvm/llvm-project/pull/138366	2025-06-20 13:16:24 -07:00
Fangrui Song	07dad4ecba	[ELF] Implement -z dynamic-undefined-weak The behavior of an undefined weak reference is implementation defined. For static -no-pie linking, dynamic relocations are generally avoided (except IRELATIVE). -shared linking generally emits dynamic relocations. Dynamic -no-pie linking and -pie allow flexibility. Changes adjust the behavior for better consistency and simpler internal representation, e.g. https://reviews.llvm.org/D63003 https://reviews.llvm.org/D105164 (generalized to undefined non-weak in 2fcaa00d1e2317a90c9071b735eb0e758b5dd58b). GNU ld introduced -z [no]dynamic-undefined-weak option to fine-tune the behavior. (The option is not very effective with -no-pie, e.g. on x86-64, `ld.bfd a.o s.so -z dynamic-undefined-weak` generates R_X86_64_NONE relocations instead of GLOB_DAT/JUMP_SLOT) This patch implements -z [no]dynamic-undefined-weak option. The effects are summarized as follows: * Static -no-pie: no-op * Dynamic -no-pie: nodynamic-undefined-weak suppresses GLOB_DAT/JUMP_SLOT * Static -pie: dynamic-undefined-weak generates ABS/GLOB_DAT/JUMP_SLOT. https://discourse.llvm.org/t/lld-weak-undefined-symbols-in-vdso-only/86749 * Dynamic -pie: nodynamic-undefined-weak suppresses ABS/GLOB_DAT/JUMP_SLOT The -pie behavior likely stays stable while -no-pie (`!ctx.arg.isPic` in `isStaticLinkTimeConstant`) behavior will likely change in the future. The current default value of ctx.arg.zDynamicUndefined is selected to prevent behavior changes. Pull Request: https://github.com/llvm/llvm-project/pull/143831	2025-06-12 19:50:41 -07:00
Kazu Hirata	7341031718	[lld] Fix a typo in documentation (#141383 )	2025-05-25 08:20:00 -07:00
Fangrui Song	0edc8b59ab	[ELF] Error if a section address is smaller than image base When using `-no-pie` without a `SECTIONS` command, the linker uses the target's default image base. If `-Ttext=` or `--section-start` specifies an output section address below this base, the result is likely unintended. - With `--no-rosegment`, the PT_LOAD segment covering the ELF header cannot include `.text` if `.text`'s address is too low, causing an `error: output file too large`. - With default `--rosegment`: - If a read-only section (e.g., `.rodata`) exists, a similar `error: output file too large` occurs. - Without read-only sections, the PT_LOAD segment covering the ELF header and program headers includes no sections, which is unusual and likely undesired. This also causes non-ascending PT_LOAD `p_vaddr` values related to the PT_LOAD that overlaps with PT_PHDR (#138584). To prevent these issues, report an error if a section address is below the image base and suggest `--image-base`. This check also applies when `--image-base` is explicitly set but is skipped when a `SECTIONS` command is used. Pull Request: https://github.com/llvm/llvm-project/pull/140187	2025-05-21 09:19:47 -07:00
Kazu Hirata	db099f14d9	[lld] Fix typos in documentation (#139854 )	2025-05-14 10:55:29 -07:00
Peter Collingbourne	4b1f1f77b0	ELF: Fix man page formatting.	2025-05-01 16:50:37 -07:00
Csanád Hajdú	427a7792b1	[LLD][Docs] Add release notes for execute-only support on AArch64 (#134804 )	2025-04-17 16:36:02 +02:00
Csanád Hajdú	2c1bdd4a08	[LLD][ELF] Allow merging XO and RX sections, and add `--[no-]xosegment` flag (#132412 ) Following from the discussion in #132224, this seems like the best approach to deal with a mix of XO and RX output sections in the same binary. This change will also simplify the implementation of the PURECODE section flag for AArch64. To control this behaviour, the `--[no-]xosegment` flag is added to LLD (similarly to `--[no-]rosegment`), which determines whether to allow merging XO and RX sections in the same segment. The default value is `--no-xosegment`, which is a breaking change compared to the previous behaviour. Release notes are also added, since this will be a breaking change.	2025-04-08 08:47:51 +02:00
Daniel Thornburgh	e84b57dfbf	[LLD][ELF] Support OVERLAY NOCROSSREFS (#133807 ) This allows NOCROSSREFS to be specified in OVERLAY linker script descriptions. This is a particularly useful part of the OVERLAY syntax, since it's very rarely possible for one overlay section to sensibly reference another. Closes #128790	2025-04-02 09:25:18 -07:00
Daniel Thornburgh	074af0f30f	[lld][ELF] Add --why-live flag (inspired by Mach-O) (#127112 ) This prints a stack of reasons that symbols that match the given glob(s) survived GC. It has no effect unless section GC occurs. This implementation does not require -ffunction-sections or -fdata-sections to produce readable results, althought it does tend to work better (as does GC). Details about the semantics: - Some chain of liveness reasons is reported; it isn't specified which chain. - A symbol or section may be live: - Intrisically (e.g., entry point) - Because needed by a live symbol or section - (Symbols only) Because part of a section live for another reason - (Sections only) Because they contain a live symbol - Both global and local symbols (`STB_LOCAL`) are supported. - References to symbol + offset are considered to point to: - If the referenced symbol is a section (`STT_SECTION`): - If a sized symbol encloses the referenced offset, the enclosing symbol. - Otherwise, the section itself, generically. - Otherwise, the referenced symbol.	2025-03-26 09:56:33 -07:00
Alexey Karyakin	c0b2c10e9f	[hexagon] Bump the default version to v68 (#132304 ) Set the default processor version to v68 when the user does not specify one in the command line. This includes changes in the LLVM backed and linker (lld). Since lld normally sets the version based on inputs, this change will only affect cases when there are no inputs. Fixes #127558	2025-03-21 20:08:45 -05:00
Jack Styles	4286f4dcce	[AArch64][GCS][LLD] Introduce -zgcs-report-dynamic Command Line Option (#127787 ) When GCS was introduced to LLD, the gcs-report option allowed for a user to gain information relating to if their relocatable objects supported the feature. For an executable or shared-library to support GCS, all relocatable objects must declare that they support GCS. The gcs-report checks were only done on relocatable object files, however for a program to enable GCS, the executable and all shared libraries that it loads must enable GCS. gcs-report-dynamic enables checks to be performed on all shared objects loaded by LLD, and in cases where GCS is not supported, a warning or error will be emitted. It should be noted that only shared files directly passed to LLD are checked for GCS support. Files that are noted in the `DT_NEEDED` tags are assumed to have had their GCS support checked when they were created. The behaviour of the -zgcs-dynamic-report option matches that of GNU ld. The behaviour is as follows unless the user explicitly sets the value: * -zgcs-report=warning or -zgcs-report=error implies -zgcs-report-dynamic=warning. This approach avoids inheriting an error level if the user wishes to continue building a module without rebuilding all the shared libraries. The same approach was taken for the GNU ld linker, so behaviour is identical across the toolchains. This implementation matches the error message and command line interface used within the GNU ld Linker. See here: `724a8341f6` To support this option being introduced, two other changes are included as part of this PR. The first converts the -zgcs-report option to utilise an Enum, opposed to StringRef values. This enables easier tracking of the value the user defines when inheriting the value for the gas-report-dynamic option. The second is to parse the Dynamic Objects program headers to locate the GNU Attribute flag that shows GCS is supported. This is needed so, when using the gcs-report-dynamic option, LLD can correctly determine if a dynamic object supports GCS. --------- Co-authored-by: Fangrui Song <i@maskray.me>	2025-03-15 18:15:05 -07:00
Fangrui Song	0a470a9264	[ELF] --package-metadata: support %[0-9a-fA-F][0-9a-fA-F] (This application-specific option is probably not appropriate as a linker option (.o file offers more flexibility and decouples JSON verification from linkers). However, the option has gained some traction in Linux distributions, with support in GNU ld, gold, and mold.) GNU ld has supported percent-encoded bytes and extensions like `%[comma]` since November 2024. mold supports just percent-encoded bytes. To prepare for potential adoption by Ubuntu, let's support percent-encoded bytes. Link: https://sourceware.org/bugzilla/show_bug.cgi?id=32003 Link: https://bugs.launchpad.net/ubuntu/+source/dpkg/+bug/2071468 Pull Request: https://github.com/llvm/llvm-project/pull/126396	2025-02-10 09:21:31 -08:00
Tom Stellard	3bd3e06f3f	Bump version to 21.0.0git (#124870 ) Also clear the release notes.	2025-01-28 19:48:43 -08:00
Fangrui Song	ad697b28f4	ReleaseNotes: add lld/ELF notes Pull Request: https://github.com/llvm/llvm-project/pull/124508	2025-01-27 18:08:18 -08:00
Weining Lu	aa273fd83e	[LoongArch] Update lld20 release notes	2025-01-23 12:38:04 +08:00
Brad Smith	52574b5f40	[ELF] Add support for PT_OPENBSD_NOBTCFI (#120005 )	2024-12-19 19:41:42 -05:00
Ivan G.	e6ced4da44	Typo fix in large_sections.rst (#120101 ) Remove duplicate word.	2024-12-17 14:01:03 +00:00
Peter Collingbourne	64da33a589	ELF: Introduce --randomize-section-padding option. The --randomize-section-padding option randomly inserts padding between input sections using the given seed. It is intended to be used in A/B experiments to determine the average effect of a change on program performance, while controlling for effects such as false sharing in the cache which may introduce measurement bias. For more details, see the RFC: https://discourse.llvm.org/t/rfc-lld-feature-for-controlling-for-code-size-dependent-measurement-bias/83334 Reviewers: smithp35, MaskRay Reviewed By: MaskRay, smithp35 Pull Request: https://github.com/llvm/llvm-project/pull/117653	2024-12-13 11:52:09 -08:00
Feng Zou	94c6dd62fa	[docs] Update release notes for APX relocation types (#118575 )	2024-12-07 21:27:10 +08:00
Sam Elliott	65ced158e9	[RISCV] Remove R_RISCV_RVC_LUI Relocation (#118714 ) This was removed from the ABI in riscv-non-isa/riscv-elf-psabi-doc#398. It is not emitted by LLVM, and seems to have been an internal implementation detail in binutils. This is a follow-up to 26ec5da744b8 which removed previous binutils internal relocations when they were removed from the ABI. The LLD implementation was not tested when it was added in https://reviews.llvm.org/D39322	2024-12-05 10:10:27 +00:00
Jacek Caban	e6cc58922f	[LLD] Add ARM64EC release note (#116282 )	2024-11-15 16:19:01 +01:00
Miguel A. Arroyo	5cd6e21bdd	[LLD][COFF] allow saving intermediate files with /lldsavetemps (#115131 ) * Parity with the `-save-temps=` flag in the `ELF` `lld` driver.	2024-11-12 22:30:48 +02:00
Miguel A. Arroyo	99cd4cb123	[LLD][MINGW] Add `--undefined-glob` flag support (#109866 )	2024-09-25 11:29:40 +03:00
Miguel A. Arroyo	b9bd8ca24e	[LLD][COFF] Adds `/includeglob` flag (#109721 ) This implements parity with the `--undefined-glob` flag on [ELF](https://reviews.llvm.org/D63244), but for COFF.	2024-09-24 23:57:01 +03:00
Daniel Thornburgh	7e8a9020b1	[LLD] Add CLASS syntax to SECTIONS (#95323 ) This allows the input section matching algorithm to be separated from output section descriptions. This allows a group of sections to be assigned to multiple output sections, providing an explicit version of --enable-non-contiguous-regions's spilling that doesn't require altering global linker script matching behavior with a flag. It also makes the linker script language more expressive even if spilling is not intended, since input section matching can be done in a different order than sections are placed in an output section. The implementation reuses the backend mechanism provided by --enable-non-contiguous-regions, so it has roughly similar semantics and limitations. In particular, sections cannot be spilled into or out of INSERT, OVERWRITE_SECTIONS, or /DISCARD/. The former two aren't intrinsic, so it may be possible to relax those restrictions later.	2024-08-05 13:06:45 -07:00
Fangrui Song	5d972c582a	[ELF] Add -z nosectionheader GNU ld since 2.41 supports this option, which is mildly useful. It omits the section header table and non-ALLOC sections (including .symtab/.strtab (--strip-all)). This option is simple to implement and might be used by LLDB to test program headers parsing without the section header table (#100900). -z sectionheader, which is the default, is also added. Pull Request: https://github.com/llvm/llvm-project/pull/101286	2024-07-31 12:57:23 -07:00
Tobias Hieta	10c6d6349e	Clear release notes for upcoming LLVM 20 dev cycle	2024-07-23 11:04:06 +02:00
Daniel Bertalan	9a8b0407fc	Reapply "[lld] enable fixup chains by default (#79894 )" (#99255 ) This reverts commit f55b79f59a77b4be586d649e9ced9f8667265011. The known issues with chained fixups have been addressed by #98913, #98305, #97156 and #95171. Compared to the original commit, support for xrOS (which postdates chained fixups' introduction) was added and an unnecessary test change was removed. ---------- Original commit message: Enable chained fixups in lld when all platform and version criteria are met. This is an attempt at simplifying the logic used in ld 907: `93d74eafc3/src/ld/Options.cpp (L5458-L5549)` Some changes were made to simplify the logic: - only enable chained fixups for macOS from 13.0 to avoid the arch check - only enable chained fixups for iphonesimulator from 16.0 to avoid the arch check - don't enable chained fixups for not specifically listed platforms - don't enable chained fixups for arm64_32	2024-07-22 22:03:32 +02:00
Fangrui Song	0778f5c1f1	[ELF] Support NOCROSSREFS and NOCROSSERFS_TO Implement the two commands described by https://sourceware.org/binutils/docs/ld/Miscellaneous-Commands.html After `outputSections` is available, check each output section described by at least one `NOCROSSREFS`/`NOCROSSERFS_TO` command. For each checked output section, scan relocations from its input sections. This step is slow, therefore utilize `parallelForEach(isd->sections, ...)`. To support non SHF_ALLOC sections, `InputSectionBase::relocations` (empty) cannot be used. In addition, we may explore eliminating this member to speed up relocation scanning. Some parse code is adapted from #95714. Close #41825 Pull Request: https://github.com/llvm/llvm-project/pull/98773	2024-07-17 10:45:59 -07:00
Fangrui Song	6464dd21b5	[ELF] OUTPUT_FORMAT: support "binary" and ignore extra OUTPUT_FORMAT commands This patch improves GNU ld compatibility. Close #87891: Support `OUTPUT_FORMAT(binary)`, which is like --oformat=binary. --oformat=binary takes precedence over an ELF `OUTPUT_FORMAT`. In addition, if more than one OUTPUT_FORMAT command is specified, only check the first one. Pull Request: https://github.com/llvm/llvm-project/pull/98837	2024-07-16 10:28:09 -07:00
Brad Smith	14b9d12039	[docs] Remove the History section (#98715 ) This does not really serve any purpose nowadays.	2024-07-13 01:38:39 -04:00
Fangrui Song	8a41327948	ReleaseNotes: add lld/ELF notes	2024-07-06 18:03:39 -07:00
Tatsuyuki Ishi	1d96e4bc2d	[ELF] Change build-id default to sha1 (#93943 ) The current default, build-id=fast, is only 8 bytes due to the usage of 64-bit XXH3. This is incompatible with RPM packaging tools which requires >=16 bytes [1]. In Clang the ENABLE_LINKER_BUILD_ID define makes it pass --build-id without a specific hash type. When also defaulting to LLD, this provides a pretty broken default out-of-box. Using XXH3 was a considerable performance advantage when build-id was first implemented, because sha1 was really sha1 and rather slow. Nowadays sha1 is just 160-bit BLAKE3 which is decently fast and not cryptographically broken, so it should be a good default. Note that the default remains "fast" for wasm because sha1 for wasm is still real sha1. Close https://github.com/llvm/llvm-project/issues/43483. [1]: `b7d427728b/build/files.c (L1883)`	2024-06-10 10:14:44 -07:00
Fangrui Song	4d9020ca0b	[ELF] Implement --force-group-allocation GNU ld's relocatable linking behaviors: * Sections with the `SHF_GROUP` flag are handled like sections matched by the `--unique=pattern` option. They are processed like orphan sections and ignored by input section descriptions. * Section groups' (usually named `.group`) content is updated as the section indexes are updated. Section groups can be discarded with `/DISCARD/ : { (.group) }`. `-r --force-group-allocation` discards section groups and allows sections with the `SHF_GROUP` flag to be matched like normal sections. If two section group members are placed into the same output section, their relocation sections (if present) are combined as well. This behavior can be useful when -r output is used as a pseudo shared object (e.g., FreeBSD's amd64 kernel modules, CHERIoT compartments). This patch implements --force-group-allocation: Input SHT_GROUP sections are discarded. * Input sections do not get the SHF_GROUP flag, so `addInputSec` will combine relocation sections if their relocated section group members are combined. The default behavior is: * Input SHT_GROUP sections are retained. * Input SHF_GROUP sections can be matched (unlike GNU ld) * Input SHF_GROUP sections keep the SHF_GROUP flag, so `addInputSec` will create different OutputDesc copies. GNU ld provides the `FORCE_GROUP_ALLOCATION` command, which is not implemented. Pull Request: https://github.com/llvm/llvm-project/pull/94704	2024-06-07 14:19:06 -07:00
Fangrui Song	cfa97699f7	[ELF] Retain uncompressed if compressed content is larger --compress-debug-sections in GNU ld, gas, and LLVM integrated assembler retain the uncompressed content if the compressed content is larger. This patch also updates the manpage (-O2 does not enable zlib level 6) and fixes a crash of --compress-sections when the uncompressed section is empty.	2024-05-22 15:55:21 -07:00
Daniel Thornburgh	66466ff151	Reland: [LLD] Implement --enable-non-contiguous-regions (#90007 ) When enabled, input sections that would otherwise overflow a memory region are instead spilled to the next matching output section. This feature parallels the one in GNU LD, but there are some differences from its documented behavior: - /DISCARD/ only matches previously-unmatched sections (i.e., the flag does not affect it). - If a section fails to fit at any of its matches, the link fails instead of discarding the section. - The flag --enable-non-contiguous-regions-warnings is not implemented, as it exists to warn about such occurrences. The implementation places stubs at possible spill locations, and replaces them with the original input section when effecting spills. Spilling decisions occur after address assignment. Sections are spilled in reverse order of assignment, with each spill naively decreasing the size of the affected memory regions. This continues until the memory regions are brought back under size. Spilling anything causes another pass of address assignment, and this continues to fixed point. Spilling after rather than during assignment allows the algorithm to consider the size effects of unspillable input sections that appear later in the assignment. Otherwise, such sections (e.g. thunks) may force an overflow, even if spilling something earlier could have avoided it. A few notable feature interactions occur: - Stubs affect alignment, ONLY_IF_RO, etc, broadly as if a copy of the input section were actually placed there. - SHF_MERGE synthetic sections use the spill list of their first contained input section (the one that gives the section its name). - ICF occurs oblivious to spill sections; spill lists for merged-away sections become inert and are removed after assignment. - SHF_LINK_ORDER and .ARM.exidx are ordered according to the final section ordering, after all spilling has completed. - INSERT BEFORE/AFTER and OVERWRITE_SECTIONS are explicitly disallowed.	2024-05-13 11:06:54 -07:00
Daniel Thornburgh	81f34afa5c	Revert "[LLD] Implement --enable-non-contiguous-regions" (#92005 ) Reverts llvm/llvm-project#90007 Broke in merging I think.	2024-05-13 10:38:40 -07:00
Daniel Thornburgh	673114447b	[LLD] Implement --enable-non-contiguous-regions (#90007 ) When enabled, input sections that would otherwise overflow a memory region are instead spilled to the next matching output section. This feature parallels the one in GNU LD, but there are some differences from its documented behavior: - /DISCARD/ only matches previously-unmatched sections (i.e., the flag does not affect it). - If a section fails to fit at any of its matches, the link fails instead of discarding the section. - The flag --enable-non-contiguous-regions-warnings is not implemented, as it exists to warn about such occurrences. The implementation places stubs at possible spill locations, and replaces them with the original input section when effecting spills. Spilling decisions occur after address assignment. Sections are spilled in reverse order of assignment, with each spill naively decreasing the size of the affected memory regions. This continues until the memory regions are brought back under size. Spilling anything causes another pass of address assignment, and this continues to fixed point. Spilling after rather than during assignment allows the algorithm to consider the size effects of unspillable input sections that appear later in the assignment. Otherwise, such sections (e.g. thunks) may force an overflow, even if spilling something earlier could have avoided it. A few notable feature interactions occur: - Stubs affect alignment, ONLY_IF_RO, etc, broadly as if a copy of the input section were actually placed there. - SHF_MERGE synthetic sections use the spill list of their first contained input section (the one that gives the section its name). - ICF occurs oblivious to spill sections; spill lists for merged-away sections become inert and are removed after assignment. - SHF_LINK_ORDER and .ARM.exidx are ordered according to the final section ordering, after all spilling has completed. - INSERT BEFORE/AFTER and OVERWRITE_SECTIONS are explicitly disallowed.	2024-05-13 10:30:50 -07:00
Fangrui Song	6d44a1ef55	[ELF] Adjust --compress-sections to support compression level zstd excels at scaling from low-ratio-very-fast to high-ratio-pretty-slow. Some users prioritize speed and prefer disk read speed, while others focus on achieving the highest compression ratio possible, similar to traditional high-ratio codecs like LZMA. Add an optional `level` to `--compress-sections` (#84855) to cater to these diverse needs. While we initially aimed for a one-size-fits-all approach, this no longer seems to work. (https://richg42.blogspot.com/2015/11/the-lossless-decompression-pareto.html) When --compress-debug-sections is used together, make --compress-sections take precedence since --compress-sections is usually more specific. Remove the level distinction between -O/-O1 and -O2 for --compress-debug-sections=zlib for a more consistent user experience. Pull Request: https://github.com/llvm/llvm-project/pull/90567	2024-05-01 11:40:46 -07:00
Fangrui Song	f02a27df2f	[ELF] Add --default-script/-dT GNU ld added --default-script (alias: -dT) in 2007. The option specifies a default script that is processed if --script/-T is not specified. -dT can be used to override GNU ld's internal linker script, but only when the application does not specify -T. In addition, dynamorio's CMakeLists.txt may use -dT. The implementation is simple and the feature can be useful to dabble with different section layouts. Pull Request: https://github.com/llvm/llvm-project/pull/89327	2024-04-19 09:09:41 -07:00

1 2 3 4 5 ...

444 Commits