llvm-project

Author	SHA1	Message	Date
Daniel Bertalan	43f10639a1	[lld-macho] Enable Linker Optimization Hints pass for arm64_32 (#148964 ) The backend emits `.loh` directives for arm64_32 as well. Our pass already handles 32-bit pointer loads correctly (there was an extraneous sanity check for 8-byte pointer sizes, I removed that here), so we can enable them for all arm64 subtargets, including our upcoming arm64e support.	2025-07-16 21:29:48 +02:00
Daniel Bertalan	fb3972dd06	[lld-macho] Move Linker Optimization Hints pass to a separate file Moving it away from the arm64 `TargetInfo` class will let us enable it more easily for arm64_32 and the soon-to-be-added arm64e target as well. This is the NFC part of #148964	2025-07-16 21:13:54 +02:00
Kazu Hirata	19f00c0570	[lld] Remove unused includes (NFC) (#141421 )	2025-05-25 10:55:39 -07:00
Leonard Grey	e385ec90e2	[lld-macho] Don't double emit reexported libraries (#132275 ) When a library is specified with both `-l` and `-reexport_libraries`, lld will emit two load commands for it, in contrast with ld64. In an upcoming version of macOS, this fails dyld validation; see https://crbug.com/404905688 --------- Co-authored-by: Mark Rowe <markrowe@chromium.org>>	2025-03-21 09:31:46 -04:00
Kazu Hirata	5d24341667	[lld] Migrate away from PointerUnion::dyn_cast (NFC) (#124504 ) Note that PointerUnion::dyn_cast has been soft deprecated in PointerUnion.h: // FIXME: Replace the uses of is(), get() and dyn_cast() with // isa<T>, cast<T> and the llvm::dyn_cast<T> This patch migrates uses of PointerUnion::dyn_cast to dyn_cast_if_present (see the definition of PointerUnion::dyn_cast). Note that we cannot use dyn_cast in any of the migrations in this patch; placing assert(!X.isNull()); just before any of dyn_cast_if_present in this patch triggers some failure in check-lld.	2025-01-27 10:34:54 -08:00
Fangrui Song	cc88a5e615	[lld-macho,NFC] Switch to increasing priorities --order_file, call graph profile, and BalancedPartitioning currently build the section order vector by decreasing priority (from SIZE_MAX to 0). However, it's conventional to use an increasing key (see OutputSection::inputOrder). Switch to increasing priorities, remove the global variable highestAvailablePriority, and remove the highestAvailablePriority parameter from BPSectionOrderer. Change size_t to int. This improves consistenty with the ELF and COFF ports. The ELF port utilizes negative priorities for --symbol-ordering-file and call graph profile, and non-negative priorities for --shuffle-sections (no Mach-O counterpart yet). Pull Request: https://github.com/llvm/llvm-project/pull/121727	2025-01-10 09:32:03 -08:00
Carlo Cabrera	d668304998	[lld][MachO] Support `-allowable_client` (#117155 ) Closes #117113. Follow-up to #114638.	2024-11-27 11:23:49 -05:00
Daniel Bertalan	f18fd6e3f9	[lld-macho] Use parallel algorithms in favor of `ThreadPool` (#99471 ) In https://reviews.llvm.org/D115416, it was decided that an explicit thread pool should be used instead of the simpler fork-join model of the `parallelFor` family of functions. Since then, more parallelism has been added to LLD, but these changes always used the latter strategy, similarly to other ports of LLD. This meant that we ended up spawning twice the requested amount of threads; one set for the `llvm/Support/Parallel.h` executor, and one for the thread pool. Since that decision, 3b4d800911 has landed, which allows us to explicitly enqueue jobs on the executor pool of the parallel algorithms, which should be enough to achieve sharded output writing and parallelized input file parsing. Now only the construction of the map file is left that should be done concurrently* with different linking steps, this commit proposes explicitly spawning a dedicated worker thread for it.	2024-07-22 08:13:07 +02:00
Daniel Bertalan	d64efe42eb	[lld-macho] Remove symbols to `__mod_init_func` with `-init_offsets` (#97156 ) When `-fixup_chains`/`-init_offsets` is used, a different section, `__init_offsets` is synthesized from `__mod_init_func`. If there are any symbols defined inside `__mod_init_func`, they are added to the symbol table unconditionally while processing the input files. Later, when querying these symbols' addresses (when constructing the symtab or exports trie), we crash with a null deref, as there is no output section assigned to them. Just making the symbols point to `__init_offsets` is a bad idea, as the new section stores 32-bit integers instead of 64-bit pointers; accessing the symbols would not do what the programmer intended. We should entirely omit them from the output. This is what ld64 and ld-prime do. This patch uses the same mechanism as dead-stripping to mark these symbols as not needed in the output. There might be nicer fixes than the workaround, this is discussed in #97155. Fixes https://github.com/llvm/llvm-project/pull/79894#issuecomment-1944092892 Fixes #94716	2024-07-06 15:41:40 +02:00
alx32	2a3a79ce4c	[lld-macho][NFC] Preserve original symbol isec, unwindEntry and size (#88357 ) Currently, when moving symbols from one `InputSection` to another (like in ICF) we directly update the symbol's `isec`, `unwindEntry` and `size`. By doing this we lose the original information. This information will be needed in a future change. Since when moving symbols we always set the symbol's `wasCoalesced` and `isec-> replacement`, we can just use this info to conditionally get the information we need at access time.	2024-04-18 11:42:22 -07:00
alx32	bbfa50696e	[lld-macho] Fix bug in makeSyntheticInputSection when -dead_strip flag is specified (#86878 ) Previously, `makeSyntheticInputSection` would create a new `ConcatInputSection` without setting `live` explicitly for it. Without `-dead_strip` this would be OK since `live` would default to `true`. However, with `-dead_strip`, `live` would default to false, and it would remain set to `false`. This hasn't resulted in any issues so far since no code paths that exposed this issue were present. However a recent change - ObjC relative method lists (https://github.com/llvm/llvm-project/pull/86231) exposes this issue by creating relocations to the `SyntheticInputSection`. When these relocations are attempted to be written, this ends up with a crash(assert), since the `SyntheticInputSection` they refer to is marked as dead (`live` = `false`). With this change, we set the correct behavior - `live` will always be `true`. We add a test case that before this change would trigger an assert in the linker.	2024-03-27 17:27:51 -07:00
alx32	742a82a729	[lld-macho] Implement support for ObjC relative method lists (#86231 ) The MachO format supports relative offsets for ObjC method lists. This support is present already in ld64. With this change we implement this support in lld also. Relative method lists can be identified by a specific flag (0x80000000) in the method list header. When this flag is present, the method list will contain 32-bit relative offsets to the current Program Counter (PC), instead of absolute pointers. Additionally, when relative method lists are used, the offset to the selector name will now be relative and point to the selector reference (selref) instead of the name itself.	2024-03-27 14:34:27 -07:00
alx32	e1a003dbbd	[lld-macho][NFC] Refactor ObjCSelRefsSection => ObjCSelRefsHelper (#86456 ) In a previous PR: https://github.com/llvm/llvm-project/pull/83878, the intent was to make no functional changes, just refactor out the code for reuse. However, by creating `ObjCSelRefsSection` as a `SyntheticSection` - this slightly changed the functionality of the application as the `SyntheticSection` constructor registers the `SyntheticSection` as a functional one - with an associated `SyntheticInputSection`. With this change we remove this unintended consequence by making the code not use a `SyntheticSection` as base, but just by having it be a static helper.	2024-03-25 06:55:11 -07:00
alx32	a53401e9df	[lld-macho][NFC] Refactor ObjCSelRefsSection out of ObjCStubsSection (#83878 ) Currently ObjCStubsSection is handling both the logic for the "__objc_stubs" section, as well as the logic for the "__objc_selrefs" section. While this is OK for now, it will be an issue for other features that want to interact with the "__objc_selrefs" section, such as upcoming relative method lists feature - which will also want to create / reference entries in the "__objc_selrefs" section. In this PR we split the logic relating to handling the "__objc_selrefs" section into a new SyntheticSection (ObjCSelRefsSection). Non-functional change - neither the behavior nor implementation changes, the interface is just made more friendly to not have "__objc_selrefs" so bound to "__objc_stubs". --------- Co-authored-by: Alex B <alexborcan@meta.com>	2024-03-10 13:22:31 -07:00
Mehdi Amini	716042a63f	Rename llvm::ThreadPool -> llvm::DefaultThreadPool (NFC) (#83702 ) The base class llvm::ThreadPoolInterface will be renamed llvm::ThreadPool in a subsequent commit. This is a breaking change: clients who use to create a ThreadPool must now create a DefaultThreadPool instead.	2024-03-05 18:00:46 -08:00
Kyungwoo Lee	391393179a	[lld-macho] icf objc stubs (#79730 ) This supports icf for objc stubs.	2024-02-01 14:19:11 -08:00
Kyungwoo Lee	cb46c61817	[lld-macho] dead-strip objc stubs (#79726 ) This supports dead-strip for objc stubs.	2024-01-29 23:29:57 -08:00
Fangrui Song	359f170f5f	[lld-macho] Use fixed chunk size for UUID Chunk size decided by the thread count makes the UUID less deterministic (e.g. across machines with different core counts.) Follow ELF and just use a fixed chunksize. Fixes: https://github.com/llvm/llvm-project/issues/63961 Reviewed By: #lld-macho, keith Differential Revision: https://reviews.llvm.org/D155761	2023-07-19 17:24:36 -07:00
Keith Smiley	f317ce218e	[lld-macho] Implement -no_uuid Since UUID generation in lld is fast this is rarely used but it can be helpful to avoid temporary issues like https://github.com/llvm/llvm-project/issues/63961 Differential Revision: https://reviews.llvm.org/D155735	2023-07-19 16:39:31 -07:00
Fangrui Song	2090d66b23	[lld-macho] Switch to xxh3_64bits xxh3 is substantially faster than xxh64. For lld/ELF, there is substantial speedup in `.debug_str` duplicate elimination (D154813). Use xxh3 for lld-macho as well. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D155677	2023-07-19 09:58:44 -07:00
Keith Smiley	806f5b3019	[lld-macho] Switch to new tool ID As of Xcode 15 there is now a tool ID for LLD, likely driven by Apple's tests with using LLD for their CAS work in clang. This updates LLD to use the correct ID, and updates the object library so that llvm-objdump prints it correctly. Differential Revision: https://reviews.llvm.org/D152929	2023-06-15 09:40:02 -07:00
Vy Nguyen	7e5f4ed556	[lld-macho]Ensure canonicalization happen even for "skipped" referent sections. Details: See bug report: https://github.com/llvm/llvm-project/issues/63039 Differential Revision: https://reviews.llvm.org/D151824	2023-06-06 12:53:03 -04:00
Fangrui Song	8d85c96e0e	[lld] StringRef::{starts,ends}with => {starts,ends}_with. NFC The latter form is now preferred to be similar to C++20 starts_with. This replacement also removes one function call when startswith is not inlined.	2023-06-05 14:36:19 -07:00
Keith Smiley	48e5f704c5	[lld-macho] Remove linking bitcode support Apple deprecated bitcode in the deployment process in Xcode 14.0. Last month Apple started requiring Xcode 14.1+ to submit apps to the App Store. Since there isn't a use for bundling bitcode outside of submitting to the App Store we should be safe to delete this handling entirely from LLD. Differential Revision: https://reviews.llvm.org/D150697	2023-05-30 14:47:11 -07:00
Jez Ng	c4d9df9f78	[lld-macho][nfc] Clean up a bunch of clang-tidy issues	2023-04-05 07:50:28 -04:00
Jez Ng	dd4a9c463b	[lld-macho][nfc] Convert more alignTo() to alignToPowerOf2() Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D145261	2023-03-07 16:59:38 -08:00
Keith Smiley	6578e0d1d0	[lld-macho] Remove duplicate minimum version info At some point PlatformInfo's Target changed types to a type that also has minimum deployment target info. This caused ambiguity if you tried to get the target triple from the Target, as the actual minimum version info was being stored separately. This bulk of this change is changing the parsing of these values to support this. Differential Revision: https://reviews.llvm.org/D145263	2023-03-03 13:47:01 -08:00
Jez Ng	4f2a461793	[lld-macho] Have all load commands aligned to the word size This is what ld64 does, and also what we already do for most of the other load commands. I'm not aware of a good way to test this, but I don't think it really matters. Differential Revision: https://reviews.llvm.org/D141462	2023-01-24 20:11:04 -05:00
Jez Ng	a0c01f05cd	[lld-macho][nfc] Use alignToPowerOf2 instead of alignTo when possible Skips the divide operation which is generally expensive. Not that it matters in this diff, the code changed is not particularly hot, but just for principle & consistency... Reviewed By: #lld-macho, oontvoo, MaskRay Differential Revision: https://reviews.llvm.org/D141461	2023-01-11 17:13:33 -05:00
Keith Smiley	2e5989e814	[lld-macho] Flip string deduplication default Previously by default, when not using `--ifc=`, lld would not deduplicate string literals. This reveals reliance on undefined behavior where string literal addresses are compared instead of using string equality checks. While ideally you would be able to easily identify and eliminate the reliance on this UB, this can be difficult, especially for third party code, and increases the friction and risk of users migrating to lld. This flips the default to deduplicate strings unless `--no-deduplicate-strings` is passed, matching ld64's behavior. Differential Revision: https://reviews.llvm.org/D140517	2022-12-22 15:52:46 -08:00
Vy Nguyen	fc7a71890d	[lld-macho][nfc] Clean up includes - remove unused/duplicate includes - reformatting/whitespaces Differential Revision: https://reviews.llvm.org/D136266	2022-10-19 13:56:24 -04:00
Jez Ng	7b45dfc681	[lld-macho] Canonicalize personality pointers in EH frames We already do this for personality pointers referenced from compact unwind entries; this patch extends that behavior to personalities referenced via EH frames as well. This reduces the number of distinct personalities we need in the final binary, and helps us avoid hitting the "too many personalities" error. I renamed `UnwindInfoSection::prepareRelocations()` to simply `prepare` since we now do some non-reloc-specific stuff within. Fixes #58277. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D135728	2022-10-11 23:50:46 -04:00
Daniel Bertalan	0d30e92f59	[lld-macho] Add support for emitting chained fixups This commit adds support for chained fixups, which were introduced in Apple's late 2020 OS releases. This format replaces the dyld opcodes used for supplying rebase and binding information, and encodes most of that data directly in the memory location that will have the fixup applied. This reduces binary size and is a requirement for page-in linking, which will be available starting with macOS 13. A high-level overview of the format and my implementation can be found in SyntheticSections.h. This feature is currently gated behind the `-fixup_chains` flag, and will be enabled by default for supported targets in a later commit. Like in ld64, lazy binding is disabled when chained fixups are in use, and the `-init_offsets` transformation is performed by default. Differential Revision: https://reviews.llvm.org/D132560	2022-10-04 11:48:45 +02:00
Vincent Lee	58edaef3fe	[lld-macho] Do not error out on dead stripped duplicate symbols Builds that error out on duplicate symbols can still succeed if the symbols will be dead stripped. Currently, this is the current behavior in ld64. https://github.com/apple-oss-distributions/ld64/blob/main/src/ld/Resolver.cpp#L2018. In order to provide an easier to path for adoption, introduce a new flag that will retain compatibility with ld64's behavior (similar to `--deduplicate-literals`). This is turned off by default since we do not encourage this behavior in the linker. Reviewed By: #lld-macho, thakis, int3 Differential Revision: https://reviews.llvm.org/D134794	2022-09-30 15:09:27 -07:00
Daniel Bertalan	f546165754	[lld-macho] Don't create entries in isecPriorities during sorting (NFC) If a value for a given key is not present, `DenseMap::operator[]` default-constructs one, which is wasteful when we don't do anything with it afterwards. Fix it by calling `lookup()` instead which only returns the default value, but does not modify the map. This speeds up linking a fair bit when only a small portion of all sections are specified in the order file, like in the case of Chromium Framework: N Min Max Median Avg Stddev x 25 3.727684 3.8808699 3.753552 3.7702461 0.0397282 + 25 3.6469049 3.7523289 3.6764321 3.6841622 0.025525047 Difference at 95.0% confidence -0.0860839 +/- 0.0189924 -2.28324% +/- 0.503745% (Student's t, pooled s = 0.0333906) Differential Revision: https://reviews.llvm.org/D134811	2022-09-28 16:50:18 +02:00
Daniel Bertalan	d2f3d7bad2	[lld-macho] Force higher alignment for __thread_vars `__thread_vars` contains pointers to `__tlv_bootstrap`, which are fixed up by dyld; however the section's alignment is not specified. This means that the relocations might end up on odd addresses, which is not representable by the soon to be added chained fixups. This is arguably a bug in MC, but this behavior has been there since TLV support was originally added. This patch forces the `__thread_vars` sections to be aligned to the target's pointer size. This is done by ld64 as well. Differential Revision: https://reviews.llvm.org/D134594	2022-09-25 08:02:07 +02:00
Vy Nguyen	016c2f5e32	[lld-macho] Support -dyld_env This arg is undocumented but from looking at the code + experiment, it's used to add additional DYLD_ENVIRONMENT load commands to the output. Differential Revision: https://reviews.llvm.org/D134058	2022-09-20 10:16:45 -04:00
Daniel Bertalan	a8843ec952	[lld-macho] Parallelize linker optimization hint processing This commit moves the parsing of linker optimization hints into `ARM64::applyOptimizationHints`. This lets us avoid allocating memory for holding the parsed information, and moves work out of `ObjFile::parse`, which is not parallelized at the moment. This change reduces the overhead of processing LOHs to 25-30 ms when linking Chromium Framework on my M1 machine; previously it took close to 100 ms. There's no statistically significant change in runtime for a --threads=1 link. Performance figures with all 8 cores utilized: N Min Max Median Avg Stddev x 20 3.8027232 3.8760762 3.8505335 3.8454145 0.026352574 + 20 3.7019017 3.8660538 3.7546209 3.7620371 0.032680043 Difference at 95.0% confidence -0.0833775 +/- 0.019 -2.16823% +/- 0.494094% (Student's t, pooled s = 0.0296854) Differential Revision: https://reviews.llvm.org/D133439	2022-09-16 17:38:46 +02:00
Nico Weber	cd7ffa2e52	lld: Include name of output file in "failed to write output" diag Differential Revision: https://reviews.llvm.org/D133110	2022-09-14 14:57:47 -04:00
Daniel Bertalan	4f688d00f4	[lld-macho] Change constant std::vector to std::array (NFC)	2022-09-04 22:43:02 +02:00
Daniel Bertalan	f7b752d277	[lld-macho] Set the SG_READ_ONLY flag on __DATA_CONST This flag instructs dyld to make the segment read-only after fixups have been performed. I'm not sure why this flag is needed, as on macOS 13 beta at least, __DATA_CONST is read-only even without this flag; but ld64 sets it as well. Differential Revision: https://reviews.llvm.org/D133010	2022-08-31 17:04:20 +02:00
Daniel Bertalan	389e0a81a1	[lld-macho] Support synthesizing __TEXT,__init_offsets This section stores 32-bit `__TEXT` segment offsets of initializer functions, and is used instead of `__mod_init_func` when chained fixups are enabled. Storing the offsets lets us avoid emitting fixups for the initializers. Differential Revision: https://reviews.llvm.org/D132947	2022-08-31 10:13:45 +02:00
Daniel Bertalan	ae5d5426fb	[lld-macho] Rename {StubHelper,ObjCStubs}Section::setup() to setUp (NFC) The phrasal verb is spelled "set up"; "setup" is a noun. Suggested in https://reviews.llvm.org/D132947#inline-1280089	2022-08-30 18:30:14 +02:00
Daniel Bertalan	6b6d1abb10	[lld-macho] Move adding bindings for stub targets out of Writer (NFC) We now re-use the existing needsBinding() helper to determine if a branch has to go through a stub. The logic for determining which type of binding is needed is moved inside StubsSection::addEntry(). This is an NFC refactor that simplifies my diff that adds support for chained fixups. Differential Revision: https://reviews.llvm.org/D132476	2022-08-25 17:37:36 +02:00
Kazu Hirata	9e296584ce	Fix unused variable warnings These warnings came up with gcc-11.3.0.	2022-08-20 00:12:35 -07:00
Keith Smiley	3c24fae398	[lld-macho] Add support for objc_msgSend stubs Apple Clang in Xcode 14 introduced a new feature for reducing the overhead of objc_msgSend calls by deduplicating the setup calls for each individual selector. This works by clang adding undefined symbols for each selector called in a translation unit, such as `_objc_msgSend$foo` for calling the `foo` method on any `NSObject`. There are 2 different modes for this behavior, the default directly does the setup for `_objc_msgSend` and calls it, and the smaller option does the selector setup, and then calls the standard `_objc_msgSend` stub function. The general overview of how this works is: - Undefined symbols with the given prefix are collected - The suffix of each matching undefined symbol is added as a string to `__objc_methname` - A pointer is added for every method name in the `__objc_selrefs` section - A `got` entry is emitted for `_objc_msgSend` - Stubs are emitting pointing to the synthesized locations Notes: - Both `__objc_methname` and `__objc_selrefs` can also exist from object files, so their contents are merged with our synthesized contents - The compiler emits method names for defined methods, but not for undefined symbols you call, but stubs are used for both - This only implements the default "fast" mode currently just to reduce the diff, I also doubt many folks will care to swap modes - This only implements this for arm64 and x86_64, we don't need to implement this for 32 bit iOS archs, but we should implement it for watchOS archs in a later diff Differential Revision: https://reviews.llvm.org/D128108	2022-08-10 17:17:17 -07:00
Nico Weber	241f0e8b76	[lld/mac] Add support for $ld$previous symbols with explicit symbol name A symbol `$ld$previous$/Another$1.2.3$1$3.0$14.0$_xxx$` means "pretend symbol `_xxx` is in dylib `/Another` with version `1.2.3` if the deployment target is between `3.0` and `14.0` and we're targeting platform `1` (ie macOS)". This means dylibs can now inject synthetic dylibs into the link, so DylibFile needs to grow a 3rd constructor. The only other interesting thing is that such an injected dylib counts as a use of the original dylib. This patch gets this mostly right (if _only_ `$ld$previous` symbols are used from a dylib, we don't add a dep on the dylib itself, matching ld64), but one case where we don't match ld64 yet is that ld64 even omits the original dylib when linking it with `-needed-l`. Lld currently still adds a load command for the original dylib in that case. (That's for a future patch.) Fixes #56074. Differential Revision: https://reviews.llvm.org/D130725	2022-07-28 20:35:48 -04:00
Jez Ng	d23da0ec6c	[lld-macho] Fold __objc_imageinfo sections Previously, we treated it as a regular ConcatInputSection. However, ld64 actually parses its contents and uses that to synthesize a single image info struct, generating one 8-byte section instead of `8 * number of object files with ObjC code`. I'm not entirely sure what impact this section has on the runtime, so I just tried to follow ld64's semantics as closely as possible in this diff. My main motivation though was to reduce binary size. No significant perf change on chromium_framework on my 16-core Mac Pro: base diff difference (95% CI) sys_time 1.764 ± 0.062 1.748 ± 0.032 [ -2.4% .. +0.5%] user_time 5.112 ± 0.104 5.106 ± 0.046 [ -0.9% .. +0.7%] wall_time 6.111 ± 0.184 6.085 ± 0.076 [ -1.6% .. +0.8%] samples 30 32 Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D130125	2022-07-23 12:12:01 -04:00
Nico Weber	0ec87addb7	[lld/mac] Add a few TimeTraceScopes Identical literal folding takes ~1.4% of the time, and was missing from the trace. Signature computation still needs ~2.2% of the time, so probably worth explicitly marking its contribution to "Write output file" (9.1%) Differential Revision: https://reviews.llvm.org/D128343	2022-06-23 11:46:57 -04:00
Daniel Bertalan	0eec7e2a89	Reland "[lld-macho] Group undefined symbol diagnostics by symbol". This reverts commit 36e7c9a450db5e22af1ec21412d918ceb2313942. This relands d61341768cf0cff7c with the fix described in https://reviews.llvm.org/D127753#3587390	2022-06-15 19:22:39 -04:00

1 2 3 4 5

236 Commits