llvm-project

Author	SHA1	Message	Date
Kyungwoo Lee	391393179a	[lld-macho] icf objc stubs (#79730 ) This supports icf for objc stubs.	2024-02-01 14:19:11 -08:00
Kyungwoo Lee	77e204c7b0	[lld-macho][arm64] implement -objc_stubs_small (#78665 ) This patch implements `-objc_stubs_small` targeting arm64, aiming to align with ld64's behavior. 1. `-objc_stubs_fast`: As previously implemented, this always uses the Global Offset Table (GOT) to invoke `objc_msgSend`. The alignment of the objc stub is 32 bytes. 2. `-objc_stubs_small`: This behavior depends on whether `objc_msgSend` is defined. If it is, it directly jumps to `objc_msgSend`. If not, it creates another stub to indirectly jump to `objc_msgSend`, minimizing the size. The alignment of the objc stub in this case is 4 bytes.	2024-01-23 07:31:34 -08:00
Vy Nguyen	595cd45a66	[lld-macho][nfc]Add bounds on sections and subsections check before attempting to dereferencing iterators. Runnign some tests with asan built of LLD would throw errors similar to the following: AddressSanitizer:DEADLYSIGNAL #0 0x55d8e6da5df7 in operator() /mnt/ssd/repo/lld/llvm-project/lld/MachO/Arch/ARM64.cpp:612 #1 0x55d8e6daa514 in operator() /mnt/ssd/repo/lld/llvm-project/lld/MachO/Arch/ARM64.cpp:650 Differential Revision: https://reviews.llvm.org/D157027	2023-09-11 14:19:25 -04:00
Jez Ng	3df4c5a92f	[NFC] Optimize vector usage in lld By using emplace_back, as well as converting some loops to for-each, we can do more efficient vectorization. Make copy constructor for TemporaryFile noexcept. Reviewed By: #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D139552	2023-01-26 20:31:42 -05:00
Jez Ng	0e8d4980a8	[lld-macho] Standardize error messages Errors / warnings that originate from a particular file should be of the form `$file: $message`. Reviewed By: #lld-macho, keith Differential Revision: https://reviews.llvm.org/D140634	2022-12-23 19:44:56 -05:00
Daniel Bertalan	0d30e92f59	[lld-macho] Add support for emitting chained fixups This commit adds support for chained fixups, which were introduced in Apple's late 2020 OS releases. This format replaces the dyld opcodes used for supplying rebase and binding information, and encodes most of that data directly in the memory location that will have the fixup applied. This reduces binary size and is a requirement for page-in linking, which will be available starting with macOS 13. A high-level overview of the format and my implementation can be found in SyntheticSections.h. This feature is currently gated behind the `-fixup_chains` flag, and will be enabled by default for supported targets in a later commit. Like in ld64, lazy binding is disabled when chained fixups are in use, and the `-init_offsets` transformation is performed by default. Differential Revision: https://reviews.llvm.org/D132560	2022-10-04 11:48:45 +02:00
Daniel Bertalan	a8843ec952	[lld-macho] Parallelize linker optimization hint processing This commit moves the parsing of linker optimization hints into `ARM64::applyOptimizationHints`. This lets us avoid allocating memory for holding the parsed information, and moves work out of `ObjFile::parse`, which is not parallelized at the moment. This change reduces the overhead of processing LOHs to 25-30 ms when linking Chromium Framework on my M1 machine; previously it took close to 100 ms. There's no statistically significant change in runtime for a --threads=1 link. Performance figures with all 8 cores utilized: N Min Max Median Avg Stddev x 20 3.8027232 3.8760762 3.8505335 3.8454145 0.026352574 + 20 3.7019017 3.8660538 3.7546209 3.7620371 0.032680043 Difference at 95.0% confidence -0.0833775 +/- 0.019 -2.16823% +/- 0.494094% (Student's t, pooled s = 0.0296854) Differential Revision: https://reviews.llvm.org/D133439	2022-09-16 17:38:46 +02:00
Daniel Bertalan	a8ec90ad9c	[lld-macho] Simplify linker optimization hint processing This commit removes the `relocTargets` vector, and instead makes the code reconstruct the referent addresses from the relocated instructions. This will allow us to move `applyOptimizationHints` from `ConcatInputSection::writeTo` to a separate pass that parses and applies LOHs in one step, on a per-file basis. This will improve performance, as parsing is currently done serially in `ObjFile::parse`. I opted to remove the sanity check that ensures that all relocations within a LOH point to the same symbol. This completely eliminates the need to search through relocations. It is my understanding that mismatched relocation targets should not be present in valid object files, so it's unlikely that the removal will lead to mislinks. Differential Revision: https://reviews.llvm.org/D133274	2022-09-06 08:19:08 +02:00
Keith Smiley	3c24fae398	[lld-macho] Add support for objc_msgSend stubs Apple Clang in Xcode 14 introduced a new feature for reducing the overhead of objc_msgSend calls by deduplicating the setup calls for each individual selector. This works by clang adding undefined symbols for each selector called in a translation unit, such as `_objc_msgSend$foo` for calling the `foo` method on any `NSObject`. There are 2 different modes for this behavior, the default directly does the setup for `_objc_msgSend` and calls it, and the smaller option does the selector setup, and then calls the standard `_objc_msgSend` stub function. The general overview of how this works is: - Undefined symbols with the given prefix are collected - The suffix of each matching undefined symbol is added as a string to `__objc_methname` - A pointer is added for every method name in the `__objc_selrefs` section - A `got` entry is emitted for `_objc_msgSend` - Stubs are emitting pointing to the synthesized locations Notes: - Both `__objc_methname` and `__objc_selrefs` can also exist from object files, so their contents are merged with our synthesized contents - The compiler emits method names for defined methods, but not for undefined symbols you call, but stubs are used for both - This only implements the default "fast" mode currently just to reduce the diff, I also doubt many folks will care to swap modes - This only implements this for arm64 and x86_64, we don't need to implement this for 32 bit iOS archs, but we should implement it for watchOS archs in a later diff Differential Revision: https://reviews.llvm.org/D128108	2022-08-10 17:17:17 -07:00
Martin Storsjö	59c6f418fa	[LLD] [MachO] Fix GCC build warnings This fixes the following warnings produced by GCC 9: ../tools/lld/MachO/Arch/ARM64.cpp: In member function ‘void {anonymous}::OptimizationHintContext::applyAdrpLdr(const lld::macho::OptimizationHint&)’: ../tools/lld/MachO/Arch/ARM64.cpp:448:18: warning: comparison of integer expressions of different signedness: ‘int64_t’ {aka ‘long int’} and ‘uint64_t’ {aka ‘long unsigned int’} [-Wsign-compare] 448 \| if (ldr.offset != (rel1->referentVA & 0xfff)) \| ~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../tools/lld/MachO/UnwindInfoSection.cpp: In function ‘bool canFoldEncoding(compact_unwind_encoding_t)’: ../tools/lld/MachO/UnwindInfoSection.cpp:404:44: warning: comparison between ‘enum<unnamed>’ and ‘enum<unnamed>’ [-Wenum-compare] 404 \| static_assert(UNWIND_X86_64_MODE_MASK == UNWIND_X86_MODE_MASK, ""); \| ^~~~~~~~~~~~~~~~~~~~ ../tools/lld/MachO/UnwindInfoSection.cpp:405:49: warning: comparison between ‘enum<unnamed>’ and ‘enum<unnamed>’ [-Wenum-compare] 405 \| static_assert(UNWIND_X86_64_MODE_STACK_IND == UNWIND_X86_MODE_STACK_IND, ""); \| ^~~~~~~~~~~~~~~~~~~~~~~~~ Differential Revision: https://reviews.llvm.org/D130970	2022-08-03 00:14:39 +03:00
Daniel Bertalan	d1e40f4d58	[lld-macho] Add LOH_ARM64_ADRP_ADD_LDR optimization hint support This hint instructs the linker to optimize an adrp+add+ldr sequence used for loading from a local symbol's address by loading directly if it's close enough, or with an adrp(p)+ldr sequence if it's not. This transformation is the same as what's done for ADRP_LDR_GOT_LDR when the symbol is local. The logic for acting on this hint is therefore moved to a new function which will be called from the existing applyAdrpLdrGotLdr() function. Differential Revision: https://reviews.llvm.org/D130505	2022-07-28 10:45:28 +02:00
Daniel Bertalan	1fb9466c6a	[lld-macho] Devirtualize TargetInfo::getRelocAttrs This method is called on each relocation when parsing input files, so the overhead of using virtual functions ends up being quite large. We now have a single non-virtual method, which reads from the appropriate array of relocation attributes set in the TargetInfo constructor. This change results in a modest 2.3% reduction in link time for chromium_framework measured on an x86-64 VPS, and 0.7% on an arm64 Mac. N Min Max Median Avg Stddev x 10 11.869417 12.032609 11.935041 11.938268 0.045802324 + 10 11.581526 11.785265 11.649885 11.659507 0.054634834 Difference at 95.0% confidence -0.278761 +/- 0.0473673 -2.33502% +/- 0.396768% (Student's t, pooled s = 0.0504124) Differential Revision: https://reviews.llvm.org/D130000	2022-07-18 19:32:58 +02:00
Daniel Bertalan	ecb14fd872	[lld-macho] Add LOH_ARM64_ADRP_LDR_GOT_LDR optimization hint support This hint instructs the linker to relax a GOT-indirect load. If the referenced symbol is external and its GOT entry is within +/- 1 MiB, the GOT entry can be loaded with a single literal ldr instruction. If the referenced symbol is local, its address may be loaded directly if it's close enough, or with an adr(p) + ldr pair if it's not. This type accounts for more than half of all LOHs in chromium_framework. This commit moves the eligibility checks into helper functions to improve the readability of the LOH processing code. Ho functional changes are intended to the previously implemented LOH types. Differential Revision: https://reviews.llvm.org/D129427	2022-07-13 12:20:14 +02:00
Daniel Bertalan	2028fe6fbc	[lld-macho] Handle LOH_ARM64_ADRP_LDR_GOT optimization hints This hint instructs the linker to perform the AdrpLdr or AdrpAdd transformation depending on whether the GOT load has been relaxed to load a local symbol's address. Differential Revision: https://reviews.llvm.org/D129059	2022-07-05 07:33:13 +02:00
Daniel Bertalan	73b659ff55	[lld-macho] Fix left shift of negative value UB I introduced this mistake in 573c7e6b3c79c7ce80a2221e000fab7dd20c0bb4. Fixes the failure on this UBSan bot: https://lab.llvm.org/buildbot/#/builders/5/builds/25537	2022-07-01 12:00:16 +02:00
Daniel Bertalan	573c7e6b3c	[lld-macho] Handle LOH_ARM64_ADRP_LDR linker optimization hints This linker optimization hint transforms a pair of adrp+ldr (immediate) instructions into an ldr (literal) load from a PC-relative address if it is 4-byte aligned and within +/- 1 MiB, as ldr can encode a signed 19-bit offset that gets multiplied by 4. In the wild, only a small number of these hints are applicable because not many loads end up close enough to the data segment. However, the added helper functions will be useful in implementing the rest of the LOH types. Differential Revision: https://reviews.llvm.org/D128942	2022-07-01 09:44:24 +02:00
Daniel Bertalan	a3f67f0920	[lld-macho] Initial support for Linker Optimization Hints Linker optimization hints mark a sequence of instructions used for synthesizing an address, like ADRP+ADD. If the referenced symbol ends up close enough, it can be replaced by a faster sequence of instructions like ADR+NOP. This commit adds support for 2 of the 7 defined ARM64 optimization hints: - LOH_ARM64_ADRP_ADD, which transforms a pair of ADRP+ADD into ADR+NOP if the referenced address is within +/- 1 MiB - LOH_ARM64_ADRP_ADRP, which transforms two ADRP instructions into ADR+NOP if they reference the same page These two kinds already cover more than 50% of all LOHs in chromium_framework. Differential Review: https://reviews.llvm.org/D128093	2022-06-30 06:28:42 +02:00
Jez Ng	e183bf8e15	[lld-macho][reland] Initial support for EH Frames This reverts commit 942f4e3a7cc9a9f8b2654817cff12907d1276031. The additional change required to avoid the assertion errors seen previously is: --- a/lld/MachO/ICF.cpp +++ b/lld/MachO/ICF.cpp @@ -443,7 +443,9 @@ void macho::foldIdenticalSections() { /relocVA=/0); isec->data = copy; } - } else { + } else if (!isEhFrameSection(isec)) { + // EH frames are gathered as hashables from unwindEntry above; give a + // unique ID to everything else. isec->icfEqClass[0] = ++icfUniqueID; } } Differential Revision: https://reviews.llvm.org/D123435	2022-06-13 07:45:16 -04:00
Douglas Yung	942f4e3a7c	Revert "[lld-macho] Initial support for EH Frames" This reverts commit 826be330af9c0a8553a5b32718ecd2d97e10438e. This was causing a test failure on build bots: - https://lab.llvm.org/buildbot/#/builders/36/builds/21770 - https://lab.llvm.org/buildbot/#/builders/58/builds/23913	2022-06-09 05:25:43 -07:00
Jez Ng	826be330af	[lld-macho] Initial support for EH Frames == Background == `llvm-mc` generates unwind info in both compact unwind and DWARF formats. LLD already handles the compact unwind format; this diff gets us close to handling the DWARF format properly. == Caveats == It's not quite done yet, but I figure it's worth getting this reviewed and landed first as it's shaping up to be a fairly large code change. Known limitations of the current code: * Only works for x86_64, for which `llvm-mc` emits "abs-ified" relocations as described in `618def651b`. `llvm-mc` emits regular relocations for ARM EH frames, which we do not yet handle correctly. Since the feature is not ready for real use yet, I've gated it behind a flag that only gets toggled on during test suite runs. With most of the new code disabled, we see just a hint of perf regression, so I don't think it'd be remiss to land this as-is: base diff difference (95% CI) sys_time 1.926 ± 0.168 1.979 ± 0.117 [ -1.2% .. +6.6%] user_time 3.590 ± 0.033 3.606 ± 0.028 [ +0.0% .. +0.9%] wall_time 7.104 ± 0.184 7.179 ± 0.151 [ -0.2% .. +2.3%] samples 30 31 == Design == Like compact unwind entries, EH frames are also represented as regular ConcatInputSections that get pointed to via `Defined::unwindEntry`. This allows them to be handled generically by e.g. the MarkLive and ICF code. (But note that unlike compact unwind subsections, EH frame subsections do end up in the final binary.) In order to make EH frames "look like" a regular ConcatInputSection, some processing is required. First, we need to split the `__eh_frame` section along EH frame boundaries rather than along symbol boundaries. We do this by decoding the length field of each EH frame. Second, the abs-ified relocations need to be turned into regular Relocs. == Next Steps == In order to support EH frames on ARM targets, we will either have to teach LLD how to handle EH frames with explicit relocs, or we can try to make `llvm-mc` emit abs-ified relocs for ARM as well. I'm hoping to do the latter as I think it will make the LLD implementation both simpler and faster to execute. == Misc == The `obj-file-with-stabs.s` test had to be updated as the previous version would trip assertion errors in the code. It appears that in our attempt to produce a minimal YAML test input, we created a file with invalid EH frame data. I've fixed this by re-generating the YAML and not doing any hand-pruning of it. Reviewed By: #lld-macho, Roger Differential Revision: https://reviews.llvm.org/D123435	2022-06-08 23:40:52 -04:00
Jez Ng	7f3ddf8443	[lld-macho][nfc] Allow Defined symbols to be placed in binding sections Previously, we only allowed this for DylibSymbols. However, in order to properly support `-flat_namespace` as well as `-interposable`, we need to allow this for Defined symbols too. Therefore we hoist the `lazyBindOffset` and the `stubsHelperIndex` into the parent Symbol class. The actual change to support interposition under `-flat_namespace` is in {D119294}; the NFC changes here have been split out for easier review. Perf regression isn't stat sig on my 3.2 GHz 16-Core Intel Xeon W linking chromium_framework: base diff difference (95% CI) sys_time 1.227 ± 0.021 1.234 ± 0.031 [ -0.3% .. +1.5%] user_time 3.665 ± 0.036 3.674 ± 0.035 [ -0.2% .. +0.7%] wall_time 4.596 ± 0.055 4.609 ± 0.064 [ -0.3% .. +0.9%] samples 34 47 Max RSS regression is barely stat sig: base diff difference (95% CI) time 1003664356.324 ± 15404053.912 1010380403.613 ± 10578309.455 [ +0.0% .. +1.3%] samples 37 31 Reviewed By: modimo Differential Revision: https://reviews.llvm.org/D121351	2022-03-14 22:18:32 -04:00
Nico Weber	9721197520	[lld/mac] Set branchRange a bit more carefully - Don't subtract thunkSize from branchRange. Most places care about the actual maximal branch range. Subtract thunkSize in the one place that wants to leave room for a thunk. - Set it to 0x800_0000 instead of 0xFF_FFFF - Subtract 4 for the positive branch direction since it's a two's complement 24bit number sign-extended mutiplied by 4, so its range is -0x800_0000..+0x7FF_FFFC - Make boundary checks include the boundary values This doesn't make a huge difference in practice. It's preparation for a "real" fix for PR51578 -- but it also lets the repro in comment 0 in that bug place one more thunk before hitting the TODO. Differential Revision: https://reviews.llvm.org/D108897	2021-08-30 12:36:06 -04:00
Greg McGary	93c8559baf	[lld-macho] Implement branch-range-extension thunks Extend the range of calls beyond an architecture's limited branch range by first calling a thunk, which loads the far address into a scratch register (x16 on ARM64) and branches through it. Other ports (COFF, ELF) use multiple passes with successively-refined guesses regarding the expansion of text-space imposed by thunk-space overhead. This MachO algorithm places thunks during MergedOutputSection::finalize() in a single pass using exact thunk-space overheads. Thunks are kept in a separate vector to avoid the overhead of inserting into the `inputs` vector of `MergedOutputSection`. FIXME: * arm64-stubs.s test is broken * add thunk tests * Handle thunks to DylibSymbol in MergedOutputSection::finalize() Differential Revision: https://reviews.llvm.org/D100818	2021-05-12 09:44:58 -07:00
Jez Ng	1aa29dffce	[lld-macho] Support subtractor relocations that reference sections The minuend (but not the subtrahend) can reference a section. Note that we do not yet properly validate that the subtrahend isn't referencing a section; I've filed PR50034 to track that. I've also extended the reloc-subtractor.s test to reorder symbols, to make sure that the addends are being associated with the minuend (and not the subtrahend) relocation. Fixes PR49999. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D100804	2021-04-20 16:58:57 -04:00
Jez Ng	4938b090cf	[lld-macho] Don't use arrays as template parameters MSVC from VSCode 2017 appears unhappy with it (causes an internal compiler error.) This also means that we need to avoid doing `sizeof(stubCode)` as `sizeof(int[N])` on function array parameters decays into `sizeof(int *)`. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100605	2021-04-15 21:16:34 -04:00
Jez Ng	3bc88eb392	[lld-macho] Add support for arm64_32 From what I can tell, it's pretty similar to arm64. The two main differences are: 1. No 64-bit relocations 2. Stub code writes to 32-bit registers instead of 64-bit Plus of course the various on-disk structures like `segment_command` are using the 32-bit instead of the 64-bit variants. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99822	2021-04-15 21:16:33 -04:00
Jez Ng	8ca366935b	Revert "[lld-macho] Add support for arm64_32" and other stacked diffs This reverts commits: * 8914902b01a3f8bdea9c71a0d9d23e4ee0ae80e4 * 35a745d814e1cde3de25d0d959fddc31e1061a41 * 682d1dfe09436857aa3b64365b5cc6fcbf1f043b	2021-04-13 12:40:58 -04:00
Jez Ng	8914902b01	[lld-macho] Add support for arm64_32 From what I can tell, it's pretty similar to arm64. The two main differences are: 1. No 64-bit relocations 2. Stub code writes to 32-bit registers instead of 64-bit Plus of course the various on-disk structures like `segment_command` are using the 32-bit instead of the 64-bit variants. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99822	2021-04-13 10:43:28 -04:00
Jez Ng	88cb786ec2	[lld-macho][nfc] Remove DYSYM8 reloc attribute It's likely redundant, per discussion with @gkm. The BYTE8 attribute covers the bit width requirement already. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100133	2021-04-09 19:48:08 -04:00
Jez Ng	817d98d841	[lld-macho][nfc] Refactor in preparation for 32-bit support The main challenge was handling the different on-disk structures (e.g. `mach_header` vs `mach_header_64`). I tried to strike a balance between sprinkling `target->wordSize == 8` checks everywhere (branchy = slow, and ugly) and templatizing everything (causes code bloat, also ugly). I think I struck a decent balance by judicious use of type erasure. Note that LLD-ELF has a similar architecture, though it seems to use more templating. Linking chromium_framework takes about the same time before and after this change: N Min Max Median Avg Stddev x 20 4.52 4.67 4.595 4.5945 0.044423204 + 20 4.5 4.71 4.575 4.582 0.056344803 No difference proven at 95.0% confidence Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99633	2021-04-02 18:46:39 -04:00
Greg McGary	427d359721	[lld-macho][NFC] Drop unnecessary macho:: namespace prefix on unambiguous references to Symbol Within `lld/macho/`, only `InputFiles.cpp` and `Symbols.h` require the `macho::` namespace qualifier to disambiguate references to `class Symbol`. Add braces to outer `for` of a 5-level single-line `if`/`for` nest. Differential Revision: https://reviews.llvm.org/D99555	2021-03-30 14:58:35 -07:00
Jez Ng	dc8bee9265	[lld-macho] Check address ranges when applying relocations This diff required fixing `getEmbeddedAddend` to apply sign extension to 32-bit values. We were previously passing around wrong 64-bit addend values that became "right" after being truncated back to 32-bit. I've also made `getEmbeddedAddend` return a signed int, which is similar to what LLD-ELF does for its `getImplicitAddend`. `reportRangeError`, `checkUInt`, and `checkInt` are counterparts of similar functions in LLD-ELF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D98387	2021-03-12 17:26:27 -05:00
Jez Ng	a723db92d8	[lld-macho][nfc] Refactor subtractor reloc handling SUBTRACTOR relocations are always paired with UNSIGNED relocations to indicate a pair of symbols whose address difference we want. Functionally they are like a single relocation: only one pointer gets written / relocated. Previously, we would handle these pairs by skipping over the SUBTRACTOR relocation and writing the pointer when handling the UNSIGNED reloc. This diff reverses things, so we write while handling SUBTRACTORs and skip over the UNSIGNED relocs instead. Being able to distinguish between SUBTRACTOR and UNSIGNED relocs in the write phase (i.e. inside `relocateOne`) is useful for the upcoming range check diff: we want to check that SUBTRACTOR relocs write signed values, but UNSIGNED relocs (naturally) write unsigned values. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D98386	2021-03-11 13:28:13 -05:00
Jez Ng	5433a79176	[lld-macho][nfc] Create Relocations.{h,cpp} for relocation-specific code This more closely mirrors the structure of lld-ELF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D98384	2021-03-11 13:28:09 -05:00
Nico Weber	7d26916859	[lld/mac] tweak comment based on feedback on D98053	2021-03-05 16:28:38 -05:00
Nico Weber	210cc0738b	[mac/lld] Fix scale computation for vector ops in PAGEOFF12 relocations With this, llvm-tblgen no longer tries and fails to allocate 7953 petabyte when it runs during the build. Instead, `check-llvm` with lld/mac as host linker now completes without any failures on an m1 mac. This vector op handling code matches what happens in: - ld64's OutputFile::applyFixUps() in OutputFile.cpp for kindStoreARM64PageOff12 - lld.ld64.darwinold's offset12KindFromInstruction() in lld/lib/ReaderWriter/MachO/ArchHandler_arm64.cpp for offset12scale16 - RuntimeDyld's decodeAddend() in llvm/lib/ExecutionEngine/RuntimeDyld/Targets/RuntimeDyldMachOAArch64.h for ARM64_RELOC_PAGEOFF12 Fixes PR49444. Differential Revision: https://reviews.llvm.org/D98053	2021-03-05 12:24:37 -05:00
Jez Ng	82b3da6f6f	[lld-macho] Extract embedded addends for arm64 UNSIGNED relocations On arm64, UNSIGNED relocs are the only ones that use embedded addends instead of the ADDEND relocation. Also ensure that the addend works when UNSIGNED is part of a SUBTRACTOR pair. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D97105	2021-02-27 12:31:34 -05:00
Jez Ng	541390131e	[lld-macho] Don't emit rebase opcodes for subtractor minuend relocs Also add a few asserts to verify that we are indeed handling an UNSIGNED relocation as the minued. I haven't made it an actual user-facing error since I don't think llvm-mc is capable of generating SUBTRACTOR relocations without an associated UNSIGNED. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D97103	2021-02-27 12:31:34 -05:00
Jez Ng	cc5c03e109	[lld-macho] Properly test subtractor relocations & fix their attributes `llvm-mc` doesn't generate any relocations for subtractions between local symbols -- they must be global -- so the previous test wasn't actually testing any relocation logic. I've fixed that and extended the test to cover r_length=3 relocations as well as both x86_64 and arm64. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D97057	2021-02-27 12:31:34 -05:00
Jez Ng	5e851733c5	[lld-macho] Fix semantics & add tests for ARM64 GOT/TLV relocs I've adjusted the RelocAttrBits to better fit the semantics of the relocations. In particular: 1. _UNSIGNED relocations are no longer marked with the `TLV` bit, even though they can occur within TLV sections. Instead the `TLV` bit is reserved for relocations that can reference thread-local symbols, and _UNSIGNED relocations have their own `UNSIGNED` bit. The previous implementation caused TLV and regular UNSIGNED semantics to be conflated, resulting in rebase opcodes being incorrectly emitted for TLV relocations. 2. I've added a new `POINTER` bit to denote non-relaxable GOT relocations. This distinction isn't important on x86 -- the GOT relocations there are either relaxable or non-relaxable loads -- but arm64 has `GOT_LOAD_PAGE21` which loads the page that the referent symbol is in (regardless of whether the symbol ends up in the GOT). This relocation must reference a GOT symbol (so must have the `GOT` bit set) but isn't itself relaxable (so must not have the `LOAD` bit). The `POINTER` bit is used for relocations that must reference a GOT slot. 3. A similar situation occurs for TLV relocations. 4. ld64 supports both a pcrel and an absolute version of ARM64_RELOC_POINTER_TO_GOT. But the semantics of the absolute version are pretty weird -- it results in the value of the GOT slot being written, rather than the address. (That means a reference to a dynamically-bound slot will result in zeroes being written.) The programs I've tried linking don't use this form of the relocation, so I've dropped our partial support for it by removing the relevant RelocAttrBits. Reviewed By: alexshap Differential Revision: https://reviews.llvm.org/D97031	2021-02-23 22:02:38 -05:00
Mikael Holmen	caff023b77	[lld] Silence compiler warnings by removing always true/false comparisons type is an uint8_t so type >= 0 is always true and type < 0 is always false.	2021-02-17 08:16:02 +01:00
Greg McGary	87104faac4	[lld-macho] Add ARM64 target arch This is an initial base commit for ARM64 target arch support. I don't represent that it complete or bug-free, but wish to put it out for review now that some basic things like branch target & load/store address relocs are working. I can add more tests to this base commit, or add them in follow-up commits. It is not entirely clear whether I use the "ARM64" (Apple) or "AArch64" (non-Apple) naming convention. Guidance is appreciated. Differential Revision: https://reviews.llvm.org/D88629	2021-02-08 18:14:07 -07:00

42 Commits