llvm-project

Author	SHA1	Message	Date
Arthur Eubanks	00647a18ce	[X86] Don't respect large data threshold for globals with an explicit section (#78348 ) If multiple globals are placed in an explicit section, there's a chance that the large data threshold will cause the different globals to be inconsistent in whether they're large or small. Mixing sections with mismatched large section flags can cause undesirable issues like increased relocation pressure because there may be 32-bit references to the section in some TUs, but the section is considered large since input section flags are unioned and other TUs added the large section flag. An explicit code model on the global still overrides the decision. We can do this for globals without any references to them, like what we did with asan_globals in #74514. If we have some precompiled small code model files where asan_globals is not considered large mixed with medium/large code model files, that's ok because the section is considered large and placed farther. However, overriding the code model for globals in some TUs but not others and having references to them from code will still result in the above undesired behavior. This mitigates a whole class of mismatched large section flag issues like what #77986 was trying to fix. This ends up not adding the SHF_X86_64_LARGE section flag on explicit sections in the medium/large code model. This is ok for the large code model since all references from large text must use 64-bit relocations anyway.	2024-01-17 15:38:32 -08:00
Arthur Eubanks	7433b1ca3e	Reapply "[X86] Set SHF_X86_64_LARGE for globals with explicit well-known large section name (#74381 )" This reverts commit 19fff858931bf575b63a0078cc553f8f93cced20. Now that explicit large globals are handled properly in the small code model.	2023-12-21 10:51:30 -08:00
Arthur Eubanks	19fff85893	Revert "[X86] Set SHF_X86_64_LARGE for globals with explicit well-known large section name (#74381 )" This reverts commit 323451ab88866c42c87971cbc670771bd0d48692. Code with these section names in the wild doesn't compile because support for large globals in the small code model is not complete yet.	2023-12-12 16:31:41 -08:00
Arthur Eubanks	ddebce7021	[X86] Respect code_model when determining if a global is small/large (#74498 ) Using the GlobalVariable code_model property added in #72077. code_model = "small" means the global should be treated as small regardless of the TargetMachine code model. code_model = "large" means the global should be treated as large regardless of the TargetMachine code model. Inferring small/large based on a known section name still takes precedence for correctness. The intention is to use this for globals that are accessed very infrequently but also take up a lot of space in the binary to mitigate relocation overflows. Prime examples are globals that go in "__llvm_prf_names" for coverage/PGO instrumented builds and "asan_globals" for ASan builds.	2023-12-05 10:38:02 -08:00
Arthur Eubanks	323451ab88	[X86] Set SHF_X86_64_LARGE for globals with explicit well-known large section name (#74381 ) Globals marked with the .lbss/.ldata/.lrodata should automatically be treated as large. Do this regardless of the code model for consistency when mixing object files compiled with different code models. Basically the other half of #70748. Example in the wild: https://codebrowser.dev/qt5/qtbase/src/testlib/qtestcase.cpp.html#1664	2023-12-05 08:26:08 -08:00
Arthur Eubanks	635756e4f3	[X86] Place data in large sections for large code model (#70265 ) This allows better interoperability mixing small/medium/large code model code since large code model data can be put into separate large sections. And respect large data threshold under large code model. gcc also does this: https://gcc.gnu.org/onlinedocs/gcc/x86-Options.html. See https://groups.google.com/g/x86-64-abi/c/jnQdJeabxiU.	2023-11-17 15:47:28 -08:00
Fangrui Song	5908559c10	[X86] Don't set SHF_X86_64_LARGE for variables with explicit section name of a well-known small data section prefix (#70748 ) Commit f3ea73133f91c1c23596d45680c8f2269c1dd289 allows SHF_X86_64_LARGE for all global variables with an explicit section. For the following variables, their data sections will be annotated as SHF_X86_64_LARGE. ``` const char relro[512] __attribute__((section(".rodata"))) = "a"; const char *const relro __attribute__((section(".data.rel.ro"))) = "a"; char data[512] __attribute__((section(".data"))) = "a"; ``` The typical linker requirement is that we do not create more than one output section with the same name, and the only output section should have the bitwise OR value of all input section flags. Therefore, the output .data section will have the SHF_X86_64_LARGE flag and be moved away from the regular sections. This is undesired but benign. However, .data.rel.ro having the SHF_X86_64_LARGE flag is problematic because dynamic loaders do not support more than one PT_GNU_RELRO program header, and LLD produces the error `error: section: .jcr is not contiguous with other relro sections`. I believe the most appropriate solution is to disallow SHF_X86_64_LARGE on variables with an explicit section of certain prefixes ( .bss/.data/.bss) and allow others (e.g. metadata sections for various instrumentation). Fortunately, global variables with an explicit .bss/.data/.bss section are rare, so they should not cause excessive relocation overflow pressure.	2023-10-30 17:03:04 -07:00
Arthur Eubanks	f3ea73133f	[ELF] Set large section flag for globals with an explicit section (#69396 ) An oversight in https://reviews.llvm.org/D148836 since this is a different code path.	2023-10-18 16:24:23 -07:00
Arthur Eubanks	b915f60678	[CodeGen] Don't treat thread local globals as large data (#67764 ) Otherwise they may mistakenly get the large section flag.	2023-09-29 12:56:53 -07:00
Arthur Eubanks	1feb00a28c	[X86] Introduce a large data threshold for the medium code model Currently clang's medium code model treats all data as large, putting them in a large data section and using more expensive instruction sequences to access them. Following gcc's -mlarge-data-threshold, which allows putting data under a certain size in a normal data section as opposed to a large data section. This allows using cheaper code sequences to access some portion of data in the binary (which will be implemented in LLVM in a future patch). And under the medium codel mode, only put data above the large data threshold into large data sections, not all data. Reviewed By: MaskRay, rnk Differential Revision: https://reviews.llvm.org/D149288	2023-09-14 15:09:25 -07:00
Arthur Eubanks	43249378da	[X86] Use "l" prefix for data sections under medium/large code model And also set the SHF_X86_64_LARGE section flag. gcc only uses the "l" prefix and SHF_X86_64_LARGE in the medium code model for data larger than -mlarge-data-threshold. But it seems more consistent to use it in the large code model as well in case separate parts of the binary aren't compiled with the large code model and also have a .data/.bss/.rodata section. Reviewed By: MaskRay, tkoeppe Differential Revision: https://reviews.llvm.org/D148836	2023-05-31 12:31:08 -07:00

11 Commits