llvm-project

Author	SHA1	Message	Date
Thurston Dang	c85466dcd4	Reapply "[msan] Automatically print shadow for failing outlined checks" (#145611 ) (#145615 ) This reverts commit 5eb5f0d8760c6b757c1da22682b5cf722efee489 i.e., relands 1b71ea411a9d36705663b1724ececbdfec7cc98c. Test case was failing on aarch64 because the long double type is implemented differently on x86 vs aarch64. This reland restricts the test to x86. ---- Original CL description: A commonly used aid for debugging MSan reports is `__msan_print_shadow()`, which requires manual app code annotations (typically of the variable in the UUM report or nearby). This is in contrast to ASan, which automatically prints out the shadow map when a check fails. This patch changes MSan to print the shadow that failed an outlined check (checks are outlined per function after the `-msan-instrumentation-with-call-threshold` is exceeded) if verbosity >= 1. Note that we do not print out the shadow map of "neighboring" variables because this is technically infeasible; see "Caveat" below. This patch can be easier to use than `__msan_print_shadow()` because this does not require manual app code annotations. Additionally, due to optimizations, `__msan_print_shadow()` calls can sometimes spuriously affect whether a variable is initialized. As a side effect, this patch also enables outlined checks for arbitrary-sized shadows (vs. the current hardcoded handlers for {1,2,4,8}-byte shadows). Caveat: the shadow does not necessarily correspond to an individual user variable, because MSan instrumentation may combine and/or truncate multiple shadows prior to emitting a check that the mangled shadow is zero (e.g., `convertShadowToScalar()`, `handleSSEVectorConvertIntrinsic()`, `materializeInstructionChecks()`). OTOH it is arguably a strength that this feature emit the shadow that directly matters for the MSan check, but which cannot be obtained using the MSan API.	2025-06-24 20:33:11 -07:00
Thurston Dang	5eb5f0d876	Revert "[msan] Automatically print shadow for failing outlined checks" (#145611 ) Reverts llvm/llvm-project#145107 Reason: buildbot breakage (https://lab.llvm.org/buildbot/#/builders/51/builds/18512)	2025-06-24 15:53:19 -07:00
Thurston Dang	1b71ea411a	[msan] Automatically print shadow for failing outlined checks (#145107 ) A commonly used aid for debugging MSan reports is `__msan_print_shadow()`, which requires manual app code annotations (typically of the variable in the UUM report or nearby). This is in contrast to ASan, which automatically prints out the shadow map when a check fails. This patch changes MSan to print the shadow that failed an outlined check (checks are outlined per function after the `-msan-instrumentation-with-call-threshold` is exceeded) if verbosity >= 1. Note that we do not print out the shadow map of "neighboring" variables because this is technically infeasible; see "Caveat" below. This patch can be easier to use than `__msan_print_shadow()` because this does not require manual app code annotations. Additionally, due to optimizations, `__msan_print_shadow()` calls can sometimes spuriously affect whether a variable is initialized. As a side effect, this patch also enables outlined checks for arbitrary-sized shadows (vs. the current hardcoded handlers for {1,2,4,8}-byte shadows). Caveat: the shadow does not necessarily correspond to an individual user variable, because MSan instrumentation may combine and/or truncate multiple shadows prior to emitting a check that the mangled shadow is zero (e.g., `convertShadowToScalar()`, `handleSSEVectorConvertIntrinsic()`, `materializeInstructionChecks()`). OTOH it is arguably a strength that this feature emit the shadow that directly matters for the MSan check, but which cannot be obtained using the MSan API.	2025-06-24 15:09:44 -07:00
k-kashapov	271399831b	[MSan] Change overflow_size_tls type to IntPtrTy (#117689 ) As discussed in https://github.com/llvm/llvm-project/pull/109284#discussion_r1838819987: Changed `__msan_va_arg_overflow_size_tls` type from `Int64Ty` to `IntPtrTy`.	2025-04-08 09:51:13 -07:00
Vitaly Buka	a0bb2e21c1	[NFC][sanitizer] Move `InitTlsSize` into `InitializePlatformEarly` (#108921 )	2024-09-18 16:19:35 -07:00
Fangrui Song	ba66d60b1c	[sanitizer] Replace ALIGNED with alignas C++11 `alignas` is already used extensively. `alignas` must precede `static`, so adjust the ordering accordingly. msan.cpp: Clang 15 doesn't allow `__attribute__((visibility("default"))) alignas(16)`. Use the order `alignas(16) SANITIZER_INTERFACE_ATTRIBUTE`. Tested with Clang 7. Pull Request: https://github.com/llvm/llvm-project/pull/98958	2024-07-15 16:12:42 -07:00
Thurston Dang	62c61aa2bf	[msan] Change #ifdef SANITIZER_PPC to #if (#94009 ) `0e96eebc7f` accidentally turned the prior patch (`57a507930b`) into a no-op because this macro is always defined (as either 1 or 0). This patch changes it to correctly use #if.	2024-05-31 14:26:49 -07:00
Thurston Dang	0e96eebc7f	[msan] Reland: Increase k num stack origin descrs (limited to non-PowerPC) (#93117 ) The original pull request (https://github.com/llvm/llvm-project/pull/92838) was reverted due to a PowerPC buildbot breakage (`df626dd11c`). This reland limits the scope of the change to non-PowerPC platforms. I am unaware of any PowerPC use cases that would benefit from a larger kNumStackOriginDescrs constant. Original CL description: This increases the constant size of kNumStackOriginDescrs to 4M (64GB of BSS across two arrays), which ought to be enough for anybody. This is the easier alternative suggested by eugenis@ in https://github.com/llvm/llvm-project/pull/92826.	2024-05-28 12:52:45 -07:00
Thurston Dang	df626dd11c	Revert "[msan] Increase kNumStackOriginDescrs constant (#92838 )" This reverts commit 57a507930b50c445140feb68bffe1c21af53319e. Reason: buildbot breakage (https://lab.llvm.org/buildbot/#/builders/57/builds/35160)	2024-05-21 21:45:37 +00:00
Thurston Dang	57a507930b	[msan] Increase kNumStackOriginDescrs constant (#92838 ) This increases the constant size of kNumStackOriginDescrs to 4M (64GB of BSS across two arrays), which ought to be enough for anybody. This is the easier alternative suggested by eugenis@ in https://github.com/llvm/llvm-project/pull/92826.	2024-05-21 12:41:36 -07:00
Thurston Dang	58f7251820	[msan] Re-exec with no ASLR if memory layout is incompatible on Linux (#85142 ) This ports the change from TSan (`0784b1eefa`). Testing notes: run 'sudo sysctl vm.mmap_rnd_bits=32; ninja check-msan' before and after this patch. N.B. aggressive ASLR may also cause the app to overlap with the allocator region; for MSan, this was fixed in `af2bf86a37`	2024-03-15 09:49:00 -07:00
Vitaly Buka	fcce843227	[msan] Use `pthread_atfork` instead of interceptor (#75398 ) This is done for consistency with other sanitizers. Also lock the allocator.	2023-12-13 15:36:38 -08:00
Leonard Chan	afd170bdd9	[sanitizer] Consolidate some LowLevelAllocators to one This removes and replaces usage of a few LowLevelAllocators with a single one provided by sanitizer_common. Functionally, there should be no difference between using different allocators vs the same one. This works really well with D158783 which controls the size of each allocator mmap to significantly reduce fragmentation. This doesn't remove them all, mainly the ones used by asan and the flag parser. Differential Revision: https://reviews.llvm.org/D158786	2023-08-28 23:12:37 +00:00
Fangrui Song	39b8a27132	[sanitizer] Simplify with GET_CALLER_PC_BP. NFC	2023-02-04 11:30:14 -08:00
Vitaly Buka	ad2b356f85	[msan] Use no-origin functions when possible Saves 1.8% of .text size on CTMark Reviewed By: kda Differential Revision: https://reviews.llvm.org/D133077	2022-09-01 19:18:38 -07:00
Kevin Athey	532564de17	[MSAN] add flag to suppress storage of stack variable names with -sanitize-memory-track-origins Allows for even more savings in the binary image while simultaneously removing the name of the offending stack variable. Depends on D131631 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131728	2022-08-12 11:59:53 -07:00
Kevin Athey	ec277b67eb	[MSAN] Separate id ptr from constant string for variable names used in track origins. The goal is to reduce the size of the MSAN with track origins binary, by making the variable name locations constant which will allow the linker to compress them. Follows: https://reviews.llvm.org/D131415 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131631	2022-08-12 08:47:36 -07:00
Kevin Athey	4735104f09	[MSAN] remove unused debugging statements (NFC) Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131748	2022-08-11 20:24:34 -07:00
Vitaly Buka	aca1276160	[msan] Avoid unnecessary PC increment/decrement Reviewed By: kda Differential Revision: https://reviews.llvm.org/D131692	2022-08-11 14:29:07 -07:00
Vitaly Buka	af77e5e4c0	[msan] Extract SetAllocaOrigin	2022-08-10 20:53:02 -07:00
Vitaly Buka	d1040c455f	[msan] Another try for powerpc fix after D131205	2022-08-10 20:39:25 -07:00
Vitaly Buka	05b3374925	[msan] Try to fix powerpc after D131205	2022-08-10 19:28:30 -07:00
Kevin Athey	d7a47a9bb5	Desist from passing function location to __msan_set_alloca_origin4. This is done by calling __msan_set_alloca_origin and providing the location of the variable by using the call stack. This is prepatory work for dropping variable names when track-origins is enabled. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D131205	2022-08-10 09:02:53 -07:00
Vitaly Buka	47a9528fb4	[NFC][msan] Rename SymbolizerScope to UnwinderScope and hide	2022-04-12 18:57:54 -07:00
Dmitry Vyukov	595d340dce	sanitizer_common: make internal/external headers compatible This is a follow up to 4f3f4d672254 ("sanitizer_common: fix __sanitizer_get_module_and_offset_for_pc signature mismatch") which fixes a similar problem for msan build. I am getting the following error compiling a unit test for code that uses sanitizer_common headers and googletest transitively includes sanitizer interface headers: In file included from third_party/gwp_sanitizers/singlestep_test.cpp:3: In file included from sanitizer_common/sanitizer_common.h:19: sanitizer_interface_internal.h:41:5: error: typedef redefinition with different types ('struct __sanitizer_sandbox_arguments' vs 'struct __sanitizer_sandbox_arguments') } __sanitizer_sandbox_arguments; common_interface_defs.h:39:3: note: previous definition is here } __sanitizer_sandbox_arguments; Reviewed By: melver Differential Revision: https://reviews.llvm.org/D119546	2022-02-11 19:39:44 +01:00
Vitaly Buka	b3267bb3af	[NFC][msan] Split ThreadStart and Init	2021-11-08 18:58:33 -08:00
Martin Liska	13a442ca49	Enable -Wformat-pedantic and fix fallout. Differential Revision: https://reviews.llvm.org/D113172	2021-11-05 13:12:35 +01:00
Vitaly Buka	ef85ea9a4f	[msan] Print both shadow and user address before: 00 00 00 00 ff ff ff ff 00 00 00 00 00 00 00 00 Shadow map of [0x211000000005, 0x21100000012e), 297 bytes: now: 0x2f60d213ac10[0x7f60d213ac10] 00 00 00 00 ff ff ff ff 00 00 00 00 00 00 00 00 Shadow map [0x211000000005, 0x21100000012e) of [0x711000000005, 0x711000000135), 297 bytes: Differential Revision: https://reviews.llvm.org/D111261	2021-10-07 17:56:46 -07:00
Dmitry Vyukov	d26d5a0a3d	msan: clean up and enable format string checking Depends on D107981. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107982	2021-08-13 13:45:02 +02:00
Fangrui Song	261d6e05d5	[sanitizer] Simplify __sanitizer::BufferedStackTrace::UnwindImpl implementations Intended to be NFC. D102046 relies on the refactoring for stack boundaries.	2021-05-13 21:26:31 -07:00
Dmitry Vyukov	2721e27c3a	sanitizer_common: deduplicate CheckFailed We have some significant amount of duplication around CheckFailed functionality. Each sanitizer copy-pasted a chunk of code. Some got random improvements like dealing with recursive failures better. These improvements could benefit all sanitizers, but they don't. Deduplicate CheckFailed logic across sanitizers and let each sanitizer only print the current stack trace. I've tried to dedup stack printing as well, but this got me into cmake hell. So let's keep this part duplicated in each sanitizer for now. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102221	2021-05-12 08:50:53 +02:00
Nico Weber	0e92cbd6a6	Revert "[sanitizer] Simplify GetTls with dl_iterate_phdr on Linux" This reverts commit ec575e3b0a462ff7a3d23d0f39a22147606050de. Still doesn't work, see https://crbug.com/1196037	2021-04-05 19:00:18 -04:00
Fangrui Song	ec575e3b0a	[sanitizer] Simplify GetTls with dl_iterate_phdr on Linux This was reverted by f176803ef1f4050a350e01868d64fe09a674d3bf due to Ubuntu 16.04 x86-64 glibc 2.23 problems. This commit additionally calls `__tls_get_addr({modid,0})` to work around the dlpi_tls_data==NULL issues for glibc<2.25 (https://sourceware.org/bugzilla/show_bug.cgi?id=19826) GetTls is the range of * thread control block and optional TLS_PRE_TCB_SIZE * static TLS blocks plus static TLS surplus On glibc, lsan requires the range to include `pthread::{specific_1stblock,specific}` so that allocations only referenced by `pthread_setspecific` can be scanned. This patch uses `dl_iterate_phdr` to collect TLS blocks. Find the one with `dlpi_tls_modid==1` as one of the initially loaded module, then find consecutive ranges. The boundaries give us addr and size. This allows us to drop the glibc internal `_dl_get_tls_static_info` and `InitTlsSize` entirely. Use the simplified method with non-Android Linux for now, but in theory this can be used with *BSD and potentially other ELF OSes. This simplification enables D99566 for TLS Variant I architectures. See https://reviews.llvm.org/D93972#2480556 for analysis on GetTls usage across various sanitizers. Differential Revision: https://reviews.llvm.org/D98926	2021-04-04 15:35:53 -07:00
Nico Weber	f176803ef1	Revert "[sanitizer] Simplify GetTls with dl_iterate_phdr" This reverts commit 9be8f8b34d9b150cd1811e3556fe9d0cd735ae29. This breaks tsan on Ubuntu 16.04: $ cat tiny_race.c #include <pthread.h> int Global; void Thread1(void x) { Global = 42; return x; } int main() { pthread_t t; pthread_create(&t, NULL, Thread1, NULL); Global = 43; pthread_join(t, NULL); return Global; } $ out/gn/bin/clang -fsanitize=thread -g -O1 tiny_race.c --sysroot ~/src/chrome/src/build/linux/debian_sid_amd64-sysroot/ $ docker run -v $PWD:/foo ubuntu:xenial /foo/a.out FATAL: ThreadSanitizer CHECK failed: ../../compiler-rt/lib/tsan/rtl/tsan_platform_linux.cpp:447 "((thr_beg)) >= ((tls_addr))" (0x7fddd76beb80, 0xfffffffffffff980) #0 <null> <null> (a.out+0x4960b6) #1 <null> <null> (a.out+0x4b677f) #2 <null> <null> (a.out+0x49cf94) #3 <null> <null> (a.out+0x499bd2) #4 <null> <null> (a.out+0x42aaf1) #5 <null> <null> (libpthread.so.0+0x76b9) #6 <null> <null> (libc.so.6+0x1074dc) (Get the sysroot from here: https://commondatastorage.googleapis.com/chrome-linux-sysroot/toolchain/500976182686961e34974ea7bdc0a21fca32be06/debian_sid_amd64_sysroot.tar.xz) Also reverts follow-on commits: This reverts commit 58c62fd9768594ec8dd57e8320ba2396bf8b87e5. This reverts commit 31e541e37587100a5b21378380f54c028fda2d04.	2021-04-02 18:19:17 -04:00
Fangrui Song	9be8f8b34d	[sanitizer] Simplify GetTls with dl_iterate_phdr GetTls is the range of * thread control block and optional TLS_PRE_TCB_SIZE * static TLS blocks plus static TLS surplus On glibc, lsan requires the range to include `pthread::{specific_1stblock,specific}` so that allocations only referenced by `pthread_setspecific` can be scanned. This patch uses `dl_iterate_phdr` to collect TLS ranges. Find the one with `dlpi_tls_modid==1` as one of the initially loaded module, then find consecutive ranges. The boundaries give us addr and size. This allows us to drop the glibc internal `_dl_get_tls_static_info` and `InitTlsSize` entirely. Use the simplified method with non-Android Linux for now, but in theory this can be used with *BSD and potentially other ELF OSes. In the future, we can move `ThreadDescriptorSize` code to lsan (and consider intercepting `pthread_setspecific`) to avoid hacks in generic code. See https://reviews.llvm.org/D93972#2480556 for analysis on GetTls usage across various sanitizers. Differential Revision: https://reviews.llvm.org/D98926	2021-03-25 21:55:27 -07:00
Florian Schmaus	b1dd1a0997	[msan] Do not use 77 as exit code, instead use 1 MSan uses 77 as exit code since it appeared with c5033786ba34 ("[msan] MemorySanitizer runtime."). However, Test runners like the one from Meson use the GNU standard approach where a exit code of 77 signals that the test should be skipped [1]. As a result Meson's test runner reports tests as skipped if MSan is enabled and finds issues: build $ meson test ninja: Entering directory `/home/user/code/project/build' ninja: no work to do. 1/1 PROJECT:all / SimpleTest SKIP 0.09s I could not find any rationale why 77 was initially chosen, and I found no other clang sanitizer that uses this value as exit code. Hence I believe it is safe to change this to a safe default. You can restore the old behavior by setting the environment variable MSAN_OPTIONS to "exitcode=77", e.g. export MSAN_OPTIONS="exitcode=77" 1: https://mesonbuild.com/Unit-tests.html#skipped-tests-and-hard-errors Reviewed By: #sanitizers, eugenis Differential Revision: https://reviews.llvm.org/D92490	2020-12-10 14:23:12 -08:00
Vitaly Buka	d48f2d7c02	[sanitizer] Cleanup -Wnon-virtual-dtor warnings	2020-11-02 20:30:50 -08:00
Fangrui Song	2d7fd38cf7	[sanitizers] Remove unneeded MaybeCallDefaultOptions() and nullptr checks D28596 added SANITIZER_INTERFACE_WEAK_DEF which can guarantee `_default_options` are always defined. The weak attributes on the `__{asan,lsan,msan,ubsan}_default_options` declarations can thus be removed. `MaybeCallDefaultOptions` no longer need nullptr checks, so their call sites can just be replaced by `___default_options`. Reviewed By: #sanitizers, vitalybuka Differential Revision: https://reviews.llvm.org/D87175	2020-09-08 10:12:05 -07:00
Justin Cady	1d3ef5f122	[MSAN] Add fiber switching APIs Add functions exposed via the MSAN interface to enable MSAN within binaries that perform manual stack switching (e.g. through using fibers or coroutines). This functionality is analogous to the fiber APIs available for ASAN and TSAN. Fixes google/sanitizers#1232 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D86471	2020-08-27 19:30:40 -07:00
Gui Andrade	c0b5000bd8	[MSAN RT] Use __sanitizer::mem_is_zero in __msan_test_shadow The former function is particularly optimized for exactly the use case we're interested in: an all-zero buffer. This reduces the overhead of calling this function some 80% or more. This is particularly for instrumenting code heavy with string processing functions, like grep. An invocation of grep with the pattern '[aeiou]k[aeiou]' has its runtime reduced by ~75% with this patch Differential Revision: https://reviews.llvm.org/D84961	2020-08-10 19:22:27 +00:00
Gui Andrade	b0ffa8befe	[MSAN] Pass Origin by parameter to __msan_warning functions Summary: Normally, the Origin is passed over TLS, which seems like it introduces unnecessary overhead. It's in the (extremely) cold path though, so the only overhead is in code size. But with eager-checks, calls to __msan_warning functions are extremely common, so this becomes a useful optimization. This can save ~5% code size. Reviewers: eugenis, vitalybuka Reviewed By: eugenis, vitalybuka Subscribers: hiraditya, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D81700	2020-06-15 17:49:18 -07:00
serge-sans-paille	af38074874	Fix strict aliasing warning in msan.cpp Use internal_memcpy instead. Differential Revision: https://reviews.llvm.org/D80732	2020-06-01 07:42:10 +02:00
Dan Liew	4c39f34199	[SanitizerCommon] Print the current value of options when printing out help. Summary: Previously it wasn't obvious what the default value of various sanitizer options were. A very close approximation of the "default values" for the options are the current value of the options at the time of printing the help output. In the case that no other options are provided then the current values are the default values (apart from `help`). ``` ASAN_OPTIONS=help=1 ./program ``` This patch causes the current option values to be printed when the `help` output is enabled. The original intention for this patch was to append `(Default: <value>)` to an option's help text. However because this is technically wrong (and misleading) I've opted to append `(Current Value: <value>)` instead. When trying to implement a way of displaying the default value of the options I tried another solution where the default value used in `.inc` files were used to create compile time strings that where used when printing the help output. This solution was not satisfactory for several reasons: Stringifying the default values with the preprocessor did not work very well in several cases. Some options contain boolean operators which no amount of macro expansion can get rid of. * It was much more invasive than this patch. Every sanitizer had to be changed. * The settings of `__<sanitizer>_default_options()` are ignored. For those reasons I opted for the solution in this patch. rdar://problem/42567204 Reviewers: kubamracek, yln, kcc, dvyukov, vitalybuka, cryptoad, eugenis, samsonov Subscribers: #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D69546	2019-11-14 14:04:34 -08:00
Vitaly Buka	d39e7e2cf1	[compiler-rt] Use GetNextInstructionPc in signal handlers Summary: All other stack trace callers assume that PC contains return address. HWAsan already use GetNextInstructionPc in similar code. PR43339 Reviewers: eugenis, kcc, jfb Subscribers: dexonsmith, dberris, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D68313 llvm-svn: 373529	2019-10-02 21:20:37 +00:00
Vitaly Buka	c0fa632236	Remove NOLINTs from compiler-rt llvm-svn: 371687	2019-09-11 23:19:48 +00:00
David Carlier	e2ed800d62	[Sanitizer] checks ASLR on FreeBSD - Especially MemorySanitizer fails if those sysctl configs are enabled. Reviewers: vitalybuka, emaste, dim Reviewed By: dim Differential Revision: https://reviews.llvm.org/D66582 llvm-svn: 369708	2019-08-22 21:36:35 +00:00
Nico Weber	60c66db476	compiler-rt: Rename .cc file in lib/msan to .cpp Like r367463, but for msan. llvm-svn: 367562	2019-08-01 14:08:18 +00:00

47 Commits