llvm-project

Author	SHA1	Message	Date
Thurston Dang	4220538e25	[msan] Handle multiply-add-accumulate; apply to AVX Vector Neural Network Instructions (VNNI) (#153927 ) This extends the pmadd handler (recently improved in https://github.com/llvm/llvm-project/pull/153353) to three-operand intrinsics (multiply-add-accumulate), and applies it to the AVX Vector Neural Network Instructions. Updates the tests from https://github.com/llvm/llvm-project/pull/153135	2025-08-18 13:18:27 -07:00
Thurston Dang	ade755d62b	[msan] Add Instrumentation for Avx512 Instructions: pmaddw, pmaddubs (#153919 ) This applies the pmadd handler (recently improved in https://github.com/llvm/llvm-project/pull/153353) to the Avx512 equivalent of the pmaddw and pmaddubs intrinsics: <16 x i32> @llvm.x86.avx512.pmaddw.d.512(<32 x i16>, <32 x i16>) <32 x i16> @llvm.x86.avx512.pmaddubs.w.512(<64 x i8>, <64 x i8>)	2025-08-18 11:31:15 -07:00
Thurston Dang	638bd11c13	[msan] Handle SSE/AVX pshuf intrinsic by applying to shadow (#153895 ) llvm.x86.sse.pshuf.w(<1 x i64>, i8) and llvm.x86.avx512.pshuf.b.512(<64 x i8>, <64 x i8>) are currently handled strictly, which is suboptimal. llvm.x86.ssse3.pshuf.b(<1 x i64>, <1 x i64>) llvm.x86.ssse3.pshuf.b.128(<16 x i8>, <16 x i8>) and llvm.x86.avx2.pshuf.b(<32 x i8>, <32 x i8>) are currently heuristically handled using maybeHandleSimpleNomemIntrinsic, which is incorrect. Since the second argument is the shuffle order, we instrument all these intrinsics using `handleIntrinsicByApplyingToShadow(..., /trailingVerbatimArgs=/1)` (https://github.com/llvm/llvm-project/pull/114490).	2025-08-15 20:28:30 -07:00
Thurston Dang	2b75ff192d	[msan] Reland with even more improvement: Improve packed multiply-add instrumentation (#153353 ) This reverts commit cf002847a464c004a57ca4777251b1aafc33d958 i.e., relands ba603b5e4d44f1a25207a2a00196471d2ba93424. It was reverted because it was subtly wrong: multiplying an uninitialized zero should not result in an initialized zero. This reland fixes the issue by using instrumentation analogous to visitAnd (bitwise AND of an initialized zero and an uninitialized value results in an initialized value). Additionally, this reland expands a test case; fixes the commit message; and optimizes the change to avoid the need for horizontalReduce. The current instrumentation has false positives: it does not take into account that multiplying an initialized zero value with an uninitialized value results in an initialized zero value This change fixes the issue during the multiplication step. The horizontal add step is modeled using bitwise OR. Future work can apply this improved handler to the AVX512 equivalent intrinsics (x86_avx512_pmaddw_d_512, x86_avx512_pmaddubs_w_512.) and AVX VNNI intrinsics.	2025-08-15 16:35:42 -07:00
Thurston Dang	cf002847a4	Revert "[msan] Improve packed multiply-add instrumentation" (#153343 ) Reverts llvm/llvm-project#152941 Buildbot breakage: https://lab.llvm.org/buildbot/#/builders/66/builds/17843	2025-08-12 21:32:07 -07:00
Thurston Dang	ba603b5e4d	[msan] Improve packed multiply-add instrumentation (#152941 ) The current instrumentation has false positives: if there is a single uninitialized bit in any of the operands, the entire output is poisoned. This does not take into account that multiplying an uninitialized value with zero results in an initialized zero value. This step allows elements that are zero to clear the corresponding shadow during the multiplication step. The horizontal add step and accumulation step (if any) are modeled using bitwise OR. Future work can apply this improved handler to the AVX512 equivalent intrinsics (x86_avx512_pmaddw_d_512, x86_avx512_pmaddubs_w_512.) and AVX VNNI intrinsics.	2025-08-12 19:13:48 -07:00
Thurston Dang	9a174518a8	[NFCI][msan] Precommit tests for AVX-VNNI (#153135 ) The tests largely cover AVX-VNNI (Vector Neural Network Instructions): - vpdpbusd, vpdpbusds - vpdpwssd, vpdpwssds AVX-VNNI-INT8: - vpdpbssd, vpdpbssds - vpdpbsud, vpdpbsuds - vpdpbuud, vpdpbuuds AVX-VNNI-INT16: - vpdpwsud, vpdpwsuds - vpdpwusd, vpdpwusds - vpdpwuud, vpdpwuuds These instructions are currently heuristically handled (by OR'ing together the vectors). This is incorrect because: 1) multiplication by a zero should result in an initialized value 2) the addition is horizontal (within vectors, not "vertically" between vectors). Future work can improve the instrumentation by applying the updated handleVectorPmaddIntrinsic() from https://github.com/llvm/llvm-project/pull/152941	2025-08-12 09:12:54 -07:00
Nikita Popov	c23b4fbdbb	[IR] Remove size argument from lifetime intrinsics (#150248 ) Now that #149310 has restricted lifetime intrinsics to only work on allocas, we can also drop the explicit size argument. Instead, the size is implied by the alloca. This removes the ability to only mark a prefix of an alloca alive/dead. We never used that capability, so we should remove the need to handle that possibility everywhere (though many key places, including stack coloring, did not actually respect this).	2025-08-08 11:09:34 +02:00
Nikita Popov	dbfc3ed690	[TypeSanitizer] Use alloca size for lifetime markers (#152154 ) Split out from https://github.com/llvm/llvm-project/pull/150248: Use the size of the alloca instead of the size passed to the lifetime intrinsic. As a bonus, this handles dynamic allocas correctly (see the added test) instead of doing a memset with size -1...	2025-08-07 14:39:32 +02:00
Yussur Mustafa Oraji	ded1f3ec96	[TSan] Add option to ignore capturing behavior when instrumenting (#148156 ) While not needed for most applications, some tools such as [MUST](https://www.i12.rwth-aachen.de/cms/i12/forschung/forschungsschwerpunkte/lehrstuhl-fuer-hochleistungsrechnen/~nrbe/must/) depend on the instrumentation being present. MUST uses the ThreadSanitizer annotation interface to detect data races in MPI programs, where the capture tracking is detrimental as it has no bearing on MPI data races, leading to missed races.	2025-08-06 15:47:33 +02:00
Nikita Popov	ba099c516d	[StackLifetime] Remove handling for lifetime size mismatch (#151965 ) Split out from #150248: Since #150944 the size passed to lifetime.start/end is considered meaningless. The lifetime always applies to the whole alloca. Accordingly remove handling for size mismatch in the StackLifetime analysis.	2025-08-05 09:19:10 +02:00
Nikita Popov	6df66a0683	[TypeSanitizer] Add test with lifetime intrinsics (NFC)	2025-08-04 17:00:47 +02:00
Paul Walker	04f98889ae	[LLVM][NumericalStabilitySanitizer] Add support for vector ConstantFPs. (#151739 )	2025-08-04 13:58:32 +01:00
Nikita Popov	86727fe9a1	[IR] Allow poison argument to lifetime markers (#151148 ) This slightly relaxes the invariant established in #149310, by also allowing the lifetime argument to be poison. This is to support the typical pattern of RAUWing with poison when removing an instruction. It's worth noting that this does not require any conservative assumptions, lifetimes with poison arguments can simply be skipped. Fixes https://github.com/llvm/llvm-project/issues/151119.	2025-08-04 10:02:04 +02:00
shuffle2	7b5a44c605	[hwasan] Add hwasan-all-globals option (#149621 ) hwasan-globals does not instrument globals with custom sections, because existing code may use `__start_`/`__stop_` symbols to iterate over globals in such a way which will cause hwasan assertions. Introduce new hwasan-all-globals option, which instruments all user-defined globals (but not those globals which are generated by the hwasan instrumentation itself), including those with custom sections. fixes #142442	2025-07-31 11:38:42 -07:00
Thurston Dang	56944e606a	[msan] Approximately handle AVX Galois Field Affine Transformation (#150794 ) e.g., <16 x i8> @llvm.x86.vgf2p8affineqb.128(<16 x i8>, <16 x i8>, i8) <32 x i8> @llvm.x86.vgf2p8affineqb.256(<32 x i8>, <32 x i8>, i8) <64 x i8> @llvm.x86.vgf2p8affineqb.512(<64 x i8>, <64 x i8>, i8) Out A x b where A and x are packed matrices, b is a vector, Out = A * x + b in GF(2) Multiplication in GF(2) is equivalent to bitwise AND. However, the matrix computation also includes a parity calculation. For the bitwise AND of bits V1 and V2, the exact shadow is: Out_Shadow = (V1_Shadow & V2_Shadow) \| (V1 & V2_Shadow) \| (V1_Shadow & V2) We approximate the shadow of gf2p8affine using: Out_Shadow = _mm512_gf2p8affine_epi64_epi8(x_Shadow, A_shadow, 0) \| _mm512_gf2p8affine_epi64_epi8(x, A_shadow, 0) \| _mm512_gf2p8affine_epi64_epi8(x_Shadow, A, 0) \| _mm512_set1_epi8(b_Shadow) This approximation has false negatives: if an intermediate dot-product contains an even number of 1's, the parity is 0. It has no false positives. Updates the test from https://github.com/llvm/llvm-project/pull/149258	2025-07-30 08:06:50 -07:00
Nikita Popov	f34bfd58cb	[HWAsan] Fix incorrect lifetime sizes in tests (NFC) (#150459 ) These tests used a lifetime size larger than the alloca. When fixing that some check calls go away, so I'm putting this up for review to confirm that this is correct.	2025-07-25 09:37:19 +02:00
Nikita Popov	92c55a315e	[IR] Only allow lifetime.start/end on allocas (#149310 ) lifetime.start and lifetime.end are primarily intended for use on allocas, to enable stack coloring and other liveness optimizations. This is necessary because all (static) allocas are hoisted into the entry block, so lifetime markers are the only way to convey the actual lifetimes. However, lifetime.start and lifetime.end are currently allowed to be used on non-alloca pointers. We don't actually do this in practice, but just the mere fact that this is possible breaks the core purpose of the lifetime markers, which is stack coloring of allocas. Stack coloring can only work correctly if all lifetime markers for an alloca are analyzable. * If a lifetime marker may operate on multiple allocas via a select/phi, we don't know which lifetime actually starts/ends and handle it incorrectly (https://github.com/llvm/llvm-project/issues/104776). * Stack coloring operates on the assumption that all lifetime markers are visible, and not, for example, hidden behind a function call or escaped pointer. It's not possible to change this, as part of the purpose of lifetime markers is that they work even in the presence of escaped pointers, where simple use analysis is insufficient. I don't think there is any way to have coherent semantics for lifetime markers on allocas, while also permitting them on arbitrary pointer values. This PR restricts lifetimes to operate on allocas only. As a followup, I will also drop the size argument, which is superfluous if we always operate on an alloca. (This change also renders various code handling lifetime markers on non-alloca dead. I plan to clean up that kind of code after dropping the size argument as well.) In practice, I've only found a few places that currently produce lifetimes on non-allocas: * CoroEarly replaces the promise alloca with the result of an intrinsic, which will later be replaced back with an alloca. I think this is the only place where there is some legitimate loss of functionality, but I don't think this is particularly important (I don't think we'd expect the promise in a coroutine to admit useful lifetime optimization.) * SafeStack moves unsafe allocas onto a separate frame. We can safely drop lifetimes here, as SafeStack performs its own stack coloring. * Similar for AddressSanitizer, it also moves allocas into separate memory. * LSR sometimes replaces the lifetime argument with a GEP chain of the alloca (where the offsets ultimately cancel out). This is just unnecessary. (Fixed separately in https://github.com/llvm/llvm-project/pull/149492.) * InferAddrSpaces sometimes makes lifetimes operate on an addrspacecast of an alloca. I don't think this is necessary.	2025-07-21 15:04:50 +02:00
Florian Mayer	a5481e7d5a	[NFCI] [HWASan] add test for custom section global (#149625 )	2025-07-18 21:53:53 -07:00
Nikita Popov	de959569f7	[AddressSanitizer] Generate test checks (NFC)	2025-07-18 17:00:48 +02:00
Thurston Dang	e73d1a5341	[msan] Add tests for avx512-gfni-intrinsics (#149258 ) Gluten-free, nuts included or something	2025-07-17 10:40:04 -07:00
Thurston Dang	5c4877ee0d	[msan] Re-fix disjoint OR instrumentation from #145990 (#148760 ) When disjoint OR was specified and a bit position contained a 1 in both operands, #145990 would set the corresponding shadow bit to uninitialized. However, the output of the operation is (LLVM) 'poison' for the entire result, hence the entire shadow ought to be uninitialized. This patch corrects the issue.	2025-07-15 15:32:15 -07:00
Thurston Dang	66850d0c06	[msan] Fix 'Simplify 'maskedCheckAVXIndexShadow' #147839 ' (#148785 ) https://github.com/llvm/llvm-project/pull/147839/ incorrectly checked the (lower bits of the) concrete value rather than the shadow.	2025-07-15 10:36:27 -07:00
Thurston Dang	6fc3b40b2c	[msan] Model is_int_min_poison to avoid false negative in abs (#148069 ) Note: since this patch reduces false negatives, buggy code that formerly passed with MSan may start failing. The current MSan handler for abs, like Hercules' in New York, ignores is_int_min_poison. This patch fixes the issue by poisoning the shadow corresponding to each int_min input value if is_int_min_poison.	2025-07-10 16:47:53 -07:00
Thurston Dang	7c66099545	[msan] Simplify 'maskedCheckAVXIndexShadow' (#147839 ) The current instrumentation has more or and element extraction than a coal mine: ``` [[TMP10:%.]] = extractelement <16 x i32> [[TMP9]], i64 0 [[TMP11:%.]] = and i32 [[TMP10]], 15 [[TMP43:%.]] = or i32 [[TMP10]], [[TMP11]] [[TMP12:%.]] = extractelement <16 x i32> [[TMP9]], i64 1 [[TMP13:%.]] = and i32 [[TMP12]], 15 [[TMP44:%.]] = or i32 [[TMP12]], [[TMP13]] ... [[TMP40:%.]] = extractelement <16 x i32> [[TMP9]], i64 15 [[TMP41:%.]] = and i32 [[TMP40]], 15 [[TMP57:%.]] = or i32 [[TMP40]], [[TMP41]] [[_MSCMP:%.]] = icmp ne i32 [[TMP57]], 0 br i1 [[_MSCMP]], label [[TMP102:%.]], label [[TMP103:%.]], !prof [[PROF1]] ``` Simplify it to: ``` [[TMP10:%.]] = trunc <16 x i32> [[T]] to <16 x i4> [[TMP12:%.]] = bitcast <16 x i4> [[TMP10]] to i64 [[_MSCMP:%.]] = icmp ne i64 [[TMP12]], 0 br i1 [[_MSCMP]], label %[[BB13:.]], label %[[BB14:.*]], !prof [[PROF1]] ```	2025-07-09 17:56:16 -07:00
Thurston Dang	702784ca76	[msan] Check mask and rounding mode in handleAVX512VectorConvertFPToInt (#147782 ) The checks were missing in "Add handler for llvm.x86.avx512.mask.cvtps2dq.512 (https://github.com/llvm/llvm-project/pull/147377)	2025-07-09 13:06:45 -07:00
Thurston Dang	cc95e4039b	[msan] Handle AVX512 vector down convert (non-mem) intrinsics (#147606 ) This handles `llvm.x86.avx512.mask.pmov{,s,us}.*.512` using `handleIntrinsicByApplyingToShadow()` where possible, otherwise using a customized slow-path handler, `handleAVX512VectorDownConvert()`. Note that shadow propagation of `pmov{s,us}` (signed/unsigned saturation) are approximated using truncation. Future work could extend `handleAVX512VectorDownConvert()` to use `GetMinMaxUnsigned()` to handle saturation precisely.	2025-07-08 20:51:19 -07:00
Thurston Dang	c8048e78ca	[NFCI][msan] Add avx512bw-intrinsics, avx512bw-intrinsics-upgrade tests (#147566 ) Forked from llvm/test/CodeGen/X86.	2025-07-08 12:37:43 -07:00
Florian Mayer	a3afbd33d8	[MSAN] only require needed bits to be initialized for permilvar (#147407 )	2025-07-07 16:21:55 -07:00
Thurston Dang	556c8467d1	[msan] Add handler for llvm.x86.avx512.mask.cvtps2dq.512 (#147377 ) Propagate the shadow according to the writemask, instead of using the default strict handler. Updates the test added in https://github.com/llvm/llvm-project/pull/123980	2025-07-07 14:49:36 -07:00
Florian Mayer	0032148ea6	[MSAN] handle permi2var (#146437 )	2025-07-07 11:24:17 -07:00
Tomer Shafir	65e11f600d	[Clang][AArch64] Remove redundant tune args to the backend (#146896 ) This change removes unnecessary tune args to the AArch64 backend. The AArch64 backend automatically handles `tune-cpu` and adds the necessar y features based on the models from TableGen. It follows this fix: https://github.com/llvm/llvm-project/pull/146260 where updating a subtarget feature didn't fail the frontend test because both the toolchain and the test suffered from a coordinated error.	2025-07-05 09:36:13 +03:00
Thurston Dang	4cf53cd266	[msan] Fix "Add optional flag to improve instrumentation of disjoint OR (#145990 )" (#146799 ) The "V1" and "V2" values were already NOT'ed, hence the calculation of disjoint OR in #145990 was incorrect. This patch fixes the issue, with some refactoring and renaming of variables.	2025-07-02 20:15:58 -07:00
Tomer Shafir	dd02fb3a51	[AArch64] Fix stale +zcm target feature to +zcm-gpr64 (#146260 ) Replaces all the uses of `+zcm` with `+zcm-gpr64`. A fix for: https://github.com/llvm/llvm-project/pull/146051	2025-06-29 15:01:05 +03:00
Thurston Dang	afe6af14ff	[msan] Add optional flag to improve instrumentation of disjoint OR (#145990 ) The disjoint OR (https://github.com/llvm/llvm-project/pull/72583) of two '1's is poison, hence the MSan ought to consider the result uninitialized (rather than initialized - i.e. a false negative - as per the existing instrumentation which ignores disjointedness). This patch adds a flag, `-msan-precise-disjoint-or`, which defaults to false (the legacy behavior). A future patch will default this flag to true. Updates the test from https://github.com/llvm/llvm-project/pull/145982	2025-06-26 22:55:55 -07:00
Thurston Dang	9e4981cf11	[NFCI][msan] Add test for "disjoint" OR (#145982 ) Disjoint OR is an extension to OR that was introduced in https://github.com/llvm/llvm-project/pull/72583. This patch adds a test case that shows MSan does not handle it correctly.	2025-06-26 15:58:05 -07:00
Thurston Dang	5a194c1fd9	[msan] Sharpen instrumentation of Intrinsic::{ctlz,cttz} (#145609 ) The current instrumentation of Intrinsic::{ctlz,cttz} has false positives. For example, consider `ctlz(0001 11??)` whereby `0` and `1` denotes initialized bits (with concrete values of 0 and 1 respectively) and `?` denotes an uninitialized bit. The result (of 3) is well-defined and the shadow ought to be fully initialized, but the current instrumentation marks it as fully uninitialized. This patch improves the fidelity of the instrumentation by comparing the number of leading (for ctlz; trailing for cttz) zeros in the concrete value and the shadow. This patch also renames the function from 'handleCountZeroes' to 'handleLeadingTrailingCountZeros', to clarify that the intrinsics handled do not count all the zeros (unlike `llvm.ctpop`, which counts all the 1s).	2025-06-25 09:29:59 -07:00
Nikita Popov	d95f46ca84	[IR] Fix incorrect writeonly on llvm.allow.ubsan/runtime.check (#145492 ) These intrinsics introduced in #84850 are currently marked as `memory(inaccessiblemem: write)`. This is not correct for the intended purpose of allowing per-block decisions, as such calls may get DCEd across control-flow boundaries (which will start actually happening with #145474). Use `memory(inaccessiblemem: readwrite)` instead, just like all the other control-flow sensitive intrinsics.	2025-06-25 09:12:21 +02:00
Thurston Dang	c85466dcd4	Reapply "[msan] Automatically print shadow for failing outlined checks" (#145611 ) (#145615 ) This reverts commit 5eb5f0d8760c6b757c1da22682b5cf722efee489 i.e., relands 1b71ea411a9d36705663b1724ececbdfec7cc98c. Test case was failing on aarch64 because the long double type is implemented differently on x86 vs aarch64. This reland restricts the test to x86. ---- Original CL description: A commonly used aid for debugging MSan reports is `__msan_print_shadow()`, which requires manual app code annotations (typically of the variable in the UUM report or nearby). This is in contrast to ASan, which automatically prints out the shadow map when a check fails. This patch changes MSan to print the shadow that failed an outlined check (checks are outlined per function after the `-msan-instrumentation-with-call-threshold` is exceeded) if verbosity >= 1. Note that we do not print out the shadow map of "neighboring" variables because this is technically infeasible; see "Caveat" below. This patch can be easier to use than `__msan_print_shadow()` because this does not require manual app code annotations. Additionally, due to optimizations, `__msan_print_shadow()` calls can sometimes spuriously affect whether a variable is initialized. As a side effect, this patch also enables outlined checks for arbitrary-sized shadows (vs. the current hardcoded handlers for {1,2,4,8}-byte shadows). Caveat: the shadow does not necessarily correspond to an individual user variable, because MSan instrumentation may combine and/or truncate multiple shadows prior to emitting a check that the mangled shadow is zero (e.g., `convertShadowToScalar()`, `handleSSEVectorConvertIntrinsic()`, `materializeInstructionChecks()`). OTOH it is arguably a strength that this feature emit the shadow that directly matters for the MSan check, but which cannot be obtained using the MSan API.	2025-06-24 20:33:11 -07:00
Thurston Dang	5eb5f0d876	Revert "[msan] Automatically print shadow for failing outlined checks" (#145611 ) Reverts llvm/llvm-project#145107 Reason: buildbot breakage (https://lab.llvm.org/buildbot/#/builders/51/builds/18512)	2025-06-24 15:53:19 -07:00
Thurston Dang	1b71ea411a	[msan] Automatically print shadow for failing outlined checks (#145107 ) A commonly used aid for debugging MSan reports is `__msan_print_shadow()`, which requires manual app code annotations (typically of the variable in the UUM report or nearby). This is in contrast to ASan, which automatically prints out the shadow map when a check fails. This patch changes MSan to print the shadow that failed an outlined check (checks are outlined per function after the `-msan-instrumentation-with-call-threshold` is exceeded) if verbosity >= 1. Note that we do not print out the shadow map of "neighboring" variables because this is technically infeasible; see "Caveat" below. This patch can be easier to use than `__msan_print_shadow()` because this does not require manual app code annotations. Additionally, due to optimizations, `__msan_print_shadow()` calls can sometimes spuriously affect whether a variable is initialized. As a side effect, this patch also enables outlined checks for arbitrary-sized shadows (vs. the current hardcoded handlers for {1,2,4,8}-byte shadows). Caveat: the shadow does not necessarily correspond to an individual user variable, because MSan instrumentation may combine and/or truncate multiple shadows prior to emitting a check that the mangled shadow is zero (e.g., `convertShadowToScalar()`, `handleSSEVectorConvertIntrinsic()`, `materializeInstructionChecks()`). OTOH it is arguably a strength that this feature emit the shadow that directly matters for the MSan check, but which cannot be obtained using the MSan API.	2025-06-24 15:09:44 -07:00
Florian Mayer	61a969b867	Revert "[MSAN] handle assorted AVX permutations" (#145404 ) Rolling back while investigating an issue that might be caused by this.	2025-06-23 14:15:39 -07:00
Thurston Dang	33a92af1b2	[msan] Add off-by-default flag to fix false negatives from partially undefined constant fixed-length vectors (#143837 ) This patch adds an off-by-default flag which, when enabled via `-mllvm -msan-poison-undef-vectors=true`, fixes a false negative in MSan (partially-undefined constant fixed-length vectors). It is currently off by default since, by fixing the false positive, code/tests that previously passed MSan may start failing. The default will be changed in a future patch. Prior to this patch, MSan computes that partially-undefined constant fixed-length vectors are fully initialized, which leads to false negatives; moreover, benign vector rewriting could theoretically flip MSan's shadow computation from initialized to uninitialized or vice-versa (). `-msan-poison-undef-vectors=true` calculates the shadow precisely: for each element of the vector, the corresponding shadow is fully uninitialized if the element is undefined/poisoned, otherwise it is fully initialized. Updates the test from https://github.com/llvm/llvm-project/pull/143823 () For example: ``` %x = insertelement <2 x i64> <i64 0, i64 poison>, i64 42, i64 0 %y = insertelement <2 x i64> <i64 poison, i64 poison>, i64 42, i64 0 ``` %x and %y are equivalent but, prior to this patch, MSan incorrectly computes the shadow of %x as <0, 0> rather than <0, -1>.	2025-06-20 10:11:12 -07:00
Paul Walker	e478a22d54	[LLVM][IRBuilder] Use NUW arithmetic for Create{ElementCount,TypeSize}. (#143532 ) This put the onus on the caller to ensure the result type is big enough. In the unlikely event a cropped result is required then explicitly truncate a safe value.	2025-06-19 13:24:39 +01:00
Kunqiu Chen	355725a25e	[TSan] Fix missing inst cleanup (#144067 ) Commit 44e875ad5b2ce26826dd53f9e7d1a71436c86212 introduced a change that replaces `ReplaceInstWithInst` with `Instruction::replaceAllUsesWith`, without subsequent instruction cleanup. This results in TSan leaving behind useless `load atomic` instructions after 'replacing' them. This commit adds cleanup back, consistent with the context.	2025-06-18 17:09:32 +08:00
Florian Mayer	d7e64d9594	[MSAN] handle assorted AVX permutations (#143462 )	2025-06-13 15:48:46 -07:00
Nikita Popov	4f8187c0dc	[TSan] Regenerate test checks (NFC)	2025-06-13 14:54:24 +02:00
Florian Mayer	6f3e2c076d	[MSAN] fork avx512vl-intrinsics and x86-vpermi2 tests (#143643 )	2025-06-12 14:08:50 -07:00
Thurston Dang	2efff47363	[NFCI][msan] Show that shadow for partially undefined constant vectors is computed as fully initialized (#143823 ) This happens because `getShadow(Value *V)` has a special case for fully undefined/poisoned values, but partially undefined values fall-through and are given a clean shadow. This leads to false negatives (no false positives). Note: MSan correctly handles InsertElementInst, but the shadow of the initial constant vector may still be wrong and be propagated. Showing that the same approximation happens for other composite types is left as an exercise for the reader.	2025-06-11 22:43:06 -07:00
Florian Mayer	23fd60d996	[MSAN] support vpermilvar AVX instructions (#143053 )	2025-06-09 17:57:19 -07:00

1 2 3 4 5 ...

1715 Commits