llvm-project

Author	SHA1	Message	Date
Alexis Engelke	2563006f31	[Clang][NFC] Drop uses of BranchInst (#187242 )	2026-03-18 12:40:13 +01:00
David Pagan	88f1ec9a70	[clang][OpenMP] Parse/Sema for OpenMP 6.0 declare_target 'local' clause (#186281 ) Parse and perform semantic checks for declare_target 'local' clause. When compiling for device offloading, generate a warning that 'local' is not yet fully supported. On the host, 'local' is/will be a no-op, so no warning is generated. NOTE: The minimal CodeGen changes allow 'local' to flow through as equivalent to the 'enter' clause after warning is generated. Testing: - Updated messages and ast tests for declare target/declare_target - ninja check-all.	2026-03-16 19:36:25 -07:00
Farzon Lotfi	12fcde1a71	[Matrix] Copy Row data from padded cbuffer offsets before swizzle (#185346 ) fixes https://github.com/llvm/llvm-project/issues/184849 The fix is just to copy the data before a swizzle can happen	2026-03-10 18:20:38 -04:00
Farzon Lotfi	e564ce97b1	[Matrix] Fix cbuffers support for matrix element expr (#185471 ) fixes #184877 This change was threefold. 1. copy the padded cbuffer from memory to a local alloca 2. switch to using the new `getFlattenedIndex` helpers for index generation 3. convert row major to column major indicies in codegen depending on LangOptions	2026-03-10 13:57:22 -04:00
Farzon Lotfi	04f0935cf4	[Matrix] For MatrixRow with constant index use Indices.size() instead of getNumColumns() (#185530 ) fixes #184380 Three changes - Use Indices.size() - Construct a new QualType if Indices.size() != MT->getNumColumns() - Add a new test case	2026-03-10 12:27:30 -04:00
Jason Van Beusekom	2d4c8e0d0f	[OpenMP][clang] Indirect and Virtual function call mapping from host to device (#184412 ) This patch implements the CodeGen logic for calling __llvm_omp_indirect_call_lookup on the device when an indirect function call or a virtual function call is made within an OpenMP target region. --------- Co-authored-by: Youngsuk Kim	2026-03-03 13:20:24 -06:00
Jason Van Beusekom	f95662d159	Revert "[OpenMP][clang] Indirect and Virtual function call mapping from host to device" (#184378 ) Reverts llvm/llvm-project#159857	2026-03-03 17:11:14 +00:00
Jason Van Beusekom	b23438661c	[OpenMP][clang] Indirect and Virtual function call mapping from host to device (#159857 ) This patch implements the CodeGen logic for calling __llvm_omp_indirect_call_lookup on the device when an indirect function call or a virtual function call is made within an OpenMP target region. --------- Co-authored-by: Youngsuk Kim	2026-03-03 02:52:34 +00:00
Nikita Popov	96113ac416	[Clang] Use llvm.ptrmask to mask out thumb bit (#183535 ) Use llvm.ptrmask instead of a ptrtoint + and + inttoptr sequence to mask out the thumb bit.	2026-03-02 10:50:58 +01:00
Deric C.	4f59da55d7	[HLSL][Matrix] EmitFromMemory when emitting load of vector and matrix element LValues (#178315 ) Fixes #177712 The MatrixElt and VectorElt cases of `EmitLoadOfLValue` did not convert the scalar value from its load/store type into its primary IR type like the other cases do, which caused issues with HLSL in particular which requires bools to be converted to and from i32 and i1 forms for its load/store and primary IR types respectively. This PR fixes the issue by applying `EmitFromMemory` to the loaded scalar.	2026-02-23 15:45:35 -08:00
Kees Cook	09a3d830a8	[Clang][CodeGen] Fix __builtin_counted_by_ref for nested struct FAMs (#182575 ) (#182590 ) GetCountedByFieldExprGEP() used getOuterLexicalRecordContext() to find the RecordDecl containing the counted_by count field. This walks up through all lexically enclosing records to find the outermost one, which is wrong when a struct with a counted_by FAM is defined nested inside another named struct. For example, when struct inner (containing the FAM) is defined inside struct outer, getOuterLexicalRecordContext() resolves to struct outer instead of struct inner. The StructAccessBase visitor then fails to match the base expression type (struct inner ) against the expected record (struct outer), returning nullptr. This nullptr propagates back as the GEP result, and the subsequent dereference in __builtin_counted_by_ref() triggers an assertion failure in Address::getBasePointer(). Replace getOuterLexicalRecordContext() with a walk that only traverses anonymous structs and unions, which are transparent in C and must be walked past. Named nested structs are independently-addressable types, so the walk stops at them. Add a regression test for a FAM struct defined nested inside another struct. This also fixes __builtin_dynamic_object_size() for FAMs in nested structs, which was silently returning -1 (unknown) instead of computing the correct size. Update the attr-counted-by-pr88931.c test to reflect the now-correct dynamic object size calculation. Fixes #182575 Signed-off-by: Kees Cook <kees@kernel.org>	2026-02-20 13:34:17 -08:00
Peter Collingbourne	370d7ce580	[Clang] Add pointer field protection feature. Pointer field protection is a use-after-free vulnerability mitigation that works by changing how data structures' pointer fields are stored in memory. For more information, see the RFC: https://discourse.llvm.org/t/rfc-structure-protection-a-family-of-uaf-mitigation-techniques/85555 Reviewers: fmayer, ojhunt Pull Request: https://github.com/llvm/llvm-project/pull/172119	2026-02-19 15:19:35 -08:00
Nathan Gauër	ae402a32c0	[Clang][HLSL] Start emitting structured GEP instruction (#177332 ) StructuredGEP is a new LLVM intrinsic which will allow to emit proper logical SPIR-V or DXIL. To properly stage this change going across FE, BE and optimizations, this commits adds a new flag: - `-fexperimental-emit-sgep` When used, this flag will allow compatible frontends to emit the new instructions. This will also allow us to migrate tests bit by bit, adding the flag to each migrated test as we make progress on the implementation. Once the frontend migration complete, the flag will remain, and work on the backend will start. Compatible backends like SPIR-V will first allow both instructions, but then, depending on a target bit similar to `requiresStructuredCFG`, will declare that they require the SGEP instruction and will start enforcing it. Once the whole chain completed, the flag will be defaulted to true and removed, finishing the migration.	2026-02-19 10:08:20 +01:00
Farzon Lotfi	ca21a65dd8	[HLSL][Cbuffer][Matrix] Add Cbuffer padding and createBufferMatrixTempAddress (#181903 ) fixes #181901 This change detects when a HLSL Cbuffer matrix’s layout differs from its in-memory type and materializes a temporary with the non-padded matrix type. Matrix elements are copied explicitly from the padded buffer layout into the temporary, ensuring correct addressing and avoiding overlapping GEPs or incorrect vector flattening.	2026-02-18 13:42:10 -05:00
Deric C.	22ecdda485	[HLSL][Matrix] Allow flattening matrix types in FlattenAccessAndTypeLValue (#177708 ) Fixes #177487 This PR adds support for flattening HLSL matrix types in code generation via `CodeGenFunction::FlattenAccessAndTypeLValue`. It removes the assertion `assert(!isa<MatrixType>(T) && "Matrix types not yet supported in HLSL");` and adds a branch for handling ConstantMatrixTypes which is similar to the VectorType case.	2026-02-17 13:43:13 -08:00
Peter Collingbourne	55857e14bc	CodeGen: Switch to generating llvm.looptrap instead of llvm.cond.loop. Reviewers: fmayer, vitalybuka Reviewed By: fmayer Pull Request: https://github.com/llvm/llvm-project/pull/181300	2026-02-13 14:15:17 -08:00
Matheus Izvekov	a3b51529d1	[clang] CodeGen: fix ConstantExpr LValue emission (#181057 ) This fixes a regression introduced in #161029, though not the fault of that patch, only by incidental changes regarding the preservation of constant expression nodes. The LValue emission of ConstantExpr was doing strange things with regards to what type corresponds to the result of the constant expression, which are not justified by any tests or in the discussions of the relevant patches. See `09669e6c5f` and https://github.com/llvm/llvm-project/pull/78041 and `51e4aa87e0` This simplifies it to just use the expression type. Since this regression was never released, there are no release notes. Fixes #177807	2026-02-12 22:07:48 -03:00
Peter Collingbourne	a544d3792c	CodeGen, Driver: Add -fsanitize-trap-loop option. This option may be used to opt into infinite loops for failed UBSan and CFI checks. It causes Clang to generate an llvm.cond.loop intrinsic call instead of a conditional branch to a trap instruction when generating code for a conditional trap. Part of this RFC: https://discourse.llvm.org/t/rfc-optimizing-conditional-traps/89456 Reviewers: fmayer, vitalybuka Reviewed By: vitalybuka, fmayer Pull Request: https://github.com/llvm/llvm-project/pull/177688	2026-02-06 17:12:53 -08:00
Farzon Lotfi	1bb916c9af	[HLSL] Represent Matrix as arrays of vectors in memory (#179861 ) fixes https://github.com/llvm/llvm-project/issues/179859 For matrix types we need to check the language mode so we can change the matrix memory layout to arrays of vectors. To make this play nice with how the rest of clang treats matrices we need to modify the MaybeConvertMatrixAddress and the CreateMemTemp function to know how to reconstruct a flattened vector. Rest of changes is just test updates.	2026-02-06 16:25:20 -05:00
Farzon Lotfi	5f4543fbaf	[HLSL][Matrix] introduce MatrixElementExpr as accessor and swizzle operator (#171225 ) fixes #159438 This patch adds `MatrixElementExpr`, a new AST node for HLSL matrix element and swizzle access (e.g. M._m00, M._11_22_33). It introduces a shared `ElementAccessExprBase` used by both matrix and vector swizzle expressions, updates Sema to parse and validate zero-based and one-based accessors, detects duplicates for l-value checks, and emits improved diagnostics. CodeGen is updated to lower scalar and multi-element accesses consistently, and full AST serialization, dumping, and tooling support is included. This implementation reflects the updated [RFC](https://github.com/llvm/wg-hlsl/pull/357/files) for HLSL matrix accessor semantics.	2026-02-06 13:16:49 -05:00
Jameson Nash	d040788af6	[clang] remove unused SrcAddr parameter from performAddrSpaceCast (#179330 ) The conversion code always ended up just getting the type of Src from the Src argument itself, with no virtual users of this, so there is no point in also providing this API hook. Fix the documentation as well, since it seems DestAddr must have been similarly removed at some point in the past from the API but was still documented. Also fixes CIR to actually return the casted value!	2026-02-05 14:03:19 -05:00
Wenju He	8ab29461c3	[OpenCL] Set half-precision Div and Sqrt accuracy (#179621 ) OpenCL spec relaxed half-precision divide to 1 ULP and sqrt to 1.5 ULP in https://github.com/KhronosGroup/OpenCL-Docs/pull/1293 https://github.com/KhronosGroup/OpenCL-Docs/pull/1386 This can enable target to use hardware rcp instruction for half.	2026-02-05 09:32:56 +08:00
Jameson Nash	0dd21ad1c6	[clang] remove addrspace cast from CreateIRTemp (#179327 ) This just added unnecessary work to the IR, since they are only used for load and store, which just causes some IR noise. Tests updated by UTC script to remove the extra lines.	2026-02-04 13:09:32 -05:00
NAKAMURA Takumi	15a81fd494	[Coverage][Single] Enable Branch coverage for CondOp (#113110 ) Depends on: #112730 #113114 https://discourse.llvm.org/t/rfc-integrating-singlebytecoverage-with-branch-coverage/82492	2026-01-28 14:02:02 +09:00
Dan Blackwell	c63a744f3f	[CodeGen][InstCombine][Sanitizers] Emit lifetimes when compiling with memtag-stack (#177130 ) Currently we do not emit lifetimes by default when compiling with memtag-stack - which means we don't catch use-after-scope (when compiling without optimization). This patch fixes that by mirroring ASan, HWASan and MSan, and always emitting lifetime markers. The patch is based on the changes made in aeca569. rdar://163713381	2026-01-22 14:22:44 +00:00
Deric C.	255af949a7	[HLSL][Matrix] Update indexed matrix elements individually (#176216 ) Fixes #174629 This PR is similar to that of #169144 but for matrices. When storing to a matrix element or matrix row, `insertelement` instructions have been replaced by GEPs followed by stores to individual matrix elements. There is no longer storing of the entire matrix to memory all at once, thus avoiding data races when writing to independent matrix elements from multiple threads.	2026-01-16 13:05:52 -08:00
Deric C.	6212c87ab9	[HLSL][Matrix] Add Matrix splat support for booleans (#175809 ) Fixes #175808 This PR adds support for boolean matrix splats by adding tests and fixing a bug in `CodeGenFunction::EmitToMemory` when the type of a boolean matrix already matches the type expected of a load/store. This PR also addresses the todo comment in `clang/lib/Sema/SemaExpr.cpp` regarding support for boolean matrix splats by removing the comment altogether since it is not necessary. --------- Co-authored-by: Farzon Lotfi <farzonlotfi@microsoft.com>	2026-01-14 10:17:13 -08:00
Deric C.	58a9dc01be	[HLSL][Matrix] Add type conversions to support bool matrix single subscript operators (#175633 ) Fixes #172711 Fixes the type mismatch issues preventing single matrix subscript getters and setters from working with boolean matrices. The changes from this PR also happens to make matrix splats work for boolean matrices, but adding tests for that and (re)introducing boolean-matrix-specific sema will be relegated to its own PR.	2026-01-14 08:44:55 -08:00
Deric C.	50c1a69db0	[HLSL][Matrix] Load and store ConstantMatrixBoolTypes as i32 FixedVectorTypes (#175245 ) Fixes #175236 This pull request improves support for HLSL constant matrix types with boolean elements in Clang's code generation. The main changes ensure that boolean (i1) matrices are correctly represented and stored as i32 vectors in LLVM IR. This includes updates to both the code generation logic and related tests. ### Code generation improvements for HLSL boolean matrices * Updated `convertTypeForLoadStore` in `CodeGenTypes.cpp` to represent constant matrix types with boolean elements as `FixedVectorType` of integers, ensuring atomic load/store operations and correct element type conversion for HLSL. * Modified `EmitToMemory` in `CGExpr.cpp` to handle both `ExtVectorBoolType` and `ConstantMatrixBoolType`, improving the handling of boolean matrices during memory emission. ### Test updates for boolean matrix codegen * Adjusted test expectations in `BoolMatrix.hlsl` to reflect the new representation, showing stores and loads of `<N x i32>` instead of `<N x i1>` for boolean matrices, and added zero-extension where necessary. * Added a new test for a 4x4 boolean matrix function to verify correct code generation for initial stores to boolean matrix parameter declaration allocas.	2026-01-12 08:57:51 -08:00
Farzon Lotfi	88788dcd5b	[HLSL][Matrix] Implement dynamic single subscript swizzle (#173201 ) fixes #170777 If we don't use vector type and instead continue to pass on the matrix type when we enter `EmitExtVectorElementExpr` Then we don't need to store the row and column length on the LValue. Using the Matrix type means we can reuse the isMatrixRow() cases in EmitLoadOfLValue and EmitStoreThroughLValue and not have to support a new lValue that is a hybrid between the ExtVectorElt and MatrixRow cases. All we need to do to support this is pass the list of column indices as a `ConstantDataVector` and check the size of this Vector to know how many column iterations we need to do. Further just index into the vector to fetch the right encoded element index value.	2026-01-07 11:57:04 -05:00
Victor Chernyakin	c438773432	[LLVM][ADT] Migrate users of `make_scope_exit` to CTAD (#174030 ) This is a followup to #173131, which introduced the CTAD functionality.	2026-01-02 20:42:56 -08:00
Farzon Lotfi	d48050ab27	[Matrix][HLSL] Implement Matrix single constant index swizzle (#173044 ) fixes #172805 For the constant case if we know the row index then we can compute the offsets via `E->getEncodedElementAccess(Elts)`. We had to also add column and row sizes to LValue so that we could compute the right index. the emitter for `MatrixSingleSubscriptExpr` collects the sizes off the type and passes it to `MakeMatrixRow`. --------- Co-authored-by: Justin Bogner <mail@justinbogner.com>	2025-12-21 14:09:20 -05:00
Farzon Lotfi	21571cf306	[HLSL][Matrix] Support row and column indexing modes for MatrixSubscriptExpr (#171564 ) fixes #167617 In DXC HLSL supports different indexing modes via codegen for its equivalent of the MatrixSubscriptExpr when the /Zpr and /Zpc flags are used see: https://godbolt.org/z/bz5Y5WG36. This change modifies EmitMatrixSubscriptExpr to consider the MatrixRowMajor/MatrixColMajor Layout flags before generating an index. Similarly it introduces `createRowMajorIndex` and `createColumnMajorIndex` in `MatrixBuilder.h` for use in `VisitMatrixSubscriptExpr`.	2025-12-20 17:21:04 -05:00
PiJoules	2f8f75fd39	[clang] Apply cfi_unchecked_callee rules to -fsanitize=function (#170725 ) Allow the normal rules for preventing instrumentation on indirect calls to `cfi_unchecked_callee` function types and `cfi_unchecked_callee` functions when using `-fsanitize=function`. While it's technically separate from `-fsanitize=cfi`, this particular UBSan mode checks for similar control flow bugs so it makes sense to also prevent those control flow checks from being added onto `cfi_unchecked_callee` functions.	2025-12-19 13:05:14 -08:00
Farzon Lotfi	c7b86a625e	[Matrix][HLSL] Revert Constant Matrix single subscript handling (#172887 ) This change creates an XFAIL that will be addressed by #172805 This part of the change was not intended to make it in. It was a way to optimize for constant index setters\getters but had the unintended consequence of doing swizzles without considering the encoded element offsets. We need to be calling `MakeExtVectorElt` in `EmitExtVectorElementExpr` so that we can call `E->getEncodedElementAccess(Elts);` off of the `ExtVectorElementExpr` Node. Calling it in `EmitMatrixSingleSubscriptExpr` was wrong because we doing have the vector encoded elementts yet. But also to get the right index we need a way to pass on the matrix column and row lengths so `MakeMatrixRow` needed to be expanded to store row and column sizes on the LValue.	2025-12-18 14:48:13 -05:00
Farzon Lotfi	60b6c53f25	[HLSL][Matrix] Add support for single subscript accessor (#170779 ) fixes #166206 - Add swizzle support if row index is constant - Add test cases - Add new AST type - Add new LValue for Matrix Row Type - TODO: Make the new LValue a dynamic index version of ExtVectorElt	2025-12-17 17:04:28 -05:00
Nikita Popov	ea9d44defc	[CGExpr] Use getSigned() for negative constant	2025-12-16 12:30:21 +01:00
Farzon Lotfi	491e001627	[HLSL][Matrix] Add Matrix Bool and represent them as i32 elements (#171051 ) fixes #171049 fixes #171050 - Allow Bools for matrix type when in HLSL mode - use ConvertTypeForMem to figure out the bool size - Add Bool matrix types to hlsl_basic_types.h --------- Co-authored-by: Helena Kotas <hekotas@microsoft.com>	2025-12-15 10:21:32 -05:00
Bill Wendling	069d3aa923	[Clang][counted_by] Correct signed counted_by values (#171260 ) If the 'counted_by' value is signed, we will incorrectly allow accesses when the value is negative. This has obvious bad effects as it will allow accessing a huge swath of unallocated memory. Also clarify and rearrange the parameters to make them more perspicuous. Fixes: #170987.	2025-12-10 15:18:09 -08:00
PiJoules	daab9dd161	[NFC][clang] Remove extra `To` in method name (#170723 ) `hasPointeeToToCFIUncheckedCalleeFunctionType` -> `hasPointeeToCFIUncheckedCalleeFunctionType`	2025-12-04 11:46:36 -08:00
Farzon Lotfi	dd1b4abfb7	[HLSL][Matrix] Add support for Matrix element and trunc Casts (#168915 ) fixes #168737 fixes #168755 This change fixes adds support for Matrix truncations via the ICK_HLSL_Matrix_Truncation enum. That ends up being most of the files changed. It also allows Matrix as an HLSL Elementwise cast as long as the cast does not perform a shape transformation ie 3x2 to 2x3. Tests for the new elementwise and truncation behavior were added. As well as sema tests to make sure we error n the shape transformation cast. I am punting right now on the ConstExpr Matrix support. That will need to be addressed later. Will file a seperate issue for that if reviewers agree it can wait.	2025-12-02 19:02:25 -05:00
Helena Kotas	434127b0c1	[HLSL] Static resources (#166880 ) This change fixes couple of issues with static resources: - Enables assignment to static resource or resource array variables (fixes #166458) - Initializes static resources and resource arrays with default constructor that sets the handle to poison	2025-12-02 22:25:17 +00:00
Helena Kotas	4a48740831	[HLSL] Update indexed vector elements individually (#169144 ) When an individual element of a vector is updated via indexing into the vector, it needs to be handled as a store operation on that one vector element. Clang treats vectors as one unit, so a vector element needs to be updated, the whole vector is loaded, the element is modified, and then the whole vector is stored. In HLSL vector elements are handled individually. We need to avoid this load/modify/store sequence to prevent overwriting other vector elements that might be getting updated in parallel. Fixes #167729 Contributes to #160208.	2025-12-01 10:34:29 -08:00
Florian Mayer	e2a29eca56	[UBSan] Use -fsanitize-handler-preserve-all-regs in codegen Pull Request: https://github.com/llvm/llvm-project/pull/168645	2025-11-26 17:29:48 -08:00
Helena Kotas	c475f8e5d8	[HLSL] Update vector swizzle elements individually (#169090 ) When individual elements of a vector are updated via vector swizzle, it needs to be handled as separate store operations to the individual vector elements. Clang treats vectors as one unit, so if a part of a vector needs to be updated, the whole vector is loaded, some elements modified, and then the whole vector is stored. In HLSL vector elements are handled separately. We need to avoid this load/modify/store sequence to prevent overwriting other vector elements that might be getting updated in parallel. Fixes #152815	2025-11-25 12:50:25 -08:00
Vitaly Buka	af098e0096	Revert "[ubsan_minimal] Allow UBSan handler from Minimal runtime to accept arguments (#152192 )" (#168812 ) This partially reverts #152192, keeping updated tests and some code reordering in clang/lib/CodeGen/CGExpr.cpp. compiler-rt/lib/ubsan_minimal/ubsan_minimal_handlers.cpp is exact revert (with followup #152419) We don't have a good use case for that, so revert it before we are stuck maintaining this API. 21.x does not have this patch. This reverts commit a1209d868632b8aea10450cd2323848ab0b6776a.	2025-11-21 01:09:28 -08:00
Aleksandr Nogikh	131cf7d5b2	[AllocToken] Enable alloc token instrumentation for size-returning functions (#168840 ) Consider a newly added "malloc_span" attribute in the allocation token instrumentation to ensure that allocation functions with the "malloc_span" attribute are processed similarly to other memory allocation functions. Update the tests to demonstrate applicability to __size_returning_new.	2025-11-20 10:33:24 +01:00
Justin Bogner	c4898f3f22	[HLSL][DirectX] Use a padding type for HLSL buffers. (#167404 ) This change drops the use of the "Layout" type and instead uses explicit padding throughout the compiler to represent types in HLSL buffers. There are a few parts to this, though it's difficult to split them up as they're very interdependent: 1. Refactor HLSLBufferLayoutBuilder to allow us to calculate the padding of arbitrary types. 2. Teach Clang CodeGen to use HLSL specific paths for cbuffers when generating aggregate copies, array accesses, and structure accesses. 3. Simplify DXILCBufferAccesses such that it directly replaces accesses with dx.resource.getpointer rather than recalculating the layout. 4. Basic infrastructure for SPIR-V handling, but the implementation itself will need work in follow ups. Fixes several issues, including #138996, #144573, and #156084. Resolves #147352.	2025-11-18 13:38:43 -08:00
J. Ryan Stinnett	342e28f8a9	[clang][DebugInfo] Attach `DISubprogram` to additional call variants (#166202 ) `DISubprogram`s are attached to call sites to support various debug info features, including entry values and tail calls. Clang 9.0 (0f6516856670a435461f56a9faeb4aa8a35a6679) was the first version to include this kind of call site `DISubprogram` attachment. This earlier work appears to visit only some call site variants, however. The call site attachment was added to a higher-level `EmitCall` path in Clang's code gen that is only used by some call variants. In particular, some C++ member calls use a different code gen path, which did not include this call site attachment step, and thus the debug info it triggers (e.g. call site entries) was not emitted for such calls. This moves `DISubprogram` attachment to a lower-level call emission path that is used by all call variants. Fixes https://github.com/llvm/llvm-project/issues/161962	2025-11-10 12:45:05 +00:00
Florian Hahn	53785846aa	[Clang] Freeze padded vectors before storing. (#164821 ) Currently Clang usually leaves padding bits uninitialized, which means they are undef at the moment. When expanding stores of vector types to include padding, the padding lanes will be poison, hence the padding bits will be poison. This interacts badly with coercion of arguments and return values, where 3 x float vectors will be loaded as i128 integer; poisoning the padding bits will make the whole value poison. Not sure if there's a better way, but I think we have a number of places that currently rely on the padding being undef, not poison. PR: https://github.com/llvm/llvm-project/pull/164821	2025-10-28 19:03:17 -07:00

1 2 3 4 5 ...

1549 Commits