llvm-project

Author	SHA1	Message	Date
Simon Pilgrim	ac017c8e93	Fix MSVC "not all control paths return a value" warning. NFC. (#182482 )	2026-02-20 12:11:27 +00:00
Justin Stitt	7f631bb523	[Clang] Introduce OverflowBehaviorType for fine-grained overflow control (#148914 ) Introduce `OverflowBehaviorType` (OBT), a new type attribute in Clang that provides developers with fine-grained control over the overflow behavior of integer types. This feature allows for a more nuanced approach to integer safety, achieving better granularity than global compiler flags like `-fwrapv` and `-ftrapv`. Type specifiers are also available as keywords `__ob_wrap` and `__ob_trap`. These can be applied to integer types (both signed and unsigned) as well as typedef declarations, where the behavior is one of the following: * `wrap`: Guarantees that arithmetic operations on the type will wrap on overflow, similar to `-fwrapv`. This suppresses UBSan's integer overflow checks for the attributed type and prevents eager compiler optimizations. * `trap`: Enforces overflow checking for the type, even when global flags like `-fwrapv` would otherwise suppress it. A key aspect of this feature is its interaction with existing mechanisms. `OverflowBehaviorType` takes precedence over global flags and, notably, over entries in the Sanitizer Special Case List (SSCL). This allows developers to "allowlist" critical types for overflow instrumentation, even if they are disabled by a broad rule in an SSCL. Signed-off-by: Justin Stitt <justinstitt@google.com>	2026-02-19 13:54:33 -08:00
Farzon Lotfi	5f4543fbaf	[HLSL][Matrix] introduce MatrixElementExpr as accessor and swizzle operator (#171225 ) fixes #159438 This patch adds `MatrixElementExpr`, a new AST node for HLSL matrix element and swizzle access (e.g. M._m00, M._11_22_33). It introduces a shared `ElementAccessExprBase` used by both matrix and vector swizzle expressions, updates Sema to parse and validate zero-based and one-based accessors, detects duplicates for l-value checks, and emits improved diagnostics. CodeGen is updated to lower scalar and multi-element accesses consistently, and full AST serialization, dumping, and tooling support is included. This implementation reflects the updated [RFC](https://github.com/llvm/wg-hlsl/pull/357/files) for HLSL matrix accessor semantics.	2026-02-06 13:16:49 -05:00
Kiran	93d0c61d4d	[Clang] Fix atomic boolean compound assignment (#33210 ) (#180200 ) This is my second attempt to merge the #33210 fix. My original patch #178220 caused test failures on risc-v, as I committed a new test file that was too target-specific. I reverted the PR (#180183) to avoid causing disruption, as I did not know how long the fix would take. I have now fixed the test so that it behaves correctly on risc-v. I have also changed the run lines so it is tested on aarch64, amd64 and risc-v.	2026-02-06 15:46:19 +00:00
Kiran	56c875a842	Revert "[Clang] Fix atomic boolean compound assignment (#33210 )" (#180183 ) Reverts llvm/llvm-project#178220	2026-02-06 12:48:23 +00:00
Kiran	4af27b06f5	[Clang] Fix atomic boolean compound assignment (#178220 ) Fixes #33210	2026-02-06 09:41:57 +00:00
Jameson Nash	d040788af6	[clang] remove unused SrcAddr parameter from performAddrSpaceCast (#179330 ) The conversion code always ended up just getting the type of Src from the Src argument itself, with no virtual users of this, so there is no point in also providing this API hook. Fix the documentation as well, since it seems DestAddr must have been similarly removed at some point in the past from the API but was still documented. Also fixes CIR to actually return the casted value!	2026-02-05 14:03:19 -05:00
Jameson Nash	0dd21ad1c6	[clang] remove addrspace cast from CreateIRTemp (#179327 ) This just added unnecessary work to the IR, since they are only used for load and store, which just causes some IR noise. Tests updated by UTC script to remove the extra lines.	2026-02-04 13:09:32 -05:00
NAKAMURA Takumi	b6c412814a	[Coverage][Single] Enable Branch coverage for `BinLAnd` and `BinLOr` (#113113 ) Depends on: #113109 #113110 #113111 https://discourse.llvm.org/t/rfc-integrating-singlebytecoverage-with-branch-coverage/82492	2026-01-29 14:53:18 +09:00
NAKAMURA Takumi	15a81fd494	[Coverage][Single] Enable Branch coverage for CondOp (#113110 ) Depends on: #112730 #113114 https://discourse.llvm.org/t/rfc-integrating-singlebytecoverage-with-branch-coverage/82492	2026-01-28 14:02:02 +09:00
Matt Arsenault	856e93713c	clang: Stop using llvm.convert.to.fp16/llvm.convert.from.fp16 (#174494 ) There is no reason to use these over fpext/fptrunc and bitcast. Split out from #174484. The test coverage is also shockingly bad, so adds a new wasm test which shows different contexts the intrinsics are used. I've also reverted this to a more conservative version that leaves the useFP16ConversionIntrinsics configuration in place, and only replaces the exact intrinsic usage. This should be removed, but it seems to have turned into a buggy ABI option. Some contexts which probably meant to check NativeHalfType or NativeHalfArgsAndReturns were relying on this instead. Additionally, some of the SVE intrinsics appear to be using __fp16 but really expect _Float16 treatment.	2026-01-21 10:04:24 +01:00
Deric C.	58a9dc01be	[HLSL][Matrix] Add type conversions to support bool matrix single subscript operators (#175633 ) Fixes #172711 Fixes the type mismatch issues preventing single matrix subscript getters and setters from working with boolean matrices. The changes from this PR also happens to make matrix splats work for boolean matrices, but adding tests for that and (re)introducing boolean-matrix-specific sema will be relegated to its own PR.	2026-01-14 08:44:55 -08:00
NAKAMURA Takumi	50d112c396	[MC/DC] Prune MCDCLogOpStack and use CGF.isMCDCDecisionExpr. NFC (#125410 ) `MCDCLogOpStack` is used only for detection of the Decision root. It can be detected with `MCDC::State::DecisionByStmt`.	2026-01-13 22:09:07 +09:00
Björn Pettersson	ded109c0cf	[clang][CodeGen] Fix ConstantInt::get for i1 in EmitScalarPrePostIncDec (#175152 ) In ScalarExprEmitter::EmitScalarPrePostIncDec we create ConstantInt values that are either 1 or -1. There is a special case when the type is i1 (e.g. for unsigned _BitInt(1)) when we need to be able to create a "i1 true" value for both inc and dec. To avoid triggering the assertions added by the pull request #171456 we now treat the ConstantInt as unsigned for increments and as signed for decrements.	2026-01-09 15:32:13 +01:00
Victor Chernyakin	c438773432	[LLVM][ADT] Migrate users of `make_scope_exit` to CTAD (#174030 ) This is a followup to #173131, which introduced the CTAD functionality.	2026-01-02 20:42:56 -08:00
Farzon Lotfi	21571cf306	[HLSL][Matrix] Support row and column indexing modes for MatrixSubscriptExpr (#171564 ) fixes #167617 In DXC HLSL supports different indexing modes via codegen for its equivalent of the MatrixSubscriptExpr when the /Zpr and /Zpc flags are used see: https://godbolt.org/z/bz5Y5WG36. This change modifies EmitMatrixSubscriptExpr to consider the MatrixRowMajor/MatrixColMajor Layout flags before generating an index. Similarly it introduces `createRowMajorIndex` and `createColumnMajorIndex` in `MatrixBuilder.h` for use in `VisitMatrixSubscriptExpr`.	2025-12-20 17:21:04 -05:00
Farzon Lotfi	60b6c53f25	[HLSL][Matrix] Add support for single subscript accessor (#170779 ) fixes #166206 - Add swizzle support if row index is constant - Add test cases - Add new AST type - Add new LValue for Matrix Row Type - TODO: Make the new LValue a dynamic index version of ExtVectorElt	2025-12-17 17:04:28 -05:00
Nikita Popov	30ce1e9c7a	[CGExprScalar] Allow implicit truncation for CharacterLiteral The value is always stored as an unsigned number, even if the char type is signed, so we have to allow truncation here.	2025-12-17 12:48:50 +01:00
Nikita Popov	5891921801	[CGExprScalar] Use getSigned() for signed value	2025-12-16 12:30:21 +01:00
Nikita Popov	af3f2da253	[CGExprScalar] Fix inc/dec of vector larger than 64-bit (#172301 ) Use getSigned() to create the 1 or -1 constant, so it gets properly sign extended. This miscompile was found while working on https://github.com/llvm/llvm-project/pull/171456.	2025-12-16 09:48:28 +01:00
Farzon Lotfi	dd1b4abfb7	[HLSL][Matrix] Add support for Matrix element and trunc Casts (#168915 ) fixes #168737 fixes #168755 This change fixes adds support for Matrix truncations via the ICK_HLSL_Matrix_Truncation enum. That ends up being most of the files changed. It also allows Matrix as an HLSL Elementwise cast as long as the cast does not perform a shape transformation ie 3x2 to 2x3. Tests for the new elementwise and truncation behavior were added. As well as sema tests to make sure we error n the shape transformation cast. I am punting right now on the ConstExpr Matrix support. That will need to be addressed later. Will file a seperate issue for that if reviewers agree it can wait.	2025-12-02 19:02:25 -05:00
Matheus Izvekov	b516dcc998	[clang] NFC: rename TagType::getOriginalDecl back to getDecl (#163271 ) This rename was made as part of https://github.com/llvm/llvm-project/pull/147835 in order to ease rebasing the PR, and give a nice window for other patches to get rebased as well. It has been a while already, so lets go ahead and rename it back.	2025-10-15 16:11:17 -03:00
Marco Elver	dfa8732d05	[AllocToken, Clang] Infer type hints from sizeof expressions and casts (#156841 ) For the AllocToken pass to accurately calculate token ID hints, we need to attach `!alloc_token` metadata for allocation calls. Unlike new expressions, untyped allocation calls (like `malloc`, `calloc`, `::operator new(..)`, `__builtin_operator_new`, etc.) have no syntactic type associated with them. For -fsanitize=alloc-token, type hints are sufficient, and we can attempt to infer the type based on common idioms. When encountering allocation calls (with `__attribute__((malloc))` or `__attribute__((alloc_size(..))`), attach `!alloc_token` by inferring the allocated type from (a) sizeof argument expressions such as `malloc(sizeof(MyType))`, and (b) casts such as `(MyType*)malloc(4096)`. Note that non-standard allocation functions with these attributes are not instrumented by default. Use `-fsanitize-alloc-token-extended` to instrument them as well. Link: https://discourse.llvm.org/t/rfc-a-framework-for-allocator-partitioning-hints/87434 --- This change is part of the following series: 1. https://github.com/llvm/llvm-project/pull/160131 2. https://github.com/llvm/llvm-project/pull/156838 3. https://github.com/llvm/llvm-project/pull/162098 4. https://github.com/llvm/llvm-project/pull/162099 5. https://github.com/llvm/llvm-project/pull/156839 6. https://github.com/llvm/llvm-project/pull/156840 7. https://github.com/llvm/llvm-project/pull/156841 8. https://github.com/llvm/llvm-project/pull/156842	2025-10-09 09:31:16 +02:00
Sarah Spall	35c57a778b	[HLSL] Add support for elementwise and aggregate splat casting struct types with bitfields (#161263 ) Adds support for elementwise and aggregate splat casting struct types with bitfields. Replacing existing Flattening function which used to produce a list of GEPs representing a flattened object with one that produces a list of LValues representing a flattened object. The LValues can be used by EmitStoreThroughLValue and EmitLoadOfLValue, ensuring bitfields are properly loaded and stored. This also simplifies the code in the elementwise and aggregate splat casting functions. Closes #125986	2025-10-06 08:26:23 -07:00
Timm Baeder	8dac6e28c9	Reapply "[clang] Convert second arg of __builtin_assume_aligned to Co… (#161945 ) …nstantExpr (#161314)" (#161719) This reverts commit f1650cf91b01470ce44f47797663d59f00828493.	2025-10-05 08:35:56 +02:00
Amr Hesham	3aaa58fd90	[Clang] Fix Codegen UO real/imag crash on scalar with type promotion (#160609 ) Fixing codegen crash when compiling real/imag unary operators on scalar with type promotion Ref: #160583	2025-09-26 18:19:21 +02:00
Erich Keane	92dcbf4092	[NFC] Fix commas on assertion from 147514 (#158635 ) As brought up on the review for #147514, the original patch doesn't correctly parenthesize the expression in the assert. This fixes it.	2025-09-15 13:53:08 +00:00
CHANDRA GHALE	2f755c543a	[OpenMP] Conditional modifier on lastprivate clause is producing incorrect result in C mode (#156004 ) Conditional modifier on lastprivate clause is producing incorrect result when compiled with clang( C compiler). IR is not emitting while compilation with C compiler. However it is working correctly with clang++ The OpenMP hook that emits the conditional modifier (checkAndEmitLastprivateConditional) is skipped in C because assignment is a prvalue and takes the scalar path. Original Codegen Support : [eddb8](`a58da1a2ff (diff-629e03f730f901cdf96b6b48fb0aed8ef156590aaff37857b8e5ad0694beddb8)`) ``` C = → prvalue → EmitAnyExpr(TEK_Scalar) → ScalarExprEmitter::VisitBinAssign (hook not reached) C++ = → lvalue → EmitBinaryOperatorLValue ``` ``` Failing Test Case : #include <stdio.h> #define N 10 int A[N]; void condlastprivate() { int x, y, z, k; x = y = z = k = 0; #pragma omp parallel for lastprivate(conditional: x,y, z) lastprivate(k) for( k=0; k<N; k++){ if ((k >2 ) && (k <6)) { x = A[k]; z = A[k]+111; } else { y = A[k]+222; } } printf("Expecting: x=5, y=231, z=116 k=10 Got: x=%d y=%d z=%d k=%d \n", x,y,z,k); } int main() { for( int i=0; i<N; i++) { A[i] = i; } condlastprivate(); return 0; } ``` ``` #>./clang -fopenmp cond_c.c #> ./a.out Expecting: x=5, y=231, z=116 k=10 Got: x=-1376379760 y=231 z=631465600 k=10 ``` --------- Co-authored-by: Chandra Ghale <ghale@pe31.hpc.amslabs.hpecorp.net>	2025-09-11 16:31:38 +05:30
Dan Liew	f1ee047320	[UBSan][BoundsSafety] Implement support for more expressive "trap reasons" (#154618 ) In 29992cfd628ed5b968ccb73b17ed0521382ba317 (#145967) support was added for "trap reasons" on traps emitted in UBSan in trapping mode (e.g. `-fsanitize-trap=undefined`). This improved the debugging experience by attaching the reason for trapping as a string on the debug info on trap instructions. Consumers such as LLDB can display this trap reason string when the trap is reached. A limitation of that patch is that the trap reason string is hard-coded for each `SanitizerKind` even though the compiler actually has much more information about the trap available at compile time that could be shown to the user. This patch is an incremental step in fixing that. It consists of two main steps. 1. Introduce infrastructure for building trap reason strings To make it convenient to construct trap reason strings this patch re-uses Clang's powerful diagnostic infrastructure to provide a convenient API for constructing trap reason strings. This is achieved by: * Introducing a new `Trap` diagnostic kind to represent trap diagnostics in TableGen files. * Adding a new `Trap` diagnostic component. While this part probably isn't technically necessary it seemed like I should follow the existing convention used by the diagnostic system. * Adding `DiagnosticTrapKinds.td` to describe the different trap reasons. * Add the `TrapReasonBuilder` and `TrapReason` classes to provide an interface for constructing trap reason strings and the trap category. Note this API while similar to `DiagnosticBuilder` has different semantics which are described in the code comments. In particular the behavior when the destructor is called is very different. * Adding `CodeGenModule::BuildTrapReason()` as a convenient constructor for the `TrapReasonBuilder`. This use of the diagnostic system is a little unusual in that the emitted trap diagnostics aren't actually consumed by normal diagnostic consumers (e.g. the console). Instead the `TrapReasonBuilder` is just used to format a string, so in effect the builder is somewhat analagous to "printf". However, re-using the diagnostics system in this way brings a several benefits: * The powerful diagnostic templating languge (e.g. `%select`) can be used. * Formatting Clang data types (e.g. `Type`, `Expr`, etc.) just work out-of-the-box. * Describing trap reasons in tablegen files opens the door for translation to different languages in the future. * The `TrapReasonBuilder` API is very similar to `DiagnosticBuilder` which makes it easy to use by anyone already familiar with Clang's diagnostic system. While UBSan is the first consumer of this new infrastructure the intent is to use this to overhaul how trap reasons are implemented in the `-fbounds-safety` implementation (currently exists downstream). 2. Apply the new infrastructure to UBSan checks for arithmetic overflow To demonstrate using `TrapReasonBuilder` this patch applies it to UBSan traps for arithmetic overflow. The intention is that we would iteratively switch to using the `TrapReasonBuilder` for all UBSan traps where it makes sense in future patches. Previously for code like ``` int test(int a, int b) { return a + b; } ``` The trap reason string looked like ``` Undefined Behavior Sanitizer: Integer addition overflowed ``` now the trap message looks like: ``` Undefined Behavior Sanitizer: signed integer addition overflow in 'a + b' ``` This string is much more specific because * It explains if signed or unsigned overflow occurred * It actually shows the expression that overflowed One possible downside of this approach is it may blow up Debug info size because now there can be many more distinct trap reason strings. To allow users to avoid this a new driver/cc1 flag `-fsanitize-debug-trap-reasons=` has been added which can either be `none` (disable trap reasons entirely), `basic` (use the per `SanitizerKind` hard coded strings), and `detailed` (use the new expressive trap reasons implemented in this patch). The default is `detailed` to give the best out-of-the-box debugging experience. The existing `-fsanitize-debug-trap-reasons` and `-fno-sanitize-debug-trap-reasons` have been kept for compatibility and are aliases of the new flag with `detailed` and `none` arguments passed respectively. rdar://158612755	2025-08-27 13:07:15 -07:00
Matheus Izvekov	88438ba1f3	[clang] AST: fix getAs canonicalization of leaf types (#155028 )	2025-08-27 06:20:14 -03:00
Matheus Izvekov	2ec71d93ad	[clang] NFC: introduce Type::getAsEnumDecl, and cast variants for all TagDecls (#155463 ) And make use of those. These changes are split from prior PR #155028, in order to decrease the size of that PR and facilitate review.	2025-08-26 16:05:59 -03:00
Yingwei Zheng	5569bf26f0	[Clang][CodeGen] Preserve alignment information for pointer arithmetics (#152575 ) Previously, the alignment of pointer arithmetics was inferred from the pointee type, losing the alignment information from its operands: `503c0908c3/clang/lib/CodeGen/CGExpr.cpp (L1446-L1449)` This patch preserves alignment information for pointer arithmetics `P +/- C`, to match the behavior of identical array subscript `&P[C]`: https://godbolt.org/z/xx1hfTrx4. Closes https://github.com/llvm/llvm-project/issues/152330. Although the motivating case can be fixed by https://github.com/llvm/llvm-project/pull/145733, the alignment cannot be recovered without a dominating memory access with larger alignment.	2025-08-25 20:08:32 +08:00
Joseph Huber	5a929a4249	[Clang] Support using boolean vectors in ternary operators (#154145 ) Summary: It's extremely common to conditionally blend two vectors. Previously this was done with mask registers, which is what the normal ternary code generation does when used on a vector. However, since Clang 15 we have supported boolean vector types in the compiler. These are useful in general for checking the mask registers, but are currently limited because they do not map to an LLVM-IR select instruction. This patch simply relaxes these checks, which are technically forbidden by the OpenCL standard. However, general vector support should be able to handle these. We already support this for Arm SVE types, so this should be make more consistent with the clang vector type.	2025-08-20 07:49:26 -05:00
Matheus Izvekov	91cdd35008	[clang] Improve nested name specifier AST representation (#147835 ) This is a major change on how we represent nested name qualifications in the AST. * The nested name specifier itself and how it's stored is changed. The prefixes for types are handled within the type hierarchy, which makes canonicalization for them super cheap, no memory allocation required. Also translating a type into nested name specifier form becomes a no-op. An identifier is stored as a DependentNameType. The nested name specifier gains a lightweight handle class, to be used instead of passing around pointers, which is similar to what is implemented for TemplateName. There is still one free bit available, and this handle can be used within a PointerUnion and PointerIntPair, which should keep bit-packing aficionados happy. * The ElaboratedType node is removed, all type nodes in which it could previously apply to can now store the elaborated keyword and name qualifier, tail allocating when present. * TagTypes can now point to the exact declaration found when producing these, as opposed to the previous situation of there only existing one TagType per entity. This increases the amount of type sugar retained, and can have several applications, for example in tracking module ownership, and other tools which care about source file origins, such as IWYU. These TagTypes are lazily allocated, in order to limit the increase in AST size. This patch offers a great performance benefit. It greatly improves compilation time for [stdexec](https://github.com/NVIDIA/stdexec). For one datapoint, for `test_on2.cpp` in that project, which is the slowest compiling test, this patch improves `-c` compilation time by about 7.2%, with the `-fsyntax-only` improvement being at ~12%. This has great results on compile-time-tracker as well: ![image](https://github.com/user-attachments/assets/700dce98-2cab-4aa8-97d1-b038c0bee831) This patch also further enables other optimziations in the future, and will reduce the performance impact of template specialization resugaring when that lands. It has some other miscelaneous drive-by fixes. About the review: Yes the patch is huge, sorry about that. Part of the reason is that I started by the nested name specifier part, before the ElaboratedType part, but that had a huge performance downside, as ElaboratedType is a big performance hog. I didn't have the steam to go back and change the patch after the fact. There is also a lot of internal API changes, and it made sense to remove ElaboratedType in one go, versus removing it from one type at a time, as that would present much more churn to the users. Also, the nested name specifier having a different API avoids missing changes related to how prefixes work now, which could make existing code compile but not work. How to review: The important changes are all in `clang/include/clang/AST` and `clang/lib/AST`, with also important changes in `clang/lib/Sema/TreeTransform.h`. The rest and bulk of the changes are mostly consequences of the changes in API. PS: TagType::getDecl is renamed to `getOriginalDecl` in this patch, just for easier to rebasing. I plan to rename it back after this lands. Fixes #136624 Fixes https://github.com/llvm/llvm-project/issues/43179 Fixes https://github.com/llvm/llvm-project/issues/68670 Fixes https://github.com/llvm/llvm-project/issues/92757	2025-08-09 05:06:53 -03:00
Yingwei Zheng	ac8295550b	[Clang][CodeGen] Move `EmitPointerArithmetic` into `CodeGenFunction`. NFC. (#152634 ) `CodeGenFunction::EmitPointerArithmetic` is needed by https://github.com/llvm/llvm-project/pull/152575. Separate the NFC changes into a new PR for smooth review.	2025-08-08 21:41:03 +08:00
Corentin Jabot	5adb9a2936	[Clang] Fix crash on `void{}` (#147514 ) Caused by an incorrect assertion. Fixes #116440	2025-07-08 17:40:43 +02:00
Kazu Hirata	efc561c061	[CodeGen] Remove an unnecessary cast (NFC) (#146380 ) E is already of Expr * and shares the same declaration among all these cases.	2025-06-30 10:11:07 -07:00
Thurston Dang	428afa62b0	[ubsan] Add more -fsanitize-annotate-debug-info checks (#141997 ) This extends https://github.com/llvm/llvm-project/pull/138577 to more UBSan checks, by changing SanitizerDebugLocation (formerly SanitizerScope) to add annotations if enabled for the specified ordinals. Annotations will use the ordinal name if there is exactly one ordinal specified in the SanitizerDebugLocation; otherwise, it will use the handler name. Updates the tests from https://github.com/llvm/llvm-project/pull/141814. --------- Co-authored-by: Vitaly Buka <vitalybuka@google.com>	2025-06-06 14:59:32 -07:00
Aaron Ballman	d1a6327b2b	[C2y] Fix _Countof handling of VLAs (#141621 ) It turns out that getVLASize() does not get you the size of a single dimension of the VLA, it gets you the full count of all elements. This caused _Countof to return invalid values on VLA ranks. Now switched to using getVLAElements1D() instead, which only gets a single dimension. Fixes #141409	2025-05-27 14:59:30 -04:00
Orlando Cazalet-Hyams	8f1d1dde8a	[KeyInstr][Clang] Assignment atom group (#134637 ) This patch is part of a stack that teaches Clang to generate Key Instructions metadata for C and C++. RFC: https://discourse.llvm.org/t/rfc-improving-is-stmt-placement-for-better-interactive-debugging/82668 The feature is only functional in LLVM if LLVM is built with CMake flag LLVM_EXPERIMENTAL_KEY_INSTRUCTIONs. Eventually that flag will be removed.	2025-05-23 09:40:09 +01:00
Craig Topper	bf0655f208	[RISCV] Improve casting between i1 scalable vectors and i8 fixed vectors for -mrvv-vector-bits (#139190 ) For i1 vectors, we used an i8 fixed vector as the storage type. If the known minimum number of elements of the scalable vector type is less than 8, we were doing the cast through memory. This used a load or store from a fixed vector alloca. If is less than 8, DataLayout indicates that the load/store reads/writes vscale bytes even if vscale is known and vscale*X is less than or equal to 8. This means the load or store is outside the bounds of the fixed size alloca as far as DataLayout is concerned leading to undefined behavior. This patch avoids this by widening the i1 scalable vector type with zero elements until it is divisible by 8. This allows it be bitcasted to/from an i8 scalable vector. We then insert or extract the i8 fixed vector into this type. Hopefully this enables #130973 to be accepted.	2025-05-14 10:27:00 -07:00
Timm Baeder	9ca8248a91	[clang] Save ShuffleVectorExpr args as ConstantExpr (#139709 ) The passed indices have to be constant integers anyway, which we verify before creating the ShuffleVectorExpr. Use the value we create there and save the indices using a ConstantExpr instead. This way, we don't have to evaluate the args every time we call getShuffleMaskIdx().	2025-05-14 16:13:01 +02:00
Oliver Hunt	65a6cbde5b	[clang] Add support for `__ptrauth` being applied to integer types (#137580 ) Allows the __ptrauth qualifier to be applied to pointer sized integer types, updates Sema to ensure trivially copyable, etc correctly handle address discriminated integers, and updates codegen to perform authentication around arithmetic on the types.	2025-05-09 13:12:09 -07:00
Matt Arsenault	5ae2aed218	clang: Remove dest LangAS argument from performAddrSpaceCast (#138866 ) It isn't used and is redundant with the result pointer type argument. A more reasonable API would only have LangAS parameters, or IR parameters, not both. Not all values have a meaningful value for this. I'm also not sure why we have this at all, it's not overridden by any targets and further simplification is possible.	2025-05-09 14:24:54 +02:00
Yingwei Zheng	d2b012e391	[Clang][CodeGen] Enable pointer overflow check for GCC workaround (#137849 ) Do not suppress the pointer overflow check for the `(i8*) nullptr + N` idiom. Related issue: https://github.com/llvm/llvm-project/issues/137833	2025-05-09 14:53:00 +08:00
Sarah Spall	02e0a954a0	[HLSL] Handle init list with OpaqueValueExprs in CGExprScalar (#138541 ) When an HLSL Init list is producing a Scalar, handle OpaqueValueExprs in the Init List with 'emitInitListOpaqueValues' Copied from 'AggExprEmitter::VisitCXXParenListOrInitListExpr' Closes #136408 --------- Co-authored-by: Chris B <beanz@abolishcrlf.org>	2025-05-05 13:04:41 -07:00
Craig Topper	123758b1f4	[IRBuilder] Add versions of createInsertVector/createExtractVector that take a uint64_t index. (#138324 ) Most callers want a constant index. Instead of making every caller create a ConstantInt, we can do it in IRBuilder. This is similar to createInsertElement/createExtractElement.	2025-05-02 16:10:18 -07:00
Akira Hatanaka	a3283a92ae	[PAC] Add support for __ptrauth type qualifier (#100830 ) The qualifier allows programmer to directly control how pointers are signed when they are stored in a particular variable. The qualifier takes three arguments: the signing key, a flag specifying whether address discrimination should be used, and a non-negative integer that is used for additional discrimination. ``` typedef void (my_callback)(const void); my_callback __ptrauth(ptrauth_key_process_dependent_code, 1, 0xe27a) callback; ``` Co-Authored-By: John McCall rjmccall@apple.com	2025-04-15 12:54:25 -07:00
Farzon Lotfi	589e1c73d0	[HLSL] Add support for modulo of floating point scalar and vectors (#135125 ) fixes #135122 SemaExpr.cpp - Make all doubles fail. Add sema support for float scalars and vectors when language mode is HLSL. CGExprScalar.cpp - Allow emit frem when language mode is HLSL.	2025-04-10 14:27:49 -04:00
Jakub Ficek	a5509d62a7	[clang] fp options fix for __builtin_convertvector (#134102 ) Add missing CGFPOptionsRAII for fptoi and itofp cases	2025-04-08 11:36:48 +02:00

1 2 3 4 5 ...

1053 Commits