llvm-project

Author	SHA1	Message	Date
Renaud Kauffmann	3856bb6bbf	[flang] [acc] Adding allocation to the recipe of scalar allocatables (#154643 ) Currently the privatization recipe of a scalar allocatable is as follow: ``` acc.private.recipe @privatization_ref_box_heap_i32 : !fir.ref<!fir.box<!fir.heap<i32>>> init { ^bb0(%arg0: !fir.ref<!fir.box<!fir.heap<i32>>>): %0 = fir.alloca !fir.box<!fir.heap<i32>> %1:2 = hlfir.declare %0 {uniq_name = "acc.private.init"} : (!fir.ref<!fir.box<!fir.heap<i32>>>) -> (!fir.ref<!fir.box<!fir.heap<i32>>>, !fir.ref<!fir.box<!fir.heap<i32>>>) acc.yield %1#0 : !fir.ref<!fir.box<!fir.heap<i32>>> } ``` This change adds the allocation for the scalar.	2025-08-20 16:04:57 -07:00
Mehdi Amini	62b29d9f76	[MLIR] Adopt LDBG() debug macro in BytecodeWriter.cpp (NFC) (#154642 )	2025-08-20 22:45:39 +00:00
Mehdi Amini	908eebcb93	[MLIR] Adopt LDBG() macro in PDL ByteCodeExecutor (NFC) (#154641 )	2025-08-20 22:40:52 +00:00
Ely Ronnen	8b64cd8be2	[lldb-dap] Add module symbol table viewer to VS Code extension #140626 (#153836 ) - VS Code extension: - Add module symbol table viewer using [Tabulator](https://tabulator.info/) for sorting and formatting rows. - Add context menu action to the modules tree. - lldb-dap - Add `DAPGetModuleSymbolsRequest` to get symbols from a module. Fixes #140626 [Screencast From 2025-08-15 19-12-33.webm](https://github.com/user-attachments/assets/75e2f229-ac82-487c-812e-3ea33a575b70)	2025-08-21 00:31:48 +02:00
Joseph Huber	27fc9671f9	Revert "[libc] Enable wide-read memory operations by default on Linux (#154602 )" This reverts commit c80d1483c6d787edf62ff9e86b1e97af5eb5abf9.	2025-08-20 17:27:13 -05:00
Craig Topper	2cb7c46bf0	[RISCV] Add missing 'OrP' to comment in RISCVInstrInfoZb.td. NFC	2025-08-20 15:27:03 -07:00
Joseph Huber	c80d1483c6	[libc] Enable wide-read memory operations by default on Linux (#154602 ) Summary: This patch changes the linux build to use the wide reads on the memory operations by default. These memory functions will now potentially read outside of the bounds explicitly allowed by the current function. While technically undefined behavior in the standard, plenty of C library implementations do this. it will not cause a segmentation fault on linux as long as you do not cross a page boundary, and because we are only reading memory it should not have atomic effects.	2025-08-20 17:17:12 -05:00
Craig Topper	ac8f0bb070	[RISCV] Reduce ManualCodeGen for segment load/store intrinsics. NFC Operate directly on the existing Ops vector instead of copying to a new vector. This is similar to what the autogenerated codegen does for other intrinsics. This reduced the clang binary size by ~96kb on my local Release+Asserts build.	2025-08-20 15:02:24 -07:00
Sergei Barannikov	46343ca374	[TableGen][DecoderEmitter] Add DecoderMethod to InstructionEncoding (NFC) (#154477 ) We used to abuse Operands list to store instruction encoding's DecoderMethod there. Let's store it in the InstructionEncoding class instead, where it belongs.	2025-08-20 21:59:59 +00:00
Mehdi Amini	dbbd3f0d07	[MLIR] Adopt LDBG() macro in Affine/Analysis/Utils.cpp (NFC) (#154626 )	2025-08-20 21:56:03 +00:00
Alan Zhao	904b4f5a27	[clang][timers][modules] Fix a timer being started when it's running (#154231 ) `ASTReader::FinishedDeserializing()` calls `adjustDeductedFunctionResultType(...)` [0], which in turn calls `FunctionDecl::getMostRecentDecl()`[1]. In modules builds, `getMostRecentDecl()` may reach out to the `ASTReader` and start deserializing again. Starting deserialization starts `ReadTimer`; however, `FinishedDeserializing()` doesn't call `stopTimer()` until after it's call to `adjustDeductedFunctionResultType(...)` [2]. As a result, we hit an assert checking that we don't start an already started timer [3]. To fix this, we simply don't start the timer if it's already running. Unfortunately I don't have a test case for this yet as modules builds are notoriously difficult to reduce. [0]: `4d2288d318/clang/lib/Serialization/ASTReader.cpp (L11053)` [1]: `4d2288d318/clang/lib/AST/ASTContext.cpp (L3804)` [2]: `4d2288d318/clang/lib/Serialization/ASTReader.cpp (L11065-L11066)` [3]: `4d2288d318/llvm/lib/Support/Timer.cpp (L150)`	2025-08-20 21:53:43 +00:00
Kazu Hirata	55551da200	[lldb] Add missing case statements for SubstBuiltinTemplatePack (#154606 ) This patch fixes: lldb/source/Plugins/TypeSystem/Clang/TypeSystemClang.cpp:4148:11: error: enumeration value 'SubstBuiltinTemplatePack' not handled in switch [-Werror,-Wswitch] 4148 \| switch (qual_type->getTypeClass()) { \| ^~~~~~~~~~~~~~~~~~~~~~~~~ lldb/source/Plugins/TypeSystem/Clang/TypeSystemClang.cpp:4852:11: error: enumeration value 'SubstBuiltinTemplatePack' not handled in switch [-Werror,-Wswitch] 4852 \| switch (qual_type->getTypeClass()) { \| ^~~~~~~~~~~~~~~~~~~~~~~~~ lldb/source/Plugins/TypeSystem/Clang/TypeSystemClang.cpp:5153:11: error: enumeration value 'SubstBuiltinTemplatePack' not handled in switch [-Werror,-Wswitch] 5153 \| switch (qual_type->getTypeClass()) { \| ^~~~~~~~~~~~~~~~~~~~~~~~~	2025-08-20 14:53:28 -07:00
Gang Chen	575fad2892	[AMDGPU] Upstream the Support for array of named barriers (#154604 )	2025-08-20 14:53:03 -07:00
Mehdi Amini	d20a74e631	[MLIR] Adopt LDBG() macro in BasicPtxBuilderInterface.cpp (NFC) (#154625 )	2025-08-20 21:51:17 +00:00
Mehdi Amini	4be19e27b5	[MLIR] Adopt LDBG() debug macros in Affine LoopAnalysis.cpp (NFC) (#154621 )	2025-08-20 21:45:42 +00:00
Steven Wu	deab049b5c	[CAS] Add ActionCache to LLVMCAS Library (#114097 ) ActionCache is used to store a mapping from CASID to CASID. The current implementation of the ActionCache can only be used to associate the key/value from the same hash context. ActionCache has two operations: `put` to store the key/value and `get` to lookup the key/value mapping. ActionCache uses the same TrieRawHashMap data structure to store the mapping, where is CASID of the key is the hash to index the map. While CASIDs for key/value are often associcate with actual CAS ObjectStore, it doesn't provide the guarantee of the existence of such object in any ObjectStore.	2025-08-20 14:42:44 -07:00
Ramkumar Ramachandra	0db57ab586	[VPlan] Improve code using onlyScalarValuesUsed (NFC) (#154564 )	2025-08-20 22:38:00 +01:00
Julian Lettner	484d0408f9	[lldb] Fix source line annotations for libsanitizers traces (#154247 ) When providing allocation and deallocation traces, the ASan compiler-rt runtime already provides call addresses (`TracePCType::Calls`). On Darwin, system sanitizers (libsanitizers) provides return address. It also discards a few non-user frames at the top of the stack, because these internal libmalloc/libsanitizers stack frames do not provide any value when diagnosing memory errors. Introduce and add handling for `TracePCType::ReturnsNoZerothFrame` to cover this case and enable libsanitizers traces line-level testing. rdar://157596927 --- Commit 1 is a mechanical refactoring to introduce and adopt `TracePCType` enum to replace `pcs_are_call_addresses` bool. It preserve the current behavior: ``` pcs_are_call_addresses: false -> TracePCType::Returns (default) true -> TracePCType::Calls ``` Best reviewed commit by commit.	2025-08-20 14:33:27 -07:00
Florian Hahn	7d33743324	[LV] Add tests for narrowing interleave groups with scalable vectors.	2025-08-20 22:31:24 +01:00
Andy Kaylor	59b33242af	[CIR][NFC] Fix warning in MemOrder lowering (#154609 ) This fixes a warning about having a default case in a fully covered enum switch statement.	2025-08-20 14:30:22 -07:00
Mehdi Amini	6445a75c98	[MLIR] Update MLIRContext to use the LDBG() style debug macro (NFC) (#154619 )	2025-08-20 21:30:11 +00:00
Mehdi Amini	ffbc8da8b5	[MLIR] Migrate LICM utils to the LDBG() macro style logging (NFC) (#154615 )	2025-08-20 21:29:50 +00:00
Mehdi Amini	780750bbf9	[MLIR] Adopt LDBG() debug macro in ConvertToLLVMPass (NFC) (#154616 )	2025-08-20 21:29:35 +00:00
YongKang Zhu	5c4f506cca	[BOLT] Validate extra entry point by querying data marker symbols (#154611 ) Look up marker symbols and decide whether candidate is really extra entry point in `adjustFunctionBoundaries()`.	2025-08-20 14:18:56 -07:00
Mehdi Amini	5683baea6d	[MLIR] Adopt LDBG() debug macro in bufferization (NFC) (#154614 )	2025-08-20 21:14:02 +00:00
Isaac Nudelman	c6fa115b2d	[clang][analyzer] Relax assertion for non-default address spaces in the cstring checker (#153498 ) Prevent an assertion failure in the cstring checker when library functions like memcpy are defined with non-default address spaces. Adds a test for this case.	2025-08-20 16:07:54 -05:00
David Majnemer	0a7eabcc56	Reapply "[APFloat] Fix getExactInverse for DoubleAPFloat" The previous implementation of getExactInverse used the following check to identify powers of two: // Check that the number is a power of two by making sure that only the // integer bit is set in the significand. if (significandLSB() != semantics->precision - 1) return false; This condition verifies that the only set bit in the significand is the integer bit, which is correct for normal numbers. However, this logic is not correct for subnormal values. APFloat represents subnormal numbers by shifting the significand right while holding the exponent at its minimum value. For a power of two in the subnormal range, its single set bit will therefore be at a position lower than precision - 1. The original check would consequently fail, causing the function to determine that these numbers do not have an exact multiplicative inverse. The new logic calculated this correctly but it seems that test/CodeGen/Thumb2/mve-vcvt-fixed-to-float.ll expected the old behavior. Seeing as how getExactInverse does not have tests or documentation, we conservatively maintain (and document) this behavior. This reverts commit 47e62e846beb267aad50eb9195dfd855e160483e.	2025-08-20 14:02:36 -07:00
Rolf Morel	cbfa265e98	[MLIR][LLVMIR][DLTI] Add `LLVM::TargetAttrInterface` and `#llvm.target` attr (#145899 ) Adds the `#llvm.target<triple = $TRIPLE, chip = $CHIP, features = $FEATURES>` attribute and along with a `-llvm-target-to-data-layout` pass to derive a MLIR data layout from the LLVM data layout string (using the existing `DataLayoutImporter`). The attribute implements the relevant DLTI-interfaces, to expose the `triple`, `chip` (AKA `cpu`) and `features` on `#llvm.target` and the full `DataLayoutSpecInterface`. The pass combines the generated `#dlti.dl_spec` with an existing `dl_spec` in case one is already present, e.g. a `dl_spec` which is there to specify size of the `index` type. Adds a `TargetAttrInterface` which can be implemented by all attributes representing LLVM targets. Similar to the Draft PR https://github.com/llvm/llvm-project/pull/78073. RFC on which this PR is based: https://discourse.llvm.org/t/mandatory-data-layout-in-the-llvm-dialect/85875	2025-08-20 22:00:30 +01:00
Shafik Yaghmour	2a66ce5edb	[Clang][NFC] Clarify some SourceManager related code (#153527 ) Static analysis flagged the columns - 1 code, it was correct but the assumption was not obvious. I document the assumption w/ assertions. While digging through related code I found getColumnNumber that looks wrong at first inspection and adding parentheses makes it clearer.	2025-08-20 13:57:37 -07:00
Philip Reames	e6b4a21849	[IR] Add utilities for manipulating length of MemIntrinsic [nfc] (#153856 ) Goal is simply to reduce direct usage of getLength and setLength so that if we end up moving memset.pattern (whose length is in elements) there are fewer places to audit.	2025-08-20 13:50:11 -07:00
Valentin Clement (バレンタインクレメン)	a4e8ec9de9	[flang][cuda][NFC] Add getDataAttr helper (#154586 )	2025-08-20 13:46:29 -07:00
Dan Salvato	45e2c50256	[M68k] Fix reverse BTST condition causing opposite failure/success logic (#153086 ) Given the test case: ```llvm define fastcc i16 @testbtst(i16 %a) nounwind { entry: switch i16 %a, label %no [ i16 11, label %yes i16 10, label %yes i16 9, label %yes i16 4, label %yes i16 3, label %yes i16 2, label %yes ] yes: ret i16 1 no: ret i16 0 } ``` We currently get this result: ```asm testbtst: ; @testbtst ; %bb.0: ; %entry move.l %d0, %d1 and.l #65535, %d1 sub.l #11, %d1 bhi .LBB0_3 ; %bb.1: ; %entry and.l #65535, %d0 move.l #3612, %d1 btst %d0, %d1 bne .LBB0_3 ; <------- Erroneous condition ; %bb.2: ; %yes moveq #1, %d0 rts .LBB0_3: ; %no moveq #0, %d0 rts ``` The cause of this is a line that explicitly reverses the `btst` condition code. But on M68k, `btst` sets condition codes the same as `and` with a bitmask, meaning `EQ` indicates failure (bit is zero) and not success, so the condition does not need to be reversed. In my testing, I've only been able to get switch statements to lower to `btst`, so I wasn't able to explicitly test other options for lowering. But (if possible to trigger) I believe they have the same logical error. For example, in `LowerAndToBTST()`, a comment specifies that it's lowering a case where the `and` result is compared against zero, which means the corresponding `btst` condition should also not be reversed. This patch simply flips the ternary expression in `getBitTestCondition()` to match the ISD condition code with the same M68k code, instead of the opposite.	2025-08-20 13:45:01 -07:00
Florian Hahn	4e6c88be7c	[TTI] Remove Args argument from getOperandsScalarizationOverhead (NFC). (#154126 ) Remove the ArrayRef<const Value> Args operand from getOperandsScalarizationOverhead and require that the callers de-duplicate arguments and filter constant operands. Removing the Value based Args argument enables callers where no Value * operands are available to use the function in a follow-up: computing the scalarization cost directly for a VPlan recipe. It also allows more accurate cost-estimates in the future: for example, when vectorizing a loop, we could also skip operands that are live-ins, as those also do not require scalarization. PR: https://github.com/llvm/llvm-project/pull/154126	2025-08-20 21:09:08 +01:00
David Green	4875553f4c	[AArch64][GlobalISel] Port unmerge KnownBits tests to print<gisel-value-tracking>. NFC This takes the known-bits tests added in #112172 and ports them over to be a new print<gisel-value-tracking> test.	2025-08-20 20:57:14 +01:00
David Green	bd94aabfb6	[AArch64][GlobalISel] Remove Selection code for s/uitofp. NFC (#154488 ) These are already handled by tablegen patterns.	2025-08-20 20:52:42 +01:00
Florian Hahn	b0d0e04693	[LV] Add test where we choose VF * IC is larger than trip count.	2025-08-20 20:40:49 +01:00
Matheus Izvekov	e1dbe093c4	[clang] build UnresolvedUsingType for constructor initializers (#154592 ) When building the base type for constructor initializer, the case of an UnresolvedUsingType was not being handled. For the non-dependent case, we are also skipping adding the UsingType, but this is just missing information in the AST. A FIXME for this is added. This fixes a regression introduced in #147835, which was never released, so there are no release notes. Fixes #154436	2025-08-20 16:24:41 -03:00
Gang Chen	60dbde69cd	[AMDGPU] report named barrier cnt part2 (#154588 )	2025-08-20 12:00:45 -07:00
Tom Honermann	8ed9c6101f	[NFC] Remove unneeded forward declaration of diagnoseUncapturableValueReferenceOrBinding() (#154591 ) The only (remaining) use of this forward declaration was removed in commit 127bf44385424891eb04cff8e52d3f157fc2cb7c.	2025-08-20 14:57:53 -04:00
Kevin McAfee	691ccf263a	[NVPTX] Implement computeKnownBitsForTargetNode for LoadV (#154165 ) Remove AND combines as they are no longer needed after this.	2025-08-20 18:57:15 +00:00
Leandro Lacerda	8d7b50e572	[Offload][Conformance] Add `RandomGenerator` for large input spaces (#154252 ) This patch implements the `RandomGenerator`, a new input generator that enables conformance testing for functions with large input spaces (e.g., double-precision math functions). Architectural Refactoring To support different generation strategies in a clean and extensible way, the existing `ExhaustiveGenerator` was refactored into a new class hierarchy: * A new abstract base class, `RangeBasedGenerator`, was introduced using the Curiously Recurring Template Pattern (CRTP). It contains the common logic for generators that operate on a sequence of ranges. * `ExhaustiveGenerator` now inherits from this base class, simplifying its implementation. New Components * The new `RandomGenerator` class also inherits from `RangeBasedGenerator`. It implements a strategy that randomly samples a specified number of points from the total input space. * Random number generation is handled by a new, self-contained `RandomState` class (a `xorshift64` PRNG seeded with `splitmix64`) to ensure deterministic and reproducible random streams for testing. Example Usage* As a first use case and demonstration of this new capability, this patch also adds the first double-precision conformance test for the `log` function. This test uses the new `RandomGenerator` to validate the implementations from the `llvm-libm`, `cuda-math`, and `hip-math` providers.	2025-08-20 13:37:01 -05:00
Joseph Huber	9888f0c3c4	[Clang] Add builtins for masked vector loads / stores (#154464 ) Summary: Clang has support for boolean vectors, these builtins expose the LLVM instruction of the same name. This differs from a manual load and select by potentially suppressing traps from deactivated lanes. Fixes: https://github.com/llvm/llvm-project/issues/107753	2025-08-20 13:33:32 -05:00
Joseph Huber	2f6b747997	[Clang] Add queryable feature 'ext_vector_type_boolean' for SIMD masks (#154227 ) Summary: We added boolean vectors in clang 15 and wish to extend them further in clang-22. However, there's no way to query for their support as they are separate to the normal extended vector type. This adds a feature so we can check for it as a feature directly.	2025-08-20 13:33:02 -05:00
Alexandre Ganea	410a1341b5	[clang][bytecode] Silence unused variable warning	2025-08-20 14:10:05 -04:00
Finn Plummer	15babbaf5d	[DirectX] Add boilerplate integration of `objcopy` for `DXContainerObjectFile` (#153079 ) This pr implements the boiler plate required to use `llvm-objcopy` for `DXContainer` object files. It defines a minimal structure `object` to represent the `DXContainer` header and the following parts. This structure is a simple representation of the object data to allow for simple modifications at the granularity of each part. It follows similarily to how the respective `object`s are defined for `ELF`, `wasm`, `XCOFF`, etc. This is the first step to implement https://github.com/llvm/llvm-project/issues/150275 and https://github.com/llvm/llvm-project/issues/150277 as compiler actions that invoke `llvm-objcopy` for functionality.	2025-08-20 10:58:42 -07:00
Michał Górny	d76bb2bb89	[mlir] Fix missing `mlir-capi-global-constructors-test` on standalone build (#154576 ) Add `mlir-capi-global-constructors-test` to `MLIR_TEST_DEPENDS` when `MLIR_ENABLE_EXECUTION_ENGINE` is enabled, to ensure that it is also built during standalone builds, and therefore fix test failure due to the executable being missing. I don't understand the purpose of `LLVM_ENABLE_PIC AND TARGET ${LLVM_NATIVE_ARCH}` block, but the condition is not true in standalone builds. Fixes 7610b1372955da55e3dc4e2eb1440f0304a56ac8.	2025-08-20 19:50:07 +02:00
Krzysztof Parzyszek	9f1679190e	[flang][OpenMP] Update GetOmpObjectList, move to parser utils (#154389 ) `GetOmpObjectList` takes a clause, and returns the pointer to the contained OmpObjectList, or nullptr if the clause does not contain one. Some clauses with object list were not recognized: handle all clauses, and move the implementation to flang/Parser/openmp-utils.cpp.	2025-08-20 12:41:26 -05:00
Yitzhak Mandelbaum	2be52f309e	[clang][dataflow] Fix uninitialized memory bug. (#154575 ) Commit #3ecfc03 introduced a bug involving an uninitialized field in `exportLogicalContext`. This patch initializes the field properly.	2025-08-20 13:36:42 -04:00
Akash Banerjee	d69ccded4f	[MLIR] Add cpow support in ComplexToROCDLLibraryCalls (#153183 ) This PR adds support for complex power operations (`cpow`) in the `ComplexToROCDLLibraryCalls` conversion pass, specifically targeting AMDGPU architectures. The implementation optimises complex exponentiation by using mathematical identities and special-case handling for small integer powers. - Force lowering to `complex.pow` operations for the `amdgcn-amd-amdhsa` target instead of using library calls - Convert `complex.pow(z, w)` to `complex.exp(w * complex.log(z))` using mathematical identity	2025-08-20 17:18:30 +00:00
Kazu Hirata	65de318d18	[Sema] Fix a warning This patch fixes: clang/lib/Sema/SemaTemplateVariadic.cpp:1069:22: error: variable 'TST' set but not used [-Werror,-Wunused-but-set-variable]	2025-08-20 09:56:27 -07:00

... 4 5 6 7 8 ...

549622 Commits