llvm-project

Author	SHA1	Message	Date
Matt Arsenault	28a0e1d0f0	MSP430: Add test for llvm.sincos intrinsic Mostly to test libcall behavior	2025-08-22 15:09:10 +09:00
Craig Topper	dee25a8a8e	[TableGen] Validate the shift amount for !srl, !shl, and !sra operators. (#132492 ) The C operator has undefined behavior for out of bounds shifts so we should check this.	2025-08-21 22:41:36 -07:00
Frederik Harwath	d6fae7f921	Reapply "[Clang] Take libstdc++ into account during GCC detection" #145056 (#154487 ) The Generic_GCC::GCCInstallationDetector class picks the GCC installation directory with the largest version number. Since the location of the libstdc++ include directories is tied to the GCC version, this can break C++ compilation if the libstdc++ headers for this particular GCC version are not available. Linux distributions tend to package the libstdc++ headers separately from GCC. This frequently leads to situations in which a newer version of GCC gets installed as a dependency of another package without installing the corresponding libstdc++ package. Clang then fails to compile C++ code because it cannot find the libstdc++ headers. Since libstdc++ headers are in fact installed on the system, the GCC installation continues to work, the user may not be aware of the details of the GCC detection, and the compiler does not recognize the situation and emit a warning, this behavior can be hard to understand - as witnessed by many related bug reports over the years. The goal of this work is to change the GCC detection to prefer GCC installations that contain libstdc++ include directories over those which do not. This should happen regardless of the input language since picking different GCC installations for a build that mixes C and C++ might lead to incompatibilities. Any change to the GCC installation detection will probably have a negative impact on some users. For instance, for a C user who relies on using the GCC installation with the largest version number, it might become necessary to use the --gcc-install-dir option to ensure that this GCC version is selected. This seems like an acceptable trade-off given that the situation for users who do not have any special demands on the particular GCC installation directory would be improved significantly. This patch does not yet change the automatic GCC installation directory choice. Instead, it does introduce a warning that informs the user about the future change if the chosen GCC installation directory differs from the one that would be chosen if the libstdc++ headers are taken into account. See also this related Discourse discussion: https://discourse.llvm.org/t/rfc-take-libstdc-into-account-during-gcc-detection/86992. This patch reapplies #145056. The test in the original PR did not specify a target in the clang RUN line and used a wrong way of piping to FileCheck.	2025-08-22 07:39:11 +02:00
Craig Topper	630712f4c1	[RISCV] Add a helper class to reduce PseudoAtomicLoadNand* pattern duplication. NFC (#154838 )	2025-08-21 22:35:28 -07:00
Matt Arsenault	b1b5102624	AMDGPU: Start considering new atomicrmw metadata on integer operations (#122138 ) Start considering !amdgpu.no.remote.memory.access and !amdgpu.no.fine.grained.host.memory metadata when deciding to expand integer atomic operations. This does not yet attempt to accurately handle fadd/fmin/fmax, which are trickier and require migrating the old "amdgpu-unsafe-fp-atomics" attribute.	2025-08-22 05:29:36 +00:00
Lang Hames	c1625fad02	[orc-rt] Rename unique_function to move_only_function. (#154888 ) This will allow the ORC runtime and its clients to easily adopt the c++-23 std::move_only_function type.	2025-08-22 15:26:10 +10:00
Craig Topper	c346f4079a	[RISCV] Use llvm_anyint_ty instead of llvm_any_ty for scalar intrinsics. NFC (#154816 )	2025-08-21 22:18:39 -07:00
Matt Arsenault	fc5fcc0c95	AMDGPU: Start using AV_MOV_B64_IMM_PSEUDO (#154500 )	2025-08-22 13:59:36 +09:00
Matt Arsenault	01f785cac4	AMDGPU: Expand remaining system atomic operations (#122137 ) System scope atomics need to use cmpxchg loops if we know nothing about the allocation the address is from. aea5980e26e6a87dab9f8acb10eb3a59dd143cb1 started this, this expands the set to cover the remaining integer operations. Don't expand xchg and add, those theoretically should work over PCIe. This is a pre-commit which will introduce performance regressions. Subsequent changes will add handling of new atomicrmw metadata, which will avoid the expansion. Note this still isn't conservative enough; we do need to expand some device scope atomics if the memory is in fine-grained remote memory.	2025-08-22 13:55:04 +09:00
Sergei Barannikov	6a7ade03d1	[TableGen][DecoderEmitter] Remove redundant variable (NFC) (#154880 ) `NumFiltered` is the number of elements in all vectors in a map. It is ever compared to 1, which is equivalent to checking if the map contains exactly one vector with exactly one element.	2025-08-22 04:42:06 +00:00
Craig Topper	586a7131d3	[RISCV][LoongArch] Prefix tablegen class names for intrinsics with 'RISCV'. NFC (#154821 ) All targets are included by Intrinsics.td so we should name things carefully to avoid interfering with other targets. Copy one class that LoongArch was also using.	2025-08-21 21:40:35 -07:00
Jordan Rupprecht	49d4712129	[bazel] Port #154774 : unroll vector.from_elements (#154882 )	2025-08-22 04:33:50 +00:00
Lang Hames	6df9a13e40	[orc-rt] Use LLVM-style header naming scheme. (#154881 ) This is more consistent with the rest of the LLVM project, and the resulting names are closer to the types defined in each of the headers.	2025-08-22 14:28:02 +10:00
dpalermo	d26ea02060	Revert "Fix Debug Build Using GCC 15" (#154877 ) Reverts llvm/llvm-project#152223	2025-08-21 21:54:58 -05:00
Yang Bai	f1f194bf10	[mlir][vector] fix: unroll vector.from_elements in gpu pipelines (#154774 ) ### Problem PR #142944 introduced a new canonicalization pattern which caused failures in the following GPU-related integration tests: - mlir/test/Integration/GPU/CUDA/TensorCore/sm80/transform-mma-sync-matmul-f16-f16-accum.mlir - mlir/test/Integration/GPU/CUDA/TensorCore/sm80/transform-mma-sync-matmul-f32.mlir The issue occurs because the new canonicalization pattern can generate multi-dimensional `vector.from_elements` operations (rank > 1), but the GPU lowering pipelines were not equipped to handle these during the conversion to LLVM. ### Fix This PR adds `vector::populateVectorFromElementsLoweringPatterns` to the GPU lowering passes that are integrated in `gpu-lower-to-nvvm-pipeline`: - `GpuToLLVMConversionPass`: the general GPU-to-LLVM conversion pass. - `LowerGpuOpsToNVVMOpsPass`: the NVVM-specific lowering pass. Co-authored-by: Yang Bai <yangb@nvidia.com>	2025-08-21 21:46:06 -05:00
Sergei Barannikov	418fb50301	[TableGen][DecoderEmitter] Calculate encoding bits once (#154026 ) Parse the `Inst` and `SoftField` fields once and store them in `InstructionEncoding` so that we don't parse them every time `getMandatoryEncodingBits()` is called.	2025-08-22 05:19:35 +03:00
Lang Hames	273b6f2911	[orc-rt] Add orc_rt::unique_function. (#154874 ) A bare-bones version of LLVM's unique_function: this behaves like a std::unique_function, except that it supports move only callable types.	2025-08-22 12:19:15 +10:00
Muhammad Bassiouni	4d323206ed	[libc][math] Refactor cospif16 implementation to header-only in src/__support/math folder. (#154222 ) Part of #147386 in preparation for: https://discourse.llvm.org/t/rfc-make-clang-builtin-math-functions-constexpr-with-llvm-libc-to-support-c-23-constexpr-math-functions/86450	2025-08-22 05:04:13 +03:00
Lang Hames	3ce25abd4a	[orc-rt] Add error.h: structured error support. (#154869 ) Adds support for the Error class, Expected class template, and related APIs that will be used for error propagation and handling in the new ORC runtime. The implementations of these types are cut-down versions of similar APIs in llvm/Support/Error.h. Most advice on llvm::Error and llvm::Expected (e.g. from the LLVM Programmer's manual) applies equally to orc_rt::Error and orc_rt::Expected. Ported from the old ORC runtime at compiler-rt/lib/orc.	2025-08-22 11:53:47 +10:00
Muhammad Bassiouni	783859b2a0	[libc][math] Refactor cospif implementation to header-only in src/__support/math folder. (#154215 ) Part of #147386 in preparation for: https://discourse.llvm.org/t/rfc-make-clang-builtin-math-functions-constexpr-with-llvm-libc-to-support-c-23-constexpr-math-functions/86450	2025-08-22 04:53:18 +03:00
Aiden Grossman	9d2a66fb32	[Clang] Slightly clean up __cpuidex_conflict.c This was intended to be fixed in #154217, but given that didn't land, it still needs to be done. I think it still makes sense to have this change in.	2025-08-22 01:37:17 +00:00
Anthony Latsis	0bc02096f6	[clang] Upstream `clang::CodeGen::getConstantSignedPointer` (#154453 ) This function was introduced to Swift's fork in https://github.com/swiftlang/llvm-project/commit/a9dd959e60c32#diff-db27b2738ad84e3f1093f9174710710478f853804d995a6de2816d1caaad30d1. The Swift compiler cannot use `CodeGenModule::getConstantSignedPointer`, to which it forwards, because that is a private interface.	2025-08-21 17:55:57 -07:00
Craig Topper	6167b1e6e9	[TableGen] Remove unnecessary use of utostr when writing to raw_ostream. NFC (#154800 ) raw_ostream is capable of printing unsigned or uint64_t directly.	2025-08-21 17:44:53 -07:00
Sergei Barannikov	b3f04bf44c	[M68k] Rename a generated file to be consistent with other targets (NFC)	2025-08-22 03:38:30 +03:00
Rahul Joshi	4eeeb8a01e	[NFC][MC][Decoder] Fix off-by-one indentation in generated code (#154855 )	2025-08-21 17:20:05 -07:00
Luke Lau	c97c6869b6	[VPlan] Allow folding not (cmp eq) -> icmp ne with other select users (#154497 ) Currently we only allow folding not (cmp eq) -> icmp ne if the not is the only user of the compare. However a common scenario is that some select might also use the compare. We can still fold the not if we also swizzle the arms of the selects. This helps avoid regressions in #150368	2025-08-22 07:59:14 +08:00
Wenju He	e6d095e89c	[libclc] Only create a target per each compile command for cmake MSVC generator (#154479 ) libclc sequential build issue addressed in commit 0c21d6b4c8ad is specific to cmake MSVC generator. Therefore, this PR avoids creating a large number of targets when a non-MSVC generator is used, such as the Ninja generator, which is used in pre-merge CI on Windows in llvm-project repo. We plan to migrate from MSVC generator to Ninja generator in our downstream CI to fix flaky cmake bug `Cannot restore timestamp`, which might be related to the large number of targets.	2025-08-22 07:45:42 +08:00
Med Ismail Bennani	05b1ec3724	[lldb/API] Add setters to SBStructuredData (#154445 ) This patch adds setters to the SBStruturedData class to be able to initialize said object from the client side directly. Signed-off-by: Med Ismail Bennani <ismail@bennani.ma>	2025-08-21 16:40:01 -07:00
Peter Collingbourne	ff85dbdf6b	ThinLTOBitcodeWriter: Emit __cfi_check to full LTO part of bitcode file. The CrossDSOCFI pass runs on the full LTO module and fills in the body of __cfi_check. This function must have the correct attributes in order to be compatible with the rest of the program. For example, when building with -mbranch-protection=standard, the function must have the branch-target-enforcement attribute, which is normally added by Clang. When __cfi_check is missing, CrossDSOCFI will give it the default set of attributes, which are likely incorrect. Therefore, emit __cfi_check to the full LTO part, where CrossDSOCFI will see it. Reviewers: efriedma-quic, vitalybuka, fmayer Reviewed By: efriedma-quic Pull Request: https://github.com/llvm/llvm-project/pull/154833	2025-08-21 16:31:32 -07:00
David Majnemer	f961b61f88	[APFloat] Properly implement DoubleAPFloat::compareAbsoluteValue The prior implementation would treat X+Y and X-Y as having equal magnitude. Rework the implementation to be more resilient.	2025-08-21 15:42:07 -07:00
Sergei Barannikov	c74afaac6c	[TableGen][DecoderEmitter] Use KnownBits for filters/encodings (NFCI) (#154691 ) `KnownBits` is faster and smaller than `std::vector<BitValue>`. It is also more convenient to use.	2025-08-22 01:37:47 +03:00
Alex MacLean	a3ed96b899	[NVPTX] Legalize aext-load to zext-load to expose more DAG combines (#154251 )	2025-08-21 15:33:23 -07:00
Patrick Simmons	304373fb6d	Fix Debug Build Using GCC 15 (#152223 ) Flang currently doesn't build in debug mode on GCC 15 due to missing dynamic libraries in some CMakeLists.txt files, and OpenMP doesn't link in debug mode due to the atomic library pulling in libstdc++ despite an incomplete attempt in the CMakeLists.txt to disable glibcxx assertions. This PR fixes these issues and allows Flang and the OpenMP runtime to build and link on GCC 15 in debug mode. --------- Co-authored-by: ronlieb <ron.lieberman@amd.com>	2025-08-21 18:28:01 -04:00
John Harrison	36d07ad83b	Reapply "[lldb-dap] Re-land refactor of DebugCommunication. (#147787 )" (#154832 ) This reverts commit 0f33b90b6117bcfa6ca3779c641c1ee8d03590fd and includes a fix for the added test that was submitted between my last update and pull.	2025-08-21 15:26:52 -07:00
Sergei Barannikov	33f6b10c17	[TableGen][DecoderEmitter] Resolve a FIXME in emitDecoder (#154649 ) As the FIXME says, we might generate the wrong code to decode an instruction if it had an operand with no encoding bits. An example is M68k's `MOV16ds` that is defined as follows: ``` dag OutOperandList = (outs MxDRD16:$dst); dag InOperandList = (ins SRC:$src); list<Register> Uses = [SR]; string AsmString = "move.w\t$src, $dst" dag Inst = (descend { 0, 1, 0, 0, 0, 0, 0, 0, 1, 1 }, (descend { 0, 0, 0 }, (operand "$dst", 3))); ``` The `$src` operand is not encoded, but what we see in the decoder is: ```C++ tmp = fieldFromInstruction(insn, 0, 3); if (!Check(S, DecodeDR16RegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; } if (!Check(S, DecodeSRCRegisterClass(MI, insn, Address, Decoder))) { return MCDisassembler::Fail; } return S; ``` This calls DecodeSRCRegisterClass passing it `insn` instead of the value of a field that doesn't exist. DecodeSRCRegisterClass has an unconditional llvm_unreachable inside it. New decoder looks like: ```C++ tmp = fieldFromInstruction(insn, 0, 3); if (!Check(S, DecodeDR16RegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; } return S; ``` We're still not disassembling this instruction right, but at least we no longer have to provide a weird operand decoder method that accepts instruction bits instead of operand bits. See #154477 for the origins of the FIXME.	2025-08-21 22:22:16 +00:00
Dave Lee	d4b9acad58	[lldb] Fix TestSettings.py (#154849 ) Fixes a few test failures on windows. See https://github.com/llvm/llvm-project/pull/153233	2025-08-21 15:19:44 -07:00
Shubham Sandeep Rastogi	7f0e70fd2d	[CMake] Update CMAKE_OSX_DEPLOYMENT_TARGET to 10.13. The greendragon standalone bot is complaining about the deployment target needing to be 10.13 or above. This change makes it 10.13	2025-08-21 15:09:31 -07:00
Rahul Joshi	22f8693248	[NFC][MC][Decoder] Extract fixed pieces of decoder code into new header file (#154802 ) Extract fixed functions generated by decoder emitter into a new MCDecoder.h header.	2025-08-21 15:06:43 -07:00
Kazu Hirata	628280b597	[memprof] Tidy up #includes (NFC) (#154684 ) We've reorganized some code within memprof, but #indludes haven't quite followed the code that moved.	2025-08-21 15:03:40 -07:00
Kazu Hirata	67e95c6f60	[llvm] Proofread DebuggingCoroutines.rst (#154681 )	2025-08-21 15:03:32 -07:00
Kazu Hirata	4e98641451	[ADT] Use SmallPtrSet or SmallSet flexibly (NFC) (#154680 ) I'm trying to remove the redirection in SmallSet.h: template <typename PointeeType, unsigned N> class SmallSet<PointeeType, N> : public SmallPtrSet<PointeeType, N> {}; to make it clear that we are using SmallPtrSet. There are only handful places that rely on this redirection. Now, this unit test is unique in that supply multiple key types via TYPED_TESTS. This patch adds UniversalSmallSet to work around the problem.	2025-08-21 15:03:24 -07:00
Kazu Hirata	ec07d8e941	[clang-tidy] Use SmallPtrSet directly instead of SmallSet (NFC) (#154679 ) I'm trying to remove the redirection in SmallSet.h: template <typename PointeeType, unsigned N> class SmallSet<PointeeType, N> : public SmallPtrSet<PointeeType, N> {}; to make it clear that we are using SmallPtrSet. There are only handful places that rely on this redirection. This patch replaces SmallSet to SmallPtrSet where the element type is a pointer.	2025-08-21 15:03:16 -07:00
Kazu Hirata	f5f6613af6	[Scalar] Use SmallSetVector instead of SmallVector (NFC) (#154678 ) insertParsePoints collects live variables and then deduplicate them while retaining the original insertion order, which is exactly what SetVector is designed for. This patch replaces SmallVector with SetSmallVector while deleting unique_unsorted. While we are at it, this patch reduces the number of inline elements to a reasonable level for linear search.	2025-08-21 15:03:08 -07:00
Thurston Dang	e45210afe2	[msan] Handle AVX512 VCVTPS2PH (#154460 ) This extends handleAVX512VectorConvertFPToInt() from 556c8467d15a131552e3c84478d768bafd95d4e6 (https://github.com/llvm/llvm-project/pull/147377) to handle AVX512 VCVTPS2PH.	2025-08-21 15:03:01 -07:00
Valentin Clement (バレンタインクレメン)	1d05d693a1	[flang][cuda] Fix offset with multiple assumed size shared array (#154844 ) When multiple assumed size variable are used in a kernel with dynamic shared memory, each variable use the 0 offset. Update the pass to account for that. ``` attributes(global) subroutine testany( a ) real(4), shared :: smasks() real(8), shared :: dmasks() end subroutine ```	2025-08-21 21:51:43 +00:00
Justin Riddell	fa67855c99	[CIR] Handle FunctionToPointerDecay casts (#153657 ) (#154060 ) Add upstream support for handling implicit FunctionToPointerDecay casts	2025-08-21 14:40:14 -07:00
Sergei Barannikov	2421929ca6	[TableGen][DecoderEmitter] Infer encoding's HasCompleteDecoder earlier (NFCI) (#154644 ) If an encoding has a custom decoder, the decoder is assumed to be "complete" (always succeed) if hasCompleteDecoder field is true. We determine this when constructing InstructionEncoding. If the decoder for an encoding is generated, it always succeeds if none of the operand decoders can fail. The latter is determined based on the value of operands' DecoderMethod/hasCompleteDecoder. This happens late, at table construction time, making the code harder to follow. This change moves this logic to the InstructionEncoding constructor.	2025-08-21 21:35:30 +00:00
Rahul Joshi	d38a5afa5a	[NFC][MC][ARM] Fix formatting for `ITStatus` and `VPTStatus` (#154815 )	2025-08-21 14:26:18 -07:00
Craig Topper	04a271adf8	[RISCV] Reorder atomic pseudo instructions and isel patterns. NFC (#154835 ) Instead of interleaving the pseudo definitions and their patterns, define all the pseudos together and all the patterns together. Add IsRV32 predicate to the patterns.	2025-08-21 14:22:02 -07:00
Joseph Huber	c704dabe88	[Clang] Fix incorrect return type for `__builtin_shufflevector` (#154817 ) Summary: The `__builtin_shufflevector` call would return a GCC vector in all cases where the vector type was increased. Change this to preserve whether or not this was an extended vector. Fixes: https://github.com/llvm/llvm-project/issues/107981	2025-08-21 16:13:52 -05:00

1 2 3 4 5 ...

549572 Commits