llvm-project

Author	SHA1	Message	Date
Anutosh Bhat	8c2dc1b5aa	[clang-repl] Implement LoadDynamicLibrary for clang-repl wasm use cases (#133037 ) Currently we don't make use of the JIT for the wasm use cases so the approach using the execution engine won't work in these cases. Rather if we use dlopen. We should be able to do the following (demonstrating through a toy project) 1) Make use of LoadDynamicLibrary through the given implementation ``` extern "C" EMSCRIPTEN_KEEPALIVE int load_library(const char *name) { auto Err = Interp->LoadDynamicLibrary(name); if (Err) { llvm::logAllUnhandledErrors(std::move(Err), llvm::errs(), "load_library error: "); return -1; } return 0; } ``` 2) Add a button to call load_library once the library has been added in our MEMFS (currently we have symengine built as a SIDE MODULE and we are loading it) (cherry picked from commit 8f56394487a4d454be0637667267ad37bd636d0f)	2025-04-29 15:54:13 -07:00
Jonas Paulsson	02afcbf63f	[SystemZ] Fix compile time regression in adjustInliningThreshold(). (#137527 ) Instead of always iterating over all GlobalVariable:s in the Module to find the case where both Caller and Callee is using the same GV heavily, first scan Callee (only if less than 200 instructions) for all GVs used more than 10 times, and then do the counting for the Caller for just those relevant GVs. The limit of 200 instructions makes sense as this aims to inline a relatively small function using a GV +10 times. This resolves the compile time problem with zig where it is on main (compared to removing the heuristic) a 380% increase, but with this change <0.5% increase (total user compile time with opt). Fixes #134714. (cherry picked from commit 98b895da30c03b7061b8740d91c0e7998e69d091)	2025-04-29 15:42:30 -07:00
Anutosh Bhat	c877757659	[clang-repl] : Fix clang-repl crash with --cuda flag (#136404 ) `clang-repl --cuda` was previously crashing with a segmentation fault, instead of reporting a clean error ``` (base) anutosh491@Anutoshs-MacBook-Air bin % ./clang-repl --cuda #0 0x0000000111da4fbc llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/opt/local/libexec/llvm-20/lib/libLLVM.dylib+0x150fbc) #1 0x0000000111da31dc llvm::sys::RunSignalHandlers() (/opt/local/libexec/llvm-20/lib/libLLVM.dylib+0x14f1dc) #2 0x0000000111da5628 SignalHandler(int) (/opt/local/libexec/llvm-20/lib/libLLVM.dylib+0x151628) #3 0x000000019b242de4 (/usr/lib/system/libsystem_platform.dylib+0x180482de4) #4 0x0000000107f638d0 clang::IncrementalCUDADeviceParser::IncrementalCUDADeviceParser(std::__1::unique_ptr<clang::CompilerInstance, std::__1::default_delete<clang::CompilerInstance>>, clang::CompilerInstance&, llvm::IntrusiveRefCntPtr<llvm::vfs::InMemoryFileSystem>, llvm::Error&, std::__1::list<clang::PartialTranslationUnit, std::__1::allocator<clang::PartialTranslationUnit>> const&) (/opt/local/libexec/llvm-20/lib/libclang-cpp.dylib+0x216b8d0) #5 0x0000000107f638d0 clang::IncrementalCUDADeviceParser::IncrementalCUDADeviceParser(std::__1::unique_ptr<clang::CompilerInstance, std::__1::default_delete<clang::CompilerInstance>>, clang::CompilerInstance&, llvm::IntrusiveRefCntPtr<llvm::vfs::InMemoryFileSystem>, llvm::Error&, std::__1::list<clang::PartialTranslationUnit, std::__1::allocator<clang::PartialTranslationUnit>> const&) (/opt/local/libexec/llvm-20/lib/libclang-cpp.dylib+0x216b8d0) #6 0x0000000107f6bac8 clang::Interpreter::createWithCUDA(std::__1::unique_ptr<clang::CompilerInstance, std::__1::default_delete<clang::CompilerInstance>>, std::__1::unique_ptr<clang::CompilerInstance, std::__1::default_delete<clang::CompilerInstance>>) (/opt/local/libexec/llvm-20/lib/libclang-cpp.dylib+0x2173ac8) #7 0x000000010206f8a8 main (/opt/local/libexec/llvm-20/bin/clang-repl+0x1000038a8) #8 0x000000019ae8c274 Segmentation fault: 11 ``` The underlying issue was that the `DeviceCompilerInstance` (used for device-side CUDA compilation) was never initialized with a `Sema`, which is required before constructing the `IncrementalCUDADeviceParser`. `89687e6f38/clang/lib/Interpreter/DeviceOffload.cpp (L32)` `89687e6f38/clang/lib/Interpreter/IncrementalParser.cpp (L31)` Unlike the host-side `CompilerInstance` which runs `ExecuteAction` inside the Interpreter constructor (thereby setting up Sema), the device-side CI was passed into the parser uninitialized, leading to an assertion or crash when accessing its internals. To fix this, I refactored the `Interpreter::create` method to include an optional `DeviceCI` parameter. If provided, we know we need to take care of this instance too. Only then do we construct the `IncrementalCUDADeviceParser`. (cherry picked from commit 21fb19f3b5d572f608e959af895d781b9b24fbbd)	2025-04-29 15:41:00 -07:00
Yingwei Zheng	f4779c3898	[InstCombine] Preserve signbit semantics of NaN with fold to fabs (#136648 ) As per the LangRef and IEEE 754-2008 standard, the sign bit of NaN is preserved if there is no floating-point operation being performed. See also `862e35e25a` for reference. Alive2: https://alive2.llvm.org/ce/z/QYtEGj Closes https://github.com/llvm/llvm-project/issues/136646 (cherry picked from commit 3e1e4062e1e95031c32c0ed9786647ef1a4141aa)	2025-04-29 15:33:30 -07:00
Yingwei Zheng	57a31e183d	[InstCombine] Do not fold logical is_finite test (#136851 ) This patch disables the fold for logical is_finite test (i.e., `and (fcmp ord x, 0), (fcmp u* x, inf) -> fcmp o* x, inf`). It is still possible to allow this fold for several logical cases (e.g., `stripSignOnlyFPOps(RHS0)` does not strip any operations). Since this patch has no real-world impact, I decided to disable this fold for all logical cases. Alive2: https://alive2.llvm.org/ce/z/aH4LC7 Closes https://github.com/llvm/llvm-project/issues/136650. (cherry picked from commit 8abc917fe04140b6c6088a67e0398f637efde808)	2025-04-29 15:31:38 -07:00
Nikita Popov	1cf8c7797d	[GlobalOpt] Do not promote malloc if there are atomic loads/stores (#137158 ) When converting a malloc stored to a global into a global, we will introduce an i1 flag to track whether the global has been initialized. In case of atomic loads/stores, this will result in verifier failures, because atomic ops on i1 are illegal. Even if we changed this to i8, I don't think it is a good idea to change atomic types in that way. Instead, bail out of the transform is we encounter any atomic loads/stores of the global. Fixes https://github.com/llvm/llvm-project/issues/137152. (cherry picked from commit 57530c23a53b5e003d389437637f61c5b9814e22)	2025-04-29 15:22:30 -07:00
Yuval Deutscher	24805c2e08	[lldb] Use correct path for lldb-server executable (#131519 ) Hey, This solves an issue where running lldb-server-20 with a non-absolute path (for example, when it's installed into `/usr/bin` and the user runs it as `lldb-server-20 ...` and not `/usr/bin/lldb-server-20 ...`) fails with `error: spawn_process failed: execve failed: No such file or directory`. The underlying issue is that when run that way, it attempts to execute a binary named `lldb-server-20` from its current directory. This is also a mild security hazard because lldb-server is often being run as root in the directory /tmp, meaning that an unprivileged user can create the file /tmp/lldb-server-20 and lldb-server will execute it as root. (although, well, it's a debugging server we're talking about, so that may not be a real concern) I haven't previously contributed to this project; if you want me to change anything in the code please don't hesitate to let me know. (cherry picked from commit 945c494e2c3c078e26ff521ef3e9455e0ff764ac)	2025-04-29 15:16:48 -07:00
Owen Pan	182e8b7f8a	[clang-format] Correctly annotate kw_operator in using decls (#136545 ) Fix #136541 (cherry picked from commit 037657de7e5ccd4a37054829874a209b82fb8be7)	2025-04-25 16:49:42 -07:00
Luke Lau	425d1aad29	[RISCV] Handle scalarized reductions in getArithmeticReductionCost This fixes a crash reported at https://github.com/llvm/llvm-project/pull/114250#issuecomment-2813686061 If the vector type isn't legal at all, e.g. bfloat with +zvfbfmin, then the legalized type will be scalarized. So use getScalarType() instead of getVectorElement() when checking for f16/bf16. (cherry picked from commit 053451cb3502144564b4d0b30a9046045d1820d4)	2025-04-25 16:46:30 -07:00
Pavel Skripkin	2d7ad98ec0	[clang][analyzer] Fix error path of builtin overflow (#136345 ) According to https://clang.llvm.org/docs/LanguageExtensions.html#checked-arithmetic-builtins, result of builtin_*_overflow functions will be initialized even in case of overflow. Align analyzer logic to docs and always initialize 3rd argument of such builtins. Closes #136292 (cherry picked from commit 060f9556a2f6ef4669f1c2cd8c4a4d76748a440f)	2025-04-25 16:42:52 -07:00
Gedare Bloom	e7ae5532bc	[clang-format] Fix mismatched break in BlockIndent (#124998 ) Near the ColumnLimit a break could be inserted before a right parens with BlockIndent without a break after the matching left parens. Avoid these hanging right parens by disallowing breaks before right parens unless there was a break after the left parens. Fixes #103306 (cherry picked from commit b8734797a3f605c4aaa37fcb5007baa273565460)	2025-04-25 16:39:37 -07:00
David Spickett	8f288eb619	[lldb][test] Adjust TestTargetReadInstructionsFlavor skipIfs Original in https://github.com/llvm/llvm-project/pull/134626 was written as if it was "this or this" but it's "this and this". So the test ran on AArch64 Linux, because Linux is not Windows. Split out the Windows check to fix that.	2025-04-25 16:35:39 -07:00
Ebuka Ezike	d76ec6a75d	[lldb] Fix SBTarget::ReadInstruction with flavor (#134626 ) When you call the `SBTarget::ReadInstructions` with flavor from lldb crashes. This is because the wrong order of the `DisassemblyBytes` constructor this fixes that --------- Signed-off-by: Ebuka Ezike <yerimyah1@gmail.com>	2025-04-25 16:35:39 -07:00
Nathan Ridge	62072e7f87	[clang][AST] Handle implicit first argument in CallExpr::getBeginLoc()	2025-04-25 16:29:08 -07:00
leecheechen	581772ed07	[LoongArch] Don't crash on instruction prefetch intrinsics (#135760 ) Instead of failing to select during isel, drop the intrinsic in lowering. Similar as the X86's PR. Seeing: https://reviews.llvm.org/D151050. Fixes #134624 (cherry picked from commit dfb5b6e27ca3f8b79ebd3346d11b3088c1600b81)	2025-04-25 16:25:42 -07:00
aankit-ca	89adc2d4f9	[HEXAGON] Fix corner cases for hwloops pass (#135439 ) Add check to make sure Dist > 0 or Dist < 0 for appropriate cmp cases to hexagon hardware loops pass. The change modifies the HexagonHardwareLoops pass to add runtime checks to make sure that end_value > initial_value for less than comparisons and end_value < initial_value for greater than comparisons. Fix for https://github.com/llvm/llvm-project/issues/133241 @androm3da @iajbar PTAL --------- Co-authored-by: aankit-quic <aankit@quicinc.com> (cherry picked from commit da8ce56c53fe6e34809ba0b310fa90257e230a89)	2025-04-25 16:22:32 -07:00
Nikita Popov	78f6719ca9	[GlobalMerge][PPC] Don't merge globals in llvm.metadata section (#131801 ) The llvm.metadata section is not emitted and has special semantics. We should not merge globals in it, similarly to how we already skip merging of `llvm.xyz` globals. Fixes https://github.com/llvm/llvm-project/issues/131394. (cherry picked from commit 9356091a98c24718572f99b51553838ed664b67a)	2025-04-25 16:19:10 -07:00
Tom Stellard	ab0074fe30	Bump version to 20.1.4	2025-04-15 17:31:50 -07:00
Alex Rønne Petersen	923a5c4f83	Revert "[ARM][ConstantIslands] Correct MinNoSplitDisp calculation (#114590 )" This reverts commit e48916f615e0ad2b994b2b785d4fe1b8a98bc322. llvmorg-20.1.3	2025-04-15 17:23:49 -07:00
Ikhlas Ajbar	86f5891c59	[llvm][Hexagon] Promote operand v2i1 to v2i32 (#135409 ) Fixes #118879 (cherry picked from commit 32c39092eab3a401d9d028c21f4707102fd70e32)	2025-04-15 17:22:27 -07:00
Louis Dionne	d55c3c2052	[libc++] Fix misplaced _LIBCPP_POP_MACROS (#134874 ) Fixes #134681 (cherry picked from commit 4ab86edb560a2e1152e22700b0cb386759286c37)	2025-04-15 17:20:58 -07:00
Younan Zhang	9420327ad7	[Clang] Fix a lambda pattern comparison mismatch after ecc7e6ce4 (#133863 ) In ecc7e6ce4, we tried to inspect the `LambdaScopeInfo` on stack to recover the instantiating lambda captures. However, there was a mismatch in how we compared the pattern declarations of lambdas: the constraint instantiation used a tailored `getPatternFunctionDecl()` which is localized in SemaLambda that finds the very primal template declaration of a lambda, while `FunctionDecl::getTemplateInstantiationPattern` finds the latest template pattern of a lambda. This difference causes issues when lambdas are nested, as we always want the primary template declaration. This corrects that by moving `Sema::addInstantiatedCapturesToScope` from SemaConcept to SemaLambda, allowing it to use the localized version of `getPatternFunctionDecl`. It is also worth exploring to coalesce the implementation of `getPatternFunctionDecl` with `FunctionDecl::getTemplateInstantiationPattern`. But I’m leaving that for the future, as I’d like to backport this fix (ecc7e6ce4 made the issue more visible in clang 20, sorry!), and changing Sema’s ABI would not be suitable in that regards. Hence, no release note. Fixes https://github.com/llvm/llvm-project/issues/133719 (cherry picked from commit dcc2182bce3d2ef0e0a991664c51b4b3bfcf7197)	2025-04-15 15:53:53 -07:00
Aaron Ballman	4da7285e63	Silence -Wcast-function-type warnings on idiomatic Windows code (#135660 ) On Windows, GetProcAddress() is the API used to dynamically load function pointers (similar to dlsym on Linux). This API returns a function pointer (a typedef named FARPROC), which means that casting from the call to the eventual correct type is technically a function type mismatch on the cast. However, because this is idiomatic code on Windows, we should accept it unless -Wcast-function-type-strict is passed. This was brought up in post-commit review feedback on https://github.com/llvm/llvm-project/pull/86131	2025-04-15 14:31:14 -07:00
Florian Hahn	c5109be53b	[LV] Disable epilogue vectorization for FindLastIV if start is poison. Back-porting properly freezing of the start value during epilogue vectorization (2bdc1a1337) is non-trivial. For the 20.x release, just disable epilogue vectorization for FindLastIV reductions where the start value may be poison or undef. Fixes #126836.	2025-04-15 14:14:15 -07:00
Florian Hahn	91a3f14d94	[LV] Add tests with FindLastIV and epilogue vectorization. Tests for #126836.	2025-04-15 14:14:15 -07:00
Dominik Adamski	2131242240	[LLVM][MemCpyOpt] Unify alias tags if we optimize allocas (#129537 ) Optimization of alloca instructions may lead to invalid alias tags. Incorrect alias tags can result in incorrect optimization outcomes for Fortran source code compiled by Flang with flags: `-O3 -mmlir -local-alloc-tbaa -flto`. This commit removes alias tags when memcpy optimization replaces two arrays with one array, thus ensuring correct compilation of Fortran source code using flags: `-O3 -mmlir -local-alloc-tbaa -flto`. This commit is also a proposal to fix the reported issue: https://github.com/llvm/llvm-project/issues/133984 --------- Co-authored-by: Shilei Tian <i@tianshilei.me> (cherry picked from commit 716b02d8c575afde7af1af13df145019659abca2)	2025-04-15 13:17:35 -07:00
Louis Dionne	86c9853638	[libc++] Fix deployment targets that were incorrectly bumped (#134278 ) When I introduced the various `_LIBCPP_INTRODUCED_IN_LLVM_XY_ATTRIBUTE` macros in 182f5e9b2f03, I tried to correlate them to the right OS versions, but it seems that I made a few mistakes. This wasn't caught in the CI because we don't test back-deployment that far. rdar://148405946 (cherry picked from commit a97f73405f8e074263a0ed2dd2b8c87c014f46d9)	2025-04-15 13:15:42 -07:00
Louis Dionne	dfd6f12336	[libc++] Guard additional headers with _LIBCPP_HAS_LOCALIZATION (#131921 ) There were some remaining headers that were not guarded with _LIBCPP_HAS_LOCALIZATION, leading to errors when trying to use modules on platforms that don't support localization (since all the headers get pulled in when building the 'std' module). This patch brings these headers in line with what we do for every other header that depends on localization. This patch also requires including <picolibc.h> from <__configuration/platform.h> in order to define _NEWLIB_VERSION. In the long term, we should use a better approach for doing that, such as defining a macro in the __config_site header. (cherry picked from commit 4090910a695efcba4b484e9f8ad2b564e9a4e7ed)	2025-04-15 13:14:13 -07:00
Pavel Labath	dc9d4f9a70	[lldb] Respect LaunchInfo::SetExecutable in ProcessLauncherPosixFork (#133093 ) Using argv[0] for this was incorrect. I'm ignoring LaunchInfo::SetArg0, as that's what darwin and windows launchers do (they use the first element of the args vector instead). I picked up the funny unit test re-exec method from the llvm unit tests. (cherry picked from commit 39e7efe1e4304544289d8d1b45f4d04d11b4a791)	2025-04-15 13:11:58 -07:00
ZhaoQi	9c7d728698	[LoongArch] Move fix-tle-le-sym-type test to test/MC. NFC (#133839 ) (cherry picked from commit 46968310cb837e4b32859edef2107080b828b117)	2025-04-15 13:10:22 -07:00
Phoebe Wang	0c30835a63	[X86][AVX10] Remove VAES and VPCLMULQDQ feature from AVX10.1 (#135489 ) According to SDM, they require both VAES/VPCLMULQDQ and AVX10.1 CPUID bits. Fixes: #135394 (cherry picked from commit ebba554a3211b0b98d3ae33ba70f9d6ceaab6ad4)	2025-04-14 12:45:29 -07:00
Wang, Phoebe	2e09664082	[X86] Backport saturate-convert intrinsics renaming & YMM rounding intrinsics removal in AVX10.2 AVX10.2 YMM rounding instructions are removed from latest AVX10 technical paper. Remove all intrinsics from compiler. AVX10.2 saturate-convert intrinsics are modified to use "s_" in the name. It is a consensus made with GCC folks to avoid future ambiguity.	2025-04-14 12:43:46 -07:00
Yingwei Zheng	0dd4235473	[SCEV] Use ashr to adjust constant multipliers (#135534 ) SCEV converts "-2 nsw (i32 V)" into "2148473647 nsw (i32 V)". But we cannot preserve the nsw flag when the constant multiplier is negative. This patch changes lshr to ashr so that we can preserve both nsw and nuw flags. Alive2 proof: https://alive2.llvm.org/ce/z/LZVSEa Closes https://github.com/llvm/llvm-project/issues/135531. (cherry picked from commit bb9580a02b393683ff0b6c360df684f33c715a1f)	2025-04-14 12:38:12 -07:00
Hua Tian	a141e58685	[llvm][CodeGen] avoid repeated interval calculation in window scheduler (#132352 ) Some new registers are reused when replacing some old ones in certain use case of ModuloScheduleExpander. It is necessary to avoid repeated interval calculations for these registers. (cherry picked from commit 7e65944292278cc245e36cc6ca971654d584012d)	2025-04-14 12:34:31 -07:00
Hua Tian	d88cd35023	[llvm][CodeGen] Fix the empty interval issue in Window Scheduler (#129204 ) The interval of newly generated reg in ModuloScheduleExpander is empty. This will cause crash at some corner case. This patch recalculate the live intervals of these regs. (cherry picked from commit b09b9ac1081d19c8021df8e55e96cd1325f0eed0)	2025-04-14 12:34:31 -07:00
Kazu Hirata	73d1e8598e	[CodeGen] Avoid repeated hash lookups (NFC) (#130237 ) (cherry picked from commit 616f277640129ff37d4005737a62acbd60d06ca1)	2025-04-14 12:34:31 -07:00
Mariya Podchishchaeva	7034995f10	[clang] Handle Binary StingLiteral kind in one more place (#132201 ) The bots are upset by https://github.com/llvm/llvm-project/pull/127629 . Fix that.	2025-04-14 12:26:02 -07:00
Mariya Podchishchaeva	2e7710eaff	[clang] Introduce "binary" StringLiteral for #embed data (#127629 ) StringLiteral is used as internal data of EmbedExpr and we directly use it as an initializer if a single EmbedExpr appears in the initializer list of a char array. It is fast and convenient, but it is causing problems when string literal character values are checked because #embed data values are within a range [0-2^(char width)] but ordinary StringLiteral is of maybe signed char type. This PR introduces new kind of StringLiteral to hold binary data coming from an embedded resource to mitigate these problems. The new kind of StringLiteral is not assumed to have signed char type. The new kind of StringLiteral also helps to prevent crashes when trying to find StringLiteral token locations since these simply do not exist for binary data. Fixes https://github.com/llvm/llvm-project/issues/119256	2025-04-14 12:26:02 -07:00
Marc Auberer	e0db588f3d	[IR] Fix assertion error in User new/delete edge case (#129914 ) Fixes #129900 If `operator delete` was called after an unsuccessful constructor call after `operator new`, we ran into undefined behaviour. This was discovered by our malfunction tests while preparing an upgrade to LLVM 20, that explicitly check for such kind of bugs. (cherry picked from commit 8d38906d08f0189a7a7f865b267f47cab0a3790f)	2025-04-11 13:05:07 -07:00
Miro Hrončok	d5bb7b866e	Avoid a race condition in opt-viewer/optrecord (#131214 ) See https://bugzilla.redhat.com/2336915 See https://reviews.llvm.org/D41784?id= See https://github.com/androm3da/optviewer-demo/issues/4#issuecomment-718787822 Fixes https://github.com/llvm/llvm-project/issues/62403. The race condition happened when the demangler_proc was being set. The locking mechanism itself happened too late. This way, the lock always exists (to avoid a race when creating it) and is always used when creating demangler_proc. (cherry picked from commit e0f8898e1d432ab4fdcaf353bf87a1d3cf36491a)	2025-04-11 12:29:56 -07:00
Yingwei Zheng	d15fef4209	[IndVarSimplify] Handle the case where both operands are the same when widening IV (#135207 ) `WidenIV::widenWithVariantUse` assumes that exactly one of the binop operands is the IV to be widened. This miscompilation happens when it tries to sign-extend the "NonIV" operand while the IV is zero-extended. Closes https://github.com/llvm/llvm-project/issues/135182. (cherry picked from commit d14acb78065bdd331019924feaaef52e5e744529)	2025-04-11 11:51:02 -07:00
Stefan Schmidt	91647ae0df	[X86][SSE] Don't emit SSE2 load instructions in SSE1-only mode (#134547 ) This fixes a regression I traced back to `8b43c1be23` / https://github.com/llvm/llvm-project/pull/79000 The regression caused an SSE2 instruction, `movsd`, to be emitted as a replacement for an SSE instruction, `movaps` despite the target potentially not supporting this instruction, such as when building with clang using `-march=pentium3`. Fixes #134607 (cherry picked from commit 08e080ee98832c2aec6f379b04f486bea18730cc)	2025-04-11 11:37:58 -07:00
Owen Pan	d05543ed07	[clang-format] Keep the space between `not` and a unary operator (#135035 ) Also keep the space between `not` and `::`. Based on the [documentation](https://releases.llvm.org/20.1.0/tools/clang/docs/ClangFormatStyleOptions.html#spaceafterlogicalnot), it can be argued that SpaceAfterLogicalNot doesn't cover the alternative operator `not`. Closes #125465 (cherry picked from commit f34483838937b1a01ee11ee22bdd6e13c81e9fff)	2025-04-11 11:33:19 -07:00
Paul Kirth	81220e68a4	[fatlto] Add coroutine passes when using FatLTO with ThinLTO (#134434 ) When coroutines are used w/ both -ffat-lto-objects and -flto=thin, the coroutine passes are not added to the optimization pipelines. Ensure they are added before ModuleOptimization to generate a working ELF object. Fixes #134409. (cherry picked from commit 268c065eab06b81a0d7256ac62c0865b3781e236)	2025-04-11 11:30:09 -07:00
Owen Pan	edb54a7821	Release/20.x: [clang-format] Set C11 instead of C17 for LK_C Backport d71ee7d23048ca64d14a7536927a006867cea39a	2025-04-11 11:17:54 -07:00
Tiezhu Yang	4181e829d1	[LLDB][LoongArch] Fix build errors about NT_LOONGARCH_HW_{BREAK,WATCH} (#126020 ) On some OS distros such as LoongArch Fedora 38 mate-5 [1], there are no macro definitions NT_LOONGARCH_HW_BREAK and NT_LOONGARCH_HW_WATCH in the system header, then there exist some errors when building LLDB on LoongArch. (1) Description of Problem: ``` llvm-project/lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_loongarch64.cpp:529:16: error: 'NT_LOONGARCH_HW_WATCH' was not declared in this scope; did you mean 'NT_LOONGARCH_LBT'? 529 \| int regset = NT_LOONGARCH_HW_WATCH; \| ^~~~~~~~~~~~~~~~~~~~~ \| NT_LOONGARCH_LBT llvm-project/lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_loongarch64.cpp:543:12: error: 'NT_LOONGARCH_HW_BREAK' was not declared in this scope; did you mean 'NT_LOONGARCH_CSR'? 543 \| regset = NT_LOONGARCH_HW_BREAK; \| ^~~~~~~~~~~~~~~~~~~~~ \| NT_LOONGARCH_CSR ``` (2) Steps to Reproduce: ``` git clone https://github.com/llvm/llvm-project.git mkdir -p llvm-project/llvm/build && cd llvm-project/llvm/build cmake .. -G "Ninja" \ -DCMAKE_BUILD_TYPE=Release \ -DLLVM_BUILD_RUNTIME=OFF \ -DLLVM_ENABLE_PROJECTS="clang;lldb" \ -DCMAKE_INSTALL_PREFIX=/usr/local/llvm \ -DLLVM_TARGETS_TO_BUILD="LoongArch" \ -DLLVM_HOST_TRIPLE=loongarch64-redhat-linux ninja ``` (3) Additional Info: Maybe there are no problems on the OS distros with newer glibc devel library, so this issue is related with OS distros. (4) Root Cause Analysis: This is because the related Linux kernel commit [2] was merged in 2023-02-25 and the glibc devel library has some delay with kernel, the glibc version of specified OS distros is not updated in time. (5) Final Solution: One way is to ask the maintainer of OS distros to update glibc devel library, but it is better to not depend on the glibc version. In order to avoid the build errors, just define NT_LOONGARCH_HW_BREAK and NT_LOONGARCH_HW_WATCH in LLDB if there are no these definitions in the system header. By the way, in order to fit within 80 columns, use C++-style comments for the new added NT_LOONGARCH_HW_BREAK and NT_LOONGARCH_HW_WATCH. While at it, for consistency, just modify the current NT_LOONGARCH_LSX and NT_LOONGARCH_LASX to C++-style comments too. [1] https://mirrors.wsyu.edu.cn/fedora/linux/development/rawhide/Everything/loongarch64/iso/livecd-fedora-mate-5.loongarch64.iso [2] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=1a69f7a161a7 Signed-off-by: Tiezhu Yang <yangtiezhu@loongson.cn> (cherry picked from commit 50ae1c7bf40ba50aaf3132fa869eda8f06648155)	2025-04-11 11:15:57 -07:00
Martin Storsjö	7436329bfe	Revert "[clang] [ARM] Explicitly enable NEON for Windows/Darwin targets (#122095 )" This reverts commit 8fa0f0efce5fb81eb422e6d7eec74c66dafef4a3. This change broke assembling for e.g. "armv7s-apple-darwin" triples, which should enable VFPv4 by default (and did that before this change), but after this change, only NEON/VFPv3 were available. This is being fixed properly in latest git main as part of https://github.com/llvm/llvm-project/pull/130623 (possibly as a split out change), but any proper fix here seems to have too much potential surprises for an existing release branch.	2025-04-11 11:09:26 -07:00
Aaron Puchert	a0c8959cc8	[X86] When expanding LCMPXCHG16B_SAVE_RBX, substitute RBX in base (#134109 ) The pseudo-instruction LCMPXCHG16B_SAVE_RBX is used when RBX serves as frame base pointer. At a very late stage it is then translated into a regular LCMPXCHG16B, preceded by copying the actual argument into RBX, and followed by restoring the register to the base pointer. However, in case the `cmpxchg` operates on a local variable, RBX might also be used as a base for the memory operand in frame finalization, and we've overwritten RBX with the input operand for `cmpxchg16b`. So we have to rewrite the memory operand base to use the saved value of RBX. Fixes #119959. (cherry picked from commit 9e0ca5720bee96f4b19eeb69a119b5eda3ab5528)	2025-04-11 11:06:54 -07:00
Louis Dionne	a8b5fe017a	[libc++] Add missing release note for LLVM 20 about zip_view (#134144 ) We should have had a release note in LLVM 20 about implementing P2165R4 since that is technically an ABI and API break for zip_view. We don't expect anyone to actually hit the ABI issue, but we've come across some (fairly small) breakage due to the API change, so this should at least be mentioned in the release notes. (cherry picked from commit 2cd8edd1fff2a9d82902f70270b4209463a34cba)	2025-04-11 11:00:14 -07:00
Austin Schuh	41aefdbebe	cuda clang: Fix argument order for __reduce_max_sync (#132881 ) Fixes: https://github.com/llvm/llvm-project/issues/131415 --------- Signed-off-by: Austin Schuh <austin.linux@gmail.com> (cherry picked from commit 2d1517d257fcbd0c9bce14badc7646e94d81ea2b)	2025-04-11 10:58:46 -07:00

1 2 3 4 5 ...

525882 Commits