llvm-project

Author	SHA1	Message	Date
YAMAMOTO Takashi	9247be8ca0	[lld][WebAssembly] Do not relocate ABSOLUTE symbols (#153763 ) Fixes https://github.com/llvm/llvm-project/issues/153759	2025-08-18 21:56:48 -07:00
Timm Baeder	25c137e43b	[clang][bytecode] Save a per-block dynamic allocation ID (#154094 ) This fixes an old todo item about wrong allocation counting and some diagnostic differences.	2025-08-19 06:52:21 +02:00
Sergei Barannikov	f84ce1e1d0	[TableGen][DecoderEmitter] Extract a couple of loop invariants (NFC)	2025-08-19 07:47:15 +03:00
Craig Topper	da19383ae7	[RISCV] Fold (X & -4096) == 0 -> (X >> 12) == 0 (#154233 ) This is a more general form of the recently added isel pattern (seteq (i64 (and GPR:$rs1, 0x8000000000000000)), 0) -> (XORI (i64 (SRLI GPR:$rs1, 63)), 1) We can use a shift right for any AND mask that is a negated power of 2. But for every other constant we need to use seqz instead of xori. I don't think there is a benefit to xori over seqz as neither are compressible. We already do this transform from target independent code when the setcc constant is a non-zero subset of the AND mask that is not a legal icmp immediate. I don't believe any of these patterns comparing MSBs to 0 are canonical according to InstCombine. The canonical form is (X < 4096). I'm curious if these appear during SelectionDAG and if so, how. My goal here was just to remove the special case isel patterns.	2025-08-18 21:24:35 -07:00
Craig Topper	2817873082	[RISCV] Fold (sext_inreg (setcc), i1) -> (sub 0, (setcc). (#154206 ) This helps the 3 vendor extensions that make sext_inreg i1 legal. I'm delaying this until after LegalizeDAG since we normally have sext_inreg i1 up until LegalizeDAG turns it into and+neg. I also delayed the recently added (sext_inreg (xor (setcc), -1), i1) combine. Though the xor isn't likely to appear before LegalizeDAG anyway.	2025-08-18 21:24:03 -07:00
Anutosh Bhat	00ffd8b8aa	[lldb] Fix error : unknown error while starting lldb's C/C++ repl (#153560 ) Fixes https://github.com/llvm/llvm-project/issues/153157 The proposed solution has been discussed here (https://github.com/llvm/llvm-project/issues/153157#issue-3313379242) This is what we would be seeing now ``` base) anutosh491@Anutoshs-MacBook-Air bin % ./lldb /Users/anutosh491/work/xeus-cpp/a.out (lldb) target create "/Users/anutosh491/work/xeus-cpp/a.out" Current executable set to '/Users/anutosh491/work/xeus-cpp/a.out' (arm64). (lldb) b main Breakpoint 1: where = a.out`main, address = 0x0000000100003f90 (lldb) r Process 71227 launched: '/Users/anutosh491/work/xeus-cpp/a.out' (arm64) Process 71227 stopped * thread #1, queue = 'com.apple.main-thread', stop reason = breakpoint 1.1 frame #0: 0x0000000100003f90 a.out`main a.out`main: -> 0x100003f90 <+0>: sub sp, sp, #0x10 0x100003f94 <+4>: str wzr, [sp, #0xc] 0x100003f98 <+8>: str w0, [sp, #0x8] 0x100003f9c <+12>: str x1, [sp] (lldb) expression --repl -l c -- 1> 1 + 1 (int) $0 = 2 2> 2 + 2 (int) $1 = 4 ``` ``` base) anutosh491@Anutoshs-MacBook-Air bin % ./lldb /Users/anutosh491/work/xeus-cpp/a.out (lldb) target create "/Users/anutosh491/work/xeus-cpp/a.out" Current executable set to '/Users/anutosh491/work/xeus-cpp/a.out' (arm64). (lldb) b main Breakpoint 1: where = a.out`main, address = 0x0000000100003f90 (lldb) r Process 71355 launched: '/Users/anutosh491/work/xeus-cpp/a.out' (arm64) Process 71355 stopped * thread #1, queue = 'com.apple.main-thread', stop reason = breakpoint 1.1 frame #0: 0x0000000100003f90 a.out`main a.out`main: -> 0x100003f90 <+0>: sub sp, sp, #0x10 0x100003f94 <+4>: str wzr, [sp, #0xc] 0x100003f98 <+8>: str w0, [sp, #0x8] 0x100003f9c <+12>: str x1, [sp] (lldb) expression --repl -l c -- 3 + 3 Warning: trailing input is ignored in --repl mode 1> 1 + 1 (int) $0 = 2 ```	2025-08-19 09:51:20 +05:30
Luke Lau	144736b07e	[VPlan] Don't fold live ins with both scalar and vector operands (#154067 ) If we end up with a extract_element VPInstruction where both operands are live-ins, we will try to fold the live-ins even though the first operand is a vector whilst the live-in is scalar. This fixes it by just returning the vector live-in instead of calling the folder, and removes the handling for insertelement where we aren't able to do the fold. From some quick testing we previously never hit this fold anyway, and were probably just missing test coverage. Fixes #154045	2025-08-19 04:10:53 +00:00
Md Asghar Ahmad Shahid	c24c23d9ab	[NFC][mlir][vector] Handle potential static cast assertion. (#152957 ) In FoldArithToVectorOuterProduct pattern, static cast to vector type causes assertion when a scalar type was encountered. It seems the author meant to have a dyn_cast instead. This NFC patch handles it by using dyn_cast.	2025-08-19 09:27:20 +05:30
Henrik G. Olsson	e1ff432eb6	Reland "[Utils] Add new --update-tests flag to llvm-lit" (#153821 ) This reverts commit `e495231238` to reland the --update-tests feature, originally landed in https://github.com/llvm/llvm-project/pull/108425.	2025-08-18 20:24:27 -07:00
Sergei Barannikov	c8c2218c00	[TableGen][DecoderEmitter] Synthesize decoder table name in emitTable (#154255 ) Previously, HW mode name was appended to decoder namespace name when enumerating encodings, and then emitTable appended the bit width to it to form the final table name. Let's do this all in one place. A nice side effect is that this allows us to avoid having to deal with std::string. The changes in the tests are caused by the different order of tables.	2025-08-19 06:19:54 +03:00
Shilei Tian	1f6e13a161	Revert "[AMDGPU][Attributor] Infer inreg attribute in `AMDGPUAttributor` (#146720 )" This reverts commit 84ab301554f8b8b16b94263a57b091b07e9204f2 because it breaks several AMDGPU test bots.	2025-08-18 22:59:52 -04:00
Sudharsan Veeravalli	8495018a85	[RISCV] Use sd_match in trySignedBitfieldInsertInMask (#154152 ) This keeps everything in APInt and makes it easier to understand and maintain.	2025-08-19 08:22:06 +05:30
slavek-kucera	5b55899781	[clangd] Clangd running with `--experimental-modules-support` crashes when the compilation database is unavailable (#153802 ) fixes llvm/llvm-project#132413	2025-08-19 10:19:13 +08:00
Shilei Tian	84ab301554	[AMDGPU][Attributor] Infer inreg attribute in `AMDGPUAttributor` (#146720 ) This patch introduces `AAAMDGPUUniformArgument` that can infer `inreg` function argument attribute. The idea is, for a function argument, if the corresponding call site arguments are always uniform, we can mark it as `inreg` thus pass it via SGPR. In addition, this AA is also able to propagate the inreg attribute if feasible.	2025-08-18 22:01:47 -04:00
Iris Shi	cc68e45343	[CIR] Implement codegen for inline assembly without input and output operands (#153546 ) - Part of #153267 https://github.com/llvm/clangir/blob/main/clang/lib/CIR/CodeGen/CIRAsm.cpp	2025-08-18 18:54:24 -07:00
ZhaoQi	be3fd6ae25	[LoongArch] Use section-relaxable check instead of relax feature from STI (#153792 ) In some cases, such as using `lto` or `llc`, relax feature is not available from this `SubtargetInfo` (`LoongArchAsmBackend` is instantiated too early), causing loss of relocations. This commit modifiy the condition to check whether the section which contains the two symbols is relaxable. If not relaxable, no need to record relocations.	2025-08-19 09:48:51 +08:00
Ye Tian	db843e5d09	[DAG] Add ISD::FP_TO_SINT_SAT/FP_TO_UINT_SAT handling to SelectionDAG::canCreateUndefOrPoison (#154244 ) Related to https://github.com/llvm/llvm-project/issues/153366	2025-08-19 10:45:11 +09:00
Jianjian Guan	1eb5b18a04	[mlir][emitc] Support dense as init value for ShapedType (#144826 )	2025-08-19 09:41:15 +08:00
Matt Arsenault	19ebfa6d0b	RuntimeLibcalls: Move exception call config to tablegen (#151948 ) Also starts pruning out these calls if the exception model is forced to none. I worked backwards from the logic in addPassesToHandleExceptions and the pass content. There appears to be some tolerance for mixing and matching exception modes inside of a single module. As far as I can tell _Unwind_CallPersonality is only relevant for wasm, so just add it there. As usual, the arm64ec case makes things difficult and is missing test coverage. The set of calls in list form is necessary to use foreach for the duplication, but in every other context a dag is more convenient. You cannot use foreach over a dag, and I haven't found a way to flatten a dag into a list. This removes the last manual setLibcallImpl call in generic code.	2025-08-19 10:35:59 +09:00
Matt Arsenault	fe67267d19	MSP430: Move __mspabi_mpyll calling conv config to tablegen (#153988 ) There are several libcall choices for MUL_I64 which depend on the subtarget, but this is the base case. The manual custom ISelLowering is still overriding the decision until we have a way to control lowering choices, but we can still get the calling convention set for now.	2025-08-19 10:25:10 +09:00
Helena Kotas	eb3d88423d	[HLSL] Global resource arrays element access (#152454 ) Adds support for accessing individual resources from fixed-size global resource arrays. Design proposal: https://github.com/llvm/wg-hlsl/blob/main/proposals/0028-resource-arrays.md Enables indexing into globally scoped, fixed-size resource arrays to retrieve individual resources. The initialization logic is primarily handled during codegen. When a global resource array is indexed, the codegen translates the `ArraySubscriptExpr` AST node into a constructor call for the corresponding resource record type and binding. To support this behavior, Sema needs to ensure that: - The constructor for the specific resource type is instantiated. - An implicit binding attribute is added to resource arrays that lack explicit bindings (#152452). Closes #145424	2025-08-18 18:20:46 -07:00
Mariusz Borsa	0c512f7897	[Sanitizers][Test] narrower constraint for XFAIL (#154245 ) It's a followup to https://github.com/llvm/llvm-project/pull/154189 , which broke test on android bot. Making sure XFAIL only happen on darwin bots rdar://158543555 Co-authored-by: Mariusz Borsa <m_borsa@apple.com>	2025-08-18 18:17:25 -07:00
ZhaoQi	3acb7093c2	[LoongArch][NFC] Add tests for fixing missed addsub relocs when enabling relax (#154108 )	2025-08-19 09:15:23 +08:00
Lang Hames	ee7a6a45bd	[ORC-RT] Initial check-in for a new, top-level ORC runtime project. (#113499 ) Includes CMake files and placeholder header, library, test tool, regression test and unit test. The aim for this project is to create a replacement for the existing ORC Runtime that currently resides in `llvm-project/compiler-rt/lib/orc`. The new project will provide a superset of the original features, and the old runtime will be removed once the new runtime is sufficiently developed. See discussion at https://discourse.llvm.org/t/rfc-move-orc-executor-support-into-top-level-project/81049	2025-08-19 10:56:18 +10:00
Mel Chen	1dac302ce7	[LV] Explicitly disallow interleaved access requiring gap mask for scalable VFs. nfc (#154122 ) Currently, VPInterleaveRecipe::execute does not support generating LLVM IR for interleaved accesses that require a gap mask for scalable VFs. It would be better to detect and prevent such groups from being vectorized as interleaved accesses in LoopVectorizationCostModel::interleavedAccessCanBeWidened, rather than relying on the TTI function getInterleavedMemoryOpCost to return an invalid cost.	2025-08-19 08:42:39 +08:00
Jordan Rupprecht	fb4450c459	[bazel] Add more load statements for C++ rules (#154207 ) Same thing as #149584, but for more elements of cc_rules (e.g. `CcInfo`), and applying it to some files that were added since then (build files under mlir/examples). Command: `buildifier --lint=fix --warnings=native-cc-binary,native-cc-import,native-cc-library,native-cc-objc-import,native-cc-objc-library,native-cc-shared-library,native-cc-test,native-cc-toolchain,native-cc-toolchain-suite,native-cc-fdo-prefetch-hints,native-cc-fdo-profile,native-cc-memprof-profile,native-cc-propeller-optimize,native-cc-common,native-cc-debug-package-info,native-cc-info,native-cc-shared-library-info,native-cc-shared-library-hint-info`	2025-08-18 19:41:49 -05:00
Muhammad Bassiouni	523c3a0197	[libc][math] fix coshf16 build errors. (#154226 )	2025-08-19 03:39:24 +03:00
Oliver Hunt	62d2a8e682	[NFC][Clang][Docs] Update Pointer Authentication documentation (#152596 ) This updates the pointer authentication documentation to include a complete description of the existing functionaliy and behaviour, details of the more complex aspects of the semantics and security properties, and the Apple arm64e ABI design. Co-authored-by: Ahmed Bougacha Co-authored-by: Akira Hatanaka Co-authored-by: John Mccall --------- Co-authored-by: Ahmed Bougacha <ahmed@bougacha.org> Co-authored-by: Akira Hatanaka <ahatanak@gmail.com> Co-authored-by: John Mccall <rjmccall@apple.com>	2025-08-18 17:27:36 -07:00
Craig Topper	6c0518a88f	[RISCV] Prioritize zext.h/zext.w over XTheadBb th.extu. (#154186 ) Fixes #154125.	2025-08-18 16:56:57 -07:00
Craig Topper	396dfdf7be	[RISCV] Remove -O0 from rv32xandesperf.ll. NFC (#154224 ) Invert icmp conditions in test input to make the tests generate the expected branch instructions.	2025-08-18 16:56:38 -07:00
Wenju He	a450dc80bf	[libclc] Implement __clc_get_local_size/__clc_get_max_sub_group_size for amdgcn (#153785 ) This simplifies downstream refactoring of libspirv workitem function in https://github.com/intel/llvm/tree/sycl/libclc/libspirv/lib/generic	2025-08-19 07:51:17 +08:00
Jesse Schwartzentruber	4d2288d318	[compiler-rt] [test] Add test for frame counter out of order. (#154190 ) Add test for #148278. This was written with the aide of ChatGPT 5 and tested on Linux x86_64.	2025-08-18 16:22:57 -07:00
Stanislav Mekhanoshin	3ef3b30c3c	Revert "[AMDGPU] Fold copies of constant physical registers into their uses (#154183 )" (#154219 ) This reverts commit 3395676a18ab580f21ebcd4324feaf1294a8b6d9. Fails libc/test/src/string/libc.test.src.string.memmove_test.__hermetic__	2025-08-18 16:22:47 -07:00
Florian Mayer	19474672e5	[UBSan] [min-rt] make minimal runtime handlers weak (#154220 ) This allows people to override them, if they want.	2025-08-18 16:19:31 -07:00
Muhammad Bassiouni	2c79dc1082	[libc][math] Refactor coshf16 implementation to header-only in src/__support/math folder. (#153582 ) Part of #147386 in preparation for: https://discourse.llvm.org/t/rfc-make-clang-builtin-math-functions-constexpr-with-llvm-libc-to-support-c-23-constexpr-math-functions/86450	2025-08-19 02:08:07 +03:00
Andy Kaylor	7ac4d9bd53	[CIR] Add support for calling virtual functions (#153893 ) This change adds support for calling virtual functions. This includes adding the cir.vtable.get_virtual_fn_addr operation to lookup the address of the function being called from an object's vtable.	2025-08-18 15:56:33 -07:00
Sergei Barannikov	61a859bf6f	Use llvm::copy instead of append_range to work around MacOS build failure	2025-08-19 01:43:22 +03:00
Mariusz Borsa	4dc32df3ca	[Sanitizers][Test] XFAIL suppressions/fread_fwrite (#154189 ) These tests are failing on current macOS version installed on Apple bot rdar://158543555 Co-authored-by: Mariusz Borsa <m_borsa@apple.com>	2025-08-18 15:37:06 -07:00
Mariusz Borsa	0d9346e5e9	[Sanitizers][Test] XFAIL array cookie tests on arm (#154031 ) Poisoning of C++ array redzones is only implemented for x86 CPUs. New arm64 bots are being brought up, where these test fail. Original C++ arrays shadow poisoning change: http://reviews.llvm.org/D4774 rdar://158025391 Co-authored-by: Mariusz Borsa <m_borsa@apple.com>	2025-08-18 15:36:24 -07:00
Sergei Barannikov	0cd4ae9be0	Reland "[TableGen][DecoderEmitter] Store HW mode ID instead of name (NFC) (#154052 )" (#154212 ) This reverts commit 5612dc533a9222a0f5561b2ba7c897115f26673f. Reland with MacOS build fixed.	2025-08-18 22:28:20 +00:00
Stanislav Mekhanoshin	a26c3e9491	[AMDGPU] User SGPR count increased to 32 on gfx1250 (#154205 )	2025-08-18 15:04:56 -07:00
Daniel Paoliello	a4cff34f3f	[win][x64] Permit lea to adjust the stack when using unwind v2 (#154171 ) In some cases `leaq` may be used to adjust the stack in an epilog, this is permitted by unwind v2 and shouldn't raise an error.	2025-08-18 15:00:59 -07:00
Peter Klausler	50a40738d6	[flang] Catch semantic error with LBOUND/UBOUND (#154184 ) The "ARRAY=" argument to these intrinsics cannot be scalar, whether "DIM=" is present or not. (Allowing the "ARRAY=" argument to be scalar when "DIM=" is absent would be a conceivable extension returning an empty result array, like SHAPE() does with extents, but it doesn't seem useful in a programming language without compilation-time rank polymorphism apart from assumed-rank dummy arguments, and those are supported.) Fixes https://github.com/llvm/llvm-project/issues/154044.	2025-08-18 14:45:38 -07:00
Peter Klausler	638f8636df	[flang][runtime] Account for missing READ(SIZE=) characters (#153967 ) One of the two formatted real input paths was failing to call GotChar() to account for the characters that it consumes. Fixes https://github.com/llvm/llvm-project/issues/153830.	2025-08-18 14:45:14 -07:00
Peter Klausler	9a7a16c8d5	[flang][runtime] Allow child NAMELIST input to advance records (#153963 ) NAMELIST input in child I/O is rare, and it's not clear in the standard whether it should be allowed to advance to later records in the parent unit. But GNU Fortran supports it, and there's no good reason not to do so since a NAMELIST input group that isn't terminated on the same line is otherwise going to be a fatal error. Fixes https://github.com/llvm/llvm-project/issues/153416.	2025-08-18 14:44:48 -07:00
Peter Klausler	ffec266980	[flang][runtime] Catch bad OPEN specifiers for unformatted files (#153707 ) When an OPEN statement has specifiers that are allowed only for formatted files, detect an error when the file turns out to be unformatted. Fixes https://github.com/llvm/llvm-project/issues/153480.	2025-08-18 14:44:23 -07:00
Peter Klausler	c53792b278	[flang][runtime] OPEN(existingUnit,POSITION=) (#153688 ) Ensure that when a connected unit is reopened with POSITION='REWIND' or 'APPEND', and a STATUS='OLD' or unspecified, that it is actually repositioned as requested. Fixes https://github.com/llvm/llvm-project/issues/153426.	2025-08-18 14:44:00 -07:00
Peter Klausler	2cf982c0f5	[flang] Don't duplicate impure function call for UBOUND() (#153648 ) Because the per-dimension information in a descriptor holds an extent and a lower bound, but not an upper bound, the calculation of the upper bound sometimes requires that the extent and lower bound be extracted from a descriptor and added together, minus 1. This shouldn't be attempted when the NamedEntity of the descriptor is something that shouldn't be duplicated and used twice; specifically, it shouldn't apply to NamedEntities containing references to impure functions as parts of subscript expressions. Fixes https://github.com/llvm/llvm-project/issues/153031.	2025-08-18 14:43:13 -07:00
Matthias Braun	48232594a0	llvm-profgen: Options cleanup / fixes (#147632 ) - Add `cl::cat(ProfGenCategory)` to non-hidden options so they show up in `--help` output. - Introduce `Options.h` for options referenced in multiple files.	2025-08-18 21:42:55 +00:00
Peter Klausler	50b55a5ee9	[flang][runtime] Fix AllocateAssignmentLHS for monomorphic LHS (#153073 ) When the left-hand side of an assignment statement is an allocatable that has a monomorphic derived type, and the right-hand side of the assignment has a type that is an extension of that type, don't change the incoming type or element size of the descriptor before allocating it. Fixes https://github.com/llvm/llvm-project/issues/152758.	2025-08-18 14:42:16 -07:00

1 2 3 4 5 ...

549069 Commits