llvm-project

Author	SHA1	Message	Date
Tom Stellard	e6c3289804	[CMake][Release] Disable PGO (#88465 ) (#89000 ) Running the test-release.sh script with PGO enabled causes build errors like: ld.lld: error: Function Import: link error: linking module flags 'ProfileSummary': IDs have conflicting values I believe this a build system bug due to the PGO profile data being generated unconditionally. If you run `ninja check-all` and then `ninja install` like we do in test-release.sh, then the profile data is regenerated during `ninja install` and some of the clang tools which are not test dependencies get build during the ninja install step with different profile data. When these tools link against the LLVM libraries, like libSupport, we end up with these errors. (cherry picked from commit 0d2bb7f017f13ceae793fab7d83d3e67e8d8d8f8) llvmorg-18.1.4	2024-04-16 17:21:15 -07:00
Cinhi Young	028e425f86	[MIPS] Fix the opcode of max.fmt and mina.fmt (#85609 ) - The opcode of the mina.fmt and max.fmt is documented wrong, the object code compiled from the same assembly with LLVM behaves differently than one compiled with GCC and Binutils. - Modify the opcodes to match Binutils. The actual opcodes are as follows: {5,3} \| bits {2,0} of func \| ... \| 100 \| 101 \| 110 \| 111 -----+-----+-----+-----+-----+----- 010 \| ... \| min \| mina \| max \| maxa (cherry picked from commit 8b859c6e4a8e9ab9969582267bbdc04ed6bfa535)	2024-04-16 17:07:04 -07:00
Giuliano Belinassi	e3c832b37b	Fix override keyword being print to the left side Previously, the `override` keyword in C++ was being print in the left side of a method decl, which is unsupported by C++ standard. This commit fixes that by setting the `CanPrintOnLeft` field to 0, forcing it to be print on the right side of the decl. Signed-off-by: Giuliano Belinassi <gbelinassi@suse.de>	2024-04-16 16:09:49 -07:00
Nikita Popov	1deeee3f5d	Revert "[Mips] Fix missing sign extension in expansion of sub-word atomic max (#77072 )" These changes caused correctness regressions observed in Rust, see https://github.com/llvm/llvm-project/pull/77072#issuecomment-2049009507. This reverts commit 0e501dbd932ef1c6f4e747c83bf33beef0a09ecf. This reverts commit fbb27d16fa12aa595cbd20a1fb5f1c5b80748fa4.	2024-04-16 14:42:59 -07:00
Martin Storsjö	995539ce05	[LLD] [COFF] Don't add pseudo relocs for dangling references (#88487 ) When doing GC, we normally won't have dangling references, because such a reference would keep the other section alive, keeping it from being eliminated. However, references within DWARF sections are ignored for the purposes of GC (because otherwise, they would essentially keep everything alive, defeating the point of the GC), see c579a5b1d92a9bc2046d00ee2d427832e0f5ddec for more context. Therefore, dangling relocations against discarded symbols are ignored within DWARF sections (see maybeReportRelocationToDiscarded in Chunks.cpp). Consequently, we also shouldn't create any pseudo relocations for these cases, as we run into a null pointer dereference when trying to generate the pseudo relocation info for it. This fixes the downstream bug https://github.com/mstorsjo/llvm-mingw/issues/418, fixing crashes on combinations with -ffunction-sections, -fdata-sections, -Wl,--gc-sections and debug info. (cherry picked from commit 9c970d5ecd6a85188cd2b0a941fcd4d60063ef81)	2024-04-16 14:40:47 -07:00
Louis Dionne	db67e6fb9a	[libc++] Fix -Wgnu-include-next in stddef.h (#88214 ) As reported in #86843, we must have #pragma GCC system_header before we use #include_next, otherwise the compiler may not understand that we're in a system header and may issue a diagnostic for our usage of (cherry picked from commit 3c4b673af05f53e8a4d1a382b5c86367ea512c9e)	2024-04-16 14:36:31 -07:00
XChy	647fbc7108	[SelectionDAG] Prevent combination on inconsistent type in `combineCarryDiamond` (#84888 ) Fixes #84831 When matching carry pattern with `getAsCarry`, it may produce different type of carryout. This patch checks such case and does early exit. I'm new to DAG, any suggestion is appreciated. (cherry picked from commit cb4453dc69d75064c9a82e9a6a9bf0d0ded4b204)	2024-04-16 12:43:13 -07:00
Cyrill Leutwiler	eaae766a20	[RISCV] Support rv{32, 64}e in the compiler builtins (#88252 ) Register spills (save/restore) in RISC-V embedded work differently because there are less registers and different stack alignment. [GCC equivalent ](https://github.com/gcc-mirror/gcc/blob/master/libgcc/config/riscv/save-restore.S#L298C16-L336) Follow up from #76777. --------- Signed-off-by: xermicus <cyrill@parity.io> (cherry picked from commit bd32aaa8c9ec2094f605315b3989adc2a567ca98)	2024-04-15 16:18:14 -07:00
Tom Stellard	c24b41d71f	github-upload-release.py: Fix bug preventing release creation (#84571 ) After aa02002491333c42060373bc84f1ff5d2c76b4ce we started passing the user name to the create_release function and this was being interpreted as the git tag. (cherry picked from commit 0b9ce71a256d86c08f2b52ad2e337395b8f54b41)	2024-04-15 16:13:26 -07:00
Alexandre Ganea	c837970dd7	[Codegen][X86] Fix /HOTPATCH with clang-cl and inline asm (#87639 ) This fixes an edge case where functions starting with inline assembly would assert while trying to lower that inline asm instruction. After this PR, for now we always add a no-op (xchgw in this case) without considering the size of the next inline asm instruction. We might want to revisit this in the future. This fixes Unreal Engine 5.3.2 compilation with clang-cl and /HOTPATCH. Should close https://github.com/llvm/llvm-project/issues/56234 (cherry picked from commit ec1af63dde58c735fe60d6f2aafdb10fa93f410d)	2024-04-15 16:02:49 -07:00
Yingwei Zheng	d0ddcce21d	[InstSimplify] Make sure the simplified value doesn't generate poison in threadBinOpOverSelect (#87075 ) Alive2: https://alive2.llvm.org/ce/z/y_Jmdn Fix https://github.com/llvm/llvm-project/issues/87042. (cherry picked from commit 3197f9d8b0efc3efdc531421bd11c16305d9b1ff)	2024-04-15 16:00:46 -07:00
Daniel Paoliello	4056cc29df	Prepend all library intrinsics with `#` when building for Arm64EC (#87542 ) While attempting to build some Rust code, I was getting linker errors due to missing functions that are implemented in `compiler-rt`. Turns out that when `compiler-rt` is built for Arm64EC, all its function names are mangled with the leading `#`. This change removes the hard-coded list of library-implemented intrinsics to mangle for Arm64EC, and instead assumes that they all must be mangled.	2024-04-15 15:57:42 -07:00
Alexey Bataev	6e071cf305	[SLP]Fix a crash if the argument of call was affected by minbitwidth analysis. Need to support proper type conversion for function arguments to avoid compiler crash.	2024-04-15 15:33:32 -07:00
Brian Cain	d89da2ac88	[libcxx] coerce formatter precision to int (#87738 ) __precision_ is declared as an int32_t which on some hexagon platforms is defined as a long. This change fixes errors like the ones below: In file included from /local/mnt/workspace/hex/llvm-project/libcxx/test/libcxx/diagnostics/format.nodiscard_extensions.compile.pass.cpp:19: In file included from /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/format:202: In file included from /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__format/format_functions.h:29: /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__format/formatter_floating_point.h:700:17: error: no matching function for call to 'max' 700 \| int __p = std::max(1, (__specs.__has_precision() ? __specs.__precision_ : 6)); \| ^~~~~~~~ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__format/formatter_floating_point.h:771:25: note: in instantiation of function template specialization 'std::__formatter::__format_floating_point<float, char, std::format_context>' requested here 771 \| return __formatter::__format_floating_point(__value, __ctx, __parser_.__get_parsed_std_specifications(__ctx)); \| ^ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__format/format_functions.h:284:42: note: in instantiation of function template specialization 'std::__formatter_floating_point<char>::format<float, std::format_context>' requested here 284 \| __ctx.advance_to(__formatter.format(__arg, __ctx)); \| ^ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__format/format_functions.h:429:15: note: in instantiation of function template specialization 'std::__vformat_to<std::back_insert_iterator<std::string>, char, std::back_insert_iterator<std::__format::__output_buffer<char>>>' requested here 429 \| return std::__vformat_to(std::move(__out_it), __fmt, __args); \| ^ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__format/format_functions.h:462:8: note: in instantiation of function template specialization 'std::vformat_to<std::back_insert_iterator<std::string>>' requested here 462 \| std::vformat_to(std::back_inserter(__res), __fmt, __args); \| ^ /local/mnt/workspace/hex/llvm-project/libcxx/test/libcxx/diagnostics/format.nodiscard_extensions.compile.pass.cpp:29:8: note: in instantiation of function template specialization 'std::vformat<void>' requested here 29 \| std::vformat("", std::make_format_args()); \| ^ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__algorithm/max.h:35:1: note: candidate template ignored: deduced conflicting types for parameter '_Tp' ('int' vs. 'int32_t' (aka 'long')) 35 \| max(_LIBCPP_LIFETIMEBOUND const _Tp& __a, _LIBCPP_LIFETIMEBOUND const _Tp& __b) { \| ^ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__algorithm/max.h:43:1: note: candidate template ignored: could not match 'initializer_list<_Tp>' against 'int' 43 \| max(initializer_list<_Tp> __t, _Compare __comp) { \| ^ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__algorithm/max.h:48:86: note: candidate function template not viable: requires single argument '__t', but 2 arguments were provided 48 \| _LIBCPP_NODISCARD_EXT inline _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_SINCE_CXX14 _Tp max(initializer_list<_Tp> __t) { \| ^ ~~~~~~~~~~~~~~~~~~~~~~~~~ /local/mnt/workspace/hex/obj_runtimes_hex88_qurt_v75_ON_ON_shared/include/c++/v1/__algorithm/max.h:29:1: note: candidate function template not viable: requires 3 arguments, but 2 were provided 29 \| max(_LIBCPP_LIFETIMEBOUND const _Tp& __a, _LIBCPP_LIFETIMEBOUND const _Tp& __b, _Compare __comp) { \| ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ (cherry picked from commit e1830f586ac4c504f632bdb69aab49234256e899)	2024-04-15 15:31:46 -07:00
Koakuma	b6ebea7972	[SPARC] Implement L and H inline asm argument modifiers (#87259 ) This adds support for using the L and H argument modifiers for twinword operands in inline asm code, such as in: ``` %1 = tail call i64 asm sideeffect "rd %pc, ${0:L} ; srlx ${0:L}, 32, ${0:H}", "={o4}"() ``` This is needed by the Linux kernel. (cherry picked from commit 697dd93ae30f489e5bcdac74c2ef2d876e3ca064)	2024-04-10 15:20:57 -07:00
Louis Dionne	bffecba7ce	[libc++] Simplify the implementation of <stddef.h> (#86843 ) Libc++'s own <stddef.h> is complicated by the need to handle various platform-specific macros and to support duplicate inclusion. In reality, we only need to add a declaration of nullptr_t to it, so we can simply include the underlying <stddef.h> outside of our guards to let it handle re-inclusion itself. (cherry picked from commit 2950283dddab03c183c1be2d7de9d4999cc86131)	2024-04-10 15:09:27 -07:00
Schuyler Eldridge	9899a2d76c	[lit][ci] Publish lit wheels (#88072 ) Add wheel publishing in addition to existing source distribution publishing of lit. Fixes #63369. This also uses the exact fix proposed by @EFord36 in #63369. Signed-off-by: Schuyler Eldridge <schuyler.eldridge@sifive.com> (cherry picked from commit 8a8ab8f70cbb5507d1aa55efcd9c6e61ad4e891c)	2024-04-10 15:08:01 -07:00
Owen Pan	3ceccbdb19	[clang-format] Correctly annotate braces of empty ctors/dtors (#82097 ) Also reformat Polly. Fixes #79834. (cherry picked from commit 8de230093f585b64fcd642b46e6560131e95b394)	2024-04-10 15:00:37 -07:00
Ian Anderson	429d628725	[Headers] Don't declare unreachable() from stddef.h in C++ (#86748 ) Even if __need_unreachable is set, stddef.h should not declare unreachable() in C++ because it conflicts with the declaration in \<utility>. (cherry picked from commit df69a305253f1d1b4a4066055a07101a4cc03e55)	2024-04-10 14:59:00 -07:00
Craig Topper	feba8727f8	[ConstantRange] Fix off by 1 bugs in UIToFP and SIToFP handling. (#86041 ) We were passing the min and max values of the range to the ConstantRange constructor, but the constructor expects the upper bound to 1 more than the max value so we need to add 1. We also need to use getNonEmpty so that passing 0, 0 to the constructor creates a full range rather than an empty range. And passing smin, smax+1 doesn't cause an assertion. I believe this fixes at least some of the reason #79158 was reverted. (cherry picked from commit 12836467b76c56872b4c22a6fd44bcda696ea720)	2024-04-10 14:55:38 -07:00
Craig Topper	e4259b583c	[Float2Int] Pre-commit test for SIToFP/UIToFP ConstantRange bug. NFC The range for these operations is being constructed without the maximum value for the range due to an incorrect usage of the ConstantRange constructor. This causes Float2Int to think the range for 'uitofp i1' only contains 0 instead of 0 and 1. (cherry picked from commit 6295e677220bb6ec1fa8abe2f4a94b513b91b786)	2024-04-10 14:55:38 -07:00
Tom Stellard	daca56d8e1	Bump version to 18.1.4 (#87715 )	2024-04-10 14:18:52 -07:00
Mariya Podchishchaeva	c13b7485b8	[clang] Avoid -Wshadow warning when init-capture named same as class field (#74512 ) Shadowing warning doesn't make much sense since field is not available in lambda's body without capturing this. Fixes https://github.com/llvm/llvm-project/issues/71976 llvmorg-18.1.3	2024-04-02 15:10:59 -07:00
Fangrui Song	f249092ef2	[tsan] Refine fstat{,64} interceptors (#86625 ) In glibc versions before 2.33. `libc_nonshared.a` defines `__fxstat/__fxstat64` but there is no `fstat/fstat64`. glibc 2.33 added `fstat/fstat64` and obsoleted `__fxstat/__fxstat64`. Ports added after 2.33 do not provide `__fxstat/__fxstat64`, so our `fstat/fstat64` interceptors using `__fxstat/__fxstat64` interceptors would lead to runtime failures on such ports (LoongArch and certain RISC-V ports). Similar to https://reviews.llvm.org/D118423, refine the conditions that we define fstat{,64} interceptors. `fstat` is supported by musl/*BSD while `fstat64` is glibc only. (cherry picked from commit d5224b73ccd09a6759759791f58426b6acd4a2e2)	2024-04-01 22:10:23 +00:00
Younan Zhang	76c721994a	[clang][Sema] Fix a CTAD regression after 42239d2e9 (#86914 ) The most recent declaration of a template as a friend can introduce a different template parameter depth compared to what we anticipate from a CTAD guide. Fixes https://github.com/llvm/llvm-project/issues/86769	2024-04-01 14:47:15 -07:00
Congcong Cai	e0f0c463b5	[mlir][transform] replace original op to loop ops (#83537 ) (cherry picked from commit 0597644a6466ae9148b0b41cb8f95d5022e045c2)	2024-04-01 14:01:03 -07:00
Xing Xue	47aac495c8	[OpenMP][AIX] Affinity implementation for AIX (#84984 ) This patch implements `affinity` for AIX, which is quite different from platforms such as Linux. - Setting CPU affinity through masks and related functions are not supported. System call `bindprocessor()` is used to bind a thread to one CPU per call. - There are no system routines to get the affinity info of a thread. The implementation of `get_system_affinity()` for AIX gets the mask of all available CPUs, to be used as the full mask only. - Topology is not available from the file system. It is obtained through system SRAD (Scheduler Resource Allocation Domain). This patch has run through the libomp LIT tests successfully with `affinity` enabled. (cherry picked from commit d394f3a162b871668d0c8e8bf6a94922fa8698ae)	2024-03-30 09:57:06 -04:00
Andrew Ng	a385a917da	[Support] Fix color handling in formatted_raw_ostream (#86700 ) The color methods in formatted_raw_ostream were forwarding directly to the underlying stream without considering existing buffered output. This would cause incorrect colored output for buffered uses of formatted_raw_ostream. Fix this issue by applying the color to the formatted_raw_ostream itself and temporarily disabling scanning of any color related output so as not to affect the position tracking. This fix means that workarounds that forced formatted_raw_ostream buffering to be disabled can be removed. In the case of llvm-objdump, this can improve disassembly performance when redirecting to a file by more than an order of magnitude on both Windows and Linux. This improvement restores the disassembly performance when redirecting to a file to a level similar to before color support was added. (cherry picked from commit c9db031c48852af491747dab86ef6f19195eb20d)	2024-03-28 14:23:36 -07:00
Martin Storsjö	0cd4babe03	Reapply [libcxx] [modules] Fix relative paths with absolute LIBCXX_INSTALL_MODULES_DIR (#86020 ) This reapplies 272d1b44efdedb68c194970a610f0ca1b7b769c5 (from #85756), which was reverted in 407937036fa7640f61f225474b1ea6623a40dbdd. In the previous attempt, empty CMAKE_INSTALL_PREFIX was handled by quoting them, in d209d1340b99d4fbd325dffb5e13b757ab8264ea. That made the calls to cmake_path(ABSOLUTE_PATH) succeed, but the output paths of that weren't actually absolute, which was required by file(RELATIVE_PATH). Avoid this issue by constructing a non-empty base directory variable to use for calculating the relative path. (cherry picked from commit 50801f1095d33e712c3a51fdeef82569bd09007f)	2024-03-28 14:18:41 -07:00
Ulrich Weigand	cfaeee6a89	release/18.x: [SystemZ] Fix overflow flag for i128 USUBO (#86491 ) We use the VSCBIQ/VSBIQ/VSBCBIQ family of instructions to implement USUBO/USUBO_CARRY for the i128 data type. However, these instructions use an inverted sense of the borrow indication flag (a value of 1 indicates no borrow, while a value of 0 indicated borrow). This does not match the semantics of the boolean "overflow" flag of the USUBO/USUBO_CARRY ISD nodes. Fix this by generating code to explicitly invert the flag. These cancel out of the result of USUBO feeds into an USUBO_CARRY. To avoid unnecessary zero-extend operations, also improve the DAGCombine handling of ZERO_EXTEND to optimize (zext (xor (trunc))) sequences where appropriate. Fixes: https://github.com/llvm/llvm-project/issues/83268	2024-03-27 16:22:38 -07:00
Arthur Eubanks	767b61c159	[lld/ELF][X86] Respect outSecOff when checking if GOTPCREL can be relaxed (#86334 ) The existing implementation didn't handle when the input text section was some offset from the output section. This resulted in an assert in relaxGot() with an lld built with asserts for some large binaries, or even worse, a silently broken binary with an lld without asserts. (cherry picked from commit 48048051323d5dd74057dc5f32df8c3c323afcd5)	2024-03-27 16:16:02 -07:00
Andrei Golubev	7c7c3d4a3a	[ODS][NFC] Cast range.size() to int32_t in accumulation (#85629 ) Using range.size() "as is" means we accumulate 'size_t' values into 'int32_t' variable. This may produce narrowing conversion warnings (particularly, on MSVC). The surrounding code seems to cast <x>.size() to 'int32_t' so following this practice seems safe enough. Co-authored-by: Ovidiu Pintican <ovidiu.pintican@intel.com> (cherry picked from commit bce17034157fdfe4d898d30366c1eeca3442fa3d)	2024-03-27 16:14:17 -07:00
yingopq	0e501dbd93	[Mips] Restore wrong deletion of instruction 'and' in unsigned min/max processing. (#85902 ) Fix #61881 (cherry picked from commit 5d7fd6a04a6748936dece9d90481b2ba4ec97e53)	2024-03-27 16:09:19 -07:00
yingopq	fbb27d16fa	[Mips] Fix missing sign extension in expansion of sub-word atomic max (#77072 ) Add sign extension "SEB/SEH" before compare. Fix #61881 (cherry picked from commit 755b439694432d4f68e20e979b479cbc30602bb1)	2024-03-27 16:09:19 -07:00
George Koehler	e74c167823	[PowerPC] provide CFI for ELF32 to unwind cr2, cr3, cr4 (#83098 ) Delete the code that skips the CFI for the condition register on ELF32. The code checked !MustSaveCR, which happened only when Subtarget.is32BitELFABI(), where spillCalleeSavedRegisters is spilling cr in a different way. The spill was missing CFI. After deleting this code, a spill of cr2 to cr4 gets CFI in the same way as a spill of r14 to r31. Fixes #83094 (cherry picked from commit 6b70c5d79fe44cbe01b0443454c6952c5b541585)	2024-03-27 16:07:07 -07:00
Chen Zheng	1c7c16e582	[NFC][PowerPC] use script to regenerate the CHECK lines (cherry picked from commit 3196005f6bedbed61a86626a9e4f8fee7437a914)	2024-03-27 16:07:07 -07:00
Thurston Dang	c2a57034ef	[msan] Re-exec with no ASLR if memory layout is incompatible on Linux (#85142 ) This ports the change from TSan (`0784b1eefa`). Testing notes: run 'sudo sysctl vm.mmap_rnd_bits=32; ninja check-msan' before and after this patch. N.B. aggressive ASLR may also cause the app to overlap with the allocator region; for MSan, this was fixed in `af2bf86a37` (cherry picked from commit 58f7251820b14c93168726a24816d8a094599be5)	2024-03-27 16:02:37 -07:00
Thurston Dang	b74f615846	[msan] Add 'MappingDesc::ALLOCATOR' type and check it is available (#85153 ) MSan divides the virtual address space into APP, INVALID, SHADOW and ORIGIN memory. The allocator usually just steals a bit of the APP address space: typically the bottom portion of the PIE binaries section, which works because the Linux kernel maps from the top of the PIE binaries section. However, if ASLR is very aggressive, the binary may end up mapped in the same location where the allocator wants to live; this results in a segfault. This patch adds in a MappingDesc::ALLOCATOR type and enforces that the memory range for the allocator is not occupied by anything else. Since the allocator range information is not readily available in msan.h, we duplicate the information from msan_allocator.cpp. Note: aggressive ASLR can also lead to a different type of failure, where the PIE binaries/libraries are mapped entirely outside of the APP/ALLOCATOR sections; that will be addressed in a separate patch (https://github.com/llvm/llvm-project/pull/85142). (cherry picked from commit af2bf86a372cacf5f536bae06e2f2d3886eefb7b)	2024-03-27 16:02:37 -07:00
Antonio Frighetto	98a15dd2d7	[clang][CodeGen] Allow `memcpy` replace with trivial auto var init When emitting the storage (or memory copy operations) for constant initializers, the decision whether to split a constant structure or array store into a sequence of field stores or to use `memcpy` is based upon the optimization level and the size of the initializer. In afe8b93ffdfef5d8879e1894b9d7dda40dee2b8d, we extended this by allowing constants to be split when the array (or struct) type does not match the type of data the address to the object (constant) is expected to contain. This may happen when `emitStoresForConstant` is called by `EmitAutoVarInit`, as the element type of the address gets shrunk. When this occurs, let the initializer be split into a bunch of stores only under `-ftrivial-auto-var-init=pattern`. Fixes: https://github.com/llvm/llvm-project/issues/84178.	2024-03-27 15:53:00 -07:00
hev	2498e3a07f	[LoongArch] Assume no-op addrspacecasts by default (#82332 ) This PR indicates that `addrspacecasts` are always no-ops on LoongArch. Fixes #82330 (cherry picked from commit dd3e0a4643670f33850278ad281a358bbdd04e92)	2024-03-23 18:44:31 -07:00
Jeffrey Byrnes	368dc8553c	[SROA]: Only defer trying partial sized ptr or ptr vector types Change-Id: Ic77f87290905addadd5819dff2d0c62f031022ab (cherry picked from commit 1e828f838cc0f15074f3dbbb04929c06ef0c9729)	2024-03-23 15:39:23 -07:00
ostannard	0e16af8e4c	[ARM] Update IsRestored for LR based on all returns (#82745 ) PR #75527 fixed ARMFrameLowering to set the IsRestored flag for LR based on all of the return instructions in the function, not just one. However, there is also code in ARMLoadStoreOptimizer which changes return instructions, but it set IsRestored based on the one instruction it changed, not the whole function. The fix is to factor out the code added in #75527, and also call it from ARMLoadStoreOptimizer if it made a change to return instructions. Fixes #80287. (cherry picked from commit 749384c08e042739342c88b521c8ba5dac1b9276)	2024-03-23 15:34:54 -07:00
Oliver Stannard	6f8016fda7	Pre-commit test showing bug #80287 This test shows the bug where LR is used as a general-purpose register on a code path where it is not spilled to the stack. (cherry picked from commit 8779cf68e80dcc0b15e8034f39e6ce18b08352b6)	2024-03-23 15:34:54 -07:00
Tom Stellard	ef6d1ec07c	Bump version to 18.1.3 (#86168 )	2024-03-22 11:44:29 -07:00
Phoebe Wang	26a1d6601d	[X86] Add missing subvector_subreg_lowering for BF16 (#83720 ) llvmorg-18.1.2	2024-03-19 14:06:42 -07:00
Chuanqi Xu	0bf7ff1028	[C++20] [Moduls] Avoid computing odr hash for functions from comparing constraint expression Previously we disabled to compute ODR hash for declarations from the global module fragment. However, we missed the case that the functions lives in the concept requiments (see the attached the test files for example). And the mismatch causes the potential crashment. Due to we will set the function body as lazy after we deserialize it and we will only take its body when needed. However, we don't allow to take the body during deserializing. So it is actually potentially problematic if we set the body as lazy first and computing the hash value of the function, which requires to deserialize its body. So we will meet a crash here. This patch tries to solve the issue by not taking the body of the function from GMF. Note that we can't skip comparing the constraint expression from the GMF directly since it is an key part of the function selecting and it may be the reason why we can't return 0 directly for `FunctionDecl::getODRHash()` from the GMF.	2024-03-19 14:03:55 -07:00
Luke Lau	a2c93b34df	[RISCV] Fix mgather -> riscv.masked.strided.load combine not extending indices (#82506 ) This fixes the miscompile reported in #82430 by telling isSimpleVIDSequence to sign extend to XLen instead of the width of the indices, since the "sequence" of indices generated by a strided load will be at XLen. This was the simplest way I could think of getting isSimpleVIDSequence to treat the indexes as if they were zero extended to XLenVT. Another way we could do this is by refactoring out the "get constant integers" part from isSimpleVIDSequence and handle them as APInts so we can separately zero extend it. Fixes #82430 (cherry picked from commit 815644b4dd882ade2e5649d4f97c3dd6f7aea200)	2024-03-19 13:57:52 -07:00
Luke Lau	a9d4ed7170	[RISCV] Adjust test case to show wrong stride. NFC See https://github.com/llvm/llvm-project/pull/82506#discussion_r1498080785 (cherry picked from commit 11d115d0569b212dfeb7fe6485be48070e068e19)	2024-03-19 13:57:52 -07:00
Luke Lau	42f511c95c	[RISCV] Add test case for miscompile in gather -> strided load combine. NFC This shows the issue in #82430, but triggers it via the widening SEW combine rather than a GEP that RISCVGatherScatterLowering doesn't detect. (cherry picked from commit 2cd59bdc891ab59a1abfe5205feb45791a530a47)	2024-03-19 13:57:52 -07:00
Nikita Popov	53ea0de61d	[TSan] Fix atomicrmw xchg with pointer and floats (#85228 ) atomicrmw xchg also accepts pointer and floating-point values. To handle those, insert necessary casts to and from integer. This is what we do for cmpxchg as well. Fixes https://github.com/llvm/llvm-project/issues/85226. (cherry picked from commit ff2fb2a1d78585944dcdb9061c8487fe1476dfa4)	2024-03-19 13:26:11 -07:00

1 2 3 4 5 ...

487707 Commits