llvm-project

Author	SHA1	Message	Date
Matt Arsenault	2502e3b7ba	IR: Promote "denormal-fp-math" to a first class attribute (#174293 ) Convert "denormal-fp-math" and "denormal-fp-math-f32" into a first class denormal_fpenv attribute. Previously the query for the effective denormal mode involved two string attribute queries with parsing. I'm introducing more uses of this, so it makes sense to convert this to a more efficient encoding. The old representation was also awkward since it was split across two separate attributes. The new encoding just stores the default and float modes as bitfields, largely avoiding the need to consider if the other mode is set. The syntax in the common cases looks like this: `denormal_fpenv(preservesign,preservesign)` `denormal_fpenv(float: preservesign,preservesign)` `denormal_fpenv(dynamic,dynamic float: preservesign,preservesign)` I wasn't sure about reusing the float type name instead of adding a new keyword. It's parsed as a type but only accepts float. I'm also debating switching the name to subnormal to match the current preferred IEEE terminology (also used by nofpclass and other contexts). This has a behavior change when using the command flag debug options to set the denormal mode. The behavior of the flag ignored functions with an explicit attribute set, per the default and f32 version. Now that these are one attribute, the flag logic can't distinguish which of the two components were explicitly set on the function. Only one test appeared to rely on this behavior, so I just avoided using the flags in it. This also does not perform all the code cleanups this enables. In particular the attributor handling could be cleaned up. I also guessed at how to support this in MLIR. I followed MemoryEffects as a reference; it appears bitfields are expanded into arguments to attributes, so the representation there is a bit uglier with the 2 2-element fields flattened into 4 arguments.	2026-02-05 13:31:26 +00:00
Trevor Gross	3920bc61ca	[TargetLowering] Change the `softPromoteHalfType` default to `true` (#175149 ) The default `f16` lowering has some issues that result in incorrect float behavior, so over time most targets have switched to use `softPromoteHalfType`. Swap to soft promotion by default and add overrides for SystemZ and AMDGPU, which are the two remaining backends that still depend on this behavior. All basic `f16` op tests now pass on all remaining experimental arches. Fixes: https://github.com/llvm/llvm-project/issues/97981 Fixes: https://github.com/llvm/llvm-project/issues/97975	2026-01-11 12:48:26 +01:00
Trevor Gross	1be04b7edf	[XCore] Use `softPromoteHalfType` (#175142 ) Follow suite from other targets. Fixes the XCore portion of https://github.com/llvm/llvm-project/issues/97975 Fixes the XCore portion of https://github.com/llvm/llvm-project/issues/97981	2026-01-09 10:31:01 +00:00
Trevor Gross	054ee2f870	[CSKY] Use `softPromoteHalfType` (#175138 ) Follow suite from other targets. Fixes the C-SKY portion of https://github.com/llvm/llvm-project/issues/97975 Fixes the C-SKY portion of https://github.com/llvm/llvm-project/issues/97981	2026-01-09 11:10:54 +01:00
Trevor Gross	a06bf00fd1	[VE] Use `softPromoteHalfType` (#175141 ) Follow suite from other targets. Fixes the (unlisted) VE portion of https://github.com/llvm/llvm-project/issues/97975 Fixes the (unlisted) VE portion of https://github.com/llvm/llvm-project/issues/97981	2026-01-09 11:10:07 +01:00
Trevor Gross	2a254d4200	[M68k] Use `softPromoteHalfType` (#175140 ) Follow suite from other targets. Fixes the M68k portion of https://github.com/llvm/llvm-project/issues/97975 Fixes the M68k portion of https://github.com/llvm/llvm-project/issues/97981	2026-01-09 11:09:06 +01:00
Trevor Gross	88575340d4	[MSP430] Use `softPromoteHalfType` (#175139 ) Follow suite from other targets. Fixes the MSP430 portion of https://github.com/llvm/llvm-project/issues/97975 Fixes the MSP430 portion of https://github.com/llvm/llvm-project/issues/97981	2026-01-09 11:08:34 +01:00
Trevor Gross	a218940d39	[Lanai] Use `softPromoteHalfType` (#175137 ) There are currently no other tests checking `half` so I am unsure how well supported the type is, but the patch here resolves the op tests. Fixes the (unlisted) Lanai portion of https://github.com/llvm/llvm-project/issues/97975 Fixes the (unlisted) Lanai portion of https://github.com/llvm/llvm-project/issues/97981	2026-01-09 11:07:36 +01:00
Trevor Gross	db26ce5c55	[PowerPC] Change `half` to use soft promotion rather than `PromoteFloat` (#152632 ) On PowerPC targets, `half` uses the default legalization of promoting to a `f32`. However, this has some fundamental issues related to inability to round trip. Resolve this by switching to the soft legalization, which passes `f16` as an `i16`. The PowerPC ABI Specification does not define a `_Float16` type, so the calling convention changes are acceptable. Fixes the PowerPC part of https://github.com/llvm/llvm-project/issues/97975 Fixes the PowerPC part of https://github.com/llvm/llvm-project/issues/97981	2026-01-08 15:35:01 +01:00
Trevor Gross	4903c6260c	[WebAssembly] Change `half` to use soft promotion rather than `PromoteFloat` (#152833 ) The default `half` legalization, which Wasm currently uses, does not respect IEEE conventions: for example, casting to bits may invoke a lossy libcall, meaning soft float operations cannot be correctly implemented. Change to the soft promotion legalization which passes `f16` as an `i16` and treats each `half` operation as an individual f16->f32->libcall->f32->f16 sequence. Of note in the test updates are that `from_bits` and `to_bits` are now libcall-free, and that chained operations now round back to `f16` after each step. Fixes the wasm portion of https://github.com/llvm/llvm-project/issues/97981 Fixes the wasm portion of https://github.com/llvm/llvm-project/issues/97975 Fixes: https://github.com/llvm/llvm-project/issues/96437 Fixes: https://github.com/llvm/llvm-project/issues/96438	2026-01-08 15:07:59 +01:00
Nikita Popov	8ea8f682f7	Revert "[SelectionDAG] Fix null pointer dereference in resolveDanglingDebugInfo" (#173925 ) Reverts llvm/llvm-project#173500. Test fails depending on the host system.	2025-12-29 22:05:17 +00:00
MetalOxideSemi	7a3bbf724d	[SelectionDAG] Fix null pointer dereference in resolveDanglingDebugInfo (#173500 ) ## Summary Fix null pointer dereference in `SelectionDAGBuilder::resolveDanglingDebugInfo`. ## Problem `Val.getNode()->getIROrder()` is called before checking if `Val.getNode()` is null, causing crashes when compiling code with debug info that contains aggregate constants with nested empty structs. ## Solution Move the `ValSDNodeOrder` declaration inside the `if (Val.getNode())` block. ## Test Case Reproduces with aggregate types containing nested empty structs: ```llvm %3 = insertvalue { { i1, {} }, ptr, { { {} }, { {} } }, i64 } { { i1, {} } zeroinitializer, ptr null, { { {} }, { {} } } zeroinitializer, i64 2 }, ptr %2, 1, !dbg !893 ## Crash stack 0. Program arguments: llc-20 -O3 -mcpu=native -relocation-model=pic -filetype=obj /cloudide/workspace/temp/sf.ll -o /dev/null 1. Running pass 'Function Pass Manager' on module '/cloudide/workspace/temp/sf.ll'. 2. Running pass 'X86 DAG->DAG Instruction Selection' on function '@filter_create' Stack dump without symbol names (ensure you have llvm-symbolizer in your PATH or set the environment var `LLVM_SYMBOLIZER_PATH` to point to it): 0 libLLVM.so.20.1 0x00007ff87ebbdf86 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) + 54 1 libLLVM.so.20.1 0x00007ff87ebbbb90 llvm::sys::RunSignalHandlers() + 80 2 libLLVM.so.20.1 0x00007ff87ebbe640 3 libpthread.so.0 0x00007ff87db79140 4 libLLVM.so.20.1 0x00007ff87f3fd2ff llvm::SelectionDAGBuilder::resolveDanglingDebugInfo(llvm::Value const, llvm::SDValue) + 303 5 libLLVM.so.20.1 0x00007ff87f3fda5e llvm::SelectionDAGBuilder::getValue(llvm::Value const) + 142 6 libLLVM.so.20.1 0x00007ff87f3fe79f llvm::SelectionDAGBuilder::getValueImpl(llvm::Value const) + 3343 7 libLLVM.so.20.1 0x00007ff87f3fda34 llvm::SelectionDAGBuilder::getValue(llvm::Value const) + 100 8 libLLVM.so.20.1 0x00007ff87f3fc1ab llvm::SelectionDAGBuilder::visitInsertValue(llvm::InsertValueInst const&) + 603 9 libLLVM.so.20.1 0x00007ff87f3eeaf7 llvm::SelectionDAGBuilder::visit(llvm::Instruction const&) + 327 10 libLLVM.so.20.1 0x00007ff87f4904b8 llvm::SelectionDAGISel::SelectBasicBlock(llvm::ilist_iterator_w_bits<llvm::ilist_detail::node_options<llvm::Instruction, false, false, void, true, llvm::BasicBlock>, false, true>, llvm::ilist_iterator_w_bits<llvm::ilist_detail::node_options<llvm::Instruction, false, false, void, true, llvm::BasicBlock>, false, true>, bool&) + 72 11 libLLVM.so.20.1 0x00007ff87f490304 llvm::SelectionDAGISel::SelectAllBasicBlocks(llvm::Function const&) + 5956 12 libLLVM.so.20.1 0x00007ff87f48e2b4 llvm::SelectionDAGISel::runOnMachineFunction(llvm::MachineFunction&) + 372 13 libLLVM.so.20.1 0x00007ff87f48c689 llvm::SelectionDAGISelLegacy::runOnMachineFunction(llvm::MachineFunction&) + 169 14 libLLVM.so.20.1 0x00007ff87efb8e32 llvm::MachineFunctionPass::runOnFunction(llvm::Function&) + 610 15 libLLVM.so.20.1 0x00007ff87ed104be llvm::FPPassManager::runOnFunction(llvm::Function&) + 638 16 libLLVM.so.20.1 0x00007ff87ed15ff3 llvm::FPPassManager::runOnModule(llvm::Module&) + 51 17 libLLVM.so.20.1 0x00007ff87ed10c11 llvm::legacy::PassManagerImpl::run(llvm::Module&) + 1105 18 llc-20 0x000055972ce77dc1 main + 9649 19 libc.so.6 0x00007ff87d68ad7a __libc_start_main + 234 20 llc-20 0x000055972ce7247a _start + 42	2025-12-28 18:00:46 +00:00
Folkert de Vries	a587ccd87d	fix `llvm.fma.f16` double rounding issue when there is no native support (#171904 ) fixes https://github.com/llvm/llvm-project/issues/98389 As the issue describes, promoting `llvm.fma.f16` to `llvm.fma.f32` does not work, because there is not enough precision to handle the repeated rounding. `f64` does have sufficient space. So this PR explicitly promotes the 16-bit fma to a 64-bit fma. I could not find examples of a libcall being used for fma, but that's something that could be looked in separately to work around code size issues.	2025-12-17 22:03:01 +01:00
David Green	7db97696a2	[ARM][AArch64] Replace ".f16(bfloat" with ".bf16(bfloat" in intrinsics. NFC It looks like these were copied from fp16 tests, and forgot to update the intrinsic types. Also remove some old definitions that are no longer required.	2025-12-14 04:39:41 +00:00
Daniel Thornburgh	e7f6038e89	[LLVM] Mark reloc-none test unsupported on Hexagon (#171205 ) Prevents infinite loop issue recorded in #147427. More work will be required to make @llvm.reloc_none work correctly on Hexagon.	2025-12-08 21:36:23 +00:00
Fabian Parzefall	a82b97c524	[CodeGen] Fix lpad padding at section start after empty block (#112595 ) If a landing pad is at the very start of a split section, it has to be padded by a nop instruction. Otherwise its offset is marked as zero in the LSDA, which means no landing pad (leading it to be skipped). LLVM already handles this. If a landing pad is the first machine block in a section, a nop is inserted to ensure a non-zero offset. However, if the landing pad is preceeded by an empty block, the nop would be omitted. To fix this, this patch adds a field to machine blocks indicating whether this block contains the first instruction in its section. This variable is then used to determine whether to emit the padding. Co-authored-by: Jinjie Huang <huangjinjie@bytedance.com>	2025-12-04 10:29:24 +08:00
Daniel Thornburgh	5f08fb4d72	[IR] llvm.reloc.none intrinsic for no-op symbol references (#147427 ) This intrinsic emits a BFD_RELOC_NONE relocation at the point of call, which allows optimizations and languages to explicitly pull in symbols from static libraries without there being any code or data that has an effectual relocation against such a symbol. See issue #146159 for context.	2025-11-06 08:52:46 -08:00
Grigory Pastukhov	7398591148	[CodeGen] Add skipFunction() check to MachineFunctionSplitter (#166260 ) MachineFunctionSplitter was missing a skipFunction() check, causing it to incorrectly split functions that should be skipped (e.g., functions with optnone attribute). This patch adds an early skipFunction() check in runOnMachineFunction() to ensure these functions are never split, regardless of profile data availability or other splitting conditions.	2025-11-04 11:01:50 -08:00
beetrees	11571a005a	Fix legalizing `FNEG` and `FABS` with `TypeSoftPromoteHalf` (#156343 ) Based on top of #157211. `FNEG` and `FABS` must preserve signalling NaNs, meaning they should not convert to f32 to perform the operation. Instead legalize to `XOR` and `AND`. Fixes almost all of #104915	2025-10-11 11:08:26 +09:00
Alex MacLean	9b24ccca73	[NVPTX] Allow more argument integer types, such as i256 and i96 (#154824 ) The refactoring of ComputePTXValueVTs in #154476 caused the complier to no longer crash when lowering i256 and i96. This has caused a few tests to unexpectedly pass. Update these tests and tweak how we emit parameter declarations to correctly lower these types.	2025-08-21 13:54:38 -07:00
Trevor Gross	549d7c4f35	[SPARC] Change `half` to use soft promotion rather than `PromoteFloat` (#152727 ) `half` currently uses the default legalization of promoting to a `f32`; however, this implementation implements math in a way that results in incorrect rounding. Switch to the soft promote implementation, which does not have this problem. The SPARC ABI does not specify a `_Float16` type, so there is no concern with keeping interface compatibility. Fixes the SPARC part of https://github.com/llvm/llvm-project/issues/97975 Fixes the SPARC part of https://github.com/llvm/llvm-project/issues/97981	2025-08-18 20:56:24 +02:00
Trevor Gross	919021b0df	[Arm64EC] Add support for `half` (#152843 ) `f16` is passed and returned in vector registers on both x86 on AArch64, the same calling convention as `f32`, so it is a straightforward type to support. The calling convention support already exists, added as part of a6065f0fa55a ("Arm64EC entry/exit thunks, consolidated. (#79067)"). Thus, add mangling and remove the error in order to make `half` work. MSVC does not yet support `_Float16`, so for now this will remain an LLVM-only extension. Fixes the `f16` portion of https://github.com/llvm/llvm-project/issues/94434	2025-08-12 14:15:52 -07:00
Trevor Gross	314458197e	[Test] Add cross-platform smoke tests for `half` support (NFC) (#152616 ) There are a number of platforms affected by [1]. It is easy enough to check in a cross-platform way that bitcasts aren't using f16<->f32 libcalls; thus, add a generic test covering most supported architectures, with an XFAIL for targets that are currently broken. As they get fixed, this test will fail and can be updated. [1]: https://github.com/llvm/llvm-project/issues/97981	2025-08-08 11:28:33 +02:00
Daniel Paoliello	07da480614	[win][arm64ec] More fixes for building and testing Arm64EC Windows (#151409 ) * `tools/llvm-objcopy/MachO/update-section-object.test` was failing on Windows since the input file (`macho_sections.s`) might be checked out with the wrong line ending, resulting in difference in the size of sections being checked. * Removed the check for Windows in `AArch64Arm64ECCallLowering`: when `llc` is run without an explicit target, the module's target triple is unknown so this assert fires. * Expect `llvm/test/CodeGen/Generic/allow-check.ll` to fail for Arm64EC: Global ISel is not supported.	2025-08-05 14:54:08 -07:00
Matt Arsenault	d0d3f15c38	RuntimeLibcalls: Stop opting out of exp10 (#148604 )	2025-08-04 00:08:46 +09:00
Trevor Gross	d214f07f09	[IR] Add a test for `f128` libm libcall lowering (NFC) (#148308 ) `f128` intrinsic functions from libm sometimes lower to `long double` library calls when they instead need to be `f128` versions. Add a generic test demonstrating current behavior.	2025-07-14 14:29:20 +02:00
Orlando Cazalet-Hyams	1dc46d45fc	[RemoveDIs] Fix rotten --implicit-check-not lines (#144711 )	2025-06-24 12:32:50 +01:00
Jacek Caban	be5c96bfac	[CodeGen][COFF] Always emit CodeView compiler info on Windows targets (#142970 ) MSVC always emits minimal CodeView metadata with compiler information, even when debug info is otherwise disabled. Other tools may rely on this metadata being present. For example, linkers use it to determine whether hotpatching is enabled for the object file.	2025-06-13 22:48:29 +02:00
Jeremy Morse	c84f2c79da	[DebugInfo][RemoveDIs] Delete experimental-iterator test-flags from tests (#140045 ) Over in 6a45fce, this flag (experimental-debuginfo-iterators) was switched to do nothing, to flush out anything that depended on the debug-intrinsics way of doing things. It's been a month and nothing's super-broken, so we'll start to rip things out. This commit deletes MergeFunc's debuginfo-iterators test: in d2942a86d7 it's documented that that test is specifically because of differences between intrinsic/non-intrinsic data structures, and we're deleting the possibility of that difference.	2025-06-02 18:20:12 +01:00
Matt Arsenault	36b710a7e5	CodeGen: Convert some assorted errors to use reportFatalUsageError (#142031 ) The test coverage is lacking for many of these errors.	2025-05-30 08:06:53 +02:00
Paul Walker	01813e8929	[LLVM][VecLib] Refactor LIBMVEC integration to be target neutral. (#138262 ) Renames LIBMVEC-X86 to LIBMVEC and updates TLI to only add the existing x86 specific mapping when targeting x86.	2025-05-07 11:05:25 +01:00
Jeremy Morse	1ebc308bba	[DebugInfo][RemoveDIs] Remove debug-intrinsic printing cmdline options (#131855 ) During the transition from debug intrinsics to debug records, we used several different command line options to customise handling: the printing of debug records to bitcode and textual could be independent of how the debug-info was represented inside a module, whether the autoupgrader ran could be customised. This was all valuable during development, but now that totally removing debug intrinsics is coming up, this patch removes those options in favour of a single flag (experimental-debuginfo-iterators), which enables autoupgrade, in-memory debug records, and debug record printing to bitcode and textual IR. We need to do this ahead of removing the experimental-debuginfo-iterators flag, to reduce the amount of test-juggling that happens at that time. There are quite a number of weird test behaviours related to this -- some of which I simply delete in this commit. Things like print-non-instruction-debug-info.ll , the test suite now checks for debug records in all tests, and we don't want to check we can print as intrinsics. Or the update_test_checks tests -- these are duplicated with write-experimental-debuginfo=false to ensure file writing for intrinsics is correct, but that's something we're imminently going to delete. A short survey of curious test changes: * free-intrinsics.ll: we don't need to test that debug-info is a zero cost intrinsic, because we won't be using intrinsics in the future. * undef-dbg-val.ll: apparently we pinned this to non-RemoveDIs in-memory mode while we sorted something out; it works now either way. * salvage-cast-debug-info.ll: was testing intrinsics-in-memory get salvaged, isn't necessary now * localize-constexpr-debuginfo.ll: was producing "dead metadata" intrinsics for optimised-out variable values, dbg-records takes the (correct) representation of poison/undef as an operand. Looks like we didn't update this in the past to avoid spurious test differences. * Transforms/Scalarizer/dbginfo.ll: this test was explicitly testing that debug-info affected codegen, and we deferred updating the tests until now. This is just one of those silent gnochange issues that get fixed by RemoveDIs. Finally: I've added a bitcode test, dbg-intrinsics-autoupgrade.ll.bc, that checks we can autoupgrade debug intrinsics that are in bitcode into the new debug records.	2025-04-01 14:27:11 +01:00
Congcong Cai	a80aad2812	[YAML] fix output incorrect format for block scalar string (#132897 ) After outputting block scalar string, the indent will be wrong. This patch fixes Padding after block scalar string to ensure the correct format of yaml. The new added ut will fail in main. ```diff @@ -3,4 +3,4 @@ Just a block scalar doc -scalar: a + scalar: a ...\n ```	2025-03-27 02:16:27 +08:00
Jeremy Morse	792a6f8119	[RemoveDIs] Remove "try-debuginfo-iterators..." test flags (#130298 ) These date back to when the non-intrinsic format of variable locations was still being tested and was behind a compile-time flag, so not all builds / bots would correctly run them. The solution at the time, to get at least some test coverage, was to have tests opt-in to non-intrinsic debug-info if it was built into LLVM. Nowadays, non-intrinsic format is the default and has been on for more than a year, there's no need for this flag to exist. (I've downgraded the flag from "try" to explicitly requesting non-intrinsic format in some places, so that we can deal with tests that are explicitly about non-intrinsic format in their own commit).	2025-03-14 15:50:49 +00:00
antangelo	73f087b331	[NFC][SelectionDAG] Replace generic @llvm.expect.with.probability codegen test with X86 test (#117848 ) Adds test case for X86 to check that the output of @llvm.expect.with.probability's generic lowering is reasonable. This replaces a generic test which only asserts that llc does not crash.	2024-12-01 17:46:08 -05:00
Sergei Barannikov	61a23646c9	[SjLjEHPrepare] Configure call sites correctly (#117656 ) After 9fe78db4, the pass inserts `store volatile i32 -1, ptr %call_site` before all invoke instruction except the one in the entry block, which has the effect of bypassing landing pads on exceptions. When configuring the call site for a potentially throwing instruction check that it is not `InvokeInst` -- they are handled by earlier code.	2024-11-27 08:03:47 +03:00
antangelo	dd4844722d	[SelectionDAG] Add generic implementation for @llvm.expect.with.probability when optimizations are disabled (#117459 ) Handle \@llvm.expect.with.probability in SelectionDAGBuilder, FastISel, and IntrinsicLowering in the same way \@llvm.expect is handled, where the value is passed through as-is. This can be reached if the intrinsic is used without optimizations, where it would otherwise be properly transformed out. Fixes #115411 for SelectionDAG. A similar patch is likely needed for GlobalISel.	2024-11-26 20:22:25 -05:00
Kyungwoo Lee	fe69a20cc1	Reland [CGData][GMF] Skip No Params (#116548 ) This update follows up on change #112671 and is mostly a NFC, with the following exceptions: - Introduced `-global-merging-skip-no-params` to bypass merging when no parameters are required. - Parameter count is now calculated based on the unique hash count. - Added `-global-merging-inst-overhead` to adjust the instruction overhead, reflecting the machine instruction size. - Costs and benefits are now computed using the double data type. Since the finalization process occurs offline, this should not significantly impact build time. - Moved a sorting operation outside of the loop. This is a patch for https://discourse.llvm.org/t/rfc-global-function-merging/82608.	2024-11-25 13:55:02 -08:00
Kyungwoo Lee	fe3c23b439	Revert "[CGData][GMF] Skip No Params (#116548 )" This reverts commit fdf1f69c57ac3667d27c35e097040284edb1f574.	2024-11-25 11:09:29 -08:00
Kyungwoo Lee	fdf1f69c57	[CGData][GMF] Skip No Params (#116548 ) This update follows up on change #112671 and is mostly a NFC, with the following exceptions: - Introduced `-global-merging-skip-no-params` to bypass merging when no parameters are required. - Parameter count is now calculated based on the unique hash count. - Added `-global-merging-inst-overhead` to adjust the instruction overhead, reflecting the machine instruction size. - Costs and benefits are now computed using the double data type. Since the finalization process occurs offline, this should not significantly impact build time. - Moved a sorting operation outside of the loop. This is a patch for https://discourse.llvm.org/t/rfc-global-function-merging/82608.	2024-11-25 10:57:41 -08:00
Rahman Lavaee	68f7b075c0	[BasicBlockSections] Allow mixing of -basic-block-sections with MFS. (#117076 ) This PR allows mixing `-basic-block-sections` with `-enable-machine-function-splitter`. The strategy is to let `-basic-block-sections` take precedence over functions with profiles.	2024-11-22 22:23:29 -08:00
Kyungwoo Lee	816c975ea7	Fix crash from [CGData] Global Merge Functions (#112671 ) (#116241 ) Module summary index is optional for this pass, and we shouldn't run it, but import it as necessary.	2024-11-15 14:57:17 -08:00
Koakuma	23d209f350	[SPARC] Allow overaligned `alloca`s (#107223 ) SPARC ABI doesn't use stack realignment, so let LLVM know about it in `SparcFrameLowering`. This has the side effect of making all overaligned allocations go through `LowerDYNAMIC_STACKALLOC`, so implement the missing logic there too for overaligned allocations. This makes the SPARC backend not crash on overaligned `alloca`s and fix https://github.com/llvm/llvm-project/issues/89569.	2024-11-03 22:53:03 +07:00
Afanasyev Ivan	4e1b9d34f9	[mir-strip-debug] Fix debug location info strip for bundled instructions (#113676 ) Fix bug that `mir-strip-debug` pass does not remove debug location from bundled instructions. Problem arises during testing that debug info does not affect optimization passes output (`llvm-lit` with ` -Dllc="llc -debugify-and-strip-all-safe"`), when pass operates on MIR with bundled instructions + memory operands. Let mir test check looks like: ``` CHECK-NEXT: BUNDLE { CHECK-NEXT: $r3 = LD $r1, $r2 :: (load (s64) from %ir.a, !tbaa !2) CHECK-NEXT: } ``` So as `mir-strip-debug` pass does not process bundled instructions, running `llc -debugify-and-strip-all-safe` on the test will produce the following output: ``` BUNDLE { $r3 = LD $r1, $r2, debug-location !DILocation(line: 3, column: 1, scope: <0x608cb2b99b10>) :: (load (s64) from %ir.a, !tbaa !2) } ``` And test will fail, but it shouldn't. Seems like the root cause is that `mir-strip-debug` pass should remove debug location from bundled instructions.	2024-10-29 10:26:15 -07:00
Alex Bradbury	2fe1f84db3	[test] Fix llc-start-stop.ll when the default target enables the loop terminator folding pass Previously this would fail if the default target enabled the loop terminator folding pass (currently just RISC-V), as it runs after loop strength reduction.	2024-10-07 16:06:44 +01:00
Sean Perry	27b5dc422c	Add target-byteorder for cases where endian in target triple is what matters (#107915 ) I came across the subtly when setting up lit for z/OS and running it on a Linux on Power machine. Linux on Power is little endian. This was resulting in all of these tests being run even though the target triple was z/OS which is big endian. The lit should really be checking if the target is little endian not the host. The previous way didn't handle cross compilation while running lit.	2024-09-23 13:00:44 -04:00
Alexis Engelke	fa92d51f9e	[VP] Merge ExpandVP pass into PreISelIntrinsicLowering (#101652 ) Similar to #97727; avoid an extra pass over the entire IR by performing the lowering as part of the pre-isel-intrinsic-lowering pass.	2024-08-06 09:27:59 +02:00
Jonas Paulsson	f231d3dab3	Fix some X86 tests (#101944 ) extractelement-shuffle.ll: Test for bugfix in DAGCombiner, moved to Generic. 2010-07-06-DbgCrash.ll and 2006-10-02-BoolRetCrash.ll: Bugfixes in X86, run tests with X86 backend.	2024-08-05 11:50:05 +02:00
Matt Arsenault	af1d2b9fb1	CodeGen: Remove -disable-debug-info-print cl::opt (#100319 ) This was first introduced way back in in 2010 by 6c74a872a8d34d41b751efb68e335cbe91b5a5cc, and has little evidence of use. Only one test attempts to make use of this, but it's also redundant since it's also using strip to drop debug info anyway (and that also makes the test buggy, since it's intended to test with and without debug info). The other tests using it were only added to test the option after discovering it was untested and moved, in later commits.	2024-07-25 16:39:39 +04:00
paperchalice	ab58b6d58e	Revert "[CodeGen][NewPM] Port machine-branch-prob to new pass manager" (#96858 ) Reverts llvm/llvm-project#96389 Some ppc bots failed.	2024-06-27 15:00:17 +08:00

1 2 3 4 5 ...

809 Commits