llvm-project

Author	SHA1	Message	Date
yingopq	d6e0798a2a	[Mips] Add the missing judgment when processing function handleMFLOSlot (#121463 ) In function handleMFLOSlot, we may get a variable LastInstInFunction with a value of true from function getNextMachineInstr and IInSlot may be null which would trigger an assert. So we need to skip this case. Fix #118223.	2025-01-24 20:03:03 +08:00
Cinhi Young	6735d527f9	[MIPS] [MSA] Widen v2i8, v216 and v2i32 vectors (#123040 ) - Widen v2i8, v2i16 and v2i32 vectors so they don't cast back and forth, and make sure that instructions with correct data unit is being used. - Handle undef indices for VSHF when lowering VECTOR_SHUFFLE (it crashes if such index is present).	2025-01-24 11:23:34 +08:00
Hervé Poussineau	26b87aad9e	[Mips] Handle declspec(dllimport) on mipsel-windows-* triples (#120912 ) On Windows, imported symbols must be searched with '__imp_' prefix. Support imported global variables and imported functions.	2025-01-21 16:18:02 +08:00
Cinhi Young	385f776b63	[MIPS][MSA] Invert operand order of `ILVOD` when lowering `VECTOR_SHUFFLE` (#123555 ) This PR fixes operand order of `ILVOD.df` when lowering `VECTOR_SHUFFLE`, the result was `<y[1], x[1]>` while it should be `<x[1], y[1]>`. * This PR is split from #123040.	2025-01-21 15:54:10 +08:00
yingopq	754ed95b66	[Mips] Fix compiler crash when returning fp128 after calling a functi… (#117525 ) …on returning { i8, i128 } Fixes https://github.com/llvm/llvm-project/issues/96432.	2025-01-20 16:47:40 +08:00
Hervé Poussineau	d8a5fae691	[MC][Mips] Add MipsWinCOFFObjectWriter/MipsWinCOFFStreamer (#114611 ) llc is now able to create MIPS COFF files for simple cases.	2024-12-20 17:31:38 +08:00
Fangrui Song	8b02d809d2	[test] Remove redundant -march= in llc -mtriple=	2024-12-14 17:46:30 -08:00
Craig Topper	7ece560a50	[GISel] Support narrowing G_ICMP with more than 2 parts. (#119335 ) This allows us to support i128 G_ICMP on RV32. I'm not sure how to test the "left over" part of this as RISC-V always widens to a power of 2 before narrowing.	2024-12-12 09:50:26 -08:00
Fangrui Song	ae26f50aea	[test] Change llc -march=mips* to -mtriple=mips* Similar to 806761a7629df268c8aed49657aeccffa6bca449	2024-12-10 22:14:06 -08:00
Florian Hahn	ef102b4a63	[MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987 ) The improvements in 63917e1 / #70796 do not check for memory barriers/unmodelled sideeffects, which means we may incorrectly hoist loads across memory barriers. Fix this by checking any machine instruction in the loop is a load-fold barrier. PR: https://github.com/llvm/llvm-project/pull/116987	2024-11-21 10:25:04 +00:00
Davide	8cd348c96a	[MIPS] Updated MIPS N calling conventions so that fp16 arguments no longer cause a crash (#116569 ) This PR fixes a bug introduced by #110199, which causes any half float argument to crash the compiler on MIPS64. Currently compiling this bit of code with `llc -mtriple=mips64`: ``` define void @half_args(half %a) nounwind { entry: ret void } ``` Crashes with the following log: ``` LLVM ERROR: unable to allocate function argument #0 PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace. Stack dump: 0. Program arguments: llc -mtriple=mips64 1. Running pass 'Function Pass Manager' on module '<stdin>'. 2. Running pass 'MIPS DAG->DAG Pattern Instruction Selection' on function '@half_args' #0 0x000055a3a4013df8 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x32d0df8) #1 0x000055a3a401199e llvm::sys::RunSignalHandlers() (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x32ce99e) #2 0x000055a3a40144a8 SignalHandler(int) Signals.cpp:0:0 #3 0x00007f00bde558c0 __restore_rt libc_sigaction.c:0:0 #4 0x00007f00bdea462c __pthread_kill_implementation ./nptl/pthread_kill.c:44:76 #5 0x00007f00bde55822 gsignal ./signal/../sysdeps/posix/raise.c:27:6 #6 0x00007f00bde3e4af abort ./stdlib/abort.c:81:7 #7 0x000055a3a3f80e3c llvm::report_fatal_error(llvm::Twine const&, bool) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x323de3c) #8 0x000055a3a2e20dfa (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x20dddfa) #9 0x000055a3a2a34e20 llvm::MipsTargetLowering::LowerFormalArguments(llvm::SDValue, unsigned int, bool, llvm::SmallVectorImpl<llvm::ISD::InputArg> const&, llvm::SDLoc const&, llvm::SelectionDAG&, llvm::SmallVectorImpl<llvm::SDValue>&) const MipsISelLowering.cpp:0:0 #10 0x000055a3a3d896a9 llvm::SelectionDAGISel::LowerArguments(llvm::Function const&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x30466a9) #11 0x000055a3a3e0b3ec llvm::SelectionDAGISel::SelectAllBasicBlocks(llvm::Function const&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x30c83ec) #12 0x000055a3a3e09e21 llvm::SelectionDAGISel::runOnMachineFunction(llvm::MachineFunction&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x30c6e21) #13 0x000055a3a2aae1ca llvm::MipsDAGToDAGISel::runOnMachineFunction(llvm::MachineFunction&) MipsISelDAGToDAG.cpp:0:0 #14 0x000055a3a3e07706 llvm::SelectionDAGISelLegacy::runOnMachineFunction(llvm::MachineFunction&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x30c4706) #15 0x000055a3a3051ed6 llvm::MachineFunctionPass::runOnFunction(llvm::Function&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x230eed6) #16 0x000055a3a35a3ec9 llvm::FPPassManager::runOnFunction(llvm::Function&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x2860ec9) #17 0x000055a3a35ac3b2 llvm::FPPassManager::runOnModule(llvm::Module&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x28693b2) #18 0x000055a3a35a499c llvm::legacy::PassManagerImpl::run(llvm::Module&) (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x286199c) #19 0x000055a3a262abbb main (/home/davide/Ps2/rps2-tools/prefix/bin/llc+0x18e7bbb) #20 0x00007f00bde3fc4c __libc_start_call_main ./csu/../sysdeps/nptl/libc_start_call_main.h:74:3 #21 0x00007f00bde3fd05 call_init ./csu/../csu/libc-start.c:128:20 #22 0x00007f00bde3fd05 __libc_start_main@GLIBC_2.2.5 ./csu/../csu/libc-start.c:347:5 #23 0x000055a3a2624921 _start /builddir/glibc-2.39/csu/../sysdeps/x86_64/start.S:117:0 ``` This is caused by the fact that after the change, `f16`s are no longer lowered as `f32`s in calls. Two possible fixes are available: - Update calling conventions to properly support passing `f16` as integers. - Update `useFPRegsForHalfType()` to return `true` so that `f16` are still kept in `f32` registers, as before #110199. This PR implements the first solution to not introduce any more ABI changes as #110199 already did. As of what is the correct ABI for halfs, I don't think there is a correct answer. GCC doesn't support halfs on MIPS, and I couldn't find any information on old MIPS ABI manuals either.	2024-11-19 10:23:32 +01:00
yingopq	86e4beb702	[MIPS] LLVM data layout give i128 an alignment of 16 for mips64 (#112084 ) Fix parts of #102783.	2024-11-06 16:14:30 +01:00
Simon Pilgrim	aef0e77c76	[DAG] visitAND - Fold (and (srl X, C), 1) -> (srl X, BW-1) for signbit extraction (#114992 ) If we're masking the LSB of a SRL node result and that is shifting down an extended sign bit, see if we can change the SRL to shift down the MSB directly. These patterns can occur during legalisation when we've sign extended to a wider type but the SRL is still shifting from the subreg. Alternative to #114967 Fixes the remaining regression in #112588	2024-11-05 14:42:15 +00:00
yingopq	f0231b6164	[MIPS] Use softPromoteHalf legalization for fp16 rather than PromoteFloat (#110199 ) Fix part of #97975.	2024-11-05 14:41:02 +01:00
Ying Huang	a256e89fd1	[Mips] Add additional half float tests (NFC) For https://github.com/llvm/llvm-project/pull/110199.	2024-11-05 10:15:51 +01:00
Hervé Poussineau	6fa1647a47	[MC][Mips] Rename MipsMCAsmInfo to MipsELFMCAsmInfo (#112592 ) Also change MipsAsmPrinter::emitStartOfAsmFile to emit ELF-related sections only when using ELF output file format.	2024-11-01 08:42:34 +08:00
Vladimir Radosavljevic	401d123a1f	[MCP] Optimize copies when src is used during backward propagation (#111130 ) Before this patch, redundant COPY couldn't be removed for the following case: ``` $R0 = OP ... ... // Read of %R0 $R1 = COPY killed $R0 ``` This patch adds support for tracking the users of the source register during backward propagation, so that we can remove the redundant COPY in the above case and optimize it to: ``` $R1 = OP ... ... // Replace all uses of %R0 with $R1 ```	2024-10-23 13:37:02 +02:00
Alex Rønne Petersen	5785cbb405	[llvm] Ensure that soft float targets don't emit `fma()` libcalls. (#106615 ) The previous behavior could be harmful in some edge cases, such as emitting a call to `fma()` in the `fma()` implementation itself. Do this by just being more accurate in `isFMAFasterThanFMulAndFAdd()`. This was already done for PowerPC; this commit just extends that to Arm, z/Arch, and x86. MIPS and SPARC already got it right, but I added tests for them too, for good measure. Note: I don't have commit access.	2024-10-19 06:13:15 -07:00
Alex Rønne Petersen	ad4a582fd9	[llvm] Consistently respect `naked` fn attribute in `TargetFrameLowering::hasFP()` (#106014 ) Some targets (e.g. PPC and Hexagon) already did this. I think it's best to do this consistently so that frontend authors don't run into inconsistent results when they emit `naked` functions. For example, in Zig, we had to change our emit code to also set `frame-pointer=none` to get reliable results across targets. Note: I don't have commit access.	2024-10-18 09:35:42 +04:00
Nikita Popov	9f81acf4ef	[Mips] Regenerate test checks (NFC) Some of these check lines are insufficient to determine correctness. Generate full check lines instead. To reduce noise, add nounwind and use static relocation model.	2024-10-01 14:49:14 +02:00
yingopq	debc325bb1	[MIPS] Fix failing to legalize load+call with vector of non-p2 integer (#109625 ) Add a condition to check whether the vector element type is a power of 2. Fixes #102870.	2024-09-24 09:38:38 +02:00
yingopq	677177bb60	[Mips] Fix mfhi/mflo hazard miscompilation about div and mult (#91449 ) Fix issue1: In mips1-4, require a minimum of 2 instructions between a mflo/mfhi and the next mul/dmult/div/ddiv/divu/ddivu instruction. Fix issue2: In mips1-4, should not put mflo into the delay slot for the return. Fix https://github.com/llvm/llvm-project/issues/81291	2024-09-23 19:07:13 +08:00
futog	3e0a76b1fd	[Codegen][LegalizeIntegerTypes] Improve shift through stack (#96151 ) Minor improvement on cc39c3b17fb2598e20ca0854f9fe6d69169d85c7. Use an aligned stack slot to store the shifted value. Use the native register width as shifting unit, so the load of the shift result is aligned. If the shift amount is a multiple of the native register width, there is no need to do a follow-up shift after the load. I added new tests for these cases. Co-authored-by: Gergely Futo <gergely.futo@hightec-rt.com>	2024-09-23 11:45:43 +02:00
yingopq	72cacf1d99	[MIPS] Fix -msingle-float doesn't work with double on O32 (#107543 ) Skip the following function 'CustomLowerNode' when the operand had done `SoftenFloatResult`. Fix #93052	2024-09-20 07:37:18 +08:00
anbbna	b847076f55	[Mips] Add test file for 'xor' and 'and' instructions (#106679 ) Part of #99783 This test is meant to reflect the oncoming change as this test shows the unoptimized result with unnecessary SLLs.	2024-09-20 07:34:38 +08:00
yingopq	1ad84d7961	[Mips] Optimize `or (and $src1, mask), (shl $src2, shift)` to `ins` (#103017 ) Optimize `$dst = or (and $src1, (2**size0 - 1)), (shl $src2, size0)` to `ins $src1, $src2, pos, size`, where `pos = size0, size = 32 - pos`. Fix #90325	2024-09-13 00:05:54 +08:00
Alex Rønne Petersen	c0b3e491cc	[llvm][Mips] Bail on underaligned loads/stores in FastISel. (#106231 ) We encountered this problem in Zig, causing all of our `mips(el)-linux-gnueabi*` tests to fail: https://github.com/ziglang/zig/issues/21215 For these unusual cases, let's just bail in `MipsFastISel` since `MipsTargetLowering` can handle them fine. Note: I don't have commit access.	2024-09-12 22:10:19 +08:00
YunQiang Su	c641b611f8	MIPSr6: Add llvm.is.fpclasss intrinsic support (#107857 ) MIPSr6 has class.s/class.d instructions. Let's use them for llvm.is.fpclass intrinsic.	2024-09-11 09:37:12 +08:00
YunQiang Su	1e153461c6	MIPS: Add fcanonicalize for pre-R6 (#104554 ) MIPSr6 has max.s/max.d/min.s/min.d instructions, which can be used as fcanonicalize. For pre-R6, we have no instructions that can fcanonicalize an float, so let's use `fadd Y,X,X` to quiet it if it is NaN. IEEE754-2008 requires that the result of general-computational and quiet-computational operation shouldn't be signal NaN.	2024-08-27 17:13:46 +08:00
Craig Topper	ebe7265b14	[Mips] Fix fast isel for i16 bswap. (#103398 ) We need to mask the SRL result to 8 bits before ORing in the SLL. This is needed in case bits 23:16 of the input aren't zero. They will have been shifted into bits 15:8. We don't need to AND the result with 0xffff. It's ok if the upper 16 bits of the register are garbage. Fixes #103035.	2024-08-16 14:54:51 -07:00
YunQiang Su	fb9e685fc4	Intrinsic: introduce minimumnum and maximumnum for IR and SelectionDAG (#96649 ) C23 introduced new functions fminimum_num and fmaximum_num, and they follow the minimumNumber and maximumNumber of IEEE754-2019. Let's introduce new intrinsics to support them. This patch introduces support only support for scalar values. The support of vector (vp, vp.reduce, vector.reduce), experimental.constrained will be added in future patches. With this patch, MIPSr6 and LoongArch can work out of box with fcanonical and fmax/fmin. Aarch64/PowerPC64 can use the same login as MIPSr6 and LoongArch, while they have no fcanonical support yet. I will add it in future patches. The FMIN/FMAX of RISC-V instructions follows the minimumNumber/maximumNumber of IEEE754-2019. We can just add it in future patch. Background https://discourse.llvm.org/t/rfc-fix-llvm-min-f-and-llvm-max-f-intrinsics/79735 Currently we have fminnum/fmaxnum, which have different behavior on different platform for NUM vs sNaN: 1) Fallback to fmin(3)/fmax(3): return qNaN. 2) ARM64/ARM32+Neon: same as libc. 3) MIPSr6/LoongArch/RISC-V: return NUM. And the fix of fminnum/fmaxnum to follow minNUM/maxNUM of IEEE754-2008 will submit as separated patches.	2024-08-15 14:09:36 +08:00
Craig Topper	abc1acf8df	[TargetLowering][AMDGPU][ARM][RISCV][X86] Teach SimplifyDemandedBits to combine (srl (sra X, C1), ShAmt) -> sra(X, C1+ShAmt) (#101751 ) If the upper bits of the shr aren't demanded. This helps with cases where the outer srl was originally an sra and was converted to a srl by SimplifyDemandedBits before it had a chance to combine with the inner sra. This can occur when the inner sra was part of a sign_extend_inreg expansion. There are some regressions in ARM and Thumb2.	2024-08-14 08:44:57 -07:00
Craig Topper	91c3a718b2	[Mips] ISel zext nneg the same as sext for Mips64. (#102852 ) Fixes #62587.	2024-08-12 13:47:27 -07:00
yingopq	e711a0c80f	[MIPS] Fix missing ANDI optimization (#97689 ) 1. Add MipsPat to optimize (andi (srl (truncate i64 $1), x), y) to (andi (truncate (dsrl i64 $1, x)), y). 2. Add MipsPat to optimize (ext (truncate i64 $1), x, y) to (truncate (dext i64 $1, x, y)). The assembly result is the same as gcc. Fixes https://github.com/llvm/llvm-project/issues/42826	2024-08-09 18:55:21 +01:00
yingopq	5fb20024e2	[Mips] Add test for AND optimization (#102278 ) See https://github.com/llvm/llvm-project/issues/42826	2024-08-07 20:55:13 +01:00
Nikita Popov	f2f18459d4	Revert "Intrinsic: introduce minimumnum and maximumnum (#93841 )" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 89881480030f48f83af668175b70a9798edca2fb. This reverts commit 225d8fc8eb24fb797154c1ef6dcbe5ba033142da.	2024-06-21 08:34:04 +02:00
YunQiang Su	8988148003	Intrinsic: introduce minimumnum and maximumnum (#93841 ) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AArch64/PowerPC: follow the IEEE754-2008's minNUM: return qNaN. RISC-V/Hexagon follow the IEEE754-2019's minimumNumber: return NUM. X86: Returns NUM but not same with IEEE754-2019's minimumNumber as +0.0 is not always greater than -0.0. MIPS/LoongArch/Generic: return NUM. LIBCALL: returns qNaN. So, let's introduce llvm.minmumnum/llvm.maximumnum, which always follow IEEE754-2019's minimumNumber/maximumNumber. Half-fix: #93033	2024-06-21 11:53:08 +08:00
Thorsten Schütt	b1f9440fa9	[GlobalIsel] Import GEP flags (#93850 ) https://github.com/llvm/llvm-project/pull/90824	2024-06-14 20:56:43 +02:00
Nikita Popov	deab451e7a	[IR] Remove support for icmp and fcmp constant expressions (#93038 ) Remove support for the icmp and fcmp constant expressions. This is part of: https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179 As usual, many of the updated tests will no longer test what they were originally intended to -- this is hard to preserve when constant expressions get removed, and in many cases just impossible as the existence of a specific kind of constant expression was the cause of the issue in the first place.	2024-06-04 08:31:03 +02:00
paperchalice	9b0e1c2ca2	[NewPM][CodeGen] Port `finalize-isel` to new pass manager (#94214 ) It should preserve more analysis results, but it happens immediately after instruction selection.	2024-06-04 09:23:52 +08:00
Sergei Barannikov	3fee8b3469	[GISel] LegalizationArtifactCombiner: Elide redundant G_SEXT_INREG (#93687 ) This is similar to 373c343a, but for targets with zero-or-negative-one booleans. The difference in tests is mostly due to G_SEXT_INREG being illegal for some targets, in which case it gets expanded into G_SHL/G_ASHR pair, which is not currently optimized by the combiner.	2024-05-30 12:40:42 +03:00
YunQiang Su	0bf181eb34	MIPS: Fix llvm.{min,max}num for R6 (#93125 ) MIPS max.fmt/min.fmt instructions is IEEE2008 compatiable. If either argument is sNaN, the result will be NaN. So we define fminnum_ieee instead of fminnum in Mips32r6InstrInfo.td. We also should define fcanonicalize. So that we can define fminnum as expand to fcanonicalize and fminnum_ieee.	2024-05-23 22:27:17 +08:00
YunQiang Su	eac743d1b0	MIPS: Support '%w' token in inline asm template for MSA (#91920 ) MSA registers share the FPRs as its bottom half. So that we can use MSA instructions to work with normal float/double: double a, b, c; asm volatile ("fmadd.d %w0, %w1, %w2" : "+f"(a) : "f"(b), "f"(c)); GCC has support it for quite long time.	2024-05-20 14:46:47 +08:00
YunQiang Su	8f21294897	MIPS: Use pcrel\|sdata4 for eh_frame (#91291 ) Gas uses encoding DW_EH_PE_absptr for PIC, and gnu ld converts it to DW_EH_PE_sdata4\|DW_EH_PE_pcrel. LLD doesn't have this workarounding, thus complains ``` relocation R_MIPS_32 cannot be used against local symbol; recompile with -fPIC relocation R_MIPS_64 cannot be used against local symbol; recompile with -fPIC ``` So, let's generates asm/obj files with `DW_EH_PE_sdata4\|DW_EH_PE_pcrel` encoding. In fact, GNU ld supports such OBJs well. For N64, maybe we should use sdata8, while GNU ld doesn't support it well, and in fact sdata4 is enough now. So we just ignore the `Large` for `MCObjectFileInfo::initELFMCObjectFileInfo`. Maybe we should switch back to sdata8 once GNU LD supports it well. Fixes: #58377.	2024-05-08 17:30:14 +08:00
Cinhi Young	715219482b	[MIPS] match llvm.{min,max}num with {min,max}.fmt for R6 (#89021 ) - The behavior is similar to UCOMISD on x86, which is also used to compare two fp values, specifically on handling of NaNs. - Update related tests regarding this change. - The further goal is to implement `llvm.minimum` and `llvm.maximum` intrinsics for MIPS R6 and Pre-R6. Part of https://github.com/llvm/llvm-project/issues/64207	2024-04-27 15:53:02 +08:00
yingopq	e1aa16299f	[Mips] Use ANDi in for zero-extend in subword atomic umax/umin for both r2 and pre-R2 (#89881 ) About unsigned max/min, ANDi is available for all ISA revisions in extend before slt insn. So that we can reduce one instruction.	2024-04-24 22:31:51 +08:00
YunQiang Su	758d97dce0	[MIPS]: Rework atomic max/min expand for subword (#89575 ) The current code is so buggy: it can work for few cases. The problems include: 1. ll/sc works on a whole word, while other parts other than we rmw are dropped. 2. The oprands are not well zero-extended for unsigned ops. 3. It doesn't work for big-endian, as the postion of subword differs with little endian. And in fact, we can set the return value correct in ll/sc scope, so we can skip the sinkMBB.	2024-04-23 02:08:12 +08:00
Shilei Tian	3a106e5b2c	[GlobalISel] Fold G_ICMP if possible (#86357 ) This patch tries to fold `G_ICMP` if possible.	2024-03-29 15:59:50 -04:00
Wang Pengcheng	610b9e23c5	[SDAG] Use shifts if ISD::MUL is illegal when lowering ISD::CTPOP (#86505 ) We can avoid libcalls. Fixes #86205	2024-03-29 15:38:39 +08:00
Simon Pilgrim	5b544b511c	[Mips] ctpop.mir - regenerate checks to improve codegen diff in #86505	2024-03-26 10:43:29 +00:00

1 2 3 4 5 ...

1801 Commits