llvm-project

Author	SHA1	Message	Date
yingopq	debc325bb1	[MIPS] Fix failing to legalize load+call with vector of non-p2 integer (#109625 ) Add a condition to check whether the vector element type is a power of 2. Fixes #102870.	2024-09-24 09:38:38 +02:00
yingopq	677177bb60	[Mips] Fix mfhi/mflo hazard miscompilation about div and mult (#91449 ) Fix issue1: In mips1-4, require a minimum of 2 instructions between a mflo/mfhi and the next mul/dmult/div/ddiv/divu/ddivu instruction. Fix issue2: In mips1-4, should not put mflo into the delay slot for the return. Fix https://github.com/llvm/llvm-project/issues/81291	2024-09-23 19:07:13 +08:00
Alex Rønne Petersen	72a218056d	[llvm][Triple] Add `Environment` members and parsing for glibc/musl parity. (#107664 ) This adds support for: * `muslabin32` (MIPS N32) * `muslabi64` (MIPS N64) * `muslf32` (LoongArch ILP32F/LP64F) * `muslsf` (LoongArch ILP32S/LP64S) As we start adding glibc/musl cross-compilation support for these targets in Zig, it would make our life easier if LLVM recognized these triples. I'm hoping this'll be uncontroversial since the same has already been done for `musleabi`, `musleabihf`, and `muslx32`. I intentionally left out a musl equivalent of `gnuf64` (LoongArch ILP32D/LP64D); my understanding is that Loongson ultimately settled on simply `gnu` for this much more common case, so there doesn't seem to be a particularly compelling reason to add a `muslf64` that's basically deprecated on arrival. Note: I don't have commit access.	2024-09-20 08:53:03 +08:00
yingopq	72cacf1d99	[MIPS] Fix -msingle-float doesn't work with double on O32 (#107543 ) Skip the following function 'CustomLowerNode' when the operand had done `SoftenFloatResult`. Fix #93052	2024-09-20 07:37:18 +08:00
Lei Huang	4b524088a8	[NFC] Update function names in MCTargetAsmParser.h (#108643 ) Update function names to adhere to LLVM coding standard.	2024-09-18 11:43:49 -04:00
yingopq	1ad84d7961	[Mips] Optimize `or (and $src1, mask), (shl $src2, shift)` to `ins` (#103017 ) Optimize `$dst = or (and $src1, (2**size0 - 1)), (shl $src2, size0)` to `ins $src1, $src2, pos, size`, where `pos = size0, size = 32 - pos`. Fix #90325	2024-09-13 00:05:54 +08:00
Alex Rønne Petersen	c0b3e491cc	[llvm][Mips] Bail on underaligned loads/stores in FastISel. (#106231 ) We encountered this problem in Zig, causing all of our `mips(el)-linux-gnueabi*` tests to fail: https://github.com/ziglang/zig/issues/21215 For these unusual cases, let's just bail in `MipsFastISel` since `MipsTargetLowering` can handle them fine. Note: I don't have commit access.	2024-09-12 22:10:19 +08:00
YunQiang Su	c641b611f8	MIPSr6: Add llvm.is.fpclasss intrinsic support (#107857 ) MIPSr6 has class.s/class.d instructions. Let's use them for llvm.is.fpclass intrinsic.	2024-09-11 09:37:12 +08:00
Craig Topper	aafaa69434	[Target] Use templated MachineFunction::getSubtarget in *CallingConv.td. NFC (#107311 ) This hides away the static_cast needed to get the target specific Subtarget object.	2024-09-04 23:15:25 -07:00
Jesse D	0ba006daf5	[MIPS] Fix error messages when rejecting certain assembly not supported by ISA (#94695 ) … instructions. This is a fix I stumbled upon while working on something else. I decided to break it out since it seems like a good "first issue" to submit. I updated the comments in the "wrong error" test files to indicate that the messages are no longer incorrect, but I left the names of the test files alone. I was not sure what to do with those, so I would appreciate thoughts or guidance.	2024-09-03 06:08:51 +08:00
YunQiang Su	1e153461c6	MIPS: Add fcanonicalize for pre-R6 (#104554 ) MIPSr6 has max.s/max.d/min.s/min.d instructions, which can be used as fcanonicalize. For pre-R6, we have no instructions that can fcanonicalize an float, so let's use `fadd Y,X,X` to quiet it if it is NaN. IEEE754-2008 requires that the result of general-computational and quiet-computational operation shouldn't be signal NaN.	2024-08-27 17:13:46 +08:00
Piyou Chen	b01c006f73	[TII][RISCV] Add renamable bit to copyPhysReg (#91179 ) The renamable flag is useful during MachineCopyPropagation but renamable flag will be dropped after lowerCopy in some case. This patch introduces extra arguments to pass the renamable flag to copyPhysReg.	2024-08-27 10:08:43 +08:00
Craig Topper	4b0c0ec6b8	[CodeGen] Use MCRegister for CCState::AllocateReg and CCValAssign::getReg. NFC (#106032 )	2024-08-26 11:40:25 -07:00
Craig Topper	c1b3ebba79	[MC] Update MCOperand::getReg/setReg/createReg and MCInstBuilder::addReg to use MCRegister. (#106015 ) Replace unsigned with MCRegister. Update some ternary operators that started giving errors.	2024-08-26 09:37:49 -07:00
Kazu Hirata	762cb44581	[Mips] Use a range-based for loop (NFC) (#106004 )	2024-08-26 08:47:49 -07:00
Kazu Hirata	675c748bb6	[Mips] clang-format prescanForConstants (NFC) I'm planning to change the inner loop to a range-based for loop.	2024-08-25 12:18:33 -07:00
Kazu Hirata	a6f87abf73	[Mips] Remove a trivial variable (NFC) (#105940 ) We assign I->getNumOperands() to J and immediately print that out as a debug message. We don't need to keep J across iterations.	2024-08-24 16:20:38 -07:00
Fangrui Song	59721f2326	[MIPS] Optimize sortRelocs for o32 The o32 ABI specifies: > Each relocation type of R_MIPS_HI16 must have an associated R_MIPS_LO16 entry immediately following it in the list of relocations. [...] the addend AHL is computed as (AHI << 16) + (short)ALO In practice, the high-part and low-part relocations may not be adjacent in assembly files, requiring the assembler to reorder relocations. http://reviews.llvm.org/D19718 performed the reordering, but did not optimize for the common case where a %lo immediately follows its matching %hi. The quadratic time complexity could make sections with many relocations very slow to process. This patch implements the fast path, simplifies the code, and makes the behavior more similar to GNU assembler (for the .rel.mips_hilo_8b test). We also remove `OriginalSymbol`, removing overhead for other targets. Fix #104562 Pull Request: https://github.com/llvm/llvm-project/pull/104723	2024-08-23 00:05:20 -07:00
Fangrui Song	1a6bf94407	[MC] Remove ELFRelocationEntry::OriginalAddend For MIPS's o32 ABI (REL), https://reviews.llvm.org/D19718 introduced `OriginalAddend` to find the matching R_MIPS_LO16 relocation for R_MIPS_GOT16 when STT_SECTION conversion is applicable. lw $2, %lo(local1) lui $2, %got(local1) However, we could just store the original `Addend` in `ELFRelocationEntry` and remove `OriginalAddend`. Note: The relocation ordering algorithm in https://reviews.llvm.org/D19718 is inefficient (#104562), which will be addressed by another patch.	2024-08-18 15:47:38 -07:00
Fangrui Song	bf5cd4220d	[MIPS] Remove expensive LLVM_DEBUG relocation dump The input is usually ordered by offset, so inspecting the output is sufficient. The super expensive relocation dump is not conventional.	2024-08-18 11:24:44 -07:00
Craig Topper	ebe7265b14	[Mips] Fix fast isel for i16 bswap. (#103398 ) We need to mask the SRL result to 8 bits before ORing in the SLL. This is needed in case bits 23:16 of the input aren't zero. They will have been shifted into bits 15:8. We don't need to AND the result with 0xffff. It's ok if the upper 16 bits of the register are garbage. Fixes #103035.	2024-08-16 14:54:51 -07:00
Craig Topper	91c3a718b2	[Mips] ISel zext nneg the same as sext for Mips64. (#102852 ) Fixes #62587.	2024-08-12 13:47:27 -07:00
yingopq	e711a0c80f	[MIPS] Fix missing ANDI optimization (#97689 ) 1. Add MipsPat to optimize (andi (srl (truncate i64 $1), x), y) to (andi (truncate (dsrl i64 $1, x)), y). 2. Add MipsPat to optimize (ext (truncate i64 $1), x, y) to (truncate (dext i64 $1, x, y)). The assembly result is the same as gcc. Fixes https://github.com/llvm/llvm-project/issues/42826	2024-08-09 18:55:21 +01:00
Alexis Engelke	da0e66e64c	[CodeGen][NFC] Add wrapper method for MBBMap (#101893 ) This is a preparation for changing the data structure of MBBMap.	2024-08-04 18:34:26 +02:00
Sergei Barannikov	25bea3eb03	[MC] Forward declare ELFObjectWriter (#100989 )	2024-07-30 10:40:40 +03:00
Sergei Barannikov	7a2a36f952	[AsmPrinter] Don't EmitToStreamer instructions lowered by tblgenned code (#100803 ) This allows lowering individual instructions in a bundle before a single call to EmitToStreamer for VLIW targets.	2024-07-29 19:18:18 +03:00
Kai Nacke	a79db96ec0	[GISel][TableGen] Generate getRegBankFromRegClass (#99896 ) Generating the mapping from a register class to a register bank is complex: - there can be lots of register classes - the mapping may be ambiguos - a register class can span several register banks (e.g. a register class containing all registers) - the type information is not enough to decide which register bank to map to (e.g. a register class containing floating point and vector registers, and all register can represent a f64 value) The approach taken here is to encode the register banks in an array indexed by the ID of the register class. To save space, the entries are packed into chunks of size 2^n.	2024-07-25 09:41:55 -04:00
Fangrui Song	c473e75ade	MCAssmembler: Move ELFHeaderEFlags to ELFObjectWriter Now that MCELFStreamer can access ELFObjectWriter (commit 70c52b62c5669993e341664a63bfbe5245e32884), we can move ELFHeaderEFlags there.	2024-07-22 18:20:18 -07:00
Fangrui Song	6717dc5c47	*AsmBackend.cpp: Include StringSwitch.h They currently get the header from MCLinkerOptimizationHint.h, which will be removed from MCAssembler.h.	2024-07-21 11:17:19 -07:00
Fangrui Song	8f14e39e59	[MC] Remove unnecessary isVerboseAsm from Target::AsmTargetStreamerCtorTy The parameter is confusing as it duplicates MCStreamer::isVeboseAsm (initialized from MCTargetOptions::AsmVerbose). After 233cca169237b91d16092c82bd55ee6a283afe98, no in-tree target uses the parameter.	2024-07-21 10:19:17 -07:00
Joseph Huber	615b7eeaa9	Reapply "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit 740161a9b98c9920dedf1852b5f1c94d0a683af5. I moved the `ISD` dependencies into the CodeGen portion of the handling, it's a little awkward but it's the easiest solution I can think of for now.	2024-07-20 09:29:31 -05:00
NAKAMURA Takumi	740161a9b9	Revert "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit c05126bdfc3b02daa37d11056fa43db1a6cdef69. (llvmorg-19-init-17714-gc05126bdfc3b) See #99610	2024-07-20 12:36:57 +09:00
Matt Arsenault	0f0cfcff2c	CodeGen: Avoid some references to MachineFunction's getMMI (#99652 ) MachineFunction's probably should not include a backreference to the owning MachineModuleInfo. Most of these references were used just to query the MCContext, which MachineFunction already directly stores. Other contexts are using it to query the LLVMContext, which can already be accessed through the IR function reference.	2024-07-19 22:09:05 +04:00
Kazu Hirata	3e47f6ba4a	Rapply "[Target] Use range-based for loops (NFC) (#98844 )" This iteration drops hunks where the loop body adds more elements.	2024-07-17 19:39:04 -07:00
Amara Emerson	f270a4dd66	[AArch64] Don't tail call memset if it would convert to a bzero. (#98969 ) Well, not quite that simple. We can tc memset since it returns the first argument but bzero doesn't do that and therefore we can end up miscompiling. This patch also refactors the logic out of isInTailCallPosition() into the callers. As a result memcpy and memmove are also modified to do the same thing for consistency. rdar://131419786	2024-07-17 01:31:52 -07:00
Joseph Huber	c05126bdfc	[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 ) Summary: The LTO pass and LLD linker have logic in them that forces extraction and prevent internalization of needed runtime calls. However, these currently take all RTLibcalls into account, even if the target does not support them. The target opts-out of a libcall if it sets its name to nullptr. This patch pulls this logic out into a class in the header so that LTO / lld can use it to determine if a symbol actually needs to be kept. This is important for targets like AMDGPU that want to be able to use `lld` to perform the final link step, but does not want the overhead of uncalled functions. (This adds like a second to the link time trivially)	2024-07-16 06:22:09 -05:00
Kazu Hirata	515618e245	Revert "[Target] Use range-based for loops (NFC) (#98844 )" This reverts commit 3614f65a7ba9d925010e3316a1d93bcebc632178. fixupImmediateBr seems to resize ImmBranches.	2024-07-15 20:39:49 -07:00
Kazu Hirata	3614f65a7b	[Target] Use range-based for loops (NFC) (#98844 )	2024-07-15 17:23:11 -07:00
Joseph Huber	3f1a767572	[LLVM] Factor disabled Libcalls into the initializer (#98421 ) Summary: These Libcalls represent which functions are available to the backend. If a runtime call is not available, the target sets the the name to `nullptr`. Currently, this logic is spread around the various targets. This patch pulls all of the locations that disable libcalls into the intializer. This patch is effectively NFC. The motivation behind this patch is that currently the LTO handling uses the list of all runtime calls to determine which functions cannot be internalized and must be extracted from static libraries. We do not want this to happen for libcalls that are not emitted by the backend. A follow-up patch will move out this logic so the LTO pass can know which rtlib calls are actually used by the backend.	2024-07-11 12:59:25 -05:00
Craig Topper	0df714364a	[ARM][Mips][PowerPC] Remove unnecessary static_cast creating GISel InstructionSelector. NFC Some targets only pass a TargetMachine & to their subtarget constructor and require a static_cast to their target-specific TargetMachine subclass to create *InstructionSelector. These 3 targets already have the correct TargetMachine subclass reference so no cast is needed.	2024-07-10 10:28:32 -07:00
Michael Maitland	9b95d08ef6	[GISel] Make create.*InstructionSelector arguments const (#98243 ) The InstructionSelector objects all take these arguments in as `const`. This function does not modify the object. Therefore we can mark them as `const` here.	2024-07-10 09:01:27 -04:00
Fangrui Song	057f28be3e	[MC] Remove unused MCAsmLayout declarations and includes	2024-07-01 17:47:13 -07:00
Fangrui Song	e25e8003ca	MCExpr::evaluateAsRelocatable: replace the MCAsmLayout parameter with MCAssembler Continue the MCAsmLayout removal work started by 67957a45ee1ec42ae1671cdbfa0d73127346cc95.	2024-07-01 16:23:43 -07:00
Fangrui Song	88c0a82588	[MC] Make MCAsmBackend::fixupNeedsRelaxation not pure virtual This hook only needs to be implemented if mayNeedRelaxation may return true.	2024-07-01 13:46:30 -07:00
Nikita Popov	4169338e75	[IR] Don't include Module.h in Analysis.h (NFC) (#97023 ) Replace it with a forward declaration instead. Analysis.h is pulled in by all passes, but not all passes need to access the module.	2024-06-28 14:30:47 +02:00
Nikita Popov	9df71d7673	[IR] Add getDataLayout() helpers to Function and GlobalValue (#96919 ) Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, replacing the current `getParent()->getDataLayout()` pattern.	2024-06-28 08:36:49 +02:00
Fangrui Song	0c454df448	[MC] Make changeSection private Using changeSection externally would cause `CurFrag` to be out of sync of `SectionStack`. Remove some uses from MipsTargetStreamer.cpp.	2024-06-27 21:01:52 -07:00
paperchalice	d38b518e04	Reapply "[CodeGen][NewPM] Port machine-branch-prob to new pass manager" (#96858 ) (#96869 ) This reverts commit ab58b6d58edf6a7c8881044fc716ca435d7a0156. In `CodeGen/Generic/MachineBranchProb.ll`, `llc` crashed with dumped MIR when targeting PowerPC. Move test to `llc/new-pm`, which is X86 specific.	2024-06-28 10:59:23 +08:00
paperchalice	ab58b6d58e	Revert "[CodeGen][NewPM] Port machine-branch-prob to new pass manager" (#96858 ) Reverts llvm/llvm-project#96389 Some ppc bots failed.	2024-06-27 15:00:17 +08:00
paperchalice	73e46c2bb4	[CodeGen][NewPM] Port machine-branch-prob to new pass manager (#96389 ) Like IR version `print<branch-prob>`, there is also a `print<machine-branch-prob>`.	2024-06-27 14:04:51 +08:00

1 2 3 4 5 ...

4995 Commits