llvm-project

Author	SHA1	Message	Date
Albion Fung	2fd1520247	[PowerPC] Implemented mtmsr, mfspr, mtspr Builtins Implemented builtins for mtmsr, mfspr, mtspr on PowerPC; the patch is intended for XL Compatibility. Differential revision: https://reviews.llvm.org/D106130	2021-07-20 17:51:00 -05:00
Albion Fung	3434ac9e39	[PowerPC] Store, load, move from and to registers related builtins This patch implements store, load, move from and to registers related builtins, as well as the builtin for stfiw. The patch aims to provide feature parady with xlC on AIX. Differential revision: https://reviews.llvm.org/D105946	2021-07-20 15:46:14 -05:00
Victor Huang	1a762f93f8	[PowerPC] Add PowerPC cmpb builtin and emit target indepedent code for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch add the builtin and emit target independent code for __cmpb. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D105194	2021-07-20 13:06:22 -05:00
Stefan Pintilie	1a6dc92be7	[PowerPC] Inefficient register allocation of ACC registers results in many copies. ACC registers are a combination of four consecutive vector registers. If the vector registers are assigned first this often forces a number of copies to appear just before the ACC register is created. If the ACC register is assigned first then fewer copies are generated when the vector registers are assigned. This patch tries to force the register allocator to assign the ACC registers first and then the UACC registers and then the vector pair registers. It does this by changing the priority of the register classes. This patch also adds hints to help the register allocator assign UACC registers from known ACC registers and vector pair registers from known UACC registers. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105854	2021-07-20 10:53:40 -05:00
Kai Luo	e2ee27b20b	[PowerPC] Fallback to base's implementation of shouldExpandAtomicCmpXchgInIR and shouldExpandAtomicCmpXchgInIR If we can't decide `shouldExpandAtomicCmpXchgInIR` or `shouldExpandAtomicCmpXchgInIR` in PPC's implementation after https://reviews.llvm.org/rGb9c3941cd61de1e1b9e4f3311ddfa92394475f4b, resort to base's implementation. This fixes internal build of OpenMP which uses atomic operations on float. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D106234	2021-07-20 06:14:24 +00:00
Nemanja Ivanovic	35a18a981f	[PowerPC] Implement intrinsics for mtfsf[i] This provides intrinsics for emitting instructions that set the FPSCR (`mtfsf/mtfsfi`). The patch also conservatively marks the rounding mode as an implicit def for both since they both may set the rounding mode depending on the operands. Reviewed By: #powerpc, qiucf Differential Revision: https://reviews.llvm.org/D105957	2021-07-16 16:26:11 -05:00
Lei Huang	c8937b6cb9	[PowerPC] Implement XL compact math builtins Implement a subset of builtins required for compatiblilty with AIX XL compiler. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105930	2021-07-16 13:21:13 -05:00
Masoud Ataei	ee2068b30e	[PowerPC] Updated the error message of MASSV pass to mention vectorization is needed be enable on P8 and later targets. Differential Revision: https://reviews.llvm.org/D106091	2021-07-16 14:45:09 +00:00
Amy Kwan	ba627a32e1	[PowerPC] Update Refactored Load/Store Implementation, XForm VSX Patterns, and Tests This patch includes the following updates to the load/store refactoring effort introduced in D93370: - Update various VSX patterns that use to "force" an XForm, to instead just XForm. This allows the ability for the patterns to compute the most optimal addressing mode (and to produce a DForm instruction when possible) - Update pattern and test case for the LXVD2X/STXVD2X intrinsics - Update LIT test cases that use to use the XForm instruction to use the DForm instruction Differential Revision: https://reviews.llvm.org/D95115	2021-07-16 09:28:48 -05:00
Victor Huang	4eb107ccba	[PowerPC] Add PowerPC population count, reversed load and store related builtins and instrinsics for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtins and instrisics for population count, reversed load and store related operations. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D106021	2021-07-15 17:23:56 -05:00
Quinn Pham	de3956605a	[PowerPC] Fix popcntb XL Compat Builtin for 32bit This patch implements the `__popcntb` XL compatibility builtin for 32bit in the frontend and backend. This patch also updates tests for `__popcntb` and other XL Compat sync related builtins. Reviewed By: #powerpc, nemanjai, amyk Differential Revision: https://reviews.llvm.org/D105360	2021-07-15 13:19:47 -05:00
Bogdan Graur	442123cada	Fixes memory sanitizer 'use-of-uninitialized-value' diagnostic. Differential Revision: https://reviews.llvm.org/D106047	2021-07-15 11:17:04 +02:00
Kai Luo	b9c3941cd6	[PowerPC] Generate inlined quadword lock free atomic operations via AtomicExpand This patch uses AtomicExpandPass to implement quadword lock free atomic operations. It adopts the method introduced in https://reviews.llvm.org/D47882, which expand atomic operations post RA to avoid spilling that might prevent LL/SC progress. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D103614	2021-07-15 01:12:09 +00:00
Jinsong Ji	fe52296a34	[AIX] Enable dollar sign as PC in inlineasm $ is used as PC for PowerPC inlineasm, ELF use it, enable it for AIX XCOFF as well. Reviewed By: #powerpc, amyk, nemanjai Differential Revision: https://reviews.llvm.org/D105956	2021-07-14 13:37:52 +00:00
Victor Huang	18c19414eb	[PowerPC] Add PowerPC compare and multiply related builtins and instrinsics for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtins and instrisics for compare and multiply related operations. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D102875	2021-07-13 16:55:09 -05:00
Victor Huang	781929b423	[PowerPC][NFC] Power ISA features for Semachecking [NFC] This patch adds features for pwr7, pwr8, and pwr9 that can be used for semachecking builtin functions that are only valid for certain versions of ppc. Reviewed By: nemanjai, #powerpc Authored By: Quinn Pham <Quinn.Pham@ibm.com> Differential revision: https://reviews.llvm.org/D105501	2021-07-13 13:13:34 -05:00
Victor Huang	e4585d3f4e	Revert "[PowerPC][NFC] Power ISA features for Semachecking" This reverts commit 10e0cdfc6526578c8892d895c0448e77cb9ba876.	2021-07-13 13:13:34 -05:00
Amy Kwan	b5f4ac4c11	[PowerPC] Add FI alignment check if the addressing mode is DS/DQ-Form, emit X-Form if necessary. This patch adds a function that checks whether or not the frame index is aligned when the computed addressing mode is an aligned D-Form (DS, or DQ-Form). If the frame index appears to be unaligned, within these two modes, reset the mode to X-Form in order to fall back to selection X-Form loads. A test case is added to ensure that the test emits X-Form loads and not DQ-Form loads since the frame index is not aligned within the test case. Differential Revision: https://reviews.llvm.org/D105661	2021-07-13 12:31:52 -05:00
Arthur Eubanks	693bc04bf6	[OpaquePtr] Use GlobalValue::getValueType() more	2021-07-13 09:34:34 -07:00
Albion Fung	f1aca5ac96	[PowerPC] Fix L[D\|W]ARX Implementation LDARX and LWARX sometimes gets optimized out by the compiler when it is critical to the correctness of the code. This inline asm generation ensures that it preserved. Differential Revision: https://reviews.llvm.org/D105754	2021-07-13 11:02:07 -05:00
Victor Huang	10e0cdfc65	[PowerPC][NFC] Power ISA features for Semachecking [NFC] This patch adds features for pwr7, pwr8, and pwr9 that can be used for semachecking builtin functions that are only valid for certain versions of ppc. Reviewed By: nemanjai, #powerpc Authored By: Quinn Pham <Quinn.Pham@ibm.com> Differential revision: https://reviews.llvm.org/D105501	2021-07-13 10:51:25 -05:00
Qiu Chaofan	6fd9c1901f	[PowerPC] Fix typo in vector shuffle combining a22ecb4 fixed a crash on big endian subtargets. This commit fixes a typo in that commit which may cause miscompile.	2021-07-13 14:35:47 +08:00
Amy Kwan	35909ff6cf	[PowerPC] Fix the splat immediate in PPCMIPeephole depending on if we have an Altivec and VSX splat instruction. An assertion of the following can occur because Altivec and VSX splats use a different operand number for the immediate: ``` int64_t llvm::MachineOperand::getImm() const: Assertion `isImm() && "Wrong MachineOperand accessor"' failed. ``` This patch updates PPCMIPeephole.cpp assign the correct splat immediate. Differential Revision: https://reviews.llvm.org/D105790	2021-07-12 16:20:11 -05:00
Jinsong Ji	2377eca93c	[PowerPC] Custom Lowering BUILD_VECTOR for v2i64 for P7 as well The lowering for v2i64 is now guarded with hasDirectMove, however, the current lowering can handle the pattern correctly, only lowering it when there is efficient patterns and corresponding instructions. The original guard was added in D21135, and was for Legal action. The code has evloved now, this guard is not necessary anymore. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D105596	2021-07-12 17:56:10 +00:00
Albion Fung	ef49d925e2	[PowerPC] Implement trap and conversion builtins for XL compatibility This patch implements trap and FP to and from double conversions. The builtins generate code that mirror what is generated from the XL compiler. Intrinsics are named conventionally with builtin_ppc, but are aliased to provide the same builtin names as the XL compiler. Differential Revision: https://reviews.llvm.org/D103668	2021-07-12 11:04:17 -05:00
zhijian	841077a7e9	[AIX][XCOFF] Use bit order of has_vec and longtbtable bits as defined in AIX header debug.h Summary: The bit order of the has_vec and longtbtable bits in the traceback table generated by the XL compiler flipped at some point after v12.1. This is different from the definition is the AIX header debug.h. The change in the XL compiler that caused the deviation from the OS header definition was unintentional. Since both orderings are extant and the XL compiler runtime also expects the ordering defined by the OS, we will correct the output from LLVM to match the defined ordering given by the OS (which is also consistent with the Assembler Language Reference). Mitigation for traceback tables encoded with the wrong ordering is required for either ordering. Reviewers: XingXue, HubertTong Differential Revision: https://reviews.llvm.org/D105487	2021-07-09 11:06:46 -04:00
Kai Luo	55bd12d4b7	[PowerPC] Remove implicit use register after transformToImmFormFedByLI() When the instruction has imm form and fed by LI, we can remove the redundat LI instruction. Below is an example: ``` renamable $x5 = LI8 2 renamable $x4 = exact SRD killed renamable $x4, killed renamable $r5, implicit $x5 ``` will be converted to: ``` renamable $x5 = LI8 2 renamable $x4 = exact RLDICL killed renamable $x4, 62, 2, implicit killed $x5 ``` But when we do this optimization, we forget to remove implicit killed $x5 This bug has caused a lnt case error. This patch is to fix above bug. Reviewed By: #powerpc, shchenz Differential Revision: https://reviews.llvm.org/D85288	2021-07-09 04:42:54 +00:00
Matt Arsenault	9b057f647d	GlobalISel: Track original argument index in ArgInfo SelectionDAG's equivalents in ISD::InputArg/OutputArg track the original argument index. Mips relies on this, and its currently reinventing its own parallel CallLowering infrastructure which tracks these indexes on the side. Add this to help move towards deleting the custom mips handling.	2021-07-08 13:39:02 -04:00
Qiu Chaofan	a22ecb4508	[PowerPC] Fix i64 to vector lowering on big endian Lowering for scalar to vector would skip if current subtarget is big endian and the scalar is larger or equal than 64 bits. However there's some issue in implementation that SToVRHS may refer to SToVLHS's scalar size if SToVLHS is present, which leads to some crash.o Reviewed By: nemanjai, shchenz Differential Revision: https://reviews.llvm.org/D105094	2021-07-08 11:05:09 +08:00
Nemanja Ivanovic	6a06dbafa1	[PowerPC] Disable permuted SCALAR_TO_VECTOR on LE without direct moves There are some patterns involving the permuted scalar to vector node for which we don't have patterns without direct moves on little endian subtargets. This causes selection errors. While we can of course add the missing patterns, any additional effort to make this work is not useful since there is no support for any CPU that can run in little endian mode and does not support direct moves.	2021-07-07 13:50:49 -05:00
Zarko Todorovski	ee6ca9c7df	[AIX] Use VSSRC/VSFRC Register classes for f32/f64 callee arguments on P8 and above Adding usage of VSSRC and VSFRC when adding the live in registers on AIX. This matches the behaviour of the rest of PPC Subtargets. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D104396	2021-07-07 09:18:20 -04:00
Nemanja Ivanovic	3553698de7	[PowerPC] Re-enable combine for i64 BSWAP on targets without LDBRX The combine was disabled in 4e22c7265d86 as it caused failures in the ppc64be-multistage (bootstrap) bot. It turns out that the combine did not correctly update the MMO for the high load which caused aliased stores to be reported as unaliased. This patch fixes that problem and re-enables the combine.	2021-07-06 20:42:01 -05:00
Albion Fung	7d10dd60ce	[PowerPC] Implament Load and Reserve and Store Conditional Builtins This patch implaments the load and reserve and store conditional builtins for the PowerPC target, in order to have feature parody with xlC on AIX. Differential revision: https://reviews.llvm.org/D105236	2021-07-05 21:35:41 -05:00
Kai Luo	c063946476	[AIX] Adjust CSR order to avoid breaking ABI regarding traceback Allocate non-volatile registers in order to be compatible with ABI, regarding gpr_save. Quoted from https://www.ibm.com/docs/en/ssw_aix_72/assembler/assembler_pdf.pdf page55, > The preferred method of using GPRs is to use the volatile registers first. Next, use the nonvolatile registers > in descending order, starting with GPR31. This patch is based on @jsji 's initial draft. Tested on test-suite and SPEC, found no degradation. Reviewed By: jsji, ZarkoCA, xingxue Differential Revision: https://reviews.llvm.org/D100167	2021-07-03 04:45:26 +00:00
Matt Arsenault	99c7e918b5	GlobalISel: Use LLT in call lowering callbacks This preserves the memory type so the lowerings can rely on them.	2021-07-01 12:15:54 -04:00
Qiu Chaofan	07f0faed11	[NFC][Scheduler] Refactor tryCandidate to return boolean This patch changes return type of tryCandidate from void to bool: 1. Methods in some targets already follow this convention. 2. This would help if some target wants to re-use generic code. 3. It looks more intuitive if these try-method returns the same type. We may need to change return type of them from bool to some enum further, to make it less confusing. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D103951	2021-07-01 14:31:47 +08:00
zhijian	9a9e6189d7	[AIX][XCOFF][BUG-Fixed] need to switch back to text section after emit a dumy eh structure Summary: in the patch https://reviews.llvm.org/D103651 [AIX][XCOFF] generate eh_info when vector registers are saved according to the traceback table. when generate eh_info, it switch to other section, when it done, it need to switch back to text section again. Reviewers: Jason Liu Differential Revision: https://reviews.llvm.org/105195	2021-06-30 13:56:37 -04:00
Nemanja Ivanovic	4e22c7265d	[PowerPC] Disable combine 64-bit bswap(load) without LDBRX This causes failures on the big endian bootstrap bot. Disabling this combine temporarily until I can get a proper fix.	2021-06-25 15:11:22 -05:00
Qiu Chaofan	a08fc1361a	[PowerPC] Change VSRpRC allocation order On PowerPC, VSRpRC represents the pairs of even and odd VSX register, and VRRC corresponds to higher 32 VSX registers. In some cases, extra copies are produced when handling incoming VRRC arguments with VSRpRC. This patch changes allocation order of VSRpRC to eliminate this kind of copy. Stack frame sizes may increase if allocating non-volatile registers, and some other vector copies happen. They need fix in future changes. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D104855	2021-06-25 16:04:41 +08:00
Kai Luo	b904574b3d	[PowerPC] Move PPCBranchSelector as close to asm printer as possible Currently, PPCBranchSelector is not immediately preceding asm printer pass. `-debug-pass=Structure` gives ``` PowerPC Branch Selector Contiguously Lay Out Funclets StackMap Liveness Analysis Live DEBUG_VALUE analysis Lazy Machine Block Frequency Analysis Machine Optimization Remark Emitter Linux PPC Assembly Printer ``` After the patch ``` Contiguously Lay Out Funclets StackMap Liveness Analysis Live DEBUG_VALUE analysis PowerPC Branch Selector Lazy Machine Block Frequency Analysis Machine Optimization Remark Emitter Linux PPC Assembly Printer ``` Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D104762	2021-06-25 02:05:19 +00:00
Nemanja Ivanovic	dcccb2f594	[PowerPC] Fix bswap combine for big endian systems Commit 0464586ac515e8cfebe4c7615387fd625c8869f5 added a combine for a 64-bit load feeding a bswap but the implementation is only correct for little endian systems. This fixes it for big endian systems.	2021-06-24 18:04:50 -05:00
Martin Storsjö	42f74e8249	[llvm] Rename StringRef _lower() method calls to _insensitive() This is a mechanical change. This actually also renames the similarly named methods in the SmallString class, however these methods don't seem to be used outside of the llvm subproject, so this doesn't break building of the rest of the monorepo.	2021-06-25 00:22:01 +03:00
Nemanja Ivanovic	0464586ac5	[PowerPC] Combine 64-bit bswap(load) without LDBRX When targeting CPUs that don't have LDBRX, we end up producing code that is very inefficient and large for this common idiom. This patch just optimizes it two 32-bit LWBRX instructions along with a merge. This fixes https://bugs.llvm.org/show_bug.cgi?id=49610 Differential revision: https://reviews.llvm.org/D104836	2021-06-24 15:11:47 -05:00
zhijian	bd240b3d77	[AIX][XCOFF] generate eh_info when vector registers are saved according to the traceback table. Summary: generate eh_info when vector registers are saved according to the traceback table. struct eh_info_t { unsigned version; /* EH info version 0 / #if defined(64BIT) char _pad[4]; / padding / #endif unsigned long lsda; / Pointer to Language Specific Data Area / unsigned long personality; / Pointer to the personality routine */ }; the value of lsda and personality is zero when the number of vector registers saved is large zero and there is not personality of the function Reviewers: Jason Liu Differential Revision: https://reviews.llvm.org/D103651	2021-06-22 13:01:31 -04:00
Fangrui Song	59d90fe817	Simplify some typedef struct	2021-06-19 11:36:44 -07:00
David Spickett	e4ecd83fe9	[llvm][AArch64] Handle arrays of struct properly (from IR) This only applies to FastIsel. GlobalIsel seems to sidestep the issue. This fixes https://bugs.llvm.org/show_bug.cgi?id=46996 One of the things we do in llvm is decide if a type needs consecutive registers. Previously, we just checked if it was an array or not. (plus an SVE specific check that is not changing here) This causes some confusion when you arbitrary IR like: ``` %T1 = type { double, i1 }; define [ 1 x %T1 ] @foo() { entry: ret [ 1 x %T1 ] zeroinitializer } ``` We see it is an array so we call CC_AArch64_Custom_Block which bails out when it sees the i1, a type we don't want to put into a block. This leaves the location of the double in some kind of intermediate state and leads to odd codegen. Which then crashes the backend because it doesn't know how to implement what it's been asked for. You get this: ``` renamable $d0 = FMOVD0 $w0 = COPY killed renamable $d0 ``` Rather than this: ``` $d0 = FMOVD0 $w0 = COPY $wzr ``` The backend knows how to copy 64 bit to 64 bit registers, but not 64 to 32. It can certainly be taught how but the real issue seems to be us even trying to assign a register block in the first place. This change makes the logic of AArch64TargetLowering::functionArgumentNeedsConsecutiveRegisters a bit more in depth. If we find an array, also check that all the nested aggregates in that array have a single member type. Then CC_AArch64_Custom_Block's assumption of a type that looks like [ N x type ] will be valid and we get the expected codegen. New tests have been added to exercise these situations. Note that some of the output is not ABI compliant. The aim of this change is to simply handle these situations and not to make our processing of arbitrary IR ABI compliant. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D104123	2021-06-16 13:56:01 +00:00
Nemanja Ivanovic	821a8f680e	[PowerPC] Fix spilling of paired VSX registers We have added STXVP/LXVP for spilling and restoring the registers but we neglected to add FI elimination code for these. The result is that we end up producing impossible MachineInstr's that have register operands in place of immediates.	2021-06-15 14:13:17 -05:00
Arthur Eubanks	be5d454f3f	[NFC][OpaquePtr] Avoid calling getPointerElementType() Pointee types are going away soon. For this, we mostly just care about store/load types, which are already available without the pointee types. The other intrinsics always use i8*. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D103719	2021-06-15 09:53:12 -07:00
Arthur Eubanks	25b2126b9e	[NFC] Remove redundant variable Differential Revision: https://reviews.llvm.org/D103706	2021-06-15 09:53:11 -07:00
Kai Luo	1c450c3d7e	[PowerPC] Export 16 byte load-store instructions Export `lq`, `stq`, `lqarx` and `stqcx.` in preparation for implementing 16-byte lock free atomic operations on AIX. Add a new register class `g8prc` for these instructions, since these instructions require even-odd register pair. Reviewed By: nemanjai, jsji, #powerpc Differential Revision: https://reviews.llvm.org/D103010	2021-06-15 01:56:10 +00:00

1 2 3 4 5 ...

6613 Commits