llvm-project

Author	SHA1	Message	Date
Kazu Hirata	7ada7bbee1	[Target] Use *{Set,Map}::contains (NFC)	2023-03-14 18:06:55 -07:00
Simon Pilgrim	da570ef1b4	[DAG] Match select(icmp(x,y),sub(x,y),sub(y,x)) -> abd(x,y) patterns Pulled out of PowerPC, and added ABDS support as well (hence the additional v4i32 PPC matches) Differential Revision: https://reviews.llvm.org/D144789	2023-03-14 15:10:30 +00:00
Chen Zheng	a3b57bca97	[PowerPC] remove side effect for some cases for saturate instructions Fixes #60684 Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D145353	2023-03-13 21:37:56 -04:00
Simon Pilgrim	c7d844ea0f	[DAG] Use ISD::isBitwiseLogicOp in AND/OR/XOR checks. NFCI. There's additional cases we can cleanup (mainly in target code), but this tries to cleanup generic code and PPC which had an equivalent helper.	2023-03-13 13:39:02 +00:00
Yuanfang Chen	9aae408d55	[NFC] fix typo `funciton` -> `function` credits to @jmagee	2023-03-10 18:05:25 -08:00
esmeyi	5541f47326	[PowerPC] Check if the latch block is in the value list for the PHI before get the incoming value. Summary: Fixes #60990. There is a crash reported during Running pass 'Prepare loop for ppc preferred instruction forms'. The crash occurs in 32bit PowerPC. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D145350	2023-03-08 02:19:35 -05:00
Ting Wang	bd4562976c	[PowerPC][NFC] cleanup isEligibleForTCO The input parameter IsByValArg to isEligibleForTCO() is false in all cases, so it is considered redundant and should be removed. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D145028	2023-03-02 23:04:19 -05:00
Ting Wang	65f68812d3	[PowerPC] update PPCTTIImpl::supportsTailCallFor() check conditions This patch reuse `PPCTargetLowering::isEligibleForTCO()` to check `PPCTTIImpl::supportsTailCallFor()`. Fixes #59315 Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D140369	2023-02-28 22:29:16 -05:00
Simon Pilgrim	8757ce4901	[PowerPC] Replace PPCISD::VABSD cases with generic ISD::ABDU(X,Y) node A move towards using the generic ISD::ABDU nodes on more backends Also support ISD::ABDS for v4i32 types using the existing signbit flip trick PowerPC has a select(icmp_ugt(x,y),sub(x,y),sub(y,x)) -> abdu(x,y) combine that I intend to move to DAGCombiner in a future patch. The ABS(SUB(X,Y)) -> PPCISD::VABSD(X,Y,1) v4i32 combine wasn't legal (https://alive2.llvm.org/ce/z/jc2hLU) - so I've removed it, having already added the legal sub nsw tests equivalent. Differential Revision: https://reviews.llvm.org/D142313	2023-02-25 20:17:17 +00:00
Stefan Pintilie	b47473908b	[PowerPC] Add Binary Coded Decimal Assist Instructions This patch adds three instructions for Binary Coded Decimal (BCD). They are: cdtbcd, cbcdtd, addg6s. Reviewed By: amyk Differential Revision: https://reviews.llvm.org/D144068	2023-02-24 15:59:49 -05:00
Luke Lau	b02b1e0ed6	[LV][NFC] Use ElementCount for getMaxInterleaveFactor In order to allow targets to disable interleaving for scalable vectors, pass the entire VF's ElementCount to getMaxInterleaveFactor. This is based off of the approach used here: `8d36708507` The plan would then be to disable interleaving on scalable VFs on RISC-V in a follow up patch. See https://reviews.llvm.org/D143723#4132349 Reviewed By: reames Differential Revision: https://reviews.llvm.org/D144474	2023-02-22 10:15:05 +00:00
Brad Smith	5d585c9dd0	[PowerPC] Use member function to determine PowerPC Secure PLT Add a member function isPPC32SecurePlt() to determine whether Secure PLT is used by the target 32-bit PowerPC operating environment. Reviewed By: dim, maskray Differential Revision: https://reviews.llvm.org/D144444	2023-02-21 14:08:25 -05:00
Ting Wang	d567e06946	[PowerPC][NFC] refactor eligible check for tail call optimization The check logic for TCO is scattered in two functions: IsEligibleForTailCallOptimization_64SVR4() IsEligibleForTailCallOptimization(), and serves instruction selection phase only at this moment. This patch aims to refactor existing logic to export an API for TCO eligible query before instruction selection phase. Reviewed By: shchenz, nemanjai Differential Revision: https://reviews.llvm.org/D141673	2023-02-21 06:14:47 -05:00
esmeyi	fd226142fc	[AIX] Lower some memory intrinsics to millicode functions on AIX Summary: Currently we lower MEMCPY/MEMMOVE/MEMSET/BZERO to the corresponding libc functions. And the libc functions call the millicode functions on AIX. We can lower these intrinsics directly to save one call layer. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D143997	2023-02-20 22:25:49 -05:00
Brad Smith	4b09cb2b16	[PowerPC] Correctly use ELFv2 ABI on all OS's that use the ELFv2 ABI Add a member function isPPC64ELFv2ABI() to determine what ABI is used on the 64-bit PowerPC big endian operating environment. Reviewed By: nemanjai, dim, pkubaj Differential Revision: https://reviews.llvm.org/D144321	2023-02-20 18:11:24 -05:00
Kazu Hirata	f8f3db2756	Use APInt::count{l,r}_{zero,one} (NFC)	2023-02-19 22:04:47 -08:00
Fangrui Song	432caca39a	Simplify with hasFeature. NFC	2023-02-17 18:22:24 -08:00
Ting Wang	52a774fd4c	[PowerPC] remove XXSWAPD after load from CP which is a splat value If the value from constant-pool is a splat value of vector type, do not need swap after load from constant-pool. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D139491	2023-02-16 19:21:35 -05:00
Nemanja Ivanovic	56e41fcf50	[PowerPC] Bail out of FISel when lowering long calls We currently don't handle tail calls in fast-isel but we continue with the lowering when -mlongcall is specified and lower the calls normally. We should defer to SDISel for this so that it is lowered correctly. Differential revision: https://reviews.llvm.org/D123997	2023-02-16 16:15:32 -05:00
Kazu Hirata	7e6e636fb6	Use llvm::has_single_bit<uint32_t> (NFC) This patch replaces isPowerOf2_32 with llvm::has_single_bit<uint32_t> where the argument is wider than uint32_t.	2023-02-15 22:17:27 -08:00
Matt Arsenault	09dd4d870e	DAG: Remove hasBitPreservingFPLogic This doesn't make sense as an option. fneg and fabs are bit preserving by definition. If a target has some fneg or fabs instruction that are not bitpreserving it's incorrect to lower fneg/fabs to use it.	2023-02-14 10:25:24 -04:00
Kazu Hirata	64dad4ba9a	Use llvm::bit_cast (NFC)	2023-02-14 01:22:12 -08:00
Chen Zheng	6ee2f770ef	[PowerPC][GISel] add support for fpconstant Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D133340	2023-02-14 02:39:22 +00:00
Stefan Pintilie	2e47aafb02	[PowerPC] Fix float materialization patterns. Two of the float materialization patterns use the VSSRC regsiter class. This register class is not available before Power 8. The patterns will stay the same for Power 8 and up but must use the class F4RC for Power 7 and earlier. This patch fixes those patterns. Reviewed By: nemanjai, amyk, #powerpc Differential Revision: https://reviews.llvm.org/D142120	2023-02-13 10:18:53 -05:00
Samuel Parker	2a58be4239	[HardwareLoops] NewPM support. With the NPM, we're now defaulting to preserving LCSSA, so a couple of tests have changed slightly. Differential Revision: https://reviews.llvm.org/D140982	2023-02-13 09:46:31 +00:00
Kai Luo	96aaebd12e	[MachineCopyPropagation] Eliminate spillage copies that might be caused by eviction chain Remove spill-reload like copy chains. For example ``` r0 = COPY r1 r1 = COPY r2 r2 = COPY r3 r3 = COPY r4 <def-use r4> r4 = COPY r3 r3 = COPY r2 r2 = COPY r1 r1 = COPY r0 ``` will be folded into ``` r0 = COPY r1 r1 = COPY r4 <def-use r4> r4 = COPY r1 r1 = COPY r0 ``` Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D122118	2023-02-08 03:34:25 +00:00
Philip Reames	3be1ae24fb	[CodeGen] Add standard print/debug utilities to MVT Doing so makes it easier to do printf style debugging in idiomatic manner. I followed the code structure of Value with only the definition of dump being #ifdef out in non-debug builds. Not sure if this is the "right" option; we don't seem to have any single consistent scheme on how dump is handled. Note: This is a follow up to D143454 which did the same for EVT. Differential Revision: https://reviews.llvm.org/D143511	2023-02-07 10:50:14 -08:00
Archibald Elliott	62c7f035b4	[NFC][TargetParser] Remove llvm/ADT/Triple.h I also ran `git clang-format` to get the headers in the right order for the new location, which has changed the order of other headers in two files.	2023-02-07 12:39:46 +00:00
Ting Wang	1d8f13ae45	[PowerPC] add a peephole to remove redundant swap instructions after vector splats on P8 Vector store on P8 little endian will have swap instruction added before the store in PPCISelLowring. If the vector is generated by splat, the swap instruction can be eliminated. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D139691	2023-02-02 20:52:52 -05:00
James Y Knight	0be684ed97	[PowerPC] Switch to by-name matching for instructions (part 2 of 2). This is a follow-on to https://reviews.llvm.org/D134073. Currently, all of the "memri"-style complex operands, which contain both a register and an immediate, are encoded into a single field in the instruction definition. This requires complex encoders/decoders, and instruction definitions that insert and extract the correct parts of the bits. Now, switch to naming and encoding/decoding the sub-operands separately. Thus, we can now disable useDeprecatedPositionallyEncodedOperands. Reviewed By: barannikov88 Differential Revision: https://reviews.llvm.org/D137670	2023-02-02 15:28:45 -05:00
James Y Knight	4b43ef3e5c	[PowerPC] Switch to by-name matching for instructions (part 1 of 2). This is a follow-on to https://reviews.llvm.org/D134073. After https://reviews.llvm.org/D137653 we can now switch the PPC target away from positional operand matching. This patch fixes all of the "easy" cases. While this changes a large number of lines of tablegen source, it results in only a single non-comment change in the code generated by tablegen: the (unused) codegen-only "MTVRSAVEv" instruction was previously incorrectly encoding operand 0, and now encodes (correctly) operand 1. Changes which result in generated-code changes have been split off into the next (smaller) patch, for ease of review. Reviewed By: barannikov88 Differential Revision: https://reviews.llvm.org/D137661	2023-02-02 15:28:45 -05:00
Nemanja Ivanovic	c86f8d4276	[PowerPC] Don't crash when disassembling invalid immediate There is an assert in the disassembler functions to ensure that the immediate is the appropriate width. However, sometimes what is being disassembled is not instructions but data that happens to have the bit pattern of an existing instruction but invalid operands. It is valid for such things to exist in the text section so we don't want to crash when disassembling such a thing. This patch removes the asserts and produces a disassembler failure for such cases.	2023-02-02 12:39:49 -06:00
Nemanja Ivanovic	19311e0a2e	[PowerPC] Do not convert lwz to lwa if the offset is not a multiple of 4 The transform that converts this checks the alignment of the global object being accessed. However, there was no check for the offset within the global object which caused the compiler to produce a DS relocation for an unaligned address.	2023-01-31 09:54:29 -06:00
esmeyi	2224b53f06	[PowerPC] Improve materialization for immediates which is almost a 32 bit splat. Summary: Some 64 bit constants can be materialized with fewer instructions than we currently use. We consider a 64 bit immediate value divided into four parts, Hi16OfHi32 (bits 48...63), Lo16OfHi32 (bits 32...47), Hi16OfLo32 (bits 16...31), Lo16OfLo32 (bits 0...15). When any three parts are equal, the immediate can be treated as "almost" a splat of a 32 bit value in a 64 bit register. For such case, we can use 3 instructions to generate the splat and use 1 instruction to modify the different part: Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D139813	2023-01-31 06:02:17 -05:00
Nemanja Ivanovic	f68fc8d9d2	[PowerPC] Fix incorrect shift amount for build_vector The pattern for a build_vector node was incorrect for big endian subtargets.	2023-01-30 16:36:08 -06:00
Kazu Hirata	e078201835	[Target] Use llvm::count{l,r}_{zero,one} (NFC)	2023-01-28 09:23:07 -08:00
Kazu Hirata	f20b5071f3	[llvm] Use llvm::bit_floor instead of llvm::PowerOf2Floor (NFC)	2023-01-28 09:06:31 -08:00
Matt Arsenault	778cf5431c	IR: Add atomicrmw uinc_wrap and udec_wrap These are essentially add/sub 1 with a clamping value. AMDGPU has instructions for these. CUDA/HIP expose these as atomicInc/atomicDec. Currently we use target intrinsics for these, but those do no carry the ordering and syncscope. Add these to atomicrmw so we can carry these and benefit from the regular legalization processes.	2023-01-24 17:55:11 -04:00
Guillaume Chatelet	8b1d86aedf	[NFC] Deprecate SelectionDag::getLoad that takes alignment as unsigned	2023-01-24 09:42:36 +00:00
Jay Foad	073401e59c	[MC] Define and use MCInstrDesc implicit_uses and implicit_defs. NFC. The new methods return a range for easier iteration. Use them everywhere instead of getImplicitUses, getNumImplicitUses, getImplicitDefs and getNumImplicitDefs. A future patch will remove the old methods. In some use cases the new methods are less efficient because they always have to scan the whole uses/defs array to count its length, but that will be fixed in a future patch by storing the number of implicit uses/defs explicitly in MCInstrDesc. At that point there will be no need to 0-terminate the arrays. Differential Revision: https://reviews.llvm.org/D142215	2023-01-23 14:44:58 +00:00
Jay Foad	768aed1378	[MC] Make more use of MCInstrDesc::operands. NFC. Change MCInstrDesc::operands to return an ArrayRef so we can easily use it everywhere instead of the (IMHO ugly) opInfo_begin and opInfo_end. A future patch will remove opInfo_begin and opInfo_end. Also use it instead of raw access to the OpInfo pointer. A future patch will remove this pointer. Differential Revision: https://reviews.llvm.org/D142213	2023-01-23 11:31:41 +00:00
ShihPo Hung	5fb3a57ea7	[Cost] Add CostKind to getVectorInstrCost and its related users LoopUnroll estimates the loop size via getInstructionCost(), but getInstructionCost() cannot pass CostKind to getVectorInstrCost(). And so does getShuffleCost() to getBroadcastShuffleOverhead(), getPermuteShuffleOverhead(), getExtractSubvectorOverhead(), and getInsertSubvectorOverhead(). To address this, this patch adds an argument CostKind to these functions. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D142116	2023-01-21 05:29:24 -08:00
Sergei Barannikov	6ae84d668f	[MC] Use MCRegister instead of unsigned in MCInstPrinter (NFC) Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140654	2023-01-17 22:39:39 +03:00
Lei Huang	ee559b21b9	[P10] Fix the implementation for BRH Fixes the patterns for the brh instruction to include a clrldi when emitted. Reviewed By: amyk Differential Revision: https://reviews.llvm.org/D141697	2023-01-16 13:53:43 -06:00
Craig Topper	79858d1908	[CodeGen][Target] Remove uses of Register::isPhysicalRegister/isVirtualRegister. NFC Use isPhysical/isVirtual methods.	2023-01-13 23:12:48 -08:00
Dominik Adamski	6809af1a23	Revert "[OpenMP][OMPIRBuilder] Move SIMD alignment calculation to LLVM Frontend" This reverts commit ed01de67433174d3157e9d239d59dd465d52c6a5.	2023-01-13 14:38:17 -06:00
Dominik Adamski	ed01de6743	[OpenMP][OMPIRBuilder] Move SIMD alignment calculation to LLVM Frontend Currently default simd alignment is specified by Clang specific TargetInfo class. This class cannot be reused for LLVM Flang. If we move the default alignment field into TargetMachine class then we can create TargetMachine objects and query them to find SIMD alignment. Scope of changes: 1) Added information about maximal allowed SIMD alignment to TargetMachine classes. 2) Removed getSimdDefaultAlign function from Clang TargetInfo class. 3) Refactored createTargetMachine function. Reviewed By: jsjodin Differential Revision: https://reviews.llvm.org/D138496	2023-01-13 14:07:29 -06:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Kai Luo	d9630c34f4	[PowerPC][GISel] Select sync instructions required by atomic operations This is part of selecting `G_ATOMIC*` instructions. Select `isync`, `sync` and `lwsync` in GISel. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D141360	2023-01-11 16:25:46 +08:00
Nick Desaulniers	b50327eea6	[llvm][PPCISelDAGToDAG] rename ppc-codegen to ppc-isel Every other subclass of SelectionDAGISel calls this pass "<arch>-isel". No existing tests refer to ppc-codegen so this is purely a cosmetic change to bring the pass name in line with other architecture's SelectionDAGISel subclasses. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D140497	2023-01-09 15:24:25 -08:00

1 2 3 4 5 ...

7057 Commits