llvm-project

Author	SHA1	Message	Date
Kazu Hirata	7ada7bbee1	[Target] Use *{Set,Map}::contains (NFC)	2023-03-14 18:06:55 -07:00
Ben Shi	cb45be2b4f	[RISCV][NFC] Combine identical switch cases in TTI Reviewed By: craig.topper, asb Differential Revision: https://reviews.llvm.org/D146008	2023-03-15 08:27:58 +08:00
Yeting Kuo	9637e950cb	[RISCV] Support ISD::STRICT_FADD/FSUB/FMUL/FDIV for vector types. The patch handles fixed type strict-fp by new RISCVISD::STRICT_ prefixed isd nodes. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D145900	2023-03-15 07:47:16 +08:00
Fangrui Song	2f5fe16e6d	[RISCV][MC] Adjust conditions to emit R_RISCV_ADD/R_RISCV_SUB pairs D132262 tried to simplify `IsMetadataOrEHFrameSection` originally introduced in D127549 but caused a regression as `.quad` directives in ``` .section .note,"a",@note; note: .quad extern-note # extern is undefined .section .rodata,"a",@progbits; rodata: .quad extern-rodata # extern is undefined .section .nonalloc,"",@progbits; nw: .quad extern-nw ``` are incorrectly rejected: these differences may be link-time constants and are allowed in GNU assembler and LLVM MC's non-RISC-V ports. Relax the conditions to allow these cases. For A-B, A may be defined later, but this requiresFixups call has to eagerly make a decision. For now, emit ADD/SUB unless A is `.L`. This euristic handles many temporary label differences for .debug_ and .apple_types sections. Ideally we should delay the decision of PC-relative vs ADD/SUB until A is defined. Reviewed By: compnerd Differential Revision: https://reviews.llvm.org/D145474	2023-03-14 15:17:38 -07:00
Fangrui Song	0ddc283a29	[RISCV] A@plt-B+C: emit R_RISCV_PLT32 even if A is defined Follow-up to D143226 Currently we incorrectly emit R_RISCV_ADD32/R_RISCV_SUB32. Emit R_RISCV_PLT32 instead. The new behavior matches x86-64 and AArch64.	2023-03-14 15:16:58 -07:00
David Green	180865a500	[AArch64] Add FP16 broadcast and transpose costs The FP16 broadcast and transpose can always use the same instructions as are used for i16 vectors, with or without +fullfp16. This fills in some extra costs to make sure we get them right. Differential Revision: https://reviews.llvm.org/D146035	2023-03-14 21:25:18 +00:00
Craig Topper	a1e39f35c5	[RISCV] Merge getLoadFP*Imm into a single function. We currently have 3 functions and 3 lookup tables. This was the most expediant and obvious way to fix several bugs. This patch uses a single function and single lookup table. It uses APFloat::convert to convert from the half or double to single precision. If the conversion doesn't have any errors or lose any information we use the f32 table to finish the lookup. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D145897	2023-03-14 13:11:11 -07:00
Craig Topper	81a150656b	[TableGen][RISCV][Hexagon][LoongArch] Add a list of Predicates to HwMode. Use the predicate condition instead of checkFeatures in *GenDAGISel.inc. This makes the code similar to isel pattern predicates. checkFeatures is still used by code created by SubtargetEmitter so we can't remove the string. Backends need to be careful to keep the string and predicates in sync, but I don't think that's a big issue. I haven't measured it, but this should be a compile time improvement for isel since we don't have to do any of the string processing that's inside checkFeatures. Reviewed By: kparzysz Differential Revision: https://reviews.llvm.org/D146012	2023-03-14 13:00:38 -07:00
Jonas Paulsson	f8803919ad	[SystemZ] Clear NW flags on an ISD::SUB when reused as comparison. The SystemZ backend will try to reuse an existing subtraction of two values whenever they are to be compared for equality. This depends on the SystemZ subtraction instruction setting the condition code, which can also signal overflow. A later pass will remove the compare and reuse the CC from the subtraction directly. However, if that subtraction has the NSW flag set it will not include the overflow bit in the updated CC user. That was a bug which can lead to wrong results, as shown by a csmith program. Fixes: https://github.com/llvm/llvm-project/issues/61268 Reviewed By: nikic, uweigand Differential Revision: https://reviews.llvm.org/D145811	2023-03-14 19:46:41 +01:00
Alex Bradbury	084e413893	[RISCV] Fix regression due to interaction of MachineOutliner and MachineCopyPropagation D144535 enabled machine copy propagation for RISC-V and added it to the pass pipeline in addPreEmitPass2 (after the MachineOutliner). Unfortunately, the MachineCopyPropagation pass is unable to correctly analyse outlined functions, and will delete copy instructions where a register is set that is intended to be live-out. RISCVInstrInfo::buildOutlinedFrame will directly insert a JALR, while a similar function going through the normal codegen path would have a PseudoRet with operands indicating registers that are live-out. This patch does the simplest fix, which is to run MachineCopyPropagation before the MachineOutliner. Differential Revision: https://reviews.llvm.org/D146037	2023-03-14 17:55:11 +00:00
Simon Pilgrim	da570ef1b4	[DAG] Match select(icmp(x,y),sub(x,y),sub(y,x)) -> abd(x,y) patterns Pulled out of PowerPC, and added ABDS support as well (hence the additional v4i32 PPC matches) Differential Revision: https://reviews.llvm.org/D144789	2023-03-14 15:10:30 +00:00
pvanhout	1f1fea6c38	Reland: [DAG/AMDGPU] Use UniformityAnalysis in DAGISel Switch DAGISel over to UniformityAnalysis, which was one of the last remaining users of the DivergenceAnalysis. No explosions seen during internal testing so this looks like a smooth transition. Reviewed By: sameerds Differential Revision: https://reviews.llvm.org/D145918	2023-03-14 14:38:45 +01:00
LiaoChunyu	eb54254b6e	[RISCV] Return false from shouldFormOverflowOp when type is i8 and i16 i8 and i16 are not using overflow. Reduce the number of zero extension instructions. To reduce the uncertainty of the unknown, most of the checks of the virtual function are kept Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D143646	2023-03-14 20:42:55 +08:00
Nabeel Omer	2ebbcfa07e	[X86] Fix encoding for ATOMIC_LOGIC_OP Fixes PR#61384. Differential Revision: https://reviews.llvm.org/D145930	2023-03-14 11:39:43 +00:00
Luke Lau	a9d9616c0d	[RISCV][NFC] Share interleave mask checking logic This adds two new methods to ShuffleVectorInst, isInterleave and isInterleaveMask, so that the logic to check if a shuffle mask is an interleave can be shared across the TTI, codegen and the interleaved access pass. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D145971	2023-03-14 11:02:52 +00:00
pvanhout	0e79106fc9	Revert "[DAG/AMDGPU] Use UniformityAnalysis in DAGISel" This reverts commit 0022b5803fd4f5a4e9fcf233267c0ffa1b88f763.	2023-03-14 11:48:58 +01:00
pvanhout	0022b5803f	[DAG/AMDGPU] Use UniformityAnalysis in DAGISel Switch DAGISel over to UniformityAnalysis, which was one of the last remaining users of the DivergenceAnalysis. No explosions seen during internal testing so this looks like a smooth transition. Reviewed By: sameerds Differential Revision: https://reviews.llvm.org/D145918	2023-03-14 11:18:28 +01:00
Jay Foad	dc3882eace	[AMDGPU] Fix .amdhsa_shared_vgpr_count error checking for GFX11 Differential Revision: https://reviews.llvm.org/D145936	2023-03-14 09:05:32 +00:00
Chen Zheng	a3b57bca97	[PowerPC] remove side effect for some cases for saturate instructions Fixes #60684 Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D145353	2023-03-13 21:37:56 -04:00
Philip Reames	ca0cd670dc	[RISCV] Improve SK_Reverse shuffle costs for fixed length vectors As noted by @luke (https://reviews.llvm.org/D145953#inline-1409312), we were accounting for the cost of vector element size using vlenb whereas the expression can be constant folded for fixed length vectors. Differential Revision: https://reviews.llvm.org/D145973	2023-03-13 15:17:42 -07:00
Michal Paszkowski	dc4330a925	[SPIR-V] Promote arbitrary width ints to regular width After this patch all arbitrary size integers (smaller than 64 bits) in LLVM IR will be promoted to regular size type in SPIR-V (OpTypeInt 8/16/32/64). Differential Revision: https://reviews.llvm.org/D145137	2023-03-13 22:44:47 +01:00
Felipe de Azevedo Piovezan	dde08c9c48	[AArchExpandPseudo] Preserve instruction debug number in expansions This is an initial attempt at preserving debug information in the pseudo instruction expansion of the AArch backend. In particular, we preserve the instruction number required by the InstrRef implementation of live debug values. There are many other expansions that need to be considered, but the ones addressed in this commit should be extremely common, as they handle most arithmetic and logical instructions. Differential Revision: https://reviews.llvm.org/D145943	2023-03-13 15:29:15 -04:00
Hassnaa Hamdi	40a51e1afc	[AArch64][SVE]: custom lower AVGFloor/AVGCeil. -Lower AVGFloor(A, B) to: SRL(A) + SRL(B) + (A&B)&1. -Lower AVGCeil(A, B) to: SRL(A) + SRL(B) + (A\|B)&1. Differential Revision: https://reviews.llvm.org/D143283	2023-03-13 19:00:57 +00:00
Philip Reames	64fc41ad82	[RISCV] Extend SK_Broadcast costing to scalable vectors The existing scalable costing was just bad. No LMUL cost, no i1 specific costing, etc.. We had updated the fixed cost model, but none of the code is actually fixed length specific. Moving it down handles the scalable cases too.	2023-03-13 11:07:26 -07:00
Jonas Paulsson	a5b4291911	[SystemZ] NFC minor cleanup Don't create local pointers to Subtarget in places in SystemZTargetLowering as the class already has a Subtarget member. Review: Ulrich Weigand	2023-03-13 18:55:24 +01:00
Philip Reames	a37dfbb79c	[RISCV] Fallback to scalable lowering costs for fixed length vectors Fixed vector costs may be more precise, but the actual lowering will use scalable vectors if nothing better is available. During review, we noticed a case where fixed vector reverse can be improved cost model wise, that will follow seperately. Differential Revision: https://reviews.llvm.org/D145953	2023-03-13 10:07:57 -07:00
Yonghong Song	db3d2adecb	[BPF] Improve pruning to avoid generate more types in BTF Commit 3671bdbcd214("[BPF] Fix a BTF type pruning bug") fixed a pruning bug to allow generate more types. But the commit has a bug which permits to generate more types than necessary. The following is an example to illustrate the problem. struct t1 { int a; }; struct t2 { struct t1 p1; struct t1 p2; int b; }; int foo(struct t2 arg) { return arg->b; } The following is the part of BTF generation sequence: (1). 'struct t2 arg' -> 'struct t1 p1' In this step, the type 'struct t1' will be generated as a forward decl and the ptr type (to 'struct t1') will be stored in the internal type table. (2). now the second field 'struct t1 p2' will be processed. Since the ptr type (to 'struct t1') already in the type table, the existing logic strips out ptr modifier and is able to generate BTF type for 'struct t1'. In the above step (2), if CheckPointer is true (the type traversal chain including a struct member), 'ptr' modifier should be checked and the subsequent type generation should be skipped since the same case has been processed in visitDerivedType(). The issue is exposed when I am trying to use llvm15 to compile some internal bpf programs. The bpf skeleton put the whole ELF section (after striping some sections like dwarf) as a string. The large BTF section triggered the following error: bpf_object_with_struct_ops_test_prog_bpf/BpfObjectWithStructOpsTestProg.skel.h:222:23: error: string literal of length 140144 exceeds maximum length 65536 that C++ compilers are required to support [-Werror,-Woverlength-strings] return (const void *)"\ ^~ 1 error generated. Although adding -Wno-overlength-strings could workaround the issue, improving llvm BTF generation sounds better esp. for users using vmlinux.h. Differential Revision: https://reviews.llvm.org/D145816	2023-03-13 09:34:37 -07:00
Phoebe Wang	7ff0b9735d	[X86] Use llvm::Align for passing the alignment This should be a typo in `emitConstantSizeRepmov`. Both its caller and callee store the alignment in a 64-bit variables, no reason to truncate it to 32-bit. It results in alignment turns into 0 when larger than 0x100000000. Fixes #61348 Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D145863	2023-03-14 00:06:21 +08:00
Philip Reames	cfcf274245	[RISCV] Inline and delete RISCVTTIImpl::getSpliceCost [nfc] The code structure was copied from AArch64 which has a much more complicated splice cost model.	2023-03-13 08:55:32 -07:00
Philip Reames	21bca796d7	[RISCV] Use switch in RISCVTargetTransformInfo::getShuffleCost [nfc] Refactoring in advance of a semantic change.	2023-03-13 08:40:47 -07:00
Sjoerd Meijer	775451b66a	[AArch64] Cost-model vector splat LD1Rs to avoid unprofitable SLP vectorisation This slightly increases the costs of InsertElement instructions that are part of a vector splat sequence, i.e. a load, InsertElement and a shuffle (load + dup). The resulting LD1R is a high latency instruction, and this slight increase in costs avoids SLP vectorisation for a couple of cases where this isn't profitable. Fixes: https://github.com/llvm/llvm-project/issues/61047 Differential Revision: https://reviews.llvm.org/D145578	2023-03-13 14:52:09 +00:00
Simon Pilgrim	c7d844ea0f	[DAG] Use ISD::isBitwiseLogicOp in AND/OR/XOR checks. NFCI. There's additional cases we can cleanup (mainly in target code), but this tries to cleanup generic code and PPC which had an equivalent helper.	2023-03-13 13:39:02 +00:00
Paul Walker	3870857226	[SVE] Restrict cmp+and->pred_cmp isel to instances where the and is the sole user of the compare. Without the single use restriction we may replace the and with a more costly duplicated compare. Differential Revision: https://reviews.llvm.org/D145755	2023-03-13 12:42:00 +00:00
Jay Foad	23b0df72d2	[AMDGPU] Remove BoolToList class Replace all: foreach _ = BoolToList<cond>.ret in with: if cond then Thanks to Philip Reames for D145711 which enabled this.	2023-03-13 09:22:52 +00:00
Craig Topper	c0c4c725e9	[RISCV] Return false for unsupported VTs in isFPImmLegal. I don't have a test case that fails for this, but it seemed like we should only handle legal types. The callers I looked at in DAGCombine either check the type is legal or don't even call isFPImmLegal unless LegalOperations is true. Written in a slightly odd way because switches on EVT require an additional isSimple check so an if/else chain is easier. Used a bool to shorten the code instead of having multiple ifs and returns. AArch64 uses a similarish structure.	2023-03-12 23:34:13 -07:00
Craig Topper	bf4b9857d0	Recommit "[RISCV] Add separate lookup tables for fli.h and fli.d." Fix mistake in f16 table in previous patch. Original commit message: Use separate lookup tables instead of trying to reuse the fli.s table. We were missing the 2 denormal cases for fli.h. We also had an issue where fli.d was only checking 8 bits of the 11 bit exponent.	2023-03-12 22:17:56 -07:00
Craig Topper	ffc18c3339	Revert "[RISCV] Add separate lookup tables for fli.h and fli.d." This reverts commit ebc11b68412cdcf2a0e6e2c50df262cfd9b8f481. I made a mistake in the f16 table. Will fix and recommit.	2023-03-12 22:06:48 -07:00
Craig Topper	c3cb4e4105	Revert "[RISCV] Remove unused function declaration. NFC" This reverts commit 5589c3cf752383fa42eaacc13fb985653ae528b6. I need to revert an earlier patch to fix a mistake.	2023-03-12 22:03:07 -07:00
Craig Topper	5589c3cf75	[RISCV] Remove unused function declaration. NFC	2023-03-12 21:23:39 -07:00
Vitaly Buka	ea4cbbbfa4	[SPARC] Attempt to fix bug introduced by D142458 Reported https://lab.llvm.org/buildbot/#/builders/5/builds/32113	2023-03-12 18:32:24 -07:00
Michal Paszkowski	147ff1b494	[SPIR-V] Fix llvm deprecated warnings Fixes warnings related to getAllOnesValue and isNullValue being deprecated. Differential Revision: https://reviews.llvm.org/D145709	2023-03-12 20:33:42 +01:00
Craig Topper	ebc11b6841	[RISCV] Add separate lookup tables for fli.h and fli.d. Use separate lookup tables instead of trying to reuse the fli.s table. We were missing the 2 denormal cases for fli.h. We also had an issue where fli.d was only checking 8 bits of the 11 bit exponent.	2023-03-12 11:28:49 -07:00
Craig Topper	40582493f0	[RISCV] Add overrides of isLoadFromStackSlot/isStoreFromStackSlot signatures that don't have MemBytes. D145471 added overrides of the other signature to return MemBytes, but shouldn't have removed these overrides. These signatures will now call the MemBytes signature and ignore the MemBytes. This matches X86.	2023-03-12 09:52:49 -07:00
Alex Bradbury	989f3f080e	[RISCV][NFCI] Use common MCELFStreamer code for attribute emission D102894 introduced common code for the emission of ELF attributes. Our implementation in RISC-V predates this, and basically copies the Arm logic at the time. This patch removes that duplication and uses the shared logic instead. Differential Revision: https://reviews.llvm.org/D145570	2023-03-12 16:11:12 +00:00
Simon Pilgrim	9041682d2c	[DAG] Remove redundant isZExtFree(SDValue,VT) overrides. NFC. These implementations both match the TargetLoweringBase.isZExtFree implementation	2023-03-12 15:56:04 +00:00
Jon Chesterfield	d3dda422bf	[amdgpu][nfc] Replace ad hoc LDS frame recalculation with absolute_symbol MD Post ISel, LDS variables are absolute values. Representing them as such is simpler than the frame recalculation currently used to build assembler tables from their addresses. This is a precursor to lowering dynamic/external LDS accesses from non-kernel functions. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D144221	2023-03-12 13:47:48 +00:00
Paul Walker	2f887c9a76	[InstCombine] Extend SVEVectorFuseMulAddSub to support newly added "undef" intrinsics. D143767 will change the intrinsics used to lower floating-point svadd_x, svmul_x and svsub_x builtins. This will result in the combines added as part of D140200 to no longer fire in all cases. This patch extends the existing combines for contraction to cover fadd_u, fmul_u and fsub_u intrinsics. Differential Revision: https://reviews.llvm.org/D144413	2023-03-12 10:39:24 +00:00
Simon Pilgrim	47208f8d34	[X86] matchAddressRecursively - support zext(and(shl(x,c1)),c2) -> shl(zext(and(x, c2 >> c1),c1) This came about while investigating ways to handle D145468 in a more generic manner, which involves trying harder to fold and(zext(x),c) -> zext(and(x,c)) Alive2: https://alive2.llvm.org/ce/z/7fXtDt (generic fold) Differential Revision: https://reviews.llvm.org/D145855	2023-03-12 09:49:11 +00:00
Brad Smith	eee590ca4b	Revert "[SPARC] Lower BR_CC to BPr on 64-bit target whenever possible" This reverts commit 6590a372fa3f4582c04b4b179f90a3c728e75025.	2023-03-12 04:20:25 -04:00
Koakuma	6590a372fa	[SPARC] Lower BR_CC to BPr on 64-bit target whenever possible On 64-bit target, when doing i64 BR_CC where one of the comparison operands is a constant zero, try to fold the compare and BPcc into a BPr instruction. For all integers, EQ and NE comparison are available, additionally for signed integers, GT, GE, LT, and LE is also available. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D142461	2023-03-11 17:47:53 -05:00

1 2 3 4 5 ...

71427 Commits