llvm-project

Author	SHA1	Message	Date
Manish Kausik H	69192e0193	[LegalizeDAG] Optimize CodeGen for `ISD::CTLZ_ZERO_UNDEF` (#83039 ) Previously we had the same instructions being generated for `ISD::CTLZ` and `ISD::CTLZ_ZERO_UNDEF` which did not take advantage of the fact that zero is an invalid input for `ISD::CTLZ_ZERO_UNDEF`. This commit separates codegen for the two cases to allow for the optimization for the latter case. The details of the optimization are outlined in #82075 Fixes #82075 Co-authored-by: Manish Kausik H <hmamishkausik@gmail.com>	2024-07-08 14:01:32 +01:00
Matt Arsenault	7032076242	GlobalISel: Drop vector range metadata on bitcast lowering (#97279 ) If we are reinterpreting the type, the range metadata also needs to be converted. I believe the DAG has the same bug.	2024-07-01 15:26:09 +02:00
Matt Arsenault	2df2373eb8	DAG/GlobalISel: Set disjoint for or in copysign lowering (#97057 ) We masked out the sign bit from one value, and the non-sign bits from the other so there should be no common bits set. No idea how to test this on the DAG path, other than scraping the debug logs. A few targets hit this path with f16 values, but the resulting i16 ors get anyext promoted and lose the disjoint flag. In the fp128 case, PPC gets further and the or loses the flag somewhere else later. Adding a haveNoCommonBits assert shows this works though.	2024-06-28 23:03:39 +02:00
isuckatcs	937d79bc9d	[GlobalISel][AArch64][AMDGPU] Expand FPOWI into series of multiplication (#95217 ) SelectionDAG already converts FPOWI into a series of optimized multiplications, this patch introduces the same optimization into GlobalISel.	2024-06-28 09:57:50 +02:00
David Green	e887624aca	[AArch64][GlobalISel] Add fp128 and i128 fptosi/fptoui handling. (#95528 ) Any fp128 need to end up as libcall, as will f32->i128 and f64->i128. f16 are a bit special as the maximum range of the result fits in a i17, so can be shrank to an i64. Vector with i128/fp128 types are scalarized.	2024-06-21 10:24:57 +01:00
Nikita Popov	f2f18459d4	Revert "Intrinsic: introduce minimumnum and maximumnum (#93841 )" As far as I can tell, this pull request was not approved, and did not go through an RFC on discourse. This reverts commit 89881480030f48f83af668175b70a9798edca2fb. This reverts commit 225d8fc8eb24fb797154c1ef6dcbe5ba033142da.	2024-06-21 08:34:04 +02:00
YunQiang Su	8988148003	Intrinsic: introduce minimumnum and maximumnum (#93841 ) Currently, on different platform, the behaivor of llvm.minnum is different if one operand is sNaN: When we compare sNaN vs NUM: ARM/AArch64/PowerPC: follow the IEEE754-2008's minNUM: return qNaN. RISC-V/Hexagon follow the IEEE754-2019's minimumNumber: return NUM. X86: Returns NUM but not same with IEEE754-2019's minimumNumber as +0.0 is not always greater than -0.0. MIPS/LoongArch/Generic: return NUM. LIBCALL: returns qNaN. So, let's introduce llvm.minmumnum/llvm.maximumnum, which always follow IEEE754-2019's minimumNumber/maximumNumber. Half-fix: #93033	2024-06-21 11:53:08 +08:00
Christudasan Devadasan	27bebc1161	[GISel] Unify multiple instances of getTypeForLLT (NFC) (#95577 ) Multiple static instances of this utility function have been found in different GlobalISel files. Unifying them by adding an instance in utils.cpp.	2024-06-15 18:11:32 +05:30
Farzon Lotfi	1d87433593	[x86] Add tan intrinsic part 4 (#90503 ) This change is an implementation of #87367's investigation on supporting IEEE math operations as intrinsics. Which was discussed in this RFC: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 Much of this change was following how G_FSIN and G_FCOS were used. Changes: - `llvm/docs/GlobalISel/GenericOpcode.rst` - Document the `G_FTAN` opcode - `llvm/docs/LangRef.rst` - Document the tan intrinsic - `llvm/include/llvm/Analysis/VecFuncs.def` - Associate the tan intrinsic as a vector function similar to the tanf libcall. - `llvm/include/llvm/CodeGen/BasicTTIImpl.h` - Map the tan intrinsic to `ISD::FTAN` - `llvm/include/llvm/CodeGen/ISDOpcodes.h` - Define ISD opcodes for `FTAN` and `STRICT_FTAN` - `llvm/include/llvm/IR/Intrinsics.td` - Create the tan intrinsic - `llvm/include/llvm/IR/RuntimeLibcalls.def` - Define tan libcall mappings - `llvm/include/llvm/Target/GenericOpcodes.td` - Define the `G_FTAN` Opcode - `llvm/include/llvm/Support/TargetOpcodes.def` - Create a `G_FTAN` Opcode handler - `llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td` - Map `G_FTAN` to `ftan` - `llvm/include/llvm/Target/TargetSelectionDAG.td` - Define `ftan`, `strict_ftan`, and `any_ftan` and map them to the ISD opcodes for `FTAN` and `STRICT_FTAN` - `llvm/lib/Analysis/VectorUtils.cpp` - Associate the tan intrinsic as a vector intrinsic - `llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp` Map the tan intrinsic to `G_FTAN` Opcode - `llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp` - Add `G_FTAN` to the list of floating point math operations also associate `G_FTAN` with the `TAN_F` runtime lib. - `llvm/lib/CodeGen/GlobalISel/Utils.cpp` - More floating point math operation common behaviors. - llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp - List the function expansion operations for `FTAN` and `STRICT_FTAN`. Also define both opcodes in `PromoteNode`. - `llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp` - More `FTAN` and `STRICT_FTAN` handling in the legalizer - `llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h` - Define `SoftenFloatRes_FTAN` and `ExpandFloatRes_FTAN`. - `llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp` - Define `FTAN` as a legal vector operation. - `llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp` - Define `FTAN` as a legal vector operation. - `llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp` - define tan as an intrinsic that doesn't return NaN. - `llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp` Map `LibFunc_tan`, `LibFunc_tanf`, and `LibFunc_tanl` to `ISD::FTAN`. Map `Intrinsic::tan` to `ISD::FTAN` and add selection dag handling for `Intrinsic::tan`. - `llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp` - Define `ftan` and `strict_ftan` names for the equivalent ISD opcodes. - `llvm/lib/CodeGen/TargetLoweringBase.cpp` -Define a Tan128 libcall and ISD::FTAN as a target lowering action. - `llvm/lib/Target/X86/X86ISelLowering.cpp` - Add x86_64 lowering for tan intrinsic resolves https://github.com/llvm/llvm-project/issues/70082	2024-06-05 15:01:33 -04:00
Simon Pilgrim	4e251e7cad	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warning. NFC.	2024-05-29 17:57:34 +01:00
Yingwei Zheng	24ddce62c8	[GISel] Legalize bitreverse with types smaller than 8 bits (#92998 ) This patch adds support for lowering `bitreverse` with types smaller than 8 bits. It also fixes an existing assertion failure in `llvm::APInt::getSplat`: https://godbolt.org/z/7crs8xrcG The lowering logic is copied from SDAG: `2034f2fc87/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp (L9384-L9398)`	2024-05-29 21:42:08 +08:00
AtariDreams	d582958618	Revert "[Legalizer] Check full condition for UMIN and UMAX just like the code below does for SMIN and SMAX" (#93573 ) Reverts llvm/llvm-project#87932	2024-05-28 12:25:43 -04:00
AtariDreams	70bf139651	[Legalizer] Check full condition for UMIN and UMAX just like the code below does for SMIN and SMAX (#87932 )	2024-05-26 15:07:31 -04:00
Yingwei Zheng	821bcba00d	[GISel] Add narrowScalar/widenScalar support for `G_CONSTANT_FOLD_BARRIER` (#93031 ) Fixes an error that llc fails to legalize `G_CONSTANT_FOLD_BARRIER` with i16/i128: https://godbolt.org/z/f9n6xM3sv	2024-05-22 23:35:37 +08:00
David Green	34de2151e2	[AArch64][GlobalISel] Improve legalization of G_PTR_ADD (#91763 ) The testing we have for vector ptradd was a bit lacking. In adding tests this patch found a couple of issues mostly with the way v3 vectors of ptrs were sometimes legalized via i64, and with non-i64 additions. It does not attempt to fix the issue with mergevalues from returning vector ptrs.	2024-05-13 21:58:41 +01:00
Matt Arsenault	401658cb4b	AMDGPU: Fix vector handling of fptrunc_round	2024-04-24 12:42:55 +02:00
David Green	8d49ce1764	[GlobalISel][AArch64] Add LLRINT support (#88702 ) This hooks up G_INTRINSIC_LLRINT instructions, very similar to the lrint nodes that already exist. On AArch64 they are treated the same as lrint with the default return types.	2024-04-17 18:38:24 +01:00
Dávid Ferenc Szabó	2347020e4c	[GlobalISel] Fix fewerElementsVectorPhi to insert after G_PHIs (#87927 ) Currently the inserted mergelike instructions will be inserted at the location of the G_PHI. Seems like the behaviour was correct before, but the rework done in https://reviews.llvm.org/D114198 forgot to include the part which makes sure the instructions will be inserted after all the G_PHIs.	2024-04-15 11:01:55 +02:00
David Green	28d28d58be	[AArch64][GlobalISel] Extend scalar lrint legalization. (#88360 ) This extends the legalization of lrint, adding libcall support for fp128. The old vector legal types were removed as they were not being properly handled (vector lrint is a fairly new concept as far as I understand). They can be re-added properly in a followup.	2024-04-15 09:41:08 +01:00
Michael Maitland	8aa3a77eaf	[RISCV][GISEL] Legalize G_ZEXT, G_SEXT, and G_ANYEXT, G_SPLAT_VECTOR, and G_ICMP for scalable vector types This patch legalizes G_ZEXT, G_SEXT, and G_ANYEXT. If the type is a legal mask type, then the instruction is legalized as the element-wise select, where the condition on the select is the mask typed source operand, and the true and false values are 1 or -1 (for zero/any-extension and sign extension) and zero. If the type is a legal integer or vector integer type, then the instruction is marked as legal. The legalization of the extends may introduce a G_SPLAT_VECTOR, which needs to be legalized in this patch for the extend test cases to pass. A G_SPLAT_VECTOR is legal if the vector type is a legal integer or floating point vector type and the source operand is sXLen type. This is because the SelectionDAG patterns only support sXLen typed ISD::SPLAT_VECTORS, and we'd like to reuse those patterns. A G_SPLAT_VECTOR is cutom legalized if it has a legal s1 element vector type and s1 scalar operand. It is legalized to G_VMSET_VL or G_VMCLR_VL if the splat is all ones or all zeros respectivley. In the case of a non-constant mask splat, we legalize by promoting the scalar value to s8. In order to get the s8 element vector back into s1 vector, we use a G_ICMP. In order for the splat vector and extend tests to pass, we also need to legalize G_ICMP in this patch. A G_ICMP is legal if the destination type is a legal bool vector and the LHS and RHS are legal integer vector types.	2024-04-03 15:27:15 -07:00
Shilei Tian	3a106e5b2c	[GlobalISel] Fold G_ICMP if possible (#86357 ) This patch tries to fold `G_ICMP` if possible.	2024-03-29 15:59:50 -04:00
Wang Pengcheng	610b9e23c5	[SDAG] Use shifts if ISD::MUL is illegal when lowering ISD::CTPOP (#86505 ) We can avoid libcalls. Fixes #86205	2024-03-29 15:38:39 +08:00
Michael Maitland	54a9f0e441	[RISCV][GISEL] Legalize, regbankselect, and instruction-select G_VSCALE (#85967 ) G_VSCALE should be lowered using VLENB. If the type is not sXLen it should be lowered using a G_VSCALE on the narrow type and a G_MUL. regbank select and instruction select are straightforward so we really only need to add tests to show it works.	2024-03-26 20:17:22 -04:00
David Green	fbc247367a	[AArch64][GlobalISel] Legalization for small anyext/sext/zext (#86438 ) Similar to #85625, some of the codegen is still far from optimal but this helps fix quite a few fallback cases.	2024-03-26 09:48:06 +00:00
Michael Maitland	9056ce8804	Revert "[RISCV][GISEL] Legalize G_VSCALE" This reverts commit 47681506ded30fada68f180b5e80f740bc76abcd. It is not consistent with SelectionDAG.	2024-03-25 11:46:02 -07:00
Michael Maitland	47681506de	[RISCV][GISEL] Legalize G_VSCALE G_VSCALE should be lowered using VLENB.	2024-03-25 10:44:58 -07:00
Yingwei Zheng	6c1932ffd8	[LLVM] Pass APInt by const reference. NFC. (#86278 ) This patch adjusts argument passing for `APInt` to improve the compile-time. Compile-time improvement: https://llvm-compile-time-tracker.com/compare.php?from=d1f182c895728d89c5c3d198b133e212a5d9d4a3&to=32d6611af69bf4e76373f9bc7d9649650f760e48&stat=instructions:u	2024-03-23 14:57:35 +08:00
Madhur Amilkanthwar	7bb87d5338	[AArch64][GlobalISel] Take abs scalar codegen closer to SDAG (#84886 ) This patch improves codegen for scalar (<128bits) version of llvm.abs intrinsic by using the existing non-XOR based lowering. This takes the generated code closer to SDAG. codegen with GISel for > 128 bit types is not very good with these method so not doing so.	2024-03-21 09:54:03 +05:30
Dhruv Chawla	843a978b6f	[GlobalISel] Add support to moreElementsVector for G_SEXT, G_ZEXT and G_ANYEXT (#85038 )	2024-03-18 07:46:17 +05:30
David Green	601e102bdb	[CodeGen] Use LocationSize for MMO getSize (#84751 ) This is part of #70452 that changes the type used for the external interface of MMO to LocationSize as opposed to uint64_t. This means the constructors take LocationSize, and convert ~UINT64_C(0) to LocationSize::beforeOrAfter(). The getSize methods return a LocationSize. This allows us to be more precise with unknown sizes, not accidentally treating them as unsigned values, and in the future should allow us to add proper scalable vector support but none of that is included in this patch. It should mostly be an NFC. Global ISel is still expected to use the underlying LLT as it needs, and are not expected to see unknown sizes for generic operations. Most of the changes are hopefully fairly mechanical, adding a lot of getValue() calls and protecting them with hasValue() where needed.	2024-03-17 18:15:56 +00:00
Jay Foad	fd3eaf76ba	[GISel] Enforce G_PTR_ADD RHS type matching index size for addr space (#84352 )	2024-03-09 09:07:22 +00:00
Michael Maitland	96049fcf4e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) Recommits llvm/llvm-project#80378 which was reverted in llvm/llvm-project#84330. The problem was that the change in llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir used 217 as an opcode instead of a regex.	2024-03-07 09:10:03 -08:00
Michael Maitland	552da24843	Revert "[GISEL] Add IRTranslation for shufflevector on scalable vector types" (#84330 ) Reverts llvm/llvm-project#80378 causing Buildbot failures that did not show up with check-llvm or CI.	2024-03-07 10:16:31 -05:00
Michael Maitland	2b8aaef09e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) This patch is stacked on https://github.com/llvm/llvm-project/pull/80372, https://github.com/llvm/llvm-project/pull/80307, and https://github.com/llvm/llvm-project/pull/80306. ShuffleVector on scalable vector types gets IRTranslate'd to G_SPLAT_VECTOR since a ShuffleVector that has operates on scalable vectors is a splat vector where the value of the splat vector is the 0th element of the first operand, because the index mask operand is the zeroinitializer (undef and poison are treated as zeroinitializer here). This is analogous to what happens in SelectionDAG for ShuffleVector. `buildSplatVector` is renamed to`buildBuildVectorSplatVector`. I did not make this a separate patch because it would cause problems to revert that change without reverting this change too.	2024-03-07 09:50:29 -05:00
Tuan Chuong Goh	13a78fd1ac	[AArch64][GlobalISel] Re-commit Legalize G_SHUFFLE_VECTOR for Odd-Sized Vectors (#83038 ) Legalize smaller/larger than legal vectors with i8 and i16 element sizes. Vectors with elements smaller than i8 will get widened to i8 elements.	2024-03-04 15:03:55 +00:00
chuongg3	4a5ec3cec8	Revert "[AArch64][GlobalISel] Legalize G_SHUFFLE_VECTOR for Odd-Sized Vectors" (#83544 ) Reverts llvm/llvm-project#83038 due to failing build in Fuchsia build https://lab.llvm.org/staging/#/builders/187/builds/1695	2024-03-01 08:56:34 +00:00
chuongg3	a344db793a	[AArch64][GlobalISel] Legalize G_SHUFFLE_VECTOR for Odd-Sized Vectors (#83038 ) Legalize Smaller/Larger than legal vectors with i8 and i16 element sizes. Vectors with elements smaller than i8 will get widened to i8 elements.	2024-02-29 16:31:05 +00:00
Dhruv Chawla (work)	2c9b6c1b36	[AArch64][GlobalISel] Improve codegen for G_VECREDUCE_{SMIN,SMAX,UMIN,UMAX} for odd-sized vectors (#82740 ) i8 vectors do not have their sizes changed as I noticed regressions in some tests when that was done. This patch also adds support for most G_VECREDUCE_* operations to moreElementsVector in LegalizerHelper.cpp. The code for getting the "neutral" element is taken almost exactly as it is in SelectionDAG, with the exception that support for G_VECREDUCE_{FMAXIMUM,FMINIMUM} was not added. The code for SelectionDAG is located at SelectionDAG::getNeutralELement().	2024-02-27 15:57:46 +05:30
chuongg3	0fb3d4296f	[AArch64][GlobalISel] Refactor BITCAST Legalization (#80505 ) Ensure BITCAST is only legal for types with the same amount of bits. Enable BITCAST to work with non-legal vector types as well.	2024-02-21 13:24:45 +00:00
Owen Anderson	44b717df4d	[GlobalISel] Clamp out-of-range G_EXTRACT_VECTOR_ELT constant indices when converting them into loads. (#82460 ) This avoid turning a poison value into a segfault, and fixes https://github.com/llvm/llvm-project/issues/78383	2024-02-21 00:42:22 -05:00
David Green	3a77522387	[AArch64][GlobalISel] Improve and expand fcopysign lowering (#71283 ) This alters the lowering of G_COPYSIGN to support vector types. The general idea is that we just lower it to vector operations using and/or and a mask, which are now converted to a BIF/BIT/BSP. In the process the existing AArch64LegalizerInfo::legalizeFCopySign can be removed, replying on expanding the scalar versions to vector instead, which just needs a small adjustment to allow widening scalars to vectors.	2024-02-17 10:19:27 +00:00
David Green	47c65cf62d	[AArch64][GlobalISel] Fail legalization for unknown libcalls. (#81873 ) If, like powi on windows, the libcall is unavailable we should fall back to SDAG. Currently we try and generate a call to "".	2024-02-17 08:57:14 +00:00
Mikhail Gudim	35cfaeced4	[GlobalIsel] Lower integer constants to constant pool in `LegalizerHelper`. (#81957 ) Extend LegalizerHelper's API to lower integer constants to a load from constant pool. Previously, this lowering existed only for FP constants. Apply this change to RISCV.	2024-02-16 18:51:44 -05:00
Jay Foad	d57515bd10	[LLT] Add and use isPointerVector and isPointerOrPointerVector. NFC. (#81283 )	2024-02-13 08:21:35 +00:00
chuongg3	2c552d319a	[AArch64][GlobalISel] Legalize G_ABS for Larger/Smaller Vectors (#79117 ) Legalize G_ABS for larger/smaller width vectors with legal element sizes Fallsback for the smaller width vector tests because it is unable to legalize for G_ANYEXT smaller width vectors	2024-01-28 20:21:38 +00:00
David Green	f297d0bc6d	[AArch64][GlobalISel] More FCmp legalization. (#78734 ) This fills out the fcmp handling to be more like the other instructions, adding better support for fp16 and some larger vectors. Select of f16 values is still not handled optimally in places as the select is only legal for s32 values, not s16. This would be correct for integer but not necessarily for fp. It is as if we need to do legalization -> regbankselect -> extra legaliation -> selection.	2024-01-28 15:42:36 +00:00
Kai Nacke	f2d0bba874	[GISel] Lower scalar G_SELECT in LegalizerHelper (#79342 ) The LegalizerHelper only has support to lower G_SELECT with vector operands. The approach is the same for scalar arguments, which this PR adds.	2024-01-26 09:11:29 -05:00
chuongg3	bfef161a80	[AArch64][GlobalISel] Legalize Shifts for Smaller/Larger Vectors (#78750 ) Legalize shl/lshr/ashr for smaller/larger vector widths with legal element sizes Smaller than legal vector types does not work at the moment as it relies on G_ANYEXT to work with smaller than legal vector types	2024-01-22 14:08:26 +00:00
Thorsten Schütt	67dc6e9075	[GlobalIsel][AArch64] more legal icmps (#78239 ) In https://github.com/llvm/llvm-project/pull/78181 the godbolt (https://llvm.godbolt.org/z/vMsnxMf1v) crashed with GlobalIsel. LLVM ERROR: unable to legalize instruction: %90:_(<3 x s32>) = G_ICMP intpred(uge), %15:_(<3 x s32>), %0:_ (in function: vec3_i32)	2024-01-17 22:23:51 +01:00
chuongg3	fcfe1b6482	[GlobalISel] Refactor extractParts() (#75223 ) Moved extractParts() and extractVectorParts() from LegalizerHelper to Utils to be able to use it in different passes. extractParts() will also try to use unmerge when doing irregular splits where possible, falling back to extract elements when not.	2024-01-15 16:40:39 +00:00

1 2 3 4 5 ...

614 Commits