llvm-project

Author	SHA1	Message	Date
Dhruv Chawla	843a978b6f	[GlobalISel] Add support to moreElementsVector for G_SEXT, G_ZEXT and G_ANYEXT (#85038 )	2024-03-18 07:46:17 +05:30
David Green	601e102bdb	[CodeGen] Use LocationSize for MMO getSize (#84751 ) This is part of #70452 that changes the type used for the external interface of MMO to LocationSize as opposed to uint64_t. This means the constructors take LocationSize, and convert ~UINT64_C(0) to LocationSize::beforeOrAfter(). The getSize methods return a LocationSize. This allows us to be more precise with unknown sizes, not accidentally treating them as unsigned values, and in the future should allow us to add proper scalable vector support but none of that is included in this patch. It should mostly be an NFC. Global ISel is still expected to use the underlying LLT as it needs, and are not expected to see unknown sizes for generic operations. Most of the changes are hopefully fairly mechanical, adding a lot of getValue() calls and protecting them with hasValue() where needed.	2024-03-17 18:15:56 +00:00
Jay Foad	fd3eaf76ba	[GISel] Enforce G_PTR_ADD RHS type matching index size for addr space (#84352 )	2024-03-09 09:07:22 +00:00
Michael Maitland	96049fcf4e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) Recommits llvm/llvm-project#80378 which was reverted in llvm/llvm-project#84330. The problem was that the change in llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir used 217 as an opcode instead of a regex.	2024-03-07 09:10:03 -08:00
Michael Maitland	552da24843	Revert "[GISEL] Add IRTranslation for shufflevector on scalable vector types" (#84330 ) Reverts llvm/llvm-project#80378 causing Buildbot failures that did not show up with check-llvm or CI.	2024-03-07 10:16:31 -05:00
Michael Maitland	2b8aaef09e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) This patch is stacked on https://github.com/llvm/llvm-project/pull/80372, https://github.com/llvm/llvm-project/pull/80307, and https://github.com/llvm/llvm-project/pull/80306. ShuffleVector on scalable vector types gets IRTranslate'd to G_SPLAT_VECTOR since a ShuffleVector that has operates on scalable vectors is a splat vector where the value of the splat vector is the 0th element of the first operand, because the index mask operand is the zeroinitializer (undef and poison are treated as zeroinitializer here). This is analogous to what happens in SelectionDAG for ShuffleVector. `buildSplatVector` is renamed to`buildBuildVectorSplatVector`. I did not make this a separate patch because it would cause problems to revert that change without reverting this change too.	2024-03-07 09:50:29 -05:00
Tuan Chuong Goh	13a78fd1ac	[AArch64][GlobalISel] Re-commit Legalize G_SHUFFLE_VECTOR for Odd-Sized Vectors (#83038 ) Legalize smaller/larger than legal vectors with i8 and i16 element sizes. Vectors with elements smaller than i8 will get widened to i8 elements.	2024-03-04 15:03:55 +00:00
chuongg3	4a5ec3cec8	Revert "[AArch64][GlobalISel] Legalize G_SHUFFLE_VECTOR for Odd-Sized Vectors" (#83544 ) Reverts llvm/llvm-project#83038 due to failing build in Fuchsia build https://lab.llvm.org/staging/#/builders/187/builds/1695	2024-03-01 08:56:34 +00:00
chuongg3	a344db793a	[AArch64][GlobalISel] Legalize G_SHUFFLE_VECTOR for Odd-Sized Vectors (#83038 ) Legalize Smaller/Larger than legal vectors with i8 and i16 element sizes. Vectors with elements smaller than i8 will get widened to i8 elements.	2024-02-29 16:31:05 +00:00
Dhruv Chawla (work)	2c9b6c1b36	[AArch64][GlobalISel] Improve codegen for G_VECREDUCE_{SMIN,SMAX,UMIN,UMAX} for odd-sized vectors (#82740 ) i8 vectors do not have their sizes changed as I noticed regressions in some tests when that was done. This patch also adds support for most G_VECREDUCE_* operations to moreElementsVector in LegalizerHelper.cpp. The code for getting the "neutral" element is taken almost exactly as it is in SelectionDAG, with the exception that support for G_VECREDUCE_{FMAXIMUM,FMINIMUM} was not added. The code for SelectionDAG is located at SelectionDAG::getNeutralELement().	2024-02-27 15:57:46 +05:30
chuongg3	0fb3d4296f	[AArch64][GlobalISel] Refactor BITCAST Legalization (#80505 ) Ensure BITCAST is only legal for types with the same amount of bits. Enable BITCAST to work with non-legal vector types as well.	2024-02-21 13:24:45 +00:00
Owen Anderson	44b717df4d	[GlobalISel] Clamp out-of-range G_EXTRACT_VECTOR_ELT constant indices when converting them into loads. (#82460 ) This avoid turning a poison value into a segfault, and fixes https://github.com/llvm/llvm-project/issues/78383	2024-02-21 00:42:22 -05:00
David Green	3a77522387	[AArch64][GlobalISel] Improve and expand fcopysign lowering (#71283 ) This alters the lowering of G_COPYSIGN to support vector types. The general idea is that we just lower it to vector operations using and/or and a mask, which are now converted to a BIF/BIT/BSP. In the process the existing AArch64LegalizerInfo::legalizeFCopySign can be removed, replying on expanding the scalar versions to vector instead, which just needs a small adjustment to allow widening scalars to vectors.	2024-02-17 10:19:27 +00:00
David Green	47c65cf62d	[AArch64][GlobalISel] Fail legalization for unknown libcalls. (#81873 ) If, like powi on windows, the libcall is unavailable we should fall back to SDAG. Currently we try and generate a call to "".	2024-02-17 08:57:14 +00:00
Mikhail Gudim	35cfaeced4	[GlobalIsel] Lower integer constants to constant pool in `LegalizerHelper`. (#81957 ) Extend LegalizerHelper's API to lower integer constants to a load from constant pool. Previously, this lowering existed only for FP constants. Apply this change to RISCV.	2024-02-16 18:51:44 -05:00
Jay Foad	d57515bd10	[LLT] Add and use isPointerVector and isPointerOrPointerVector. NFC. (#81283 )	2024-02-13 08:21:35 +00:00
chuongg3	2c552d319a	[AArch64][GlobalISel] Legalize G_ABS for Larger/Smaller Vectors (#79117 ) Legalize G_ABS for larger/smaller width vectors with legal element sizes Fallsback for the smaller width vector tests because it is unable to legalize for G_ANYEXT smaller width vectors	2024-01-28 20:21:38 +00:00
David Green	f297d0bc6d	[AArch64][GlobalISel] More FCmp legalization. (#78734 ) This fills out the fcmp handling to be more like the other instructions, adding better support for fp16 and some larger vectors. Select of f16 values is still not handled optimally in places as the select is only legal for s32 values, not s16. This would be correct for integer but not necessarily for fp. It is as if we need to do legalization -> regbankselect -> extra legaliation -> selection.	2024-01-28 15:42:36 +00:00
Kai Nacke	f2d0bba874	[GISel] Lower scalar G_SELECT in LegalizerHelper (#79342 ) The LegalizerHelper only has support to lower G_SELECT with vector operands. The approach is the same for scalar arguments, which this PR adds.	2024-01-26 09:11:29 -05:00
chuongg3	bfef161a80	[AArch64][GlobalISel] Legalize Shifts for Smaller/Larger Vectors (#78750 ) Legalize shl/lshr/ashr for smaller/larger vector widths with legal element sizes Smaller than legal vector types does not work at the moment as it relies on G_ANYEXT to work with smaller than legal vector types	2024-01-22 14:08:26 +00:00
Thorsten Schütt	67dc6e9075	[GlobalIsel][AArch64] more legal icmps (#78239 ) In https://github.com/llvm/llvm-project/pull/78181 the godbolt (https://llvm.godbolt.org/z/vMsnxMf1v) crashed with GlobalIsel. LLVM ERROR: unable to legalize instruction: %90:_(<3 x s32>) = G_ICMP intpred(uge), %15:_(<3 x s32>), %0:_ (in function: vec3_i32)	2024-01-17 22:23:51 +01:00
chuongg3	fcfe1b6482	[GlobalISel] Refactor extractParts() (#75223 ) Moved extractParts() and extractVectorParts() from LegalizerHelper to Utils to be able to use it in different passes. extractParts() will also try to use unmerge when doing irregular splits where possible, falling back to extract elements when not.	2024-01-15 16:40:39 +00:00
Serge Pavlov	7fc7ef1434	[GlobalISel] Lowering of {get,set,reset}_fpenv (#75086 ) The intrinsics get_fpenv, set_fpenv and reset_fpenv in this change are implemented as calls to math library functions. Target specific lowering will be implemented later on.	2024-01-10 14:18:00 +07:00
David Green	77b124cc57	[AArch64][GlobalISel] Add legalization for G_VECREDUCE_SEQ_FADD. (#76238 ) And G_VECREDUCE_SEQ_FMUL at the same time. They require the elements of the vector operand to be accumulated in order, so just need to be scalarized. Some of the operands are not simplified as much as they can quite yet due to not canonicalizing constant operands post-legalization.	2024-01-05 08:11:44 +00:00
Thomas Preud'homme	ce61b0e9a4	Add out-of-line-atomics support to GlobalISel (#74588 ) This patch implement the GlobalISel counterpart to 4d7df43ffdb460dddb2877a886f75f45c3fee188.	2024-01-04 10:15:16 +00:00
David Green	5550e9c841	[GlobalISel][AArch64] Add libcall lowering for fpowi. (#67114 ) This adds legalization, notably libcall lowering for fpowi. It is a little different to other methods as the function takes both a float and integer register. Otherwise all vectors get scalarized and fp16 is promoted to fp32.	2024-01-04 07:26:23 +00:00
David Green	d659bd1635	[GlobalISel][AArch64] Tail call libcalls. (#74929 ) This tries to allow libcalls to be tail called, using a similar method to DAG where the type is checked to make sure they match, and if so the backend, through lowerCall checks that the tailcall is valid for all arguments.	2024-01-03 07:59:36 +00:00
David Green	5b5614c92f	[AArch64][GlobalISel] Add legalization for vecreduce.fmul (#73309 ) There are no native operations that we can use for floating point mul, so lower by splitting the vector into chunks multiple times. There is still a missing fold for fmul_indexed, that could help the gisel test cases a bit.	2024-01-03 07:49:20 +00:00
Michael Maitland	6f9cb9a75c	[RISCV][GISEL] Legalize G_VAARG through expansion. (#73065 ) G_VAARG can be expanded similiar to SelectionDAG::expandVAArg through LegalizerHelper::lower. This patch implements the lowering through this style of expansion. The expansion gets the head of the va_list by loading the pointer to va_list. Then, the head of the list is adjusted depending on argument alignment information. This gives a pointer to the element to be read out of the va_list. Next, the head of the va_list is bumped to the next element in the list. The new head of the list is stored back to the original pointer to the head of the va_list so that subsequent G_VAARG instructions get the next element in the list. Lastly, the element is loaded from the alignment adjusted pointer constructed earlier. This change is stacked on #73062.	2023-12-08 13:24:27 -05:00
Craig Topper	d605d9d7a1	[RISCV][GISel] Support G_ROTL/G_ROTR with Zbb. (#72825 )	2023-12-04 13:00:34 -08:00
Momchil Velikov	c1140d49ec	[AArch64] Stack probing for dynamic allocas in GlobalISel (#67123 ) Co-authored-by: Oliver Stannard <oliver.stannard@linaro.org>	2023-12-04 09:44:02 +00:00
David Green	295edaab13	[AArch64][GlobalISel] Better vecreduce.fadd lowering. (PR #73294 ) This changes the fadd legalization to handle fp16 types, and treats more types as legal so that the backend can produce the correct patterns. This is currently a missing identity fold for `fadd x -0.0 -> x`	2023-11-27 08:20:54 +00:00
Craig Topper	5d501b1091	[GISel][RISCV] Fix several boundary cases in narrow G_SEXT_INREG. (#72719 ) This fixes cases when SizeInBits is a multiple of the narrow size. If SizeBits is equal to NarrowTy size, the first block would create an illegal G_SEXT_INREG where the the extension size is equal to the type. I tried to turn it into G_TRUNC+G_SEXT, but that just turned back into G_SEXT_INREG causing an infinite loop. So punt to the splitting case. In the for loop we should copy when the part ends on SizeInBits. In that case there is no G_SEXT_INREG needed for partial. But we should note that register in PartialExtensionReg for the first full part to use. If the part starts on SizeInBits then we should do an AShr of PartialExtensionReg. We should only get to the G_SEXT_INREG case if the SizeInBits is in the middle of the part.	2023-11-24 08:39:38 -08:00
Min-Yih Hsu	7c3c8a1277	[RISCV][GISel] Add support for G_IS_FPCLASS in F and D extensions (#72000 ) Add legalizer, regbankselect, and isel supports for floating point version of G_IS_FPCLASS.	2023-11-22 16:43:20 -08:00
Sander de Smalen	81b7f115fb	[llvm][TypeSize] Fix addition/subtraction in TypeSize. (#72979 ) It seems TypeSize is currently broken in the sense that: TypeSize::Fixed(4) + TypeSize::Scalable(4) => TypeSize::Fixed(8) without failing its assert that explicitly tests for this case: assert(LHS.Scalable == RHS.Scalable && ...); The reason this fails is that `Scalable` is a static method of class TypeSize, and LHS and RHS are both objects of class TypeSize. So this is evaluating if the pointer to the function Scalable == the pointer to the function Scalable, which is always true because LHS and RHS have the same class. This patch fixes the issue by renaming `TypeSize::Scalable` -> `TypeSize::getScalable`, as well as `TypeSize::Fixed` to `TypeSize::getFixed`, so that it no longer clashes with the variable in FixedOrScalableQuantity. The new methods now also better match the coding standard, which specifies that: * Variable names should be nouns (as they represent state) * Function names should be verb phrases (as they represent actions)	2023-11-22 08:52:53 +00:00
Acim-Maravic	f3138524db	[AMDGPU] Generic lowering for rint and nearbyint (#69596 ) The are three different rounding intrinsics, that are brought down to same instruction. Co-authored-by: Acim Maravic <acim.maravic@amd.com>	2023-11-14 18:49:21 +01:00
Craig Topper	44e8bea400	[GISel][AArch64] Notify the Observer when CTTZ lowering changes the opcode to CTPOP. (#72008 )	2023-11-12 19:36:24 -08:00
David Green	10ce319320	[AArch64][GlobalISel] Expand handling for sitofp and uitofp (#71282 ) Similar to #70635, this expands the handling of integer to fp conversions. The code is very similar to the float->integer conversions with types handled oppositely. There are some extra unhandled cases which require more handling for ASR operations.	2023-11-10 13:41:13 +00:00
David Green	54574d3272	[AArch64][GlobalISel] Expand handling for fptosi and fptoui (#70635 ) Now that we have more types handled for zext/sext and trunc, it is possible to get more types working for the vector float to integer conversions. This patch adds fp16, widening and narrowing vector support to handle more types. The smaller types wil be expanded to the size of the larger element type. A couple of case require more awkward truncates to get working as they go from illegal to illegal types.	2023-11-04 11:47:05 +00:00
Craig Topper	3750558ee1	[RISCV][GISel] Legalize G_SMULO/G_UMULO (#67635 ) Update `LegalizerHelper::widenScalarMulo` to not create a mulo if we aren't going to use the overflow flag. This prevents needing to legalize the widened operation. This generates better code when we need to make a libcall for multiply.	2023-10-13 20:34:45 -07:00
chuongg3	d88d9834e9	[AArch64][GlobalISel] Support more types for TRUNC (#66927 ) G_TRUNC will get lowered into trunc(merge(trunc(unmerge), trunc(unmerge))) if the source is larger than 128 bits or the truncation is more than half of the current bit size. Now mirrors ZEXT/SEXT code more closely for vector types.	2023-10-11 16:05:25 +01:00
Serge Pavlov	462d5830da	[GlobalISel] Add support for *_fpmode intrinsics The change implements support of the intrinsics `get_fpmode`, `set_fpmode` and `reset_fpmode` in Global Instruction Selector. Now they are lowered into library function calls. Differential Revision: https://reviews.llvm.org/D158260	2023-10-09 21:14:07 +07:00
Matt Arsenault	1328a8534b	AMDGPU: Fix handling of -0 in round lowering (#65761 )	2023-09-19 09:14:17 +03:00
Allen	eaf23b2480	[GIsel][AArch64] Legalize <2 x i16> for G_INSERT_VECTOR_ELT (#65830 ) Widen the vector elements to 64 bits to make sure it legal instead by clamping the number of elements. Depend on D153394. Fixes https://github.com/llvm/llvm-project/issues/63826	2023-09-12 21:15:01 +08:00
Jay Foad	71ca53b6cf	[GlobalISel] Lower G_SHUFFLE_VECTOR with scalar result (#65275 )	2023-09-04 13:32:43 -04:00
Matt Arsenault	b14e83d1a4	IR: Add llvm.exp10 intrinsic We currently have log, log2, log10, exp and exp2 intrinsics. Add exp10 to fix this asymmetry. AMDGPU already has most of the code for f32 exp10 expansion implemented alongside exp, so the current implementation is duplicating nearly identical effort between the compiler and library which is inconvenient. https://reviews.llvm.org/D157871	2023-09-01 19:45:03 -04:00
David Green	58a2f839fd	[AArch64][GISel] Expand coverage of FDiv and move into place. This adds some more extensive test coverage for fdiv through global isel, switching the opcodes to use the more complete ActionDefinitions to handle more cases and moving it into the position of the existing code which is no longer needed.	2023-08-30 22:09:53 +01:00
David Green	ef0b8cf3f4	[AArch64][GISel] Expand coverage of FAdd and FSub. This adds some more extensive test coverage for fadd/fsub through global isel, switching the opcodes to use the more complete ActionDefinitions to handle more cases.	2023-08-23 09:51:06 +01:00
Tuan Chuong Goh	a40c984976	[AArch64][GlobalISel] Support more legal types for EXTEND Expand (s/z/any)ext instructions to be compatible with more types for GlobalISel. This patch mainly focuses on 64-bit and 128-bit vectors with element size of powers of 2. It also notably handles larger than legal vectors. Differential Revision: https://reviews.llvm.org/D157113	2023-08-21 09:51:17 +01:00
Craig Topper	c6dee6982f	[GlobalISel][Mips] Sync G_UADDE and G_USUBE legalization with LegalizeDAG. This modifies the G_UADDE legalizaton to a version that looks shorter on Mips and RISC-V when feeding the equivalent IR to SelectionDAG. This also removes the boolean select from G_USUBE. Comments taken from LegalizeDAG and tweaked. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D158232	2023-08-17 20:36:55 -07:00

1 2 3 4 5 ...

586 Commits