llvm-project

Author	SHA1	Message	Date
Michael Maitland	2f400a2fd7	[GISEL] Add G_VSCALE instruction (#84542 )	2024-03-12 20:22:49 -04:00
Michael Maitland	034cc2f5d0	[GISEL] Add G_INSERT_SUBVECTOR and G_EXTRACT_SUBVECTOR (#84538 ) G_INSERT and G_EXTRACT are not sufficient to use to represent both INSERT/EXTRACT on a subregister and INSERT/EXTRACT on a vector. We would like to be able to INSERT/EXTRACT on vectors in cases that INSERT/EXTRACT on vector subregisters are not sufficient, so we add these opcodes. I tried to do a patch where we treated G_EXTRACT as both G_EXTRACT_SUBVECTOR and G_EXTRACT_SUBREG, but ran into an infinite loop at this [point](`8b5b294ec2/llvm/lib/Target/RISCV/RISCVISelLowering.cpp (L9932)`) in the SDAG equivalent code.	2024-03-11 13:47:30 -04:00
Benjamin Kramer	b8b434b3e1	[GISEL] Silence unused variable warning. NFC	2024-03-07 20:46:10 +01:00
Michael Maitland	96049fcf4e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) Recommits llvm/llvm-project#80378 which was reverted in llvm/llvm-project#84330. The problem was that the change in llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir used 217 as an opcode instead of a regex.	2024-03-07 09:10:03 -08:00
Michael Maitland	552da24843	Revert "[GISEL] Add IRTranslation for shufflevector on scalable vector types" (#84330 ) Reverts llvm/llvm-project#80378 causing Buildbot failures that did not show up with check-llvm or CI.	2024-03-07 10:16:31 -05:00
Michael Maitland	2b8aaef09e	[GISEL] Add IRTranslation for shufflevector on scalable vector types (#80378 ) This patch is stacked on https://github.com/llvm/llvm-project/pull/80372, https://github.com/llvm/llvm-project/pull/80307, and https://github.com/llvm/llvm-project/pull/80306. ShuffleVector on scalable vector types gets IRTranslate'd to G_SPLAT_VECTOR since a ShuffleVector that has operates on scalable vectors is a splat vector where the value of the splat vector is the 0th element of the first operand, because the index mask operand is the zeroinitializer (undef and poison are treated as zeroinitializer here). This is analogous to what happens in SelectionDAG for ShuffleVector. `buildSplatVector` is renamed to`buildBuildVectorSplatVector`. I did not make this a separate patch because it would cause problems to revert that change without reverting this change too.	2024-03-07 09:50:29 -05:00
Michael Maitland	44a46a0b68	[RISCV][GISEL] Add IRTranslation for insertelement with scalable vector type (#80377 ) This patch is stacked on #80372, #80307, and #80306.	2024-02-19 15:30:48 -05:00
David Green	3a77522387	[AArch64][GlobalISel] Improve and expand fcopysign lowering (#71283 ) This alters the lowering of G_COPYSIGN to support vector types. The general idea is that we just lower it to vector operations using and/or and a mask, which are now converted to a BIF/BIT/BSP. In the process the existing AArch64LegalizerInfo::legalizeFCopySign can be removed, replying on expanding the scalar versions to vector instead, which just needs a small adjustment to allow widening scalars to vectors.	2024-02-17 10:19:27 +00:00
Jay Foad	d57515bd10	[LLT] Add and use isPointerVector and isPointerOrPointerVector. NFC. (#81283 )	2024-02-13 08:21:35 +00:00
Thorsten Schütt	26616c62d1	[GlobalIsel][NFC] Harden MachineIRBuilder (#75465 ) Protective measures against https://github.com/llvm/llvm-project/pull/74502	2023-12-14 14:04:57 +01:00
Jay Foad	35ebd92d3d	[GlobalISel] Add G_PREFETCH (#74863 )	2023-12-11 11:06:50 +00:00
Michael Maitland	a7bbcc4690	[RISCV][GISEL] Add support for lowerFormalArguments that contain scalable vector types (#70882 ) Scalable vector types from LLVM IR can be lowered to scalable vector types in MIR according to the RISCVAssignFn.	2023-11-14 13:15:41 -05:00
Mirko Brkusanin	72e3713009	[IRTranslator] Set NUW flag for inbounds gep and load/store offsets Patch by: Acim Maravic Differential Revision: https://reviews.llvm.org/D159515	2023-09-22 16:16:28 +02:00
Sameer Sahasrabuddhe	d9847cde48	[GlobalISel] convergent intrinsics Introduced the convergent equivalent of the existing G_INTRINSIC opcodes: - G_INTRINSIC_CONVERGENT - G_INTRINSIC_CONVERGENT_W_SIDE_EFFECTS Out of the targets that currently have some support for GlobalISel, the patch assumes that the convergent intrinsics only relevant to SPIRV and AMDGPU. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D154766	2023-07-31 12:15:39 +05:30
Niwin Anto	10b1f58cba	[AArch64][GlobalISel] IR translate support for a return instruction of type <1 x i8> or <1 x i16> when using GlobalISel. Code generation for return instruction of type <1 x i8> or <1 x i16> when using GlobalISel causes internal compiler crash Could not handle ret ty. Fixes: https://github.com/llvm/llvm-project/issues/58211 Differential Revision: https://reviews.llvm.org/D153300	2023-06-25 14:40:48 -07:00
Felipe de Azevedo Piovezan	79a1e32915	[GlobalISel] Improve stack slot tracking in dbg.values For IR like: ``` %alloca = alloca ... dbg.value(%alloca, !myvar, OP_deref(<other_ops>)) ``` GlobalISel lowers it to MIR: ``` %some_reg = G_FRAME_INDEX <stack_slot> DBG_VALUE %some_reg, !myvar, OP_deref(<other_ops>) ``` In other words, if the value of `!myvar` can be obtained by dereferencing an alloca, in MIR we say that the _location_ of a variable is obtained by dereferencing register %some_reg (plus some `<other_ops>`). We can instead remove the use of `%some_reg`: the location of `!myvar` _is_ `<stack_slot>` (plus some `<other_ops>`). This patch implements this transformation, which improves debug information handling in O0, as these registers hardly ever survive register allocation. A note about testing: similar to what was done in D76934 (f24e2e9eebde4b7a1d), this patch exposed a bug in the Builder class when using `-debug`, where we tried to print an incomplete instruction. The changes in `MachineIRBuilder.cpp` address that. Differential Revision: https://reviews.llvm.org/D147536	2023-04-05 08:21:00 -04:00
Chen Zheng	6ee2f770ef	[PowerPC][GISel] add support for fpconstant Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D133340	2023-02-14 02:39:22 +00:00
Diana Picus	2004ab422a	MachineIRBuilder: Add buildMergeValues. NFC Add a `buildMergeValues` method that unconditionally builds a G_MERGE_VALUES instruction, as opposed to `buildMergeLikeInstr` which may decide on a different opcode based on the input types. I haven't audited all the uses of `buildMergeLikeInstr` to see if they can be replaced with `buildMergeValues`, but I did find a couple of obvious ones where we check that we're merging scalars right before calling `buildMerge`. This is a follow-up suggested in https://reviews.llvm.org/D140964 Differential Revision: https://reviews.llvm.org/D141373	2023-01-13 09:32:58 +01:00
Diana Picus	f95a5fbe7c	MachineIRBuilder: Rename buildMerge. NFC `buildMerge` may build a G_MERGE_VALUES, G_BUILD_VECTOR or G_CONCAT_VECTORS. Rename it to `buildMergeLikeInstr`. This is a follow-up suggested in https://reviews.llvm.org/D140964 Differential Revision: https://reviews.llvm.org/D141372	2023-01-13 09:32:58 +01:00
Diana Picus	22924bd48d	[GlobalISel] Don't switch opcodes in MIRBuilder::buildInstr At the moment, `MachineIRBuilder::buildInstr` may build an instruction with a different opcode than the one passed in as parameter. This may cause confusion for its consumers, such as `CSEMIRBuilder`, which will memoize the instruction based on the new opcode, but will search through the memoized instructions based on the original one (resulting in missed CSE opportunities). This is all the more unpleasant since buildInstr is virtual and may call itself recursively both directly and via buildCast, so it's not always easy to follow what's going on. This patch simplifies the API of `MachineIRBuilder` so that the `buildInstr` method does the least surprising thing (i.e. builds an instruction with the specified opcode) and only the convenience `buildX` methods (`buildMerge` etc) are allowed freedom over which opcode to use. This can still be confusing (e.g. one might write a unit test using `buildBuildVectorTrunc` but instead get a plain `G_BUILD_VECTOR`), but at least it's explained in the comments. In practice, this boils down to 3 changes: * `buildInstr(G_MERGE_VALUES)` will no longer call itself with `G_BUILD_VECTOR` or `G_CONCAT_VECTORS`; this functionality is moved to `buildMerge` and replaced with an assert; * `buildInstr(G_BUILD_VECTOR_TRUNC)` will no longer call itself with `G_BUILD_VECTOR`; this functionality is moved to `buildBuildVectorTrunc` and replaced with an assert; * `buildInstr(G_MERGE_VALUES)` will no longer call `buildCast` and will instead assert if we're trying to merge a single value; no change is needed in `buildMerge` since it was already asserting more than one source operand. This change is NFC for users of the `buildX` methods, but users that call `buildInstr` with relaxed parameters will have to update their code (such instances will hopefully be easy to find thanks to the asserts). Differential Revision: https://reviews.llvm.org/D140964	2023-01-05 10:02:39 +01:00
Fangrui Song	67819a72c6	[CodeGen] llvm::Optional => std::optional	2022-12-13 09:06:36 +00:00
Kazu Hirata	998960ee1f	[CodeGen] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 20:36:08 -08:00
Marco Elver	31a548021b	[GlobalISel] Propagate PCSections metadata to MachineInstr Propagate (most) PC sections metadata to MachineInstr when GlobalISel is doing instruction selection. This change results in support for architectures using GlobalISel (such as -O0 with AArch64). Not all instructions may be supported yet, and requires further target-specific handling (such as done for AArch64 pseudo-atomics). Expanding supported instructions is planned on a case-by-case basis and new use cases for PC sections metadata. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130886	2022-09-07 11:36:02 +02:00
Felipe de Azevedo Piovezan	58526b2d2b	[GlobalISel] Handle nullptr constants in dbg.value Currently, the LLVM IR -> MIR translator fails to translate dbg.values whose first argument is a null pointer. However, in other portions of the code, such pointers are always lowered to the constant zero, for example see IRTranslator::Translate(Constant, Register). This patch addresses the limitation by following the same approach of lowering null pointers to zero. A prior test was checking that null pointers were always lowered to $noreg; this test is changed to check for zero, and the previous behavior is now checked by introducing a dbg.value whose first argument is the address of a global variable. Differential Revision: https://reviews.llvm.org/D130721	2022-07-28 14:58:14 -07:00
Felipe de Azevedo Piovezan	0ef6809c48	[GlobalISel][nfc] Remove unnecessary cast The getOperand method already returns a Constant when it is called on a ConstantExpression, as such the cast is not needed. To prevent a type mismatch between the different return statements of the lambda, the lambda return type is explicitly provided. Differential Revision: https://reviews.llvm.org/D130719	2022-07-28 14:55:07 -07:00
Adrian Prantl	719ab04acf	[GlobalISel] Handle IntToPtr constants in dbg.value Currently, the IR to MIR translator can only handle two kinds of constant inputs to dbg.values intrinsics: constant integers and constant floats. In particular, it cannot handle pointers created from IntToPtr ConstantExpression objects. This patch addresses the limitation above by replacing the IntToPtr with its input integer prior to converting the dbg.value input. Patch by Felipe Piovezan! Differential Revision: https://reviews.llvm.org/D130642	2022-07-27 13:42:07 -07:00
Kazu Hirata	9e6d1f4b5d	[CodeGen] Qualify auto variables in for loops (NFC)	2022-07-17 01:33:28 -07:00
Matt Arsenault	13ac4c3de9	GlobalISel: Add buildBoolExtInReg helper	2022-07-08 11:55:08 -04:00
Shilei Tian	1023ddaf77	[LLVM] Add the support for fmax and fmin in atomicrmw instruction This patch adds the support for `fmax` and `fmin` operations in `atomicrmw` instruction. For now (at least in this patch), the instruction will be expanded to CAS loop. There are already a couple of targets supporting the feature. I'll create another patch(es) to enable them accordingly. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D127041	2022-07-06 10:57:53 -04:00
Kazu Hirata	246e83e973	[GlobalISel] Remove buildSequence (NFC) The last use was removed on Jun 27, 2019 in commit 8138996128cd17d78d9d3e6ef7b49987565cb310.	2022-06-13 06:58:36 -07:00
Jay Foad	26e1ebd3ea	[GlobalISel] Change ConstantFoldVectorBinop to return vector of APInt Previously it built MIR for the results and returned a Register. This avoids building constants for earlier elements of the vector if later elements will fail to fold, and allows CSEMIRBuilder::buildInstr to avoid unconditionally building a copy from the result. Use a new helper function MachineIRBuilder::buildBuildVectorConstant to build a G_BUILD_VECTOR of G_CONSTANTs. Differential Revision: https://reviews.llvm.org/D117758	2022-05-13 09:33:07 +01:00
serge-sans-paille	ed98c1b376	Cleanup includes: DebugInfo & CodeGen Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121332	2022-03-12 17:26:40 +01:00
Matt Arsenault	99e8e17313	Reapply "Revert "GlobalISel: Add G_ASSERT_ALIGN hint instruction" This reverts commit a97e20a3a8a58be751f023e610758310d5664562.	2022-01-24 09:26:52 -05:00
James Y Knight	a97e20a3a8	Revert "GlobalISel: Add G_ASSERT_ALIGN hint instruction" This commit sometimes causes a crash when compiling a vtable thunk. E.g.: clang '--target=aarch64-grtev4-linux-gnu' -xc++ - -c -o /dev/null <<EOF struct a { virtual int f(); }; struct c { virtual int &g() const; }; struct d : a, c { int &g() const; }; int &d::g() const {} EOF Some follow-up commits have been reverted as well: Revert "IR: Make getRetAlign check callee function attributes" Revert "Fix MSVC "32-bit shift implicitly converted to 64 bits" warning. NFC." Revert "Fix MSVC "32-bit shift implicitly converted to 64 bits" warning. NFC." This reverts commit 4f414af6a77cdbd9b6303a7afa525cfb3f9d792a. This reverts commit a5507d2e253a2c94c3ca7794edf7385af8082b97. This reverts commit 3d2d208f6a0a421b23937c39b9d371183a5913a3. This reverts commit 07ddfa95e3b5ea8464e90545f592624221b854ae.	2022-01-14 04:50:07 +00:00
Matt Arsenault	07ddfa95e3	GlobalISel: Add G_ASSERT_ALIGN hint instruction Insert it for call return values only for now, which is the only case the DAG handles also.	2022-01-12 18:20:58 -05:00
Petar Avramovic	29f88b93fd	[GlobalISel] Rework more/fewer elements for vectors Artifact combiner is not able to access individual elements after using LCMTy style merge/unmerge, extract and insert to change vector number of elements (pad with undef or split to sub-vector instructions). Use unmerge to individual elements instead and then merge elements into requested types. Change argument lowering for vectors and moreElementsVector to use buildPadVectorWithUndefElements and buildDeleteTrailingVectorElements. FewerElementsVector had a few helpers that had different behavior, introduce new helper for most of the opcodes. FewerElementsVector helper is more flexible since it can create leftover instruction smaller then requested type (useful in case target wants to avoid pad with undef and use fewer registers). If target does not want leftover of different type it should call more elements first. Some helpers were performing more elements first to have split without leftover. Opcodes that used this helper use clampMaxNumElementsStrict (does more elements first) in LegalizerInfo to avoid test changes. Fixes failures caused by failing to combine artifacts created during more/fewer elements vector. Differential Revision: https://reviews.llvm.org/D114198	2021-12-23 14:30:02 +01:00
Simon Pilgrim	7c25a32840	Fix MSVC "signed/unsigned mismatch" comparison warning. NFCI.	2021-08-30 12:11:09 +01:00
Matt Arsenault	990278d026	CodeGen: Store LLT instead of uint64_t in MachineMemOperand GlobalISel is relying on regular MachineMemOperands to track all of the memory properties of accesses. Just the raw byte size is insufficent to disambiguate all situations. For example, if we need to split an unaligned extending load, we need to know the number of bits in the original source value and can't infer it from the result type. This is also a problem for extending vector loads. This does decrease the maximum representable size from the full uint64_t bytes to a maximum of 16-bits. No in tree testcases hit this, other than places using UINT64_MAX for unknown sizes. This may be an issue for G_MEMCPY and co., although they can just use unknown size for large static sizes. This also has potential for backend abuse by relying on the type when it really shouldn't be relevant after selection. This does not include the necessary MIR printer/parser changes to represent this.	2021-06-29 17:38:51 -04:00
Sander de Smalen	0e09d18c6a	Reland [GlobalISel] NFC: Have LLT::getSizeInBits/Bytes return a TypeSize. This patch relands https://reviews.llvm.org/D104454, but fixes some failing builds on Mac OS which apparently has a different definition for size_t, that caused 'ambiguous operator overload' for the implicit conversion of TypeSize to a scalar value. This reverts commit b732e6c9a8438e5204ac96c8ca76f9b11abf98ff.	2021-06-28 15:24:27 +01:00
Benjamin Kramer	6359842bc0	[GlobalISel] Silence unused variable warning in Release builds. NFC.	2021-05-25 10:55:29 +02:00
Amara Emerson	ff30436dc5	[GlobalISel] Fix MachineIRBuilder not using the DstOp argument for G_SHUFFLE_VECTOR.	2021-05-25 00:43:26 -07:00
Vang Thao	a3d273c9ff	[GlobalISel] Fix buildZExtInReg creating new register. Fix a bug where buildZExtInReg will create and use a new register instead of using the register from parameter DstOp Res. Reviewed By: arsenm, foad Differential Revision: https://reviews.llvm.org/D101871	2021-05-05 08:19:52 -07:00
Pushpinder Singh	d0e5422eb8	[GlobalISel][AMDGPU] Lower G_UMULO/G_SMULO Reviewed By: foad Differential Revision: https://reviews.llvm.org/D93963	2021-03-23 05:45:43 +00:00
Jessica Paquette	60aa646441	[GlobalISel] Add G_ASSERT_SEXT This adds a G_ASSERT_SEXT opcode, similar to G_ASSERT_ZEXT. This instruction signifies that an operation was already sign extended from a smaller type. This is useful for functions with sign-extended parameters. E.g. ``` define void @foo(i16 signext %x) { ... } ``` This adds verifier, regbankselect, and instruction selection support for G_ASSERT_SEXT equivalent to G_ASSERT_ZEXT. Differential Revision: https://reviews.llvm.org/D96890	2021-02-17 13:10:34 -08:00
Jessica Paquette	24261729a4	[GlobalISel] Add G_ASSERT_ZEXT This adds a generic opcode which communicates that a type has already been zero-extended from a narrower type. This is intended to be similar to AssertZext in SelectionDAG. For example, ``` %x_was_extended:_(s64) = G_ASSERT_ZEXT %x, 16 ``` Signifies that the top 48 bits of %x are known to be 0. This is useful in cases like this: ``` define i1 @zeroext_param(i8 zeroext %x) { %cmp = icmp ult i8 %x, -20 ret i1 %cmp } ``` In AArch64, `%x` must use a 32-bit register, which is then truncated to a 8-bit value. If we know that `%x` is already zero-ed out in the relevant high bits, we can avoid the truncate. Currently, in GISel, this looks like this: ``` _zeroext_param: and w8, w0, #0xff ; We don't actually need this! cmp w8, #236 cset w0, lo ret ``` While SDAG does not produce the truncation, since it knows that it's unnecessary: ``` _zeroext_param: cmp w0, #236 cset w0, lo ret ``` This patch - Adds G_ASSERT_ZEXT - Adds MIRBuilder support for it - Adds MachineVerifier support for it - Documents it It also puts G_ASSERT_ZEXT into its own class of "hint instruction." (There should be a G_ASSERT_SEXT in the future, maybe a G_ASSERT_ALIGN as well.) This allows us to skip over hints in the legalizer etc. These can then later be selected like COPY instructions or removed. Differential Revision: https://reviews.llvm.org/D95564	2021-01-28 13:58:37 -08:00
Kazu Hirata	cfeecdf7b6	[llvm] Use llvm::all_of (NFC)	2021-01-06 18:27:36 -08:00
Fangrui Song	d928dfc6f9	[GlobalISel] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off builds	2020-11-30 18:31:42 -08:00
Fangrui Song	36fe1a9dea	[GlobalISel] Fix -Wunused-variable	2020-11-30 18:25:54 -08:00
Amara Emerson	87ff156414	[AArch64][GlobalISel] Fix crash during legalization of a vector G_SELECT with scalar mask. The lowering of vector selects needs to first splat the scalar mask into a vector first. This was causing a crash when building oggenc in the test suite. Differential Revision: https://reviews.llvm.org/D91655	2020-11-30 16:37:49 -08:00
Amara Emerson	2ff14957e8	[GlobalISel] Implement bit-test switch table optimization. This is mostly a straight port from SelectionDAG. We re-use the actual bit-test analysis part from SwitchLoweringUtils, which was factored out earlier to support jump-tables. Differential Revision: https://reviews.llvm.org/D85233	2020-08-12 11:31:39 -07:00

1 2 3 4 5

206 Commits