llvm-project

Author	SHA1	Message	Date
Thorsten Schütt	8e3772744d	[GlobalISel][AArch64] Legalize G_INSERT_VECTOR_ELT for SVE (#114470 ) There are patterns for: * {nxv2s32, s32, s64}, * {nxv4s16, s16, s64}, * {nxv2s16, s16, s64}	2024-11-01 06:10:26 +01:00
Matt Arsenault	db5bcb24c2	GlobalISel: Fix combine duplicating atomic loads (#111730 ) The sext_inreg (load) combine was not deleting the old load instruction, and it would never be deleted if volatile or atomic.	2024-10-31 07:55:12 -07:00
Thorsten Schütt	6effab990c	Revert "[GlobalISel][AArch64] Legalize G_INSERT_VECTOR_ELT for SVE" (#114353 ) Reverts llvm/llvm-project#114310	2024-10-31 05:41:16 +01:00
Thorsten Schütt	6bf214b7c6	[GlobalISel][AArch64] Legalize G_INSERT_VECTOR_ELT for SVE (#114310 ) There are patterns for: * {nxv2s32, s32, s64}, * {nxv4s16, s16, s64}, * {nxv2s16, s16, s64}	2024-10-31 04:56:41 +01:00
Craig Topper	f0bae562dc	[GISel] Return const APInt & from getIConstantFromReg. NFC (#114320 ) This matches what the call to ConstantInt::getValue() returns. Let the caller make a copy if needed.	2024-10-30 19:15:51 -07:00
Thorsten Schütt	b3bb6f18bb	[GlobalISel] Import samesign flag (#114267 ) Credits: https://github.com/llvm/llvm-project/pull/111419 Fixes icmp-flags.mir First attempt: https://github.com/llvm/llvm-project/pull/113090 Revert: https://github.com/llvm/llvm-project/pull/114256	2024-10-30 19:56:25 +01:00
Thorsten Schütt	4b028773b2	Revert "[GlobalISel] Import samesign flag" (#114256 ) Reverts llvm/llvm-project#113090	2024-10-30 17:03:17 +01:00
Thorsten Schütt	72b115301d	[GlobalISel] Import samesign flag (#113090 ) Credits: https://github.com/llvm/llvm-project/pull/111419	2024-10-30 16:34:01 +01:00
Petar Avramovic	84b7bcfcac	GlobalISel/MachineIRBuilder: Construct DstOp with VRegAttrs (#113581 ) Allow construction of DstOp with VRegAttrs. Also allow construction with register class or bank and LLT. Intended to be used in lowering code for reg-bank-select where new registers need to have both register bank and LLT. Add support for new type of DstOp in CSEMIRBuilder.	2024-10-30 14:15:42 +01:00
Ellis Hoag	9cc5a4bf66	Remove llvm::shouldOptForSize() from Utils.h (#112630 ) Remove `llvm::shouldOptForSize()` from `Utils.h` since we can use `llvm::shouldOptimizeForSize()` from `SizeOpts.h` instead. Depends on https://github.com/llvm/llvm-project/pull/112626	2024-10-29 14:23:47 -05:00
Benjamin Maxwell	c3260c65e8	[IR] Add `llvm.sincos` intrinsic (#109825 ) This adds the `llvm.sincos` intrinsic, legalization, and lowering. The `llvm.sincos` intrinsic takes a floating-point value and returns both the sine and cosine (as a struct). ``` declare { float, float } @llvm.sincos.f32(float %Val) declare { double, double } @llvm.sincos.f64(double %Val) declare { x86_fp80, x86_fp80 } @llvm.sincos.f80(x86_fp80 %Val) declare { fp128, fp128 } @llvm.sincos.f128(fp128 %Val) declare { ppc_fp128, ppc_fp128 } @llvm.sincos.ppcf128(ppc_fp128 %Val) declare { <4 x float>, <4 x float> } @llvm.sincos.v4f32(<4 x float> %Val) ``` The lowering is built on top of the existing FSINCOS ISD node, with additional type legalization to allow for f16, f128, and vector values.	2024-10-29 10:52:20 +00:00
Ellis Hoag	6ab26eab4f	Check hasOptSize() in shouldOptimizeForSize() (#112626 )	2024-10-28 09:45:03 -07:00
Tex Riddell	c03d09ce3e	[aarch64] atan2 intrinsic lowering (p5) (#112611 ) This change is part of this proposal: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 - `VecFuncs.def`: define intrinsic to sleef/armpl mapping - `LegalizerHelper.cpp`: add missing fewerElementsVector handling for the new atan2 intrinsic - `AArch64ISelLowering.cpp`: Add arch64 specializations for lowering like neon instructions - `AArch64LegalizerInfo.cpp`: Legalize atan2. Part 5 for Implement the atan2 HLSL Function #70096.	2024-10-24 17:53:12 -07:00
Michael Maitland	6bac41496e	[RISCV][GISEL] Legalize G_INSERT_SUBVECTOR (#108859 ) This code is heavily based on the SelectionDAG lowerINSERT_SUBVECTOR code.	2024-10-21 08:49:13 -04:00
Thorsten Schütt	d8b17f2fb6	[GlobalISel] Combine G_UNMERGE_VALUES with anyext and build vector (#112370 ) G_UNMERGE_VALUES (G_ANYEXT (G_BUILD_VECTOR)) ag G_UNMERGE_VALUES llvm/test/CodeGen/AArch64/GlobalISel \| grep ANYEXT [ANYEXT] is build vector or shuffle vector Prior art: https://reviews.llvm.org/D87117 https://reviews.llvm.org/D87166 https://reviews.llvm.org/D87174 https://reviews.llvm.org/D87427 ; CHECK-NEXT: [[BUILD_VECTOR2:%[0-9]+]]:_(<8 x s8>) = G_BUILD_VECTOR [[C2]](s8), [[C2]](s8), [[C2]](s8), [[C2]](s8), [[DEF1]](s8), [[DEF1]](s8), [[DEF1]](s8), [[DEF1]](s8) ; CHECK-NEXT: [[ANYEXT1:%[0-9]+]]:_(<8 x s16>) = G_ANYEXT [[BUILD_VECTOR2]](<8 x s8>) ; CHECK-NEXT: [[UV10:%[0-9]+]]:_(<4 x s16>), [[UV11:%[0-9]+]]:_(<4 x s16>) = G_UNMERGE_VALUES [[ANYEXT1]](<8 x s16>) Test: llvm/test/CodeGen/AArch64/GlobalISel/combine-unmerge.mir	2024-10-19 09:41:43 +02:00
Keith Packard	44b020a381	[PowerPC][ISelLowering] Support -mstack-protector-guard=tls (#110928 ) Add support for using a thread-local variable with a specified offset for holding the stack guard canary value. This supports both 32- and 64- bit PowerPC targets. This mirrors changes from #108942 but targeting PowerPC instead of RISCV. Because both of these PRs modify the same driver functions, this series is stack on top of the RISC-V one. --------- Signed-off-by: Keith Packard <keithp@keithp.com>	2024-10-17 19:06:47 -07:00
Petar Avramovic	14d006c53c	AMDGPU/GlobalISel: Run redundant_and combine in RegBankCombiner (#112353 ) Combine is needed to clear redundant ANDs with 1 that will be created by reg-bank-select to clean-up high bits in register. Fix replaceRegWith from CombinerHelper: If copy had to be inserted, first create copy then delete MI. If MI is deleted first insert point is not valid.	2024-10-16 09:43:16 +02:00
David Green	04546a0dd6	[GlobalISel] Support vector G_UNMERGE_VALUES in computeKnownBits. (#112172 ) This adds computeKnownBits support for vector->vector G_UNMERGE_VALUES, grabbing the known bits with an adjusted DemandedElts mask.	2024-10-15 08:23:05 +01:00
David Green	828d72b263	[GlobalISel] Add an assert for the DemandedElts APInt size. (#112150 ) Similar to the other implementations in DAG/ValueTracking, this adds an assert that the size of the DemandedElts is what we expect it to be - the size of a fixed length vector or APInt(1,1) otherwise. The G_BUILDVECTOR is fixed as it was passing an original DemandedElts for the scalar operands.	2024-10-14 09:59:26 +01:00
duk	464a7ee79e	[CodeGen] Generalize trap emission after SP check fail (#109744 ) Generalize and improve some target-specific code that emits traps after stack protector failure in SelectionDAG & GlobalIsel.	2024-10-12 20:01:22 -04:00
Craig Topper	20e37f03c6	[GISel] Don't preserve NSW flag when converting G_MUL of INT_MIN to G_SHL. (#111230 ) mul and shl have different meanings for the nsw flag. We need to drop it when converting a multiply by the minimum negative value.	2024-10-05 10:27:09 -07:00
Stephen Tozer	d826b0c90f	[LLVM] Add HasFakeUses to MachineFunction (#110097 ) Following the addition of the llvm.fake.use intrinsic and corresponding MIR instruction, two further changes are planned: to add an -fextend-lifetimes flag to Clang that emits these intrinsics, and to have -Og enable this flag by default. Currently, some logic for handling fake uses is gated by the optdebug attribute, which is intended to be switched on by -fextend-lifetimes (and by extension -Og later on). However, the decision was made that a general optdebug attribute should be incompatible with other opt_ attributes (e.g. optsize, optnone), since they all express different intents for how to optimize the program. We would still like to allow -fextend-lifetimes with optsize however (i.e. -Os -fextend-lifetimes should be legal), since it may be a useful configuration and there is no technical reason to not allow it. This patch resolves this by tracking MachineFunctions that have fake uses, allowing us to run passes that interact with them and skip passes that clash with them.	2024-10-04 13:13:30 +01:00
Michael Maitland	f957d080e9	[RISCV][GISEL] Legalize G_EXTRACT_SUBVECTOR (#109426 ) This is heavily based on the SelectionDAG lowerEXTRACT_SUBVECTOR code.	2024-10-01 14:08:49 -04:00
Thorsten Schütt	53943de73a	[GlobalISel] Import extract/insert subvector (#110287 ) Test: AArch64/GlobalISel/irtranslator-subvector.ll Reference: https://llvm.org/docs/LangRef.html#llvm-vector-extract-intrinsic https://llvm.org/docs/LangRef.html#llvm-vector-insert-intrinsic	2024-09-30 22:12:06 +02:00
David Green	9f255d863f	[AArch64][GlobalISel] Lower fp16 abs and neg without fullfp16. (#110096 ) This changes the existing promote logic to lower, so that it can use normal integer operations. A minor change was needed to fneg lower code to handle vectors.	2024-09-27 07:43:58 +01:00
Tex Riddell	139688a699	[SPIRV] Add atan2 function lowering (p2) (#110037 ) This change is part of this proposal: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 - Add generic opcode for atan2 - Add SPIRV lowering for atan2 Part 2 for Implement the atan2 HLSL Function #70096.	2024-09-26 15:00:59 -07:00
Youngsuk Kim	e177dd6fbb	[llvm] Replace uses of Type::getPointerTo() (NFC) (#110163 ) Replace uses of `Type::getPointerTo()` which is to be removed. --------- Co-authored-by: Nikita Popov <github@npopov.com>	2024-09-26 16:38:50 -04:00
Evgenii Kudriashov	e9cb44090f	[X86][GlobalISel] Enable scalar versions of G_UITOFP and G_FPTOUI (#100079 ) Also add tests for G_SITOFP and G_FPTOSI	2024-09-25 16:15:36 +02:00
chuongg3	b0dc7b5b86	[AArch64][GlobalISel] Prefer to use Vector Truncate (#105692 ) Tries to combine scalarised truncates into vector truncate operations EXAMPLE: `%a(i32), %b(i32) = G_UNMERGE %src(<2 x i32>)` `%T_a(i16) = G_TRUNC %a(i32)` `%T_b(i16) = G_TRUNC %b(i32)` `%Imp(i16) = G_IMPLICIT_DEF(i16)` `%dst(v8i16) = G_MERGE_VALUES %T_a(i16), %T_b(i16), %Imp(i16), %Imp(i16)` ===> `%Imp(<2 x i32>) = G_IMPLICIT_DEF(<2 x i32>)` `%Mid(<4 x s16>) = G_CONCAT_VECTORS %src(<2 x i32>), %Imp(<2 x i32>)` `%dst(<4 x s16>) = G_TRUNC %Mid(<4 x s16>)`	2024-09-23 13:52:37 +01:00
Thorsten Schütt	ccfe7d4b20	[GlobalIsel] Cleanup G_EXTRACT_VECTOR_ELT combines (#109047 ) Reduce duplicated build vector patterns by exploiting variadic args. Make index parameter const to improve hit rate. Use `getIConstantFromReg` to retrieve immediate because they are not fallible anymore. Improve extraction from build vector and shuffle vector.	2024-09-19 20:36:33 +02:00
Jay Foad	e03f427196	[LLVM] Use {} instead of std::nullopt to initialize empty ArrayRef (#109133 ) It is almost always simpler to use {} instead of std::nullopt to initialize an empty ArrayRef. This patch changes all occurrences I could find in LLVM itself. In future the ArrayRef(std::nullopt_t) constructor could be deprecated or removed.	2024-09-19 16:16:38 +01:00
Craig Topper	d5d1417659	[RISCV][GISel] Use libcalls for rint, nearbyint, trunc, round, and roundeven intrinsics. (#108779 )	2024-09-18 12:07:44 -07:00
Craig Topper	292ee93a87	[CodeGen] Use Register in SwitchLoweringUtils. NFC (#109092 ) Use an empty Register() instead of -1U.	2024-09-18 09:43:21 -07:00
Michael Maitland	ee2add0683	[GISEL] Fix bugs and clarify spec of G_EXTRACT_SUBVECTOR (#108848 ) The implementation was missing the fact that `G_EXTRACT_SUBVECTOR` destination and source vector can be different types. Also fix a bug in the MIR builder for `G_EXTRACT_SUBVECTOR` to generate the correct opcode. Clarify the G_EXTRACT_SUBVECTOR specification.	2024-09-17 10:08:39 -04:00
Thorsten Schütt	acfa294b5e	[GlobalIsel] Canonicalize G_FCMP (#108891 ) As a side-effect, we start constant folding fcmps.	2024-09-17 09:42:04 +02:00
Thorsten Schütt	5c348f692a	[GlobalIsel] Canonicalize G_ICMP (#108755 ) As a side-effect, we start constant folding icmps. Split out from https://github.com/llvm/llvm-project/pull/105991.	2024-09-16 19:25:34 +02:00
David Green	feac761f37	[GlobalISel][AArch64] Add G_FPTOSI_SAT/G_FPTOUI_SAT (#96297 ) This is an implementation of the saturating fp to int conversions for GlobalISel. On AArch64 the converstion instrctions work this way, producing saturating results. LegalizerHelper::lowerFPTOINT_SAT is ported from SDAG. AArch64 has a lot of existing tests for fptosi_sat, covering a wide range of types. I have tried to make most of them work all at once, but a few fall back due to other missing features such as f128 handling for min/max.	2024-09-16 10:33:59 +01:00
Craig Topper	367c145e5f	[IRTranslator][RISCV] Support scalable vector zeroinitializer. (#108666 )	2024-09-14 15:46:18 -07:00
Craig Topper	947374c393	[IRTranslator] Simplify fixed vector ConstantAggregateZero handling. NFC (#108667 ) We don't need to loop through the elements, they're all the same zero. We can get the first element and create a splat build_vector.	2024-09-13 22:02:29 -07:00
Thorsten Schütt	ba4bcce5f5	[GlobalIsel] Combine trunc of binop (#107721 ) trunc (binop X, C) --> binop (trunc X, trunc C) --> binop (trunc X, C`) Try to narrow the width of math or bitwise logic instructions by pulling a truncate ahead of binary operators. Vx and Nx cores consider 32-bit and 64-bit basic arithmetic equal in costs.	2024-09-11 15:04:55 +02:00
Tobias Stadler	2d338bed00	[CodeGen] Refactor DeadMIElim isDead and GISel isTriviallyDead (#105956 ) Merge GlobalISel's isTriviallyDead and DeadMachineInstructionElim's isDead code and remove all unnecessary checks from the hot path by looping over the operands before doing any other checks. See #105950 for why DeadMIElim needs to remove LIFETIME markers even though they probably shouldn't generally be considered dead. x86 CTMark O3: -0.1% AArch64 GlobalISel CTMark O0: -0.6%, O2: -0.2%	2024-09-09 16:30:44 +02:00
Craig Topper	e6e857cdf9	[GISel] Use Function::getFunctionType() instead of getType() in some remarks. (#107651 ) getType() on a Function is always 'ptr'. We should use getFunctionType() so we get the function signature.	2024-09-06 19:59:44 -07:00
anjenner	4af249fe6e	Add usub_cond and usub_sat operations to atomicrmw (#105568 ) These both perform conditional subtraction, returning the minuend and zero respectively, if the difference is negative.	2024-09-06 16:19:20 +01:00
Him188	0748f4227c	[AArch64][GlobalISel] Legalize 128-bit types for FABS (#104753 ) This patch adds a common lower action for `G_FABS`, which generates `and x8, x8, #0x7fffffffffffffff` to reset the sign bit. The action does not support vectors since `G_AND` does not support fp128. This approach is different than what SDAG is doing. SDAG stores the value onto stack, clears the sign bit in the most significant byte, and loads the value back into register. This involves multiple memory ops and sounds slower.	2024-09-03 12:47:26 +01:00
Craig Topper	4ca817d051	[GlobalISel] Add bail outs for scalable vectors to some combines. (#106496 ) These combines call getNumElements() which isn't valid for scalable vectors.	2024-08-29 14:02:53 -07:00
Craig Topper	d5c292d8ef	[GISel][RISCV] Correctly handle scalable vector shuffles of pointer vectors in IRTranslator. (#106580 )	2024-08-29 12:35:50 -07:00
Stephen Tozer	3d08ade7bd	[ExtendLifetimes] Implement llvm.fake.use to extend variable lifetimes (#86149 ) This patch is part of a set of patches that add an `-fextend-lifetimes` flag to clang, which extends the lifetimes of local variables and parameters for improved debuggability. In addition to that flag, the patch series adds a pragma to selectively disable `-fextend-lifetimes`, and an `-fextend-this-ptr` flag which functions as `-fextend-lifetimes` for this pointers only. All changes and tests in these patches were written by Wolfgang Pieb (@wolfy1961), while Stephen Tozer (@SLTozer) has handled review and merging. The extend lifetimes flag is intended to eventually be set on by `-Og`, as discussed in the RFC here: https://discourse.llvm.org/t/rfc-redefine-og-o1-and-add-a-new-level-of-og/72850 This patch implements a new intrinsic instruction in LLVM, `llvm.fake.use` in IR and `FAKE_USE` in MIR, that takes a single operand and has no effect other than "using" its operand, to ensure that its operand remains live until after the fake use. This patch does not emit fake uses anywhere; the next patch in this sequence causes them to be emitted from the clang frontend, such that for each variable (or this) a fake.use operand is inserted at the end of that variable's scope, using that variable's value. This patch covers everything post-frontend, which is largely just the basic plumbing for a new intrinsic/instruction, along with a few steps to preserve the fake uses through optimizations (such as moving them ahead of a tail call or translating them through SROA). Co-authored-by: Stephen Tozer <stephen.tozer@sony.com>	2024-08-29 17:53:32 +01:00
Sergei Barannikov	4d7a0abae8	[DataLayout] Change return type of `getStackAlignment` to `MaybeAlign` (#105478 ) Currently, `getStackAlignment` asserts if the stack alignment wasn't specified. This makes it inconvenient to use and complicates testing. This change also makes `exceedsNaturalStackAlignment` method redundant.	2024-08-27 22:59:33 +03:00
chuongg3	d58bd21150	[GlobalISel] Look between instructions to be matched (#101675 ) When a pattern is matched in TableGen, a check is run called isObviouslySafeToFold(). One of the condition that it checks for is whether the instructions that are being matched are consecutive, so the instruction's insertion point does not change. This patch allows the movement of the insertion point of a load instruction if none of the intervening instructions are stores or have side-effects.	2024-08-27 16:56:40 +01:00
Daniel Sanders	0bf5846553	InstructionSelect: Use GISelChangeObserver instead of MachineFunction::Delegate (#105725 ) The main difference is that it's possible for multiple change observers to be installed at the same time whereas there can only be one MachineFunction delegate installed. This allows downstream targets to continue to use observers to recursively select. The target in question was selecting a gMIR instruction to a machine instruction plus some gMIR around it and relying on observers to ensure it correctly selected any gMIR it created before returning to the main loop.	2024-08-23 09:43:36 -07:00

1 2 3 4 5 ...

2459 Commits