llvm-project

Author	SHA1	Message	Date
Osman Yasar	d8d4096c0b	[GlobalISel] Rewrite binop_left_to_zero using MIR Patterns (#177924 ) Following 2d87319f06ef936233ba6aaa612da9586c427d68, this PR rewrites the `binop_left_to_zero` rule using MIR Patterns. The new pattern uses `GIReplaceReg` in the apply clause. According to [MIRPatterns.rst](`5b4a5cf51f/llvm/docs/GlobalISel/MIRPatterns.rst (L222)`), `GIReplaceReg` checks `canReplaceReg`, so the new apply pattern is equivalent to the old `matchOperandIsZero` implementation. Added tests for all the opcodes covered by this rule `(G_SHL, G_LSHR, G_ASHR, G_SDIV, G_UDIV, G_SREM, G_UREM, G_MUL)`.	2026-01-29 18:28:34 +00:00
Julian Pokrovsky	5b4811eddb	[GlobalIsel] Enabling more rules for fp constant folding (#177902 ) This PR extends GlobalISel to enable additional compile time constant foldings Resolves https://github.com/llvm/llvm-project/issues/86406	2026-01-26 11:55:15 +01:00
Stefan Weigl-Bosker	828261ebb8	[GISel] Add G_CTLS Opcode and combines, lower to cls(w) (#175069 ) Fixes https://github.com/llvm/llvm-project/issues/174369 - Added a G_CTLS opcode and some pattern matching. This is the GlobalISel equivalent to https://github.com/llvm/llvm-project/pull/173417 - Add legalization for aarch64 and riscv ``` // Folds (ctlz (xor x, (sra x, bitwidth-1))) -> (add (ctls x), 1). // Folds (ctlz (or (shl (xor x, (sra x, bitwidth-1)), 1), 1) -> (ctls x) (clang aarch64) ```	2026-01-16 13:22:18 -08:00
Nathan Corbyn	b7a20c1cc4	[GlobalISel] Don't permit G_MIN/G_MAX of pointer vectors (#168872 ) - Use `LLT::changeElementType()` instead of `LLT::changeElementSize()` in `LegalizerHelper::lowerMinMax()` to avoid a crash in the case that the destination type is a pointer vector; - Reject `G_MIN`/`G_MAX` of pointers and pointer vectors in `MachineVerifier`; - Don't combine `G_SELECT`+`G_ICMP` pairs into `G_MIN`/`G_MAX` generic instructions when the operands are pointers / pointer vectors. Fixes #166556	2025-12-17 09:03:41 +00:00
Ryan Cowan	58e6d02aa2	[AArch64][GlobalISel] Check unmergeSrc is a vector in matchCombineBuildUnmerge (#168692 ) This aims to fix the crash in #168495, my combine rule was missing a check that the source vector was in fact a vector. This then caused the legality check to fail in this example as the concat was trying to concat a non vector. I have also gated the bitcast of the concat to only work on non-scalable vectors as the mutation calls `getNumElements` which crashes when called on a scalable vector. Fixes #168495	2025-11-19 12:30:51 +00:00
Ryan Cowan	d65be16ab6	[AArch64][GlobalISel] Add combine for build_vector(unmerge, unmerge, undef, undef) (#165539 ) This PR adds a new combine to the `post-legalizer-combiner` pass. The new combine checks for vectors being unmerged and subsequently padded with `G_IMPLICIT_DEF` values by building a new vector. If such a case is found, the vector being unmerged is instead just concatenated with a `G_IMPLICIT_DEF` that is as wide as the vector being unmerged. This removes unnecessary `mov` instructions in a few places.	2025-11-17 15:55:40 +00:00
Kazu Hirata	31b8ba5670	[Analysis, CodeGen] Use ArrayRef instead of const ArrayRef (NFC) (#166026 ) This patch improves readability by using "ArrayRef<T>" instead of "const ArrayRef<T>" and "const ArrayRef<T> &" in function parameter types.	2025-11-01 23:20:19 -07:00
David Green	da15b8fc2e	[AArch64][GlobalISel] Add a constant funnel shift post-legalizer combine. (#151912 ) We want to be able to produce extr instructions post-legalization. They are legal for scalars, acting as a funnel shift with a constant shift amount. Unfortunately I'm not sure if there is a way currently to represent that in the legalization rules, but it might be useful for several operations - to be able to treat and test operands with constant operands as legal or not. This adds a change to the existing matchOrShiftToFunnelShift so that AArch64 can generate such instructions post-legalization providing that the operation is scalar and the shift amount is constant.	2025-10-29 07:47:41 +00:00
David Green	a1e59bdc17	[GlobalISel] Make scalar G_SHUFFLE_VECTOR illegal. (#140508 ) I'm not sure if this is the best way forward or not, but we have a lot of issues with forgetting that shuffle_vectors can be scalar again and again. (There is another example from the recent known-bits code added recently). As a scalar-dst shuffle vector is just an extract, and a scalar-source shuffle vector is just a build vector, this patch makes scalar shuffle vector illegal and adjusts the irbuilder to create the correct node as required. Most targets do this already through lowering or combines. Making scalar shuffles illegal simplifies gisel as a whole, it just requires that transforms that create shuffles of new sizes to account for the scalar shuffle being illegal (mostly IRBuilder and LessElements).	2025-10-24 08:21:35 +01:00
Ryan Cowan	b298421e0c	[AArch64][GlobalISel] Add G_FPEXT(G_FCONSTANT) folding (#162829 )	2025-10-10 12:17:06 +00:00
Ryan Cowan	fab7bd48fc	Revert "[AArch64][GlobalISel] Add G_FPEXT(G_FCONSTANT) folding" (#162805 ) Reverts llvm/llvm-project#160902 as the tests need updating to prevent breaking the build. More specifically, the `AMDGPU` tests.	2025-10-10 10:46:30 +01:00
Ryan Cowan	66da680330	[AArch64][GlobalISel] Add G_FPEXT(G_FCONSTANT) folding (#160902 ) This change adds a new folding pattern, folding a G_FPEXT(G_FCONSTANT) to a G_FCONSTANT. To make this work on AArch64, the `G_FCONSTANT` should not be widened due to the `G_FCONSTANT` being converted to a `G_CONSTANT`. This should fix some other floating point combines when the `G_FCONSTANT` is widened due to being an fp16.	2025-10-10 15:32:32 +09:00
Mészáros Gergely	cc1ca591a4	[GlobalIsel] Add failure memory order to LegalityQuery (NFC) (#162284 ) The `cmpxchg` instruction has two memory orders, one for success and one for failure. Prior to this patch `LegalityQuery` only exposed a single memory order, that of the success case. This meant that it was not generally possible to legalize `cmpxchg` instructions based on their memory orders. Add a `FailureOrdering` field to `LegalityQuery::MemDesc`; it is only set for `cmpxchg` instructions, otherwise it is `NotAtomic`. I didn't rename `Ordering` to `SuccessOrdering` or similar to avoid breaking changes for out of tree targets. The new field does not increase `sizeof(MemDesc)`, it falls into previous padding bits due to alignment, so I'd expect there to be no performance impact for this change. Verified no breakage via check-llvm in build with AMDGPU, AArch64, and X86 targets enabled.	2025-10-09 09:05:04 +02:00
David Green	a29d7a1f04	[GlobalISel] fdiv to fmul transform (#144305 ) This is a port of the SDAG DAGCombiner::combineRepeatedFPDivisors combine that looks for multiple fdiv operations with the same divisor and converts them to a single reciprocal fdiv and multiple fmuls. It is currently a fairly faithful port, with some additions to make sure that the newly created fdiv dominates all new uses. Compared to the SDAG version it also drops some logic about splat uses which assumes no vector fdivs and some logic about x/sqrt(x) which does not yet apply to GISel.	2025-10-08 07:26:25 +01:00
jyli0116	619d36ff4f	[GISel] Combine shift + trunc + shift pattern (#155583 ) Folds shift(trunc(shift(...))) pattern into trunc(shift(...)) by combining the two shift instructions	2025-09-10 15:01:55 +01:00
jyli0116	dbadab96eb	[GlobalISel] Support saturated truncate (#150219 ) Implements combining and legalization of G_TRUNC_SSAT_S, G_TRUNC_SSAT_U, and G_TRUNC_USAT_U, which where previously added to SDAG with the below patterns: ``` truncate(smin(smax(x, C1), C2)) -> trunc_ssat_s(x) truncate(smax(smin(x, C2), C1)) -> trunc_ssat_s(x) truncate(smax(smin(x, C), 0)) -> trunc_ssat_u(x) truncate(smin(smax(x, 0), C)) -> trunc_ssat_u(x) truncate(umin(smax(x, 0), C)) -> trunc_ssat_u(x) truncate(umin(x, C)) -> trunc_usat_u(x) ```	2025-08-21 15:37:53 +01:00
Fabian Ritter	96775e9229	[GISel] Handle Flags in G_PTR_ADD Combines (#152495 ) So far, GlobalISel's G_PTR_ADD combines have ignored MIFlags like nuw, nusw, and inbounds. That was in many cases unnecessarily conservative and in others unsound, since reassociations re-used the existing G_PTR_ADD instructions without invalidating their flags. This patch aims to improve that. I've checked the transforms in this PR with Alive2 on corresponding middle-end IR constructs. A longer-term goal would be to encapsulate the logic that determines which GEP/ISD::PTRADD/G_PTR_ADD flags can be preserved in which case, since this occurs in similar forms in the middle end, the SelectionDAG combines, and the GlobalISel combines here. For SWDEV-516125.	2025-08-11 10:34:45 +02:00
paperchalice	ce86ff105b	[GlobalISel] Remove `UnsafeFPMath` references (#146319 ) This is the GlobalISel part to remove `UnsafeFPMath` flag in CodeGen pipeline.	2025-07-29 12:11:52 +08:00
jyli0116	fc5c5a934d	[GlobalISel] Allow expansion of srem by constant in prelegalizer (#148845 ) This patch allows srem by a constant to be expanded more efficiently to avoid the need for expensive sdiv instructions. This is the last part of the patches which fixes #118090	2025-07-17 14:43:58 +01:00
jyli0116	806028add1	[GlobaISel] Allow expanding of sdiv -> mul by constant (#146504 ) Allows expand of sdiv->mul by constant combine for the general case. Previously this was only occurring in the exact case. This is part of the resolution to issue #118090	2025-07-14 15:01:12 +01:00
jyli0116	9c0743fbc5	[GlobalISel] Allow expansion of urem by constant in prelegalizer (#145914 ) This patch allows urem by a constant to be expanded more efficiently to avoid the need for expensive udiv instructions. This is part of the resolution to issue #118090	2025-07-02 13:46:36 +01:00
Daniel Man	045b827367	[GlobalISel] Use-Vector-Truncate Opt Needs Elt Type Check (#146003 ) In the pre-legalizer combiner, there exists a bug with UseVectorTruncate match-apply optimization. When the destinations' types do not match the vector element type of the G_UNMERGE_VALUES instruction, the resulting collapsed truncate does not preserve original functional behavior. This commit introduces a simple type check to ensure that the destination types match the vector element type.	2025-06-27 16:41:22 +09:00
Alan Li	ada2fbfe36	[GISel] Fix ShuffleVector assert (#139769 ) Fixes issue: https://github.com/llvm/llvm-project/issues/139752 When G_SHUFFLE_VECTOR has only 1 element then it is possible the vector is decayed into a scalar.	2025-05-20 21:25:31 -04:00
David Green	b9e32749d2	[GlobalISel] Clear nsw flags when converting sub to add. (#137288 ) As shown in https://alive2.llvm.org/ce/z/PVwcTL we need to clear the nsw flags too when converting a sub to a add if the constant is INT_MIN. Fixes #137254	2025-04-26 11:00:53 +01:00
Kazu Hirata	58774f1b1f	[CodeGen] Construct SmallVector with iterator ranges (NFC) (#136258 )	2025-04-18 10:26:48 -07:00
Alan Li	2795abb2f8	[GISel][AMDGPU] Expand ShuffleVector (#124527 ) This patch dismantles G_SHUFFLE_VECTOR before lowering. The original lowering would emit extract vector element ops. We found that by using unmerged values the build vector op combine could find ways to fold. Only enabled on AMDGPU. This resolves #123631	2025-04-09 17:51:24 -07:00
Krisztian Rugasi	382962b4a8	[GlobalISel] Fix dangling reference in CombinerHelper::matchCombineExtractedVectorLoad	2025-04-07 15:04:36 +02:00
Tim Gymnich	1d0005a69a	[GlobalISel][NFC] Rename GISelKnownBits to GISelValueTracking (#133466 ) - rename `GISelKnownBits` to `GISelValueTracking` to analyze more than just `KnownBits` in the future	2025-03-29 11:51:29 +01:00
Pierre van Houtryve	7dcea28bf9	[AMDGPU] Add identity_combines to RegBankCombiner (#131305 )	2025-03-17 10:11:28 +01:00
Shilei Tian	70fdd9f0a2	[GlobalISel] Check whether `G_CTLZ` is legal in `matchUMulHToLShr` (#126457 ) We need to check `G_CTLZ` because the combine uses `G_CTLZ` to get log base 2, and it is not always legal for on a target. Fixes SWDEV-512440.	2025-02-10 00:11:09 -05:00
David Green	5803335539	[GlobalISel] Add brackets around \|\| in assert. NFC	2025-02-02 07:23:17 +00:00
lialan	5d9c717597	[GISel] Fold shifts to constant result. (#123510 ) This resolves #123212	2025-01-21 05:10:45 -08:00
Min-Yih Hsu	3cac26f541	[GISel] Combine `(neg (min/max x, (neg x)))` into `(max/min x, (neg x))` (#120998 ) This is the GISel version of #120666. Also supports both unsigned and signed version of min & max.	2025-01-02 16:29:34 -08:00
Vikash Gupta	283806695a	[GlobalIsel] Add combine for select with constants (#121088 ) The SelectionDAG Isel supports the both version of combines mentioned below : ``` select Cond, Pow2, 0 --> (zext Cond) << log2(Pow2) select Cond, 0, Pow2 --> (zext !Cond) << log2(Pow2) ``` The GlobalIsel for now only supports the first one defined in it's generic combinerHelper.cpp. This patch adds the missing second one.	2025-01-01 11:14:53 +05:30
Paul Bowen-Huggett	ee7ca0ddda	Make CombinerHelper methods const (#119529 ) There are a number of backends (specifically AArch64, AMDGPU, Mips, and RISCV) which contain a “TODO: make CombinerHelper methods const” comment. This PR does just that and makes all of the CombinerHelper methods const, removes the TODO comments and makes the associated instances const. This change makes some sense because the CombinerHelper class simply modifies the state of _other_ objects to which it holds pointers or references. Note that AMDGPU contains an identical comment for an instance of AMDGPUCombinerHelper (a subclass of CombinerHelper). I deliberately haven’t modified the methods of that class in order to limit the scope of the change. I’m happy to do so either now or as a follow-up.	2024-12-20 08:29:18 +07:00
Craig Topper	0d9fc17433	[GISel] Remove unused DataLayout operand from getApproximateEVTForLLT (#119833 )	2024-12-13 09:09:20 -08:00
David Green	94a77ebe24	[AArch64][GlobalISel] Guard against no operands in matchHoistLogicOpWithSameOpcodeHands In case both LeftHandInst and RightHandInst are IMPLICIT_DEF with no input operands, this patch protects against the post-legalizer-combiner matchHoistLogicOpWithSameOpcodeHands with no operands. The prelegalizercombiner-hoist-same-hands.mir test was cleaned up a little in the process, and has a post-legalizer run line added so that the implicit_def do not get folded awwy.	2024-12-13 11:02:55 +00:00
abhishek-kaushik22	d20731ce6b	[CGData][GlobalIsel][Legalizer][DAG][MC][AsmParser][X86][AMX] Use `std::move` to avoid copy (#118068 )	2024-12-06 09:46:15 +08:00
Thorsten Schütt	f8d1905a24	[GlobalISel] Combine [S,U]SUBO (#116489 ) We import the llvm.ssub.with.overflow.* Intrinsics, but the Legalizer also builds them while legalizing other opcodes, see narrowScalarAddSub.	2024-11-18 22:39:23 +01:00
Konstantin Schwarz	0f0e2fe97b	[GlobalISel] Turn shuffle a, b, mask -> shuffle undef, b, mask iff mask does not reference a (#115377 )	2024-11-14 15:13:41 -08:00
Nikita Popov	63fb980d50	[IR] Add helper for comparing KnownBits with IR predicate (NFC) (#115878 ) Add `ICmpInst::compare()` overload accepting `KnownBits`, similar to the existing one accepting `APInt`. This is not directly part of KnownBits (or APInt) for layering reasons.	2024-11-12 17:41:08 +01:00
David Green	b242ae32f5	[AArch64][GlobalISel] Protect against undef first element in CombineShuffleConcat. In case the first element is undef, we need to look through to find a valid type for the inputs.	2024-11-11 19:37:51 +00:00
Craig Topper	17f3e00911	Recommit "[GISel][AArch64][AMDGPU][RISCV] Canonicalize (sub X, C) -> (add X, -C) (#114309 )" The increase in fallbacks that was previously reported were not caused by this change. Original description: This matches InstCombine and DAGCombine. RISC-V only has an ADDI instruction so without this we need additional patterns to do the conversion. Some of the AMDGPU tests look like possible regressions. Maybe some patterns from isel aren't imported.	2024-11-08 10:21:46 -08:00
Konstantin Schwarz	cbfe87c253	[GlobalISel] Remove references to rhs of shufflevector if rhs is undef (#115076 )	2024-11-06 16:36:13 -08:00
Craig Topper	cff2199e0f	Revert "[GISel][AArch64][AMDGPU][RISCV] Canonicalize (sub X, C) -> (add X, -C) (#114309 )" This reverts commit 999dfb2067eb75609b735944af876279025ac171. I received a report that his may have increased fallbacks on AArch64.	2024-11-06 10:45:23 -08:00
David Green	5dc9c39ac1	[GlobalISel] Check the correct register in sextload OneUse check. (#114763 ) This fixes a bug that started triggering after #111730, where we could remove a load with multiple uses. It looks like the match should be checking the other register in a one-use check. %SrcReg = load.. %DstReg = sign_extend_inreg %SrcReg	2024-11-05 10:18:07 +00:00
Craig Topper	999dfb2067	[GISel][AArch64][AMDGPU][RISCV] Canonicalize (sub X, C) -> (add X, -C) (#114309 ) This matches InstCombine and DAGCombine. RISC-V only has an ADDI instruction so without this we need additional patterns to do the conversion. Some of the AMDGPU tests look like possible regressions. Maybe some patterns from isel aren't imported.	2024-11-04 17:20:11 -08:00
Matt Arsenault	db5bcb24c2	GlobalISel: Fix combine duplicating atomic loads (#111730 ) The sext_inreg (load) combine was not deleting the old load instruction, and it would never be deleted if volatile or atomic.	2024-10-31 07:55:12 -07:00
Thorsten Schütt	d8b17f2fb6	[GlobalISel] Combine G_UNMERGE_VALUES with anyext and build vector (#112370 ) G_UNMERGE_VALUES (G_ANYEXT (G_BUILD_VECTOR)) ag G_UNMERGE_VALUES llvm/test/CodeGen/AArch64/GlobalISel \| grep ANYEXT [ANYEXT] is build vector or shuffle vector Prior art: https://reviews.llvm.org/D87117 https://reviews.llvm.org/D87166 https://reviews.llvm.org/D87174 https://reviews.llvm.org/D87427 ; CHECK-NEXT: [[BUILD_VECTOR2:%[0-9]+]]:_(<8 x s8>) = G_BUILD_VECTOR [[C2]](s8), [[C2]](s8), [[C2]](s8), [[C2]](s8), [[DEF1]](s8), [[DEF1]](s8), [[DEF1]](s8), [[DEF1]](s8) ; CHECK-NEXT: [[ANYEXT1:%[0-9]+]]:_(<8 x s16>) = G_ANYEXT [[BUILD_VECTOR2]](<8 x s8>) ; CHECK-NEXT: [[UV10:%[0-9]+]]:_(<4 x s16>), [[UV11:%[0-9]+]]:_(<4 x s16>) = G_UNMERGE_VALUES [[ANYEXT1]](<8 x s16>) Test: llvm/test/CodeGen/AArch64/GlobalISel/combine-unmerge.mir	2024-10-19 09:41:43 +02:00
Petar Avramovic	14d006c53c	AMDGPU/GlobalISel: Run redundant_and combine in RegBankCombiner (#112353 ) Combine is needed to clear redundant ANDs with 1 that will be created by reg-bank-select to clean-up high bits in register. Fix replaceRegWith from CombinerHelper: If copy had to be inserted, first create copy then delete MI. If MI is deleted first insert point is not valid.	2024-10-16 09:43:16 +02:00

1 2 3 4 5 ...

438 Commits