llvm-project

Author	SHA1	Message	Date
Vitaly Buka	d57892a2a1	Revert "[DAGCombiner] Add support for scalarising extracts of a vector setcc" (#118693 ) Reverts llvm/llvm-project#117566 Breaks libc++ tests with HWASAN https://lab.llvm.org/buildbot/#/builders/55/builds/3959	2024-12-04 12:36:46 -08:00
Sander de Smalen	048fc2bc10	[LiveIntervals] Ignore artificial regs when adding kill flags (#116963 ) If parts of a physical register for a given liverange, as assigned by the register allocator, can be used to store other values not represented by this liverange, then `LiveIntervals::addKillFlags` normally avoids adding a kill flag on the use of this register when the value's liverange ends. However, if all the other regunits are artificial, then we can still safely add the kill flag, since those parts of the register can never be accessed independently.	2024-12-04 20:25:31 +00:00
Craig Topper	4cf2cf18c9	[RISCV][GISel] Stop over promoting G_SITOFP/UITOFP libcalls on RV64. (#118597 ) When we have legal instructions we want to promote to sXLen and let isel pattern matching removing the and/sext_inreg. When using a libcall we want to use a 'si' libcall for small types instead of 'di'. To match the RV64 ABI, we need to sign extend `unsigned int` arguments. We reuse the shouldSignExtendTypeInLibCall hook from SelectionDAG.	2024-12-04 10:42:49 -08:00
Oliver Stannard	99b862efba	[DAGISel][ARM] Fix vector truncate combine for big-endian (#118101 ) This DAG combine was incorrect for big-endian targets, because it assumes that when a bitcast changes the lane width, the least-significant bits of the wider lanes are in the lower-numbered lanes of the smaller type, which is only true for little-endian.	2024-12-04 14:32:15 +00:00
Thorsten Schütt	148fdc519c	[GlobalISel] Add G_ABDS and G_ABDU instructions (#118122 ) The DAG has the same instructions: the signed and unsigned absolute difference of it's input. For AArch64, they map to uabd and sabd for Neon and SVE. The Neon and SVE instructions will require custom patterns. They are pseudo opcodes and are not imported by the IRTranslator. We need combines to create them. PowerPC, ARM, and AArch64 have native instructions. /// i.e trunc(abs(sext(Op0) - sext(Op1))) becomes abds(Op0, Op1) /// or trunc(abs(zext(Op0) - zext(Op1))) becomes abdu(Op0, Op1) For GlobalISel, we are going to write the combines in MIR patterns. see: llvm/test/CodeGen/AArch64/abd-combine.ll - [ ] combine into abd - [ ] legalize and add td patterns	2024-12-04 12:53:15 +01:00
David Sherwood	4675db5f39	[DAGCombiner] Add support for scalarising extracts of a vector setcc (#117566 ) For IR like this: %icmp = icmp ult <4 x i32> %a, splat (i32 5) %res = extractelement <4 x i1> %icmp, i32 1 where there is only one use of %icmp we can take a similar approach to what we already do for binary ops such add, sub, etc. and convert this into %ext = extractelement <4 x i32> %a, i32 1 %res = icmp ult i32 %ext, 5 For AArch64 targets at least the scalar boolean result will almost certainly need to be in a GPR anyway, since it will probably be used by branches for control flow. I've tried to reuse existing code in scalarizeExtractedBinop to also work for setcc. NOTE: The optimisations don't apply for tests such as extract_icmp_v4i32_splat_rhs in the file CodeGen/AArch64/extract-vector-cmp.ll because scalarizeExtractedBinOp only works if one of the input operands is a constant.	2024-12-04 10:26:51 +00:00
Sam Elliott	73731d6873	[llvm-tblgen] Increase Coverage Index Size (#118329 )	2024-12-04 09:19:13 +00:00
Akshat Oke	d9b4bdbff5	[CodeGen][NewPM] Port LiveDebugVariables to NPM (#115468 ) The existing analysis was already a pimpl wrapper. I have extracted legacy pass logic to a LDVImpl wrapper named `LiveDebugVariables` which is the analysis::Result now. This controls whether to activate the LDV (depending on `-live-debug-variables` and DIsubprogram) itself. The legacy and new analysis only construct the LiveDebugVariables. VirtRegRewriter will test this.	2024-12-04 14:31:34 +05:30
Simon Pilgrim	b1a48af56a	[DAG] SimplifyDemandedVectorElts - add handling for INT<->FP conversions (#117884 )	2024-12-04 07:37:01 +00:00
Craig Topper	b076fbb844	[TargetLowering] Use Type* instead of EVT in shouldSignExtendTypeInLibCall. (#118587 ) I want to use this function for GISel too so Type * is a better common interface. All of the callers already convert EVT to Type * as needed by calling lowering anyway.	2024-12-03 22:06:55 -08:00
Brandon Wu	109e4a147f	[RISCV] Handle zeroinitializer of vector tuple Type (#113995 ) It doesn't make sense to add a new generic ISD to handle riscv tuple type. Instead we use `SPLAT_VECTOR` for ISD and further lower to `VMV_V_X`. Note: If there's `visitSPLAT_VECTOR` in generic DAG combiner, it needs to skip riscv vector tuple type. Stack on https://github.com/llvm/llvm-project/pull/114329	2024-12-04 13:40:02 +08:00
Kyungwoo Lee	4f41862c5a	Reapply "[StructuralHash] Global Variable (#118412 )" This reverts commit 6a0d6fc2e92bcfb7cb01a4c6cdd751a9b4b4c159.	2024-12-03 21:33:03 -08:00
Fangrui Song	932c5249ff	[CodeGen] Fix warning after #115531	2024-12-03 21:12:06 -08:00
Craig Topper	caa8aa551b	[SelectionDAG] Rename CallOptions::IsSExt to IsSigned. NFC (#118574 ) This is eventually passed to shouldSignExtendTypeInLibCall which calls it IsSigned.	2024-12-03 18:25:44 -08:00
Kyungwoo Lee	6a0d6fc2e9	Revert "[StructuralHash] Global Variable (#118412 )" This reverts commit 1afb81dfaf902c1c42bd91fec1a7385e6e1529d3.	2024-12-03 17:19:30 -08:00
Shubham Sandeep Rastogi	259bdc0033	Revert "Reland "[NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 )" (#118546 )" This reverts commit 0c8928d456ac3ef23ed25bfc9e5d491dd7b62a11. Broke Bot: https://lab.llvm.org/buildbot/#/builders/76/builds/5008 error: undefined reference to `vtable for llvm::DroppedVariableStatsIR'	2024-12-03 16:50:53 -08:00
Matt Arsenault	e9dc6c5fbb	CodeGen: Don't assert when printing null GlobalAddress operands (#115531 )	2024-12-03 19:15:56 -05:00
Kyungwoo Lee	1afb81dfaf	[StructuralHash] Global Variable (#118412 ) This update enhances the implementation of structural hashing for global variables, using their initial contents. Private global variables or constants are often used for metadata, where their names are not unique. This can lead to the creation of different hash results although they could be merged by the linker as they are effectively identical. - Refine the hashing of GlobalVariables for strings or certain Objective-C metadata cases that have section names. This can be further extended to other scenarios. - Expose StructuralHash for GlobalVariable so that this API can be utilized by MachineStableHashing, which is also employed in the global function outliner. This change significantly improves size reduction by an additional 1% on the LLD binary when the global function outliner and merger are enabled together. As discussed in the RFC https://discourse.llvm.org/t/loh-conflicting-with-machineoutliner/83279/8?u=kyulee-com, if we disable or relocate the LOH pass, the size impact could increase to 4%.	2024-12-03 16:01:50 -08:00
Shubham Sandeep Rastogi	0c8928d456	Reland "[NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 )" (#118546 ) Removed the virtual destructor in the derived class DroppedVariableStatsIR	2024-12-03 14:13:06 -08:00
Shubham Sandeep Rastogi	80987ef4b6	Revert "Reland [NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 )" This reverts commit acf3b1aa932b2237c181686e52bc61584a80a3ff. Broke https://lab.llvm.org/buildbot/#/builders/76/builds/5002 tools/clang/lib/CodeGen/CMakeFiles/obj.clangCodeGen.dir/BackendUtil.cpp.o:(.toc+0x258): undefined reference to `vtable for llvm::DroppedVariableStatsIR'	2024-12-03 12:51:24 -08:00
Shubham Sandeep Rastogi	d8b5af4504	Revert "Reland "Add a pass to collect dropped var stats for MIR" (#117044 )" This reverts commit 249755cedb17ffa707253edcef1a388f807caa35. Broke https://lab.llvm.org/buildbot/#/builders/160/builds/9420 Note: This is test shard 99 of 154. [==========] Running 2 tests from 2 test suites. [----------] Global test environment set-up. [----------] 1 test from DroppedVariableStatsMIR [ RUN ] DroppedVariableStatsMIR.InlinedAt -- exit: -11	2024-12-03 12:50:13 -08:00
Shubham Sandeep Rastogi	249755cedb	Reland "Add a pass to collect dropped var stats for MIR" (#117044 ) Moved the MIR Test to the unittests/CodeGen folder This is patch is part of a stack of patches, and follows https://github.com/llvm/llvm-project/pull/117042 I moved the MIR test to the unittests/CodeGen folder I am trying to reland https://github.com/llvm/llvm-project/pull/115566	2024-12-03 12:37:30 -08:00
Shubham Sandeep Rastogi	acf3b1aa93	Reland [NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 ) Moved the IR unit test to the CodeGen folder to resolve linker errors: `error: undefined reference to 'vtable for llvm::DroppedVariableStatsIR'` This patch is trying to reland https://github.com/llvm/llvm-project/pull/115563	2024-12-03 10:39:40 -08:00
Matt Arsenault	50249263a1	MLRegAlloc: Do not use float to count number of used registers [NFC] (#118489 )	2024-12-03 11:40:30 -05:00
Nikita Popov	b2df007413	[FastISel] Support unreachable with NoTrapAfterNoReturn (#118296 ) Currently FastISel triggers a fallback if there is an unreachable terminator and the TrapUnreachable option is enabled (the ISD::TRAP selection does not actually work). Add handling for NoTrapAfterNoReturn, in which case we don't actually need to emit a trap. The test is just there to make sure there is no FastISel fallback (which is why I'm not testing the case without noreturn). We have other tests that check the actual unreachable codegen variations.	2024-12-03 12:54:26 +01:00
fengfeng	7907292daa	[DAG] Apply Disjoint flag. (#118045 ) or disjoint (or disjoint (x, c0), c1) --> or disjont x, or (c0, c1) Alive2: https://alive2.llvm.org/ce/z/3wPth5 --------- Signed-off-by: feng.feng <feng.feng@iluvatar.com>	2024-12-03 09:21:03 +08:00
Craig Topper	a15400d05d	[RISCV][GISel] Support f32/f64 ldexp. (#117941 ) The existing libcall lowering in LegalizerHelper.cpp did not account for one operand being integer. Reuse the G_FPOWI code to fix this.	2024-12-02 13:30:46 -08:00
Matt Arsenault	468fb5fc7e	RegisterCoalescer: Set undef on full register uses when coalescing implicit_def (#118321 ) Previously this would delete the IMPLICIT_DEF and not introduce the undef flag on the use operand. Fixes sub-issue found while reducing #109294	2024-12-02 14:43:04 -05:00
Craig Topper	bee33b5291	[RISCV][GISel] Support f32/f64 powi. (#117937 ) Need to force libcall legalization to treat the integer argument as signed so that it can be promoted to XLen in call lowering for RV64. Alternatively we could promote the operand before converting to libcall, but going through call lowering is closer to what SelectionDAG does.	2024-12-02 09:06:38 -08:00
Craig Topper	73186546f0	[LegalizeTypes][RISCV] Call setTypeListBeforeSoften from ExpandIntRes_FP_TO_XINT if the FP type needs to be softened (#118269 ) This avoids an unnecessary sext.w before the libcall.	2024-12-02 09:06:08 -08:00
Yingwei Zheng	6568ceb9fa	[CodeGenPrepare] Drop nsw flags in `optimizeLoadExt` (#118180 ) Alive2: https://alive2.llvm.org/ce/z/pMcD7q Closes https://github.com/llvm/llvm-project/issues/118172.	2024-12-01 11:25:31 +08:00
Florian Hahn	9a0f25158c	[SelectOpt] Support ADD and SUB with zext operands. (#115489 ) Extend the support for implicit selects in the form of OR with a ZExt operand to support ADD and SUB binops as well. They similarly can form implicit selects which can be profitable to convert back the branches. PR: https://github.com/llvm/llvm-project/pull/115489	2024-11-30 21:05:41 +00:00
antangelo	b9ac390cc7	[GISel] Add generic implementation for @llvm.expect.with.probability when optimizations are disabled (#117835 ) Handle @llvm.expect.with.probability in GlobalISel in the same way @llvm.expect is handled, passing the value through as-is. This can be encountered if the intrinsic is used without optimizations, which would otherwise transform it out. Fixes #115411 for GlobalISel	2024-11-29 22:30:13 -05:00
Simon Pilgrim	31b7d4333a	[DAG] Extend extract_element(bitcast(scalar_to_vector(X))) -> trunc(srl(X,C)) (#117900 ) When extracting a smaller integer from a scalar_to_vector source, we were limited to only folding/truncating the lowest bits of the scalar source. This patch extends the fold to handle extraction of any other element, by right shifting the source before truncation. Fixes a regression from #117884	2024-11-29 17:24:38 +00:00
Akshat Oke	b68340c835	[CodeGen][NewPM] Port SpillPlacement analysis to NPM (#116618 )	2024-11-29 16:55:40 +05:30
Matt Arsenault	26fd693b97	RegisterCoalescer: Fix creating full / empty subrange on undef subreg use (#117936 )	2024-11-28 11:12:19 -05:00
Sander de Smalen	243c979187	Reland "[RegisterCoalescer] Fix up subreg lanemasks after rematerializing. (#116191 )" This patch can now reland after 318c69de52b6 relanded #114827. This reverts commit 14a58a1390a72ba6c66606e58e86425dcb902763.	2024-11-28 09:56:11 +00:00
Sander de Smalen	61653f8e39	Reland "[InitUndef] handleSubReg should skip artificial subregs. (#116248 )" This patch can now reland after 318c69de52b6 relanded #114827. This reverts commit 1683f84d289348ba6879635c4161979204f75230.	2024-11-28 09:56:11 +00:00
Petar Avramovic	87503fa51c	Revert "AMDGPU/GlobalISel: Add stub custom regbankselect pass" (#113913 ) This reverts commit e9c49901a43f5b16c3df416460b7e4dbdd24ce03. Current AMDGPURegBankSelect does nothing different then RegBankSelect. Revert to using generic RegBankSelect in preparation for adding new regbankselect passes. New AMDGPURegBankSelect, that will use uniformity analysis for regbank select decisions, will not subclass RegBankSelect. Revert regression tests to use regbankselect since amdgpu-regbankselect will be used by new pass and behavior will be different.	2024-11-27 13:16:22 -05:00
Igor Kirillov	e874c8fc27	[SelectOpt] Refactor to prepare for support more select-like operations (#117582 ) * Enables conversion of several select-like instructions within one group * Any number of auxiliary instructions depending on the same condition can be in between select-like instructions * After splitting the basic block, move select-like instructions into the relevant basic blocks and optimise them * Make it easier to add support shift-base select-like instructions and also any mixture of zext/sext/not instructions	2024-11-27 11:35:59 +00:00
Pengcheng Wang	3618c9930f	[MISched] Use right boundary when trying latency heuristics (#116592 ) We may do bottom-up or bidirectional scheduling but previously we assume we are doing top-down scheduling, which may cause some issues.	2024-11-27 14:46:05 +08:00
Sergei Barannikov	61a23646c9	[SjLjEHPrepare] Configure call sites correctly (#117656 ) After 9fe78db4, the pass inserts `store volatile i32 -1, ptr %call_site` before all invoke instruction except the one in the entry block, which has the effect of bypassing landing pads on exceptions. When configuring the call site for a potentially throwing instruction check that it is not `InvokeInst` -- they are handled by earlier code.	2024-11-27 08:03:47 +03:00
antangelo	dd4844722d	[SelectionDAG] Add generic implementation for @llvm.expect.with.probability when optimizations are disabled (#117459 ) Handle \@llvm.expect.with.probability in SelectionDAGBuilder, FastISel, and IntrinsicLowering in the same way \@llvm.expect is handled, where the value is passed through as-is. This can be reached if the intrinsic is used without optimizations, where it would otherwise be properly transformed out. Fixes #115411 for SelectionDAG. A similar patch is likely needed for GlobalISel.	2024-11-26 20:22:25 -05:00
Jonas Paulsson	175e0dd422	[MachineLateInstrsCleanup] Minor fixing (NFC). (#117816 ) With cb57b7a7, MachineLateInstrsCleanup switched to using a map to keep track of kill flags to remedy compile time regressions seen with huge functions. It seems that the comment above clearKillsForDef() became stale with that commit, and also that one of the arguments to it became unused, both of which this patch fixes.	2024-11-27 01:41:42 +01:00
Craig Topper	43b6b78771	[RISCV][GISel] Use libcalls for f32/f64 G_FCMP without F/D extensions. (#117660 ) LegalizerHelp only supported f128 libcalls and incorrectly assumed that the destination register for the G_FCMP was s32.	2024-11-26 15:48:49 -08:00
Jeremy Morse	624e52b1e3	[DebugInfo] Handle trailing empty blocks when seeking prologue_end spot (#117320 ) The optimiser will produce empty blocks that are unconditionally executed according to the CFG -- while it may not be meaningful code, and won't get a prologue_end position, we need to not crash on this input. The fault comes from assuming that there's always a next block with some instructions in it, that will eventually produce some meaningful control flow to stop at -- in the given reproducer in issue #117206 this isn't true, because the function terminates with `unreachable`. Thus, I've refactored the "get next instruction logic" into a helper that'll step through all blocks and terminate if there aren't any more. Reproducer from aeubanks	2024-11-26 14:24:25 +00:00
Nikita Popov	3e1b55cafc	[SDAG] Don't allow implicit trunc in getConstant() (#117558 ) Assert that the passed value is a valid unsigned integer value for the specified type. For signed values getSignedConstant() / getSignedTargetConstant() should be used instead.	2024-11-26 10:36:00 +01:00
Craig Topper	bc282605df	[SelectionDAG] Require last operand of (STRICT_)FP_ROUND to be a TargetConstant. (#117639 ) Fix all the places I could find that did't do this. We were already mostly correct for FP_ROUND after 9a976f36615dbe15e76c12b22f711b2e597a8e51, but not STRICT_FP_ROUND.	2024-11-25 21:36:33 -08:00
Philip Reames	6657d4bd70	[TTI][RISCV] Unconditionally break critical edges to sink ADDI (#108889 ) This looks like a rather weird change, so let me explain why this isn't as unreasonable as it looks. Let's start with the problem it's solving. ``` define signext i32 @overlap_live_ranges(ptr %arg, i32 signext %arg1) { bb: %i = icmp eq i32 %arg1, 1 br i1 %i, label %bb2, label %bb5 bb2: ; preds = %bb %i3 = getelementptr inbounds nuw i8, ptr %arg, i64 4 %i4 = load i32, ptr %i3, align 4 br label %bb5 bb5: ; preds = %bb2, %bb %i6 = phi i32 [ %i4, %bb2 ], [ 13, %bb ] ret i32 %i6 } ``` Right now, we codegen this as: ``` li a3, 1 li a2, 13 bne a1, a3, .LBB0_2 lw a2, 4(a0) .LBB0_2: mv a0, a2 ret ``` In this example, we have two values which must be assigned to a0 per the ABI (%arg, and the return value). SelectionDAG ensures that all values used in a successor phi are defined before exit the predecessor block. This creates an ADDI to materialize the immediate in the entry block. Currently, this ADDI is not sunk into the tail block because we'd have to split a critical edges to do so. Note that if our immediate was anything large enough to require two instructions we would split this critical edge. Looking at other targets, we notice that they don't seem to have this problem. They perform the sinking, and tail duplication that we don't. Why? Well, it turns out for AArch64 that this is entirely an accident of the existance of the gpr32all register class. The immediate is materialized into the gpr32 class, and then copied into the gpr32all register class. The existance of that copy puts us right back into the two instruction case noted above. This change essentially just bypasses this emergent behavior aspect of the aarch64 behavior, and implements the same "always sink immediates" behavior for RISCV as well.	2024-11-25 18:59:31 -08:00
Craig Topper	ebcaa57715	[GISel] #undef macros when they are no longer needed. NFC (#117652 ) These macros are created inside a function. They should be undefined before the end of the function.	2024-11-25 18:00:03 -08:00

1 2 3 4 5 ...

36863 Commits