llvm-project

Author	SHA1	Message	Date
Mikael Holmen	6647f3cd01	[CombinerHelper] Fix gcc warning [NFC] Without the fix gcc complains with ../lib/CodeGen/GlobalISel/CombinerHelper.cpp:1652:52: warning: suggest parentheses around '&&' within '\|\|' [-Wparentheses] 1652 \| SrcDef->getOpcode() == TargetOpcode::G_OR && "Unexpected op"); \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~	2023-05-10 09:08:06 +02:00
Craig Topper	7c5209d017	[LegalizeTypes] Simplify code for UndefinedBooleanContent in PromoteIntOp_VECREDUCE. We can treat UndefinedBooleanContent the same as ZeroOrOneBooleanContent. There's no reason to consider sign extending.	2023-05-09 23:16:27 -07:00
Craig Topper	c582146a49	[LegalizeTypes] Use ISD::isTrueWhenEqual to simplify code. NFC	2023-05-09 22:49:22 -07:00
Chen Zheng	fb45493562	[DebugLine] save one debug line entry for empty prologue Reland D147506 after fixing the failure in bot https://lab.llvm.org/buildbot/#/builders/247/builds/4125 Some debuggers like DBX on AIX assume the address in debug line entries is always incremental. But clang generates two entries (entry for file scope line and entry for prologue end) with same address if prologue is empty And if the prologue is empty, seems the first debug line entry for the function is unnecessary(i.e. removing the first entry won't impact the behavior in GDB on Linux), so I implement this for all debuggers. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D147506	2023-05-10 01:21:02 +00:00
Sami Tolvanen	9e869efc1b	[KCFI] Expand the KCFI term in comments (NFC)	2023-05-09 23:17:33 +00:00
Fangrui Song	bcaa0b26aa	PrologEpilogInserter: Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off builds	2023-05-09 13:23:58 -07:00
Aaron Ballman	945f6e65be	Wrap debug code with the LLVM_DEBUG macro; NFC While investigating a bug in Clang, I noticed that -Wframe-larger-than was emitting extra debug information along with the diagnostic. It turns out that 2e1e2f52f357768186ecfcc5ac53d5fa53d1b094 fixed an issue with the diagnostic, but accidentally left in some debug code that was exposed in all builds. So now we no longer emit things like: 8/4294967304 (0.00%) spills, 4294967296/4294967304 (100.00%) variables along with the diagnostic	2023-05-09 15:53:24 -04:00
Sami Tolvanen	e9569748de	[CodeGen][KCFI] Move cfi-type lowering to TargetLowering KCFI machine function passes transform indirect calls with a cfi-type attribute into architecture-specific type checks bundled together with the calls. Instead of having a separate pass for each architecture, add a generic machine function pass for KCFI and move the architecture-specific code that emits the actual check to TargetLowering. This avoids unnecessary duplication and makes it easier to add KCFI support to other architectures. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D149915	2023-05-09 18:38:54 +00:00
Zain Jaffal	5d3a884229	[IRGen] Change annotation metadata to support inserting tuple of strings into annotation metadata array. Annotation metadata supports adding singular annotation strings to annotation block. This patch adds the ability to insert a tuple of strings into the metadata array. The idea here is that each tuple of strings represents a piece of information that can be all related. It makes it easier to parse through related metadata information given it will be contained in one tuple. For example in remarks any pass that implements annotation remarks can have different type of remarks and pass additional information for each. The original behaviour of annotation remarks is preserved here and we can mix tuple annotations and single annotations for the same instruction. Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D148328	2023-05-09 17:51:28 +03:00
Amara Emerson	e1472db58e	[GlobalISel] Implement commuting shl (add/or x, c1), c2 -> add/or (shl x, c2), c1 << c2 There's a target hook that's called in DAGCombiner that we stub here, I'll implement the equivalent override for AArch64 in a subsequent patch since it's used by different shift combine. This change by itself has minor code size improvements on arm64 -Os CTMark: Program size.__text outputg181ppyy output8av1cxfn diff consumer-typeset/consumer-typeset 410648.00 410648.00 0.0% tramp3d-v4/tramp3d-v4 364176.00 364176.00 0.0% kimwitu++/kc 449216.00 449212.00 -0.0% 7zip/7zip-benchmark 576128.00 576120.00 -0.0% sqlite3/sqlite3 285108.00 285100.00 -0.0% SPASS/SPASS 411720.00 411688.00 -0.0% ClamAV/clamscan 379868.00 379764.00 -0.0% Bullet/bullet 452064.00 451928.00 -0.0% mafft/pairlocalalign 246184.00 246108.00 -0.0% lencod/lencod 428524.00 428152.00 -0.1% Geomean difference -0.0% Differential Revision: https://reviews.llvm.org/D150086	2023-05-08 22:37:43 -07:00
Pierre Calixte	971d982bd4	Do not optimize debug locations across section boundaries Prevent optimization of DebugLoc across section boundaries, such optimization will yield incorrect source location if memory layout of sections does not strictly match the Asm file. Reviewed By: #debug-info, dblaikie, MaskRay Differential Revision: https://reviews.llvm.org/D149294	2023-05-09 00:12:59 +00:00
Alan Zhao	f4999d3535	Revert "[CodeGen][ShrinkWrap] Split restore point" This reverts commit 1ddfd1c8186735c62b642df05c505dc4907ffac4. The original commit causes a Chrome build assertion failure with ThinLTO: https://crbug.com/1443635	2023-05-08 16:27:59 -07:00
Jonas Paulsson	10f0158f00	[MachineLateInstrsCleanup] Bugfix for handling of kill flags. With cb57b7a7, the kill flags are now tracked during the forward search over the instructions and the call to findRegisterUseOperandIdx() should therefore only check for killing uses. As shown with the failing test CodeGen/Hexagon/vector-sint-to-fp.ll, it could otherwise be the case that an undef use after the instruction that killed the register will be inserted into MBBKills, and the kill flag will not be cleared.	2023-05-08 17:12:43 +02:00
Dhruv Chawla	1d21d2eb7f	[TargetLowering] Fix unnecessary call to `computeKnownBits` (NFCI) In the SimplifyDemandedBits function, there is a fallthrough to the default case in the case of ISD::ADD, ISD::MUL and ISD::SUB. This leads to a call to computeKnownBits which is unnecessary as the calls to SimplifyDemandedBits in the cases themselves handle the calculation of the known bits. This information is discarded through the Known2 variables. By keeping this information around and calling KnownBits::mul or KnownBits::computeForAddSub directly, the unnecessary computation can be avoided. For now, the NSW bit is not passed through to KnownBits as this is something that computeKnownBits does not handle either. This requires updating computeForAddCarry to handle the flag as well. Differential Revision: https://reviews.llvm.org/D150110	2023-05-08 16:14:01 +02:00
Akshay Khadse	5c7c3af1d0	Reapply [Coverity] Fix explicit null dereferences This change fixes static code analysis errors Reviewed By: skan Differential Revision: https://reviews.llvm.org/D149506	2023-05-08 21:19:40 +08:00
sgokhale	1ddfd1c818	[CodeGen][ShrinkWrap] Split restore point Try to reland D42600 Differential Revision: https://reviews.llvm.org/D42600	2023-05-08 13:21:07 +05:30
Jonas Paulsson	cb57b7a770	[MachineLateInstrsCleanup] Improve compile time for huge functions. It was discovered that this pass could be slow on huge functions, meaning 20% compile time instead of the usual ~0.5% (with a test case spending ~19 mins just in the backend). The problem related to the necessary clearing of earlier kill flags when a redundant instruction is removed. With this patch, the handling of kill flags is now done by maintaining a map instead of scanning backwards in the function. This remedies the compile time on the huge file fully. Reviewed By: vpykhtin, arsenm Differential Revision: https://reviews.llvm.org/D147532 Resolves https://github.com/llvm/llvm-project/issues/61397	2023-05-08 09:05:17 +02:00
David Green	b774f14841	[DAG] Calculate the number of sign bits for constant BUILD_VECTOR directly. For constant BUILD_VECTORs the operands need to be legal types. This can mean that when the number of sign bits is calculated it may look that the entire constant and inefficiently produce less sign bits than it could. For example i8 vectors could use i32 elements, for which 0x000000ff would be incorrectly limited to 1 sign bit as the original value has 24 sign bits. This makes it look at the constant directly, truncated to the correct type for the element so that it can correctly return 8. Differential Revision: https://reviews.llvm.org/D149956	2023-05-07 22:31:10 +01:00
Simon Pilgrim	b7116ba8b0	[DAG] computeOverflowForUnsignedAdd - use ConstantRange::unsignedAddMayOverflow as fallback Replaces the more specific uadd_ov case	2023-05-06 22:03:38 +01:00
Simon Pilgrim	b83aa8bc75	[DAG] computeOverflowForUnsignedAdd - use getMaxValue().ult(2) to detect 0/1 values. NFCI.	2023-05-06 19:46:34 +01:00
Simon Pilgrim	8f82d8ee76	[DAG] visitSUBSAT - fold subsat(x,y) -> sub(x,y) if it never overflows	2023-05-06 15:55:04 +01:00
Simon Pilgrim	08c1150d4c	[DAG] Add computeOverflowForSignedSub/computeOverflowForUnsignedSub/computeOverflowForSub Match the addition variants (although computeOverflowForUnsignedSub is really just a placeholder), and use this in DAGCombiner::visitSUBO	2023-05-06 15:55:04 +01:00
Jay Foad	3551e0f345	[RegisterCoalescer] Fix problem with IMPLICIT_DEF live-in to an invoke Give up on erasing an IMPLICIT_DEF if it might be live-in to a call instruction in a basic block with EH pad successors. This fixes a liveness bug that will be diagnosed by MachineVerifer when D149947 lands. Differential Revision: https://reviews.llvm.org/D149954	2023-05-06 15:16:54 +01:00
Simon Pilgrim	3fb067f7ba	[DAG] visitADDSAT - fold saddsat(x,y) -> add(x,y) if it never overflows Extend existing uaddsat(x,y) fold	2023-05-06 14:18:23 +01:00
Simon Pilgrim	489e728672	[DAG] computeOverflowForSignedAdd - fix typo in comment. NFC.	2023-05-06 14:18:22 +01:00
Simon Pilgrim	7395f6ae78	[DAG] Add computeOverflowForSignedAdd and computeOverflowForAdd wrapper Add basic computeOverflowForSignedAdd helper to recognise that sadd overflow can't occur if both operands have more that one sign bit. Add computeOverflowForAdd wrapper that calls computeOverflowForSignedAdd/computeOverflowForUnsignedAdd depending on the IsSigned argument, and use this in DAGCombiner::visitADDO	2023-05-06 13:33:14 +01:00
Simon Pilgrim	c7fce3f98b	[DAG] Rename computeOverflowKind -> computeOverflowForUnsignedAdd. NFC. Matches the naming convention for the equivalent ValueTracking helpers - further SelectionDAG computeOverflowFor*() helpers will be added soon.	2023-05-05 19:38:54 +01:00
Simon Pilgrim	051918c71e	[DAG] expandIntMINMAX - add umax(x,1) --> sub(x,cmpeq(x,0)) fold Move the fold from X86 to generic expansion (We also have several existing expansions that are missing freezes on repeated operands - I've added a TODO for now).	2023-05-05 19:27:52 +01:00
Simon Pilgrim	04e809ab90	[DAG] Add TargetLowering::expandABD and convert X86 lowering to use it Scalar widening cases are still custom lowered in the X86 backend - we still need to add promotion/legalization support to handle these	2023-05-05 15:13:23 +01:00
Luo, Yuanke	ae1ca47bb4	[Coverity] Big parameter passed by value.	2023-05-05 09:50:38 +08:00
Luo, Yuanke	b0fb98227c	[Coverity] Big parameter passed by value.	2023-05-05 09:15:22 +08:00
Craig Topper	fe9f557578	[DAGCombiner][RISCV] Enable reassociation for VP_FMA in visitFADDForFMACombine. Reviewed By: fakepaper56 Differential Revision: https://reviews.llvm.org/D149911	2023-05-04 17:20:58 -07:00
Ilya Kuklin	c395a84600	[MSP430] Get the DWARF pointer size from MCAsmInfo instead of DataLayout. This change will allow to put code pointers in DWARF info fields that are larger than actual pointer size, e.g. 16-bit pointers into 32-bit fields. The need for this came up while creating support for MSP430 in LLDB. MSP430-GCC already generates DWARF info with 32-bit fields, so this change is necessary for LLDB to maintain compatibility with both GCC and LLVM binaries. Moreover, right now in LLDB there is no support for having DWARF pointer size different from ELF header type, e.g. 16-bit DWARF info within ELF32, and it seems there is no such thing as ELF16. Since other mainline targets are made to have the same pointer size in both MCAsmInfo and DataLayout, there is no need to change anything there. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D148042	2023-05-04 12:37:30 -07:00
Felipe de Azevedo Piovezan	ae39de91b8	[MIRParser][nfc] Factor out code parsing debug MD nodes This commit splits a function that both parses MD nodes from YAML into DI{Expr,Loc,Variable} objects AND adds an entry to the MF variable table, so that each of those jobs is done separately. It will enable subsequent patches to reuse the MD node parsing code. Differential Revision: https://reviews.llvm.org/D149870	2023-05-04 14:17:08 -04:00
Yeting Kuo	287aa6c453	[DAGCombiner] Use generalized pattern match for visitFSUBForFMACombine. The patch makes visitFSUBForFMACombine serve vp.fsub too. It helps DAGCombiner to fuse vp.fsub and vp.fmul patterns to vp.fma. Reviewed By: luke Differential Revision: https://reviews.llvm.org/D149821	2023-05-04 22:02:32 +08:00
Luo, Yuanke	d9b92c4d55	[Coverity] Improper use of negtive value. The `Iteration` value may be -1 which would cause incorrect loop count when pass the value to buildSqrtNROneConst or buildSqrtNRTwoConst.	2023-05-04 21:11:49 +08:00
Evgenii Kudriashov	a82d27a9a6	[X86] Support llvm.{min,max}imum.f{16,32,64} Addresses https://github.com/llvm/llvm-project/issues/53353 Reviewed By: RKSimon, pengfei Differential Revision: https://reviews.llvm.org/D145634	2023-05-04 21:04:48 +08:00
NAKAMURA Takumi	342a3ce27e	Move LLT::dump()'s impl to LowLevelType.cpp Suggested by @jobnoorman https://reviews.llvm.org/D148767#4317848	2023-05-04 21:29:59 +09:00
Tom Weaver	1d8ab713ad	Revert "[DebugLine] save one debug line entry for empty prologue" This reverts commit b48a8233f5e230e46182bf5c523ceb6a04cec8f5. This change caused https://lab.llvm.org/buildbot/#/builders/247/builds/4125 to start failing, please address the failures before resubmitting.	2023-05-04 11:08:58 +01:00
Simon Pilgrim	3928589314	[DAG] computeKnownBits - remove old ashr TODO comment KnownBits::ashr now uses the minimum shift amount to try and extend the sign bit	2023-05-04 10:26:30 +01:00
Chen Zheng	b48a8233f5	[DebugLine] save one debug line entry for empty prologue Some debuggers like DBX on AIX assume the address in debug line entries is always incremental. But clang generates two entries (entry for file scope line and entry for prologue end) with same address if prologue is empty And if the prologue is empty, seems the first debug line entry for the function is unnecessary(i.e. removing the first entry won't impact the behavior in GDB on Linux), so I implement this for all debuggers. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D147506	2023-05-04 04:37:34 +00:00
Daniel Paoliello	e48826e016	Emit the correct flags for the PROC CodeView Debug Symbol The S_LPROC32_ID and S_GPROC32_ID CodeView Debug Symbols have a flags field which LLVM has had the values for (in the ProcSymFlags enum) but has never actually set. These flags are used by Microsoft-internal tooling that leverages debug information to do binary analysis. Modified LLVM to set the correct flags: - ProcSymFlags::HasOptimizedDebugInfo - always set, as this indicates that debug info is present for optimized builds (if debug info is not emitted for optimized builds, then LLVM won't emit a debug symbol at all). - ProcSymFlags::IsNoReturn and ProcSymFlags::IsNoInline - set if the function has the NoReturn or NoInline attributes respectively. - ProcSymFlags::HasFP - set if the function requires a frame pointer (per TargetFrameLowering::hasFP). Differential Revision: https://reviews.llvm.org/D148761	2023-05-03 18:20:16 -07:00
Mateja Marjanovic	cf76074a36	[AMDGPU][GlobalISel] Check exact width in get*ClassForBitWidth and widen if necessary Instead of checking if the given bitwidth is less or equal to a bitwidth of an existing RegClass, check if it has the exact same value. For LLVM vector types that don't have a corresponding Register Class, widen them during legalization. That goes for G_EXTRACT_VECTOR_ELT, G_INSERT_VECTOR_ELT and G_BUILD_VECTOR. Differential revision: https://reviews.llvm.org/D148096 Reviewers: foad, arsenm	2023-05-03 17:32:24 +02:00
Mateja Marjanovic	6175ec0bb6	Revert "[AMDGPU][GlobalISel] Widen the vector operand in G_BUILD/INSERT/EXTRACT_VECTOR" This reverts commit b25c7cafcbe1b52ea2d1ff5e5c2f13674b5f297d.	2023-05-03 17:28:01 +02:00
Mateja Marjanovic	b25c7cafcb	[AMDGPU][GlobalISel] Widen the vector operand in G_BUILD/INSERT/EXTRACT_VECTOR Widen the vector operand type in G_BUILD_VECTOR, G_INSERT_VECTOR_ELT, G_EXTRACT_VECTOR_ELT to the nearest larger RegClass.	2023-05-03 17:14:38 +02:00
Felipe de Azevedo Piovezan	a524f84780	[SelectionDAG][NFCI] Use common logic for identifying MMI vars After function argument lowering, but prior to instruction selection, dbg declares pointing to function arguments are lowered using special logic. Later, during instruction selection (both "fast" and regular ISel), this logic is "repeated" in order to identify which intrinsics have already been lowered. This is bad for two reasons: 1. The logic is not _really_ repeated, the code is different, which could lead to duplicate lowering of the intrinsic. 2. Even if the logic were repeated properly, this is still code duplication. This patch addresses these issues by storing all preprocessed dbg.declare intrinsics in a set inside FuncInfo; the set is queried upon instruction selection. Differential Revision: https://reviews.llvm.org/D149682	2023-05-03 10:58:31 -04:00
Florian Hahn	4e2b4f97a0	[ShrinkWrap] Use underlying object to rule out stack access. Allow shrink-wrapping past memory accesses that only access globals or function arguments. This patch uses getUnderlyingObject to try to identify the accessed object by a given memory operand. If it is a global or an argument, it does not access the stack of the current function and should not block shrink wrapping. Note that the caller's stack may get accessed when passing an argument via the stack, but not the stack of the current function. This addresses part of the TODO from D63152. Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D149668	2023-05-03 09:28:07 +01:00
Shengchen Kan	3910a9fcb2	Revert part of D149033 b/c original code is correct This reverts part of D149033 and rG8f966cedea594d9a91e585e88a80a42c04049e6c. The added test case is kept to avoid future regression. Reviewed By: vzakhari, vdonaldson Differential Revision: https://reviews.llvm.org/D149639	2023-05-03 12:20:19 +08:00
Dávid Bolvanský	20831c3c23	[MachineInst] Switch NumOperands to 16bits Decrease NumOperands from 32 to 16bits (matches MCInstrDesc) so we can use saved bits to extend Flags (https://reviews.llvm.org/D118118). Reviewed By: barannikov88 Differential Revision: https://reviews.llvm.org/D149445	2023-05-02 22:48:22 +02:00
NAKAMURA Takumi	631bfdbee5	Switch `llvm/CodeGen/MachineValueType.h` to the generated one Prune `SupportTests/MVTTest` since it is no longer needed. Depends on D148769 Differential Revision: https://reviews.llvm.org/D148770	2023-05-03 00:13:20 +09:00

1 2 3 4 5 ...

33995 Commits