llvm-project

Author	SHA1	Message	Date
Josh Stone	87f57f459e	[RegAllocFast] Handle new debug values for spills These new debug values get inserted after the place where the spill happens, which means they won't be reached by the reverse traversal of basic block instructions. This would crash or fail assertions if they contained any virtual registers to be replaced. We can manually handle the new debug values right away to resolve this. Fixes https://github.com/llvm/llvm-project/issues/59172 Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D139590	2023-01-05 20:41:11 -08:00
Fangrui Song	2aedfdd9b8	[CodeGen] Default TargetOptions::RelaxELFRelocations to true MC and lld/ELF defaults were flipped in 2016. For Clang: CMake ENABLE_X86_RELAX_RELOCATIONS defaults to on in 2020. It makes sense for the TargetOptions default to be true now. R_X86_64_GOTPCRELX/R_X86_64_REX_GOTPCRELX require GNU ld newer than 2015-10 (subsumed by the current requirement of -fbinutils-version=). This should fix `rustc -Z plt=no` PIC relocatable files with GNU ld. (See https://github.com/rust-lang/rust/pull/106380)	2023-01-05 13:28:48 -08:00
Luke Drummond	108766fc7e	Fix typos I found one typo of "implemnt", then some more. s/implemnt/implement/g	2023-01-05 18:49:23 +00:00
Craig Topper	11e92bd61f	[SelectionDAG] Improve codegen for udiv by constant if any divisors are 1. If the divisor is 1, the magic algorithm does not return a correct result and we end up using a select to pick the numerator for those elements at the end. Therefore we can use undef for that element of the earlier operations when the divisor is 1. We sometimes get this through SimplifyDemandedVectorElts, but not always. Definitely seems like we don't if the NPQ fixup is used. Unfortunately, DAGCombiner is unable to fold srl X, <0, undef> to X so I had to add flags to avoid emitting the srl unless one of the shift amounts is non-zero. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D141022	2023-01-05 08:41:44 -08:00
OCHyams	eebfee8f9e	[DebugInfo][SelectionDAGISel] Do not drop all dbg.declares if one with empty metadata is found This error was introduced in 1d1de7467c32d52926ca56b9167a2c65c451ecfa (by me) about 1 month ago. Found while testing the D140901 patch stack. Reviewed By: jryans Differential Revision: https://reviews.llvm.org/D141052	2023-01-05 15:36:50 +00:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Diana Picus	6ee4f253b2	[GlobalISel] Add G_BUILD_VECTOR[_TRUNC] to CSE Add G_BUILD_VECTOR and G_BUILD_VECTOR_TRUNC to the list of opcodes in `shouldCSEOpc`. This simplifies the code generated for vector splats. Differential Revision: https://reviews.llvm.org/D140965	2023-01-05 10:15:31 +01:00
Diana Picus	22924bd48d	[GlobalISel] Don't switch opcodes in MIRBuilder::buildInstr At the moment, `MachineIRBuilder::buildInstr` may build an instruction with a different opcode than the one passed in as parameter. This may cause confusion for its consumers, such as `CSEMIRBuilder`, which will memoize the instruction based on the new opcode, but will search through the memoized instructions based on the original one (resulting in missed CSE opportunities). This is all the more unpleasant since buildInstr is virtual and may call itself recursively both directly and via buildCast, so it's not always easy to follow what's going on. This patch simplifies the API of `MachineIRBuilder` so that the `buildInstr` method does the least surprising thing (i.e. builds an instruction with the specified opcode) and only the convenience `buildX` methods (`buildMerge` etc) are allowed freedom over which opcode to use. This can still be confusing (e.g. one might write a unit test using `buildBuildVectorTrunc` but instead get a plain `G_BUILD_VECTOR`), but at least it's explained in the comments. In practice, this boils down to 3 changes: * `buildInstr(G_MERGE_VALUES)` will no longer call itself with `G_BUILD_VECTOR` or `G_CONCAT_VECTORS`; this functionality is moved to `buildMerge` and replaced with an assert; * `buildInstr(G_BUILD_VECTOR_TRUNC)` will no longer call itself with `G_BUILD_VECTOR`; this functionality is moved to `buildBuildVectorTrunc` and replaced with an assert; * `buildInstr(G_MERGE_VALUES)` will no longer call `buildCast` and will instead assert if we're trying to merge a single value; no change is needed in `buildMerge` since it was already asserting more than one source operand. This change is NFC for users of the `buildX` methods, but users that call `buildInstr` with relaxed parameters will have to update their code (such instances will hopefully be easy to find thanks to the asserts). Differential Revision: https://reviews.llvm.org/D140964	2023-01-05 10:02:39 +01:00
Craig Topper	f8751b8ee6	[TargetLowering] Remove stale FIXME. NFC This was implemented for scalars in D140750.	2023-01-04 18:40:42 -08:00
Craig Topper	3f749a5d9d	[Support][SelectionDAG][GlobalISel] Hoist PostShift adjustment for IsAdd into UnsignedDivideUsingMagic. Instead of doing the adjustment in 3 different places in the code base, do it inside UnsignedDivideUsingMagic::get. Differential Revision: https://reviews.llvm.org/D141014	2023-01-04 15:18:12 -08:00
Roman Lebedev	2a43a4478c	[NFCI][DAGCombiner] `foldExtendVectorInregToExtendOfSubvector()`: just build new VT Changing element type seems to not play well with non-simple types, even though we are operating on EVT's here.	2023-01-05 01:33:24 +03:00
Roman Lebedev	41005b7ab2	[DAGCombiner] Do try to combine `ISD::ANY_EXTEND_VECTOR_INREG` nodes These weren't previously getting combined at all here, only in target-specific combines.	2023-01-05 01:12:31 +03:00
Roman Lebedev	317a1adfe4	[DAGCombiner] Fold _EXTEND_INREG of one of CONCAT_VECTORS operands into _EXTEND of operand This appears to be the root problematic pattern for AArch64 regression in D140677. We already do this, and many more, as target-specific X86 combines, so this isn't causing much of an impact.	2023-01-05 01:12:31 +03:00
Roman Lebedev	846d06c707	[DAG] `tryToFoldExtendOfConstant()`: `sext undef` is not `undef` https://alive2.llvm.org/ce/z/cLGpWV, but https://alive2.llvm.org/ce/z/TGNH4P	2023-01-04 22:42:43 +03:00
Philip Reames	9560ac3a25	[MachineCombine] Reorganize code for readability and tracing [nfc]	2023-01-04 10:47:39 -08:00
Craig Topper	8bca60fb0a	[SelectionDAG][GlobalISel] Don't use UnsignedDivisionByConstantInfo for divisor of 1. The magic algorithm sets IsAdd indication for division by 1 that the caller had to ignore. I considered folding the ignore into UnsignedDivisionByConstantInfo, but we only allow 1 for vectors of mixed visiors. And really what we want to end up with is undef. Currently, we get to undef via DemandedElts optimizations using the select instruction. We could directly emit undef. Differential Revision: https://reviews.llvm.org/D140940	2023-01-04 10:01:15 -08:00
Jay Foad	6f7ff9b933	[MC] Consistently use MCInstrDesc::getImplicitUses and getImplicitDefs. NFC.	2023-01-04 13:16:12 +00:00
Yeting Kuo	1e9e1b9cf8	[VP][RISCV] Add vp.ctlz/cttz and RISC-V support. The patch also adds expandVPCTLZ and expandVPCTTZ to expand vp.ctlz/cttz nodes and the cost model of vp.ctlz/cttz. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140370	2023-01-04 15:15:01 +08:00
Craig Topper	84daed7fd4	[SelectionDAG][GlobalISel] Move even divisor optimization for division by constant into UnsignedDivideUsingMagic implementation. NFC I've added a bool to UnsignedDivideUsingMagic so we can continue testing it in the unit test with and without this optimization in the unit test. This is a step towards supporting "uncooperative" odd divisors. See https://ridiculousfish.com/blog/posts/labor-of-division-episode-iii.html Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D140924	2023-01-03 16:34:13 -08:00
Samuel Parker	615333bc09	[TypePromotion] NewPM support. Differential Revision: https://reviews.llvm.org/D140893	2023-01-03 15:09:29 +00:00
chenglin.bi	a0b470c984	[TypePromotion] Add truncate in ConvertTruncs when the original truncate type is not extend type If the src type is not extend type, after convert the truncate to and we need to truncate the and also to make sure the all user is legal. The old fix D137613 doesn't work when the truncate convert to and have the other users. So this time I try to add the truncate after and to avoid all these potential issues. Fix: #59554 Reviewed By: samparker Differential Revision: https://reviews.llvm.org/D140869	2023-01-03 18:13:20 +08:00
Roman Lebedev	4fc417ec37	[DAGCombiner] `convertBuildVecZextToBuildVecWithZeros()`: rework split factor calculation The original computation was both making assumptions that do not hold in practice, and being overly pessimistic. We should just check every possible split factor, and pick the best one. Fixes https://github.com/llvm/llvm-project/issues/59781	2023-01-02 18:34:35 +03:00
Roman Lebedev	1337821f11	[DAGCombiner][X86] Fold a CONCAT_VECTORS of SHUFFLE_VECTOR and it's operand into wider SHUFFLE_VECTOR This was showing as a source of many regressions with more aggressive ZERO_EXTEND_VECTOR_INREG recognition.	2023-01-01 23:18:42 +03:00
Roman Lebedev	16facf1ca6	[DAGCombiner][TLI] Do not fuse bitcast to <1 x ?> into a load/store of a vector Single-element vectors are legalized by splitting, so the the memory operations would also get scalarized. While we do have some support to reconstruct scalarized loads, we clearly don't catch everything. The comment for the affected AArch64 store suggests that having two stores was the desired outcome in the first place. This was showing as a source of many regressions with more aggressive ZERO_EXTEND_VECTOR_INREG recognition.	2022-12-31 03:49:43 +03:00
Roman Lebedev	603e849072	[NFC][TLI] Move `isLoadBitCastBeneficial()` implementation into source file ... so any change to it does not cause 700 source files to be recompiled.	2022-12-31 02:07:50 +03:00
Roman Lebedev	e4d25a9c23	[DAG] BUILD_VECTOR: absorb ZERO_EXTEND of a single first operand if all other ops are zeros This kind of pattern seems to come up as regressions with better ZERO_EXTEND_VECTOR_INREG recognition. For initial implementation, this is quite restricted to the minimal viable transform, otherwise there are too many regressions to be dealt with.	2022-12-31 00:58:11 +03:00
Vitaly Buka	6f3400e380	Revert "[CodeGen] Temporarily disable-lsr in HWASAN build" We can do the same with cmake on the bot. This reverts commit 8f70b848d339cabfaa8f1379d41dae11b9b75014.	2022-12-30 10:57:49 -08:00
Filipp Zhinkin	98265db84c	[ScheduleDAG] Support REQ_SEQUENCE unscheduling REG_SEQUENCE node requires special treatment during the unscheduling because the node is untyped and neither its class, nor cost could be retrieved the same way as for typed nodes. Related issue: https://github.com/llvm/llvm-project/issues/58911 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D138837	2022-12-30 15:17:11 +04:00
Vitaly Buka	8f70b848d3	[CodeGen] Temporarily disable-lsr in HWASAN build HWASAN exposes some non-determinism in the pass and triggers: ScalarEvolution.cpp:11540: bool llvm::ScalarEvolution::isLoopEntryGuardedByCond(const Loop , ICmpInst::Predicate, const SCEV , const SCEV *): Assertion `isAvailableAtLoopEntry(LHS, L) && "LHS is not available at Loop Entry"' failed. E.g. https://lab.llvm.org/buildbot/#/builders/236/builds/1629/steps/16/logs/stdio is broken after D137838. I tried to split D137838 into smaller patches and the one which reproduced was just a move of cpp from one dir to another. Maybe it has something do to with comparison of tagged pointeres and PtrSets used in pass. Issues is hard to reproduce, even slight changes in path, or preprocessing cpp file hide it.	2022-12-29 23:37:49 -08:00
Dmitry Borisenkov	0ec51a460a	DAG: Prevent store value forwarding to distinct addrspace load DAGCombiner replaces (load const_addr1) directly chained with (store (val, const_addr2)) with val if address space stripped const_addr1 == const_addr2. The patch fixes the issue by checking address spaces as well. However, it might makes sense to not to chain together side effects that belong to different address spaces in the first place and make SelectionDAG::root address space aware.	2022-12-29 18:19:55 -05:00
Roman Lebedev	248567a327	[DAGCombiner] Try to partition ISD::EXTRACT_VECTOR_ELT to accomodate it's ISD::BUILD_VECTOR users This mainly cleans up a few patterns that are legalized by scalarization from a wide-element vector, but then are further split apart to build a more narrow-sized-element vector. In particular this happens in some cases for illegal ISD::ZERO_EXTEND_VECTOR_INREG. Given a ISD::EXTRACT_VECTOR_ELT, which is a glorified bit sequence extract, recursively analyse all of it's users. and try to model themselves as bit sequence extractions. If all of them agree on the new, narrower element type, and all of them can be modelled as ISD::EXTRACT_VECTOR_ELT's of that new element type, do that, but only if unmodelled users are ISD::BUILD_VECTOR.	2022-12-30 01:15:53 +03:00
Craig Topper	8abd70081f	[TargetLowering] Teach BuildUDIV to take advantage of leading zeros in the dividend. If the dividend has leading zeros, we can use them to reduce the size of the multiplier and avoid the fixup cases. This patch is for scalars only, but we might be able to do this for vectors in a follow up. Differential Revision: https://reviews.llvm.org/D140750	2022-12-29 13:58:46 -08:00
Markus Böck	8f8313d533	[llvm][AsmPrinter][NFC] Cleanup `GCMetadataPrinters` field The field is currently `void`, which was originlly chosen in 2010 to not need to include `DenseMap`. Since then, `DenseMap` has been included in the header file anyways, so there is no more need to for the indirection via `void` and the cruft around it can be removed. Differential Revision: https://reviews.llvm.org/D140758	2022-12-29 20:47:45 +01:00
Roman Lebedev	c4f815d705	[DAGCombine] `combineShuffleToZeroExtendVectorInReg()`: widen shuffle elements before trying to match We might have sunk a bitcast into shuffle, and now it might be operating on more fine-grained elements than what we'd match, so we must not be dependent on whatever the granularity the shuffle happened to be in, but transform it into the one canonical for us - with widest elements.	2022-12-27 00:47:45 +03:00
Roman Lebedev	e26e7ed69a	[DAG] `combineShuffleToZeroExtendVectorInReg()`: try to match w/ commuted operands We don't have any reason to expect that the operand we will match is on any particular hand of the shuffle, so we should try both.	2022-12-26 22:54:03 +03:00
Roman Lebedev	62fc5f1640	[DAGCombiner] Add a most basic `combineShuffleToZeroExtendVectorInReg()` Sometimes we end up with a shuffles in DAG that would be better represented as a `ISD::ZERO_EXTEND_VECTOR_INREG`, and a failure to do so causes suboptimal codegen in a number of cases, especially when we will then cast vector to scalar. I acknowledge, the test changes here are rather underwhelming, but as with all of codegen, it's always a yak shawing, and this is the most stripped down version of the patch that shows some effect without having insurmountable amount of fallout to deal with. The next change resolves this regression. The transformation will be extended in follow-ups.	2022-12-26 22:54:03 +03:00
Danila Malyutin	821a59588b	[TwoAddressInstruction] Constrain RegClass when processing a statepoint This transformation could've triggered a verifier assert if RegA and RegB were of different reg classes. Fix this by constraining as the comment for replaceRegWith suggests. Differential Revision: https://reviews.llvm.org/D140672	2022-12-26 19:00:34 +03:00
Roman Lebedev	2f6aef52f2	[NFC][DAGCombiner] `canCombineShuffleToAnyExtendVectorInreg()`: take matcher as callback	2022-12-26 03:56:58 +03:00
Roman Lebedev	84ea72679e	[NFC][DAG] `canCombineShuffleToAnyExtendVectorInreg()`: check for legal op before matching Likewise as with legal types check, might as well not match if won't use.	2022-12-26 01:43:49 +03:00
Roman Lebedev	2999c45050	[NFC][DAGCombiner] Extract `canCombineShuffleToAnyVectorExtendInReg()` helper Adding zero-ext support isn't as straight-forward, and it's easier to to so in a new function, but this helper is useful there. This does not change any existing behaviour.	2022-12-26 01:04:47 +03:00
Roman Lebedev	6aa7359387	[NFC][DAG] `combineShuffleToVectorExtend()`: check that the type is legal first There is no point in doing any of the potentially-costly matching if we will inevitably give up anyway.	2022-12-26 01:03:59 +03:00
Stephen Tozer	c290a8b7a4	[DebugInfo] Fix: Variables that have no non-empty values being emitted when they have a DBG_VALUE_LIST This patch fixes a simple bug where `DbgValueHistoryMap::hasNonEmptyLocation` was incorrectly handling DBG_VALUE_LIST instructions, treating empty values as non-empty, causing empty variables to be emitted into DWARF. Reviewed By: Orlando Differential Revision: https://reviews.llvm.org/D133925	2022-12-25 13:28:27 -08:00
Vitaly Buka	83d4851436	Revert "[DebugInfo] Variables with only empty values emitting when one is variadic" Breaks HWASAN somehow. Fails at def915c39cc4e18b304c7a8c4761cc4531c3bc4b https://lab.llvm.org/buildbot/#/builders/236/builds/1547 Pass at def915c39cc4e18b304c7a8c4761cc4531c3bc4b^ https://lab.llvm.org/buildbot/#/builders/236/builds/1529 This reverts commit def915c39cc4e18b304c7a8c4761cc4531c3bc4b.	2022-12-23 21:57:53 -08:00
Roman Lebedev	03e848293e	[DAGCombiner] `visitFREEZE()`: fix cycle breaking Depending on the particular DAG, we might either create a `freeze`, or not. And only in the former case, the cycle would be formed. It would be nicer to have `ReplaceAllUsesOfValueWithIf()`, like we have in IR, but we don't have that. Fixes https://github.com/llvm/llvm-project/issues/59677	2022-12-23 18:16:22 +03:00
Roman Lebedev	d8f541efe7	[DAGCombiner] `visitFREEZE()`: fix handling of no maybe-poison ops The original code was confusing. It was stripping poison-generating flags, but the comments were saying that doing so was a TODO. If the poison-generating flags are present, then even if all operands are guaranteed not to be undef or poison, the whole operation may still produce undef or poison. We can still deal with that case, and we already do deal with it in fact, by also dropping those flags. Refs. https://github.com/llvm/llvm-project/issues/59676	2022-12-23 17:26:05 +03:00
Roman Lebedev	d7a63a0421	[DAGCombiner] `visitFREEZE()`: restore previous behaviour on no maybe-poison operands Lack of such operands implies that the op might be poison-producing due to it's flags. We seem to drop them already, but the comments are confusing. Fixes https://github.com/llvm/llvm-project/issues/59676	2022-12-23 17:26:05 +03:00
Roman Lebedev	6fea27662d	[DAGCombiner] `visitFREEZE()`: be less greedy with replacing other uses of undef	2022-12-23 02:26:36 +03:00
Roman Lebedev	f738ab9075	[DAGCombiner] `visitFREEZE()`: allow multiple maybe-poison operands for `BUILD_VECTOR`	2022-12-23 02:26:36 +03:00
Roman Lebedev	1234754bbc	[DAGCombine] `BUILD_VECTOR` can not create undef or poison	2022-12-23 02:26:36 +03:00
Roman Lebedev	114cc45a09	[NFC][DAGCombiner] `visitFREEZE()`: use early return	2022-12-23 02:26:36 +03:00

1 2 3 4 5 ...

33436 Commits