llvm-project

Author	SHA1	Message	Date
Noah Goldstein	7013638978	[DAG] Add support for `nneg` flag with `uitofp` Copy `nneg` flag when building `UINT_TO_FP` from `uitofp` and use `nneg` flag in the one place we transform `UINT_TO_FP` -> `SINT_TO_FP` if the operand is non-negative.	2024-04-09 23:06:55 -05:00
Qiu Chaofan	71eda17a06	[Legalizer] Soften EXTRACT_ELEMENT on ppcf128 (#77412 ) ppc_fp128 values are always split into two f64. Implement soften operation in soft-float mode to handle output f64 correctly.	2024-04-09 10:26:24 +08:00
David Green	9fd2e2c2fd	[DAG][AArch64] Support masked loads/stores with nontemporal flags (#87608 ) SVE has some non-temporal masked loads and stores. The metadata coming from the nodes is not copied to the MMO at the moment though, meaning it will generate a normal instruction. This patch ensures that the right flags are set if the instruction has non-temporal metadata.	2024-04-08 08:53:27 +01:00
Jay Foad	1b761205f2	[APInt] Add a simpler overload of multiplicativeInverse (#87610 ) The current APInt::multiplicativeInverse takes a modulus which can be any value, but all in-tree callers use a power of two. Moreover, most callers want to use two to the power of the width of an existing APInt, which is awkward because 2^N is not representable as an N-bit APInt. Add a new overload of multiplicativeInverse which implicitly uses 2^BitWidth as the modulus.	2024-04-04 16:11:06 +01:00
Piotr Sobczak	5b59ae423a	[DAG] Preserve NUW when reassociating (#87621 ) Similarly to the generic case below, preserve the NUW flag when reassociating adds with constants.	2024-04-04 16:47:25 +02:00
Simon Pilgrim	2d0087424f	[DAG] Remove extract_vector_elt(freeze(x)), idx -> freeze(extract_vector_elt(x), idx) fold (#87480 ) Reverse the fold with handling inside canCreateUndefOrPoison for cases where we know that the extract index is in bounds. This exposed a number or regressions, and required some initial freeze handling of SCALAR_TO_VECTOR, which will require us to properly improve demandedelts support to handle its undef upper elements. There is still one outstanding regression to be addressed in the future - how do we want to handle folds involving frozen loads? Fixes #86968	2024-04-04 11:10:55 +01:00
Simon Pilgrim	a9d963fdf8	[DAG] SoftenFloatResult - add clang-format off/on tags around switch statement. NFC. Stop clang-format from trying to put all the case on separate lines.	2024-04-04 11:02:02 +01:00
Luke Lau	3a7b5223a6	[DAGCombiner][RISCV] Handle truncating splats in isNeutralConstant (#87338 ) On RV64, we legalize zexts of i1s to (vselect m, (splat_vector i64 1), (splat_vector i64 0)), where the splat_vectors are implicitly truncating. When the vselect is used by a binop we want to pull the vselect out via foldSelectWithIdentityConstant. But because vectors with an element size < i64 will truncate, isNeutralConstant will return false. This patch handles truncating splats by getting the APInt value and truncating it. We almost don't need to do this since most of the neutral elements are either one/zero/all ones, but it will make a difference for smax and smin. I wasn't able to figure out a way to write the tests in terms of select, since we need the i1 zext legalization to create a truncating splat_vector. This supercedes #87236. Fixed vectors are unfortunately not handled by this patch (since they get legalized to _VL nodes), but they don't seem to appear in the wild.	2024-04-04 12:36:15 +08:00
Jay Foad	a6170d5b7e	[SelectionDAG] Dump convergencectrl_glue DAG node (#87487 )	2024-04-03 16:21:57 +01:00
Simon Pilgrim	39eedfded4	[DAG] visitADDLikeCommutative - convert (add x, shl(0 - y, n)) fold to SDPatternMatch. NFC.	2024-04-03 15:37:38 +01:00
aniplcc	d650fcd6bf	[DAG] SimplifyDemandedVectorElts - add ISD::AVGCEILS/AVGCEILU/AVGFLOORS/AVGFLOORU nodes (#86284 ) Fixes #84768	2024-04-03 15:00:50 +01:00
AinsleySnow	52b18430ae	[VP][DAGCombine] Use `simplifySelect` when combining vp.select. (#87342 ) Hi all, This patch is a follow-up of #79101. It migrates logic from `visitVSELECT` to `visitVP_SELECT` to simplify `vp.select`. With this patch we can do the following combinations: ``` vp.select undef, T, F --> T (if T is a constant), F otherwise vp.select <condition>, undef, F --> F vp.select <condition>, T, undef --> T vp.select false, T, F --> F vp.select <condition>, T, T --> T ``` I'm a total newbie to llvm and I'm sure there's room for improvements in this patch. Please let me know if you have any advice. Thank you in advance!	2024-04-03 07:45:50 -04:00
Prabhuk	212b1a84a6	[CallSiteInfo][NFC] CallSiteInfo -> CallSiteInfo.ArgRegPairs (#86842 ) CallSiteInfo is originally used only for argument - register pairs. Make it struct, in which we can store additional data for call sites. Also, the variables/methods used for CallSiteInfo are named for its original use case, e.g., CallFwdRegsInfo. Refactor these for the upcoming use, e.g. addCallArgsForwardingRegs() -> addCallSiteInfo(). An upcoming patch will add type ids for indirect calls to propogate them from middle-end to the back-end. The type ids will be then used to emit the call graph section. Original RFC: https://lists.llvm.org/pipermail/llvm-dev/2021-June/151044.html Updated RFC: https://lists.llvm.org/pipermail/llvm-dev/2021-July/151739.html Differential Revision: https://reviews.llvm.org/D107109?id=362888 Co-authored-by: Necip Fazil Yildiran <necip@google.com>	2024-04-02 13:05:16 -07:00
Atousa Duprat	4aba595f09	[ADT] Add signed and unsigned mulh to APInt (#84719 ) Fixes #84207	2024-04-02 17:07:56 +01:00
Il-Capitano	0ef7437780	[SelectionDAG][Statepoint] Fix truncation of `gc.statepoint` ID argument (#85908 ) The ID argument of `gc.statepoint` gets incorrectly truncated to 32 bits during code generation. This is fixed by using `uint64_t` instead of `unsigned` for the `ID` member in `SelectionDAGBuilder::StatepointLoweringInfo`, and a `patchpoint` test case is extended to check for 64 bit ID generation in stackmaps.	2024-04-02 09:28:19 -04:00
Sizov Nikita	6654235594	[SelectionDAG] implement computeKnownBits for add AVG* instructions (#86754 ) knownBits calculation for AVGFLOORU / AVGFLOORS / AVGCEILU / AVGCEILS instructions Prerequisite for #76644	2024-04-02 10:39:49 +01:00
Vitaly Buka	c4df57da1d	[CodeGen] llvm.allow.{runtime,ubsan}.check() in FastISel Follow up to #86049. clang-armv8-quick build bot can trigger this branch.	2024-03-31 23:39:33 -07:00
Sameer Sahasrabuddhe	421557974a	[AMDGPU] Use glue for convergence tokens at call-like operations (#86766 ) The earlier implementation on AMDGPU used explicit token operands at SI_CALL and SI_CALL_ISEL. This is now replaced with CONVERGENCECTRL_GLUE operands, with the following effects: - The treatment of tokens at call-like operations is now consistent with the treatment at intrinsics. - Support for tail calls using implicit tokens at SI_TCRETURN "just works". - The extra parameter at call-like instructions is eliminated, thus restoring those instructions and their handling to the original state. The new glue node is placed after the existing glue node for the outgoing call parameters, which seems to not interfere with selection of the call-like nodes.	2024-04-01 10:51:13 +05:30
Vitaly Buka	20f56e1f8e	[CodeGen] Add default lowering for llvm.allow.{runtime,ubsan}.check() (#86049 ) RFC: https://discourse.llvm.org/t/rfc-add-llvm-experimental-hot-intrinsic-or-llvm-hot/77641	2024-03-31 22:19:33 -07:00
Wang Pengcheng	610b9e23c5	[SDAG] Use shifts if ISD::MUL is illegal when lowering ISD::CTPOP (#86505 ) We can avoid libcalls. Fixes #86205	2024-03-29 15:38:39 +08:00
Jonas Paulsson	94b5c118b3	[ISel] Move handling of atomic loads from SystemZ to DAGCombiner (NFC). (#86484 ) The folding of sign/zero extensions into an atomic load by specifying an extension type is not target specific, and therefore belongs in the DAGCombiner rather than in the SystemZ backend. - Handle atomic loads similarly to regular loads by adding AtomicLoadExtActions with set/get methods. - Move SystemZ extendAtomicLoad() to DagCombiner.cpp.	2024-03-28 16:14:35 +01:00
Luke Lau	856e815ca1	[DAGCombiner] Set disjoint flag in add->or and xor->or combines (#86925 ) We check DAG.haveNoCommonBitsSet so the operands will be known to be disjoint. I couldn't think of a codegen test case since most targets aren't checking hasDisjoint yet, apart from RISCV in the or_is_add pattern, but it also falls back to computeKnownBits.	2024-03-28 18:08:59 +08:00
Craig Topper	acab142751	[LegalizeDAG] Freeze index when converting insert_elt/insert_subvector to load/store on stack. We try clamp the index to be within the bounds of the stack object we create, but if we don't freeze it, poison can propagate into the clamp code. This can cause the access to leave the bounds of the stack object. We have other instances of this issue in type legalization and extract_elt/subvector, but posting this patch first for direction check. Fixes #86717	2024-03-27 13:01:23 -07:00
Craig Topper	1c965801c4	[LegalizeDAG] Merge PerformInsertVectorEltInMemory into ExpandInsertToVectorThroughStack. NFC (#86755 ) These functions are very similar. We can share them like we do for EXTRACT_VECTOR_ELT and EXTRACT_SUBVECTOR.	2024-03-27 09:39:35 -07:00
Simon Pilgrim	9247f3185c	[DAG] foldAddSubOfSignBit - reuse existing SDLoc instead of regenerating it. NFC.	2024-03-27 12:22:31 +00:00
Simon Pilgrim	51388fbab1	[DAG] visitSub - reuse existing SDLoc instead of regenerating it. NFC.	2024-03-27 12:22:30 +00:00
Craig Topper	09155ac290	[LegalizeDAG] Remove unneeded temporary SDValues from PerformInsertVectorEltInMemory. NFC There were 3 temporaries that just renamed the 3 well name arguments to the function to Tmp1-3. Looks like this was done when the code was extracted from elsewhere into a separate function 15 years ago.	2024-03-26 16:25:24 -07:00
Emil Pedersen	0e5c504d3d	[DebugInfo] [SelectionDAG] Fix handling of duplicate dbg values (#86598 ) Before this fix, a duplicate llvm.dbg.value intrinsic referring to an argument, after an alloca, would be generated with `$noreg`, losing debug information. Instead, we silently drop the second debug info, so it doesn't break the first one. rdar://125375717	2024-03-26 12:09:22 -07:00
Simon Pilgrim	1c9d5c25ae	[DAG] foldAddSubBoolOfMaskedVal - reuse existing SDLoc instead of regenerating it. NFC.	2024-03-26 18:33:30 +00:00
Il-Capitano	308ed0233a	[Intrinsics] Make `patchpoint.i64` generic on its return type (#85911 ) Currently patchpoints can only have two result types, `void` and `i64`. This limits the result to general purpose registers. This patch makes `patchpoint.i64` an overloadable intrinsic, allowing result values that can fit in a single register (e.g. integers, pointers, floats).	2024-03-26 19:08:52 +05:30
Simon Pilgrim	5fc619b5ee	[DAG] Update ISD::AVG folds to use hasOperation to allow Custom matching prior to legalization Fixes issue where AVX1 targets weren't matching 256-bit AVGCEILU cases.	2024-03-26 10:41:07 +00:00
Simon Pilgrim	c7198e0af3	[DAG] Fold insert_subvector(N0, extract_subvector(N0, N2), N2) --> N0 (#86487 ) Handle the case where we've ended up inserting back into the source vector we extracted the subvector from.	2024-03-26 10:03:42 +00:00
AtariDreams	f5a067bb90	[SelectionDAG]: Deduce KnownNeverZero from SMIN and SMAX (#85722 )	2024-03-25 10:35:28 +00:00
houndlord	9632e1515c	Match fixed width ISD::AVGFLOORS + ISD::AVGCEILS patterns (#86222 )	2024-03-24 15:33:16 +00:00
Owen Anderson	7c9b5228da	Only check assertions that were meant to apply to the normal case of non-splat vector SREM expansion when we aren't hitting the special case. (#86238 ) Fixes https://github.com/llvm/llvm-project/issues/84830 Introduced in https://github.com/llvm/llvm-project/pull/82706	2024-03-23 21:49:29 -05:00
Harvin Iriawan	57146daeaa	[CodeGen] Update for scalable MemoryType in MMO (#70452 ) Remove getSizeOrUnknown call when MachineMemOperand is created. For Scalable TypeSize, the MemoryType created becomes a scalable_vector. 2 MMOs that have scalable memory access can then use the updated BasicAA that understands scalable LocationSize. Original Patch by Harvin Iriawan Co-authored-by: David Green <david.green@arm.com>	2024-03-23 12:56:25 +00:00
Yingwei Zheng	6c1932ffd8	[LLVM] Pass APInt by const reference. NFC. (#86278 ) This patch adjusts argument passing for `APInt` to improve the compile-time. Compile-time improvement: https://llvm-compile-time-tracker.com/compare.php?from=d1f182c895728d89c5c3d198b133e212a5d9d4a3&to=32d6611af69bf4e76373f9bc7d9649650f760e48&stat=instructions:u	2024-03-23 14:57:35 +08:00
Simon Pilgrim	ceabaa7e7a	[DAG] Fix some missing formatting when I rewrote the SUB(MAX,MIN) -> ABD patterns. NFC.	2024-03-22 11:48:03 +00:00
XChy	cb4453dc69	[SelectionDAG] Prevent combination on inconsistent type in `combineCarryDiamond` (#84888 ) Fixes #84831 When matching carry pattern with `getAsCarry`, it may produce different type of carryout. This patch checks such case and does early exit. I'm new to DAG, any suggestion is appreciated.	2024-03-22 16:05:20 +05:30
Craig Topper	c67ed2f1e1	[SelectionDAG][RISCV] Use TypeSize version of ComputeValueVTs in TargetLowering::LowerCallTo. (#86166 ) This is needed to support non-intrinsic functions returning tuple types which are represented as structs with scalable vector types in IR. I suspect this may have been broken since https://reviews.llvm.org/D158115	2024-03-21 20:35:08 -07:00
Simon Pilgrim	6942927609	[DAG] combineConcatVectorOfScalars - stop always creating UNDEF nodes. NFC. Noticed in debug logs - most calls to visitVECTOR_SHUFFLE resulted into wasteful UNDEF node creations, despite almost never being used.	2024-03-21 16:37:48 +00:00
Simon Pilgrim	e4fa2e3562	[DAG] isGuaranteedNotToBeUndefOrPoisonForTargetNode - add fallback implementation (#86125 ) Allow targets to rely on TargetLowering::isGuaranteedNotToBeUndefOrPoisonForTargetNode to test nodes for canCreateUndefOrPoisonForTargetNode + all arguments are isGuaranteedNotToBeUndefOrPoison. Targets can still perform this themselves for specific special case nodes (e.g. target shuffles). Matches the fallback in SelectionDAG::isGuaranteedNotToBeUndefOrPoison	2024-03-21 15:11:59 +00:00
AtariDreams	7e72cafd68	[SelectionDAG] Add MaskedValueIsZero check to allow folding of zero extended variables we know are safe to extend (#85573 ) Add ones for every high bit that will cleared. This will allow us to evaluate variables that have their bits known to see if they have no risk of overflow despite the shift amount being greater than the difference between the two types.	2024-03-21 16:45:17 +05:30
Simon Pilgrim	23de3862dc	[DAG] visitSUB - use sd_match to match SUB(MAX,MIN) -> ABD pattern. NFC. Seriously simplifies the commutation matching logic.	2024-03-21 09:55:50 +00:00
Simon Pilgrim	11aa95f83b	[DAG] visitSUB - pull out repeated getScalarSizeInBits() calls. NFC.	2024-03-21 09:55:50 +00:00
Simon Pilgrim	7b5a5be2a7	[DAG] visitSUB/visitSUBO - move getAsNonOpaqueConstant into the if() where its used. NFC. Noticed while beginning some cleanup for moving to pattern matchers	2024-03-21 09:19:12 +00:00
Benjamin Kramer	5f5a64134b	Revert "[DAGCombiner] Simplifying `{si\|ui}tofp` when only signbit is needed" This reverts commit 353fbeb0a294d2c7cef6d88607fa0fd50ee81462. It crashes when it encounters an UINT_TO_FP. llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp:1618 in SDValue llvm::SelectionDAG::getConstant(const ConstantInt &, const SDLoc &, EVT, bool, bool): VT.isInteger() && "Cannot create FP integer constant!"	2024-03-20 15:08:37 +01:00
Stephen Tozer	bdc77d1ecc	[RemoveDIs][NFC] Rename DPLabel->DbgLabelRecord (#85918 ) This patch renames DPLabel to DbgLabelRecord, in accordance with the ongoing DbgRecord rename. This rename was fairly trivial, since DPLabel isn't as widely used as DPValue and has no real conflicts in either its full or abbreviated name. As usual, the entire replacement was done automatically, with `s/DPLabel/DbgLabelRecord/` and `s/DPL/DLR/`.	2024-03-20 13:11:28 +00:00
Noah Goldstein	353fbeb0a2	[DAGCombiner] Simplifying `{si\|ui}tofp` when only signbit is needed If we only need the signbit `uitofp` simplified to 0, and `sitofp` simplifies to `bitcast`. Closes #85138	2024-03-19 17:17:35 -05:00
Simon Pilgrim	2377b9773d	[DAG] SimplifyShift - shift i1/vXi1 X, Y --> X (any non-zero shift amount is undefined). Alive2: https://alive2.llvm.org/ce/z/SdESbg Fixes #85681	2024-03-19 20:18:37 +00:00

1 2 3 4 5 ...

13414 Commits