llvm-project

Author	SHA1	Message	Date
Kazu Hirata	1e4e1ceeeb	[Target] Avoid repeated hash lookups (NFC) (#108677 )	2024-09-14 07:39:09 -07:00
David Green	637aa61732	[ARM] Fix VBICimm and VORRimm generation under Big endian. (#107813 ) This is a smaller follow on to #105519 that fixes VBICimm and VORRimm too. The logic behind lowering vector immediates under big endian Neon/MVE is to treat them in natural lane ordering (same as little endian), and VECTOR_REG_CAST them to the correct type (as opposed to creating the constants in big endian form and bitcasting them). This makes sure that is done when creating VORRIMM and VBICIMM.	2024-09-13 10:59:57 +01:00
Austin	3242e77841	[ARM][Codegen] Fix vector data miscompilation in arm32be (#105519 ) Fix #102418， resolved the issue of generating incorrect vrev during vectorization in big-endian scenarios	2024-09-07 14:09:29 +08:00
Nikita Popov	a7697c8655	[ARM] Do not assume alignment in vld1xN and vst1xN intrinsics (#106984 ) These intrinsics currently assume natural alignment. Instead, respect the alignment attribute on the intrinsic. Teach InstCombine to improve that alignment. If desired I could also adjust the clang frontend to add alignment annotations equivalent to the previous behavior, but I don't see any indication that such an assumption is correct in the ARM intrinsics docs. Fixes https://github.com/llvm/llvm-project/issues/59081.	2024-09-05 09:26:53 +02:00
Oliver Stannard	9cf68679c4	[ARM] Fix failure to register-allocate CMP_SWAP_64 pseudo-inst (#106721 ) This test case was failing to compile with a "ran out of registers during register allocation" error at -O0. This was because CMP_SWAP_64 has 3 operands which must be an even-odd register pair, and two other GPR operands. All of the def operands are also early-clobber, so registers can't be shared between uses and defs. Because the function has an over-aligned alloca it needs frame and base pointers, so r6 and r11 are both reserved. That leaves r0/r1, r2/r3, r4/r5 and r8/r9 as the only valid register pairs, and if the two individual GPR operands happen to get allocated to registers in different pairs then only 2 pairs will be available for the three GPRPair operands. To fix this, I've merged the two GPR operands into a single GPRPair operand. This means that the instruction now has 4 GPRPair operands, which can always be allocated without relying on luck. This does constrain register allocation a bit more, but this pseudo instruction is only used at -O0, so I don't think that's a problem.	2024-09-02 08:54:10 +01:00
Sergei Barannikov	4d7a0abae8	[DataLayout] Change return type of `getStackAlignment` to `MaybeAlign` (#105478 ) Currently, `getStackAlignment` asserts if the stack alignment wasn't specified. This makes it inconvenient to use and complicates testing. This change also makes `exceedsNaturalStackAlignment` method redundant.	2024-08-27 22:59:33 +03:00
Kiran	c50d11e6d9	Revert "[ARM] musttail fixes" committed by accident, see #104795 This reverts commit a2088a24dad31ebe44c93751db17307fdbe1f0e2.	2024-08-27 11:17:17 +01:00
Kiran	a2088a24da	[ARM] musttail fixes Backend: - Caller and callee arguments no longer have to match, just to take up the same space, as they can be changed before the call - Allowed tail calls if callee and callee both (or neither) use sret, wheras before it would be dissalowed if either used sret - Allowed tail calls if byval args are used - Added debug trace for IsEligibleForTailCallOptimisation Frontend (clang): - Do not generate extra alloca if sret is used with musttail, as the space for the sret is allocated already Change-Id: Ic7f246a7eca43c06874922d642d7dc44bdfc98ec	2024-08-27 10:44:06 +01:00
Craig Topper	4b0c0ec6b8	[CodeGen] Use MCRegister for CCState::AllocateReg and CCValAssign::getReg. NFC (#106032 )	2024-08-26 11:40:25 -07:00
David Green	b9a0276550	[ARM] Add VECTOR_REG_CAST identity fold. v16i8 VECTOR_REG_CAST (v16i8 Op) can use v16i8 Op directly, as the VECTOR_REG_CAST is a noop.	2024-08-24 21:21:27 +01:00
Craig Topper	efa859cd3e	[ARM] Use SelectonDAG::getSignedConstant.	2024-08-17 18:02:41 -07:00
Simon Pilgrim	11ba72e651	[KnownBits] Add KnownBits::add and KnownBits::sub helper wrappers. (#99468 )	2024-08-12 10:21:28 +01:00
Kazu Hirata	f4fb735840	[llvm] Construct SmallVector<SDValue> with ArrayRef (NFC) (#102578 )	2024-08-09 09:15:42 -07:00
Oliver Stannard	50a2b31800	[ARM] Be more precise about conditions for indirect tail-calls (#102451 ) This code was trying to predict the conditions in which an indirect tail call will have a free register to hold the target address, and falling back to a non-tail call if all non-callee-saved registers are used for arguments or return address authentication. However, it was only taking the number of arguments into account, not which registers they are allocated to, so floating-point arguments could cause this to give the wrong result, causing either a later error due to the lack of a free register, or a missed optimisation of not doing the tail call. The assignments of arguments to registers is available at this point in the code, so we can calculate exactly which registers will be available for the tail-call.	2024-08-09 08:50:21 +01:00
Sergei Barannikov	411d31ad69	Partially revert 92e18ffd803365c64910760ba20278f875d93681 (#101673 ) It is likely to cause stage2 build failures: https://lab.llvm.org/buildbot/#/builders/122/builds/389 https://lab.llvm.org/buildbot/#/builders/79/builds/552 I don't have an ARM machine to investigate, so I'm just reverting ARM changes to see if it helps make the bots green again.	2024-08-02 16:38:31 +03:00
Sergei Barannikov	92e18ffd80	[SDag][ARM][RISCV] Allow lowering CTPOP into a libcall (#99752 ) The main change is adding CTPOP to `RuntimeLibcalls.def` to allow targets to use LibCall action for CTPOP. DAG legalizers are changed accordingly.	2024-08-02 12:29:39 +03:00
Joseph Huber	615b7eeaa9	Reapply "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit 740161a9b98c9920dedf1852b5f1c94d0a683af5. I moved the `ISD` dependencies into the CodeGen portion of the handling, it's a little awkward but it's the easiest solution I can think of for now.	2024-07-20 09:29:31 -05:00
NAKAMURA Takumi	740161a9b9	Revert "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit c05126bdfc3b02daa37d11056fa43db1a6cdef69. (llvmorg-19-init-17714-gc05126bdfc3b) See #99610	2024-07-20 12:36:57 +09:00
Joseph Huber	c05126bdfc	[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 ) Summary: The LTO pass and LLD linker have logic in them that forces extraction and prevent internalization of needed runtime calls. However, these currently take all RTLibcalls into account, even if the target does not support them. The target opts-out of a libcall if it sets its name to nullptr. This patch pulls this logic out into a class in the header so that LTO / lld can use it to determine if a symbol actually needs to be kept. This is important for targets like AMDGPU that want to be able to use `lld` to perform the final link step, but does not want the overhead of uncalled functions. (This adds like a second to the link time trivially)	2024-07-16 06:22:09 -05:00
Kazu Hirata	5e22a53698	[Target] Use range-based for loops (NFC) (#98705 )	2024-07-13 17:40:51 -07:00
Joseph Huber	3f1a767572	[LLVM] Factor disabled Libcalls into the initializer (#98421 ) Summary: These Libcalls represent which functions are available to the backend. If a runtime call is not available, the target sets the the name to `nullptr`. Currently, this logic is spread around the various targets. This patch pulls all of the locations that disable libcalls into the intializer. This patch is effectively NFC. The motivation behind this patch is that currently the LTO handling uses the list of all runtime calls to determine which functions cannot be internalized and must be extracted from static libraries. We do not want this to happen for libcalls that are not emitted by the backend. A follow-up patch will move out this logic so the LTO pass can know which rtlib calls are actually used by the backend.	2024-07-11 12:59:25 -05:00
hstk30-hw	ef465bf8b1	[ARM] Fix arm32be softfp mode miscompilation for neon sdiv (#97883 ) Related issue: https://github.com/llvm/llvm-project/issues/97782	2024-07-08 14:18:38 +08:00
Nikita Popov	9df71d7673	[IR] Add getDataLayout() helpers to Function and GlobalValue (#96919 ) Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, replacing the current `getParent()->getDataLayout()` pattern.	2024-06-28 08:36:49 +02:00
Nikita Popov	2d209d964a	[IR] Add getDataLayout() helpers to BasicBlock and Instruction (#96902 ) This is a helper to avoid writing `getModule()->getDataLayout()`. I regularly try to use this method only to remember it doesn't exist... `getModule()->getDataLayout()` is also a common (the most common?) reason why code has to include the Module.h header.	2024-06-27 16:38:15 +02:00
Eli Friedman	39a0aa5876	[SelectionDAG] Lower llvm.ldexp.f32 to ldexp() on Windows. (#95301 ) This reduces codesize. As discussed in #92707.	2024-06-25 10:25:48 -07:00
Lucas Duarte Prates	78ff617d3f	[ARM] CMSE security mitigation on function arguments and returned values (#89944 ) The ABI mandates two things related to function calls: - Function arguments must be sign- or zero-extended to the register size by the caller. - Return values must be sign- or zero-extended to the register size by the callee. As consequence, callees can assume that function arguments have been extended and so can callers with regards to return values. Here lies the problem: Nonsecure code might deliberately ignore this mandate with the intent of attempting an exploit. It might try to pass values that lie outside the expected type's value range in order to trigger undefined behaviour, e.g. out of bounds access. With the mitigation implemented, Secure code always performs extension of values passed by Nonsecure code. This addresses the vulnerability described in CVE-2024-0151. Patches by Victor Campos. --------- Co-authored-by: Victor Campos <victor.campos@arm.com>	2024-06-20 10:22:01 +01:00
Sergei Barannikov	23c1b488fe	[ARM] Remove duplicate custom SDag node (NFCI) (#93419 ) ARMISD::SUBS is a duplicate of ARMISD::SUBC. The node was introduced in 5745b6ac. This patch replaces SUBS with SUBC and reverts changes in *.td files.	2024-06-15 12:08:32 +03:00
Farzon Lotfi	7ad12a7c04	[ARM] Add tan intrinsic lowering (#95439 ) - `ARMISelLowering.cpp` - Add f16 type and neon and mve vector support for tan	2024-06-14 10:35:50 -04:00
Jay Foad	d4a0154902	[llvm-project] Fix typo "seperate" (#95373 )	2024-06-13 20:20:27 +01:00
Simon Pilgrim	af3ffff34f	[DAG] Always allow folding XOR patterns to ABS pre-legalization (#94601 ) Removes residual ARM handling for vXi64 ABS nodes to prevent infinite loops.	2024-06-07 11:02:50 +01:00
Simon Pilgrim	c0b468523c	[ARM] Add NEON support for ISD::ABDS/ABDU nodes. (#94504 ) As noted on #94466, NEON has ABDS/ABDU instructions but only handles them via intrinsics, plus some VABDL custom patterns. This patch flags basic ABDS/ABDU for neon types as legal and updates all tablegen patterns to use abds/abdu instead. Fixes #94466	2024-06-07 10:18:45 +01:00
David Green	264b1b2486	[ARM] Convert vector fdiv+fcvt fixed-point combine to fmul. Instcombine will convert fdiv by a power-2 to fmul, this converts the PerformVDIVCombine that converts fdiv+fcvt to fixed-point fcvt to fmul+fcvt. The fdiv tests will look worse, but won't appear in practice (and should be improved again by #93882).	2024-06-03 09:31:36 +01:00
AtariDreams	1d3329c2e8	[Thumb] Resolve FIXME: Transform "(and (shl x, c2), c1)" into "(shl (and x, c1>>c2), c2)" (#82120 ) Transform "(and (shl x, c2), c1)" into "(shl (and x, c1>>c2), c2)" if "c1 >> c2" is a cheaper immediate than "c1" using HasLowerConstantMaterializationCost	2024-05-26 14:58:32 -04:00
Reid Kleckner	385faf9cde	[ARM/X86] Standardize the isEligibleForTailCallOptimization prototypes (#90688 ) Pass in CallLoweringInfo (CLI) instead of passing in the various fields directly. Also pass in CCState (CCInfo), which is computed in both the caller and the callee for a minor efficiency saving. There may also be a small correctness improvement for sibcalls with vectorcall, which has an odd way of recomputing argument locations. This is a step towards improving the handling of musttail on armv7, which we have numerous issues filed about in our tracker. I took inspiration for this from the RISCV tail call eligibility check, which uses a similar prototype.	2024-05-03 13:56:55 -07:00
Qiu Chaofan	4a8f2f2e1a	[Legalizer] Expand fmaximum and fminimum (#67301 ) According to langref, llvm.maximum/minimum has -0.0 < +0.0 semantics and propagates NaN. Expand the nodes on targets not supporting the operation, by adding extra check for NaN and using is_fpclass to check zero signs.	2024-04-29 15:09:54 +08:00
Xu Zhang	f6d431f208	[CodeGen] Make the parameter TRI required in some functions. (#85968 ) Fixes #82659 There are some functions, such as `findRegisterDefOperandIdx` and `findRegisterDefOperand`, that have too many default parameters. As a result, we have encountered some issues due to the lack of TRI parameters, as shown in issue #82411. Following @RKSimon 's suggestion, this patch refactors 9 functions, including `{reads, kills, defines, modifies}Register`, `registerDefIsDead`, and `findRegister{UseOperandIdx, UseOperand, DefOperandIdx, DefOperand}`, adjusting the order of the TRI parameter and making it required. In addition, all the places that call these functions have also been updated correctly to ensure no additional impact. After this, the caller of these functions should explicitly know whether to pass the `TargetRegisterInfo` or just a `nullptr`.	2024-04-24 14:24:14 +01:00
Prabhuk	212b1a84a6	[CallSiteInfo][NFC] CallSiteInfo -> CallSiteInfo.ArgRegPairs (#86842 ) CallSiteInfo is originally used only for argument - register pairs. Make it struct, in which we can store additional data for call sites. Also, the variables/methods used for CallSiteInfo are named for its original use case, e.g., CallFwdRegsInfo. Refactor these for the upcoming use, e.g. addCallArgsForwardingRegs() -> addCallSiteInfo(). An upcoming patch will add type ids for indirect calls to propogate them from middle-end to the back-end. The type ids will be then used to emit the call graph section. Original RFC: https://lists.llvm.org/pipermail/llvm-dev/2021-June/151044.html Updated RFC: https://lists.llvm.org/pipermail/llvm-dev/2021-July/151739.html Differential Revision: https://reviews.llvm.org/D107109?id=362888 Co-authored-by: Necip Fazil Yildiran <necip@google.com>	2024-04-02 13:05:16 -07:00
Arthur Eubanks	94c988bcfd	[NFC] Remove unused parameter from shouldAssumeDSOLocal()	2024-03-11 19:48:17 +00:00
Noah Goldstein	61c06775c9	[KnownBits] Add API for `nuw` flag in `computeForAddSub`; NFC	2024-03-05 12:59:58 -06:00
Fangrui Song	21c83feca5	[ARM] Simplify shouldAssumeDSOLocal for ELF. NFC	2024-03-01 16:14:48 -08:00
Simon Pilgrim	b45de48be2	[MVE] Expand64BitShift - handle all constant shift amounts less than 32 (#81261 ) Expand64BitShift was always dropping to generic shift legalization if the shift amount type was larger than i64, even if the constant shift amount was actually very small. I've adjusted the constant bounds checks to work with APInt types so we can always perform the comparison. This results in the MVE long shift instructions being used more often, and it looks like this is preventing some additional combines from happening. This could be addressed in the future. This came about while I was trying to extend the DAGTypeLegalizer::ExpandShift* helpers and need to move to consistently using the legal shift amount types instead of reusing the shift amount type from the original wider shift.	2024-02-11 15:02:27 +00:00
Harald van Dijk	52864d9c7b	[ARM] Switch to soft promoting half types. (#80440 ) The traditional promotion is known to generate wrong code. Fixes #73805.	2024-02-02 21:40:40 +00:00
Kazu Hirata	ae46855f53	[Target] Use getConstantOperand (NFC)	2024-01-28 18:03:38 -08:00
Nico Weber	184ca39529	[llvm] Move CodeGenTypes library to its own directory (#79444 ) Finally addresses https://reviews.llvm.org/D148769#4311232 :) No behavior change.	2024-01-25 12:01:31 -05:00
David Green	2c49586e1b	[ARM] Fix MVEFloatOps check on creating VCVTN (#79291 ) In the past PerformSplittingToNarrowingStores handled both int and float ops, but since the introduction of MVETRUNC now only operates on float operations, creating VCVTN nodes. It should be guarded by hasMVEFloatOps to prevent a failure to select.	2024-01-25 08:12:51 +00:00
Kazu Hirata	7528cf5ef2	[Target] Use getConstantOperandVal (NFC)	2024-01-14 00:53:29 -08:00
Kazu Hirata	be76f1646f	[Target] Use getConstantOperandAPInt (NFC)	2024-01-10 21:06:01 -08:00
Alex Bradbury	197214e39b	[RFC][SelectionDAG] Add and use SDNode::getAsZExtVal() helper (#76710 ) This follows on from #76708, allowing `cast<ConstantSDNode>(N)->getZExtValue()` to be replaced with just `N->getAsZextVal();` Introduced via `git grep -l "cast<ConstantSDNode>\(.\).getZExtValue" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.*)\)->getZExtValue/\1->getAsZExtVal/'` and then using `git clang-format` on the result.	2024-01-09 12:25:17 +00:00
Alex Bradbury	a181b42565	[llvm][NFC] Use SDValue::getConstantOperandAPInt(i) where possible The helper function allows examples like `cast<ConstantSDNode>(Op.getOperand(0))->getAPIntValue();` to be changed to `Op.getConstantOperandAPInt(0);`. See #76708 for further context. Although there are far fewer opportunities for replacement, I used a similar git grep and sed combo as before, given I already had it to hand: `git grep -l "cast<ConstantSDNode>\(.->getOperand\(.\)\)->getAPIntValue\(\)" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.)->getOperand\((.)\)\)->getAPIntValue\(\)/\1->getConstantOperandAPInt(\2)/'` and `git grep -l "cast<ConstantSDNode>\(.\.getOperand\(.\)\)->getAPIntValue\(\)" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.)\.getOperand\((.)\)\)->getAPIntValue\(\)/\1.getConstantOperandAPInt(\2)/'`	2024-01-02 14:43:55 +00:00
Alex Bradbury	80aeb62211	[llvm][NFC] Use SDValue::getConstantOperandVal(i) where possible (#76708 ) This helper function shortens examples like `cast<ConstantSDNode>(Node->getOperand(1))->getZExtValue();` to `Node->getConstantOperandVal(1);`. Implemented with: `git grep -l "cast<ConstantSDNode>\(.->getOperand\(.\)\)->getZExtValue\(\)" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.)->getOperand\((.)\)\)->getZExtValue\(\)/\1->getConstantOperandVal(\2)/` and `git grep -l "cast<ConstantSDNode>\(.\.getOperand\(.\)\)->getZExtValue\(\)" \| xargs sed -E -i 's/cast<ConstantSDNode>\((.)\.getOperand\((.)\)\)->getZExtValue\(\)/\1.getConstantOperandVal(\2)/'`. With a couple of simple manual fixes needed. Result then processed by `git clang-format`.	2024-01-02 13:14:28 +00:00

1 2 3 4 5 ...

2286 Commits