llvm-project

Author	SHA1	Message	Date
Boyao Wang	697beb3f17	[TargetLowering] Change getOptimalMemOpType and findOptimalMemOpLowering to take LLVM Context (#147664 ) Add LLVM Context to getOptimalMemOpType and findOptimalMemOpLowering. So that we can use EVT::getVectorVT to generate EVT type in getOptimalMemOpType. Related to [#146673](https://github.com/llvm/llvm-project/pull/146673).	2025-07-10 11:11:09 +08:00
Dominik Steenken	acdf1c7526	[DAG] Add generic expansion for ISD::FCANONICALIZE nodes (#142105 ) This PR takes the work previously done by @pawan-nirpal-031 on X86 in #106370, and makes it available in common code. This should enable all targets to use `__builtin_canonicalize` for all `f(16\|32\|64\|128)` data types. Canonicalization is implemented here as multiplication by `1.0`, as suggested in [the docs](https://llvm.org/docs/LangRef.html#llvm-canonicalize-intrinsic).	2025-07-08 16:12:17 +01:00
Matt Arsenault	d8ef156379	DAG: Remove verifyReturnAddressArgumentIsConstant (#147240 ) The intrinsic argument is already marked with immarg so non-constant values are rejected by the IR verifier.	2025-07-07 16:28:47 +09:00
Kazu Hirata	f46c1d6bcc	[PowerPC] Fix a warning This patch fixes: llvm/lib/Target/PowerPC/PPCISelLowering.cpp:9588:16: error: unused variable 'NumOps' [-Werror,-Wunused-variable]	2025-07-04 07:53:29 -07:00
zhijian lin	45909ec469	[PowePC] using MTVSRBMI instruction instead of constant pool in power10+ (#144084 ) The instruction MTVSRBMI set 0x00(or 0xFF) to each byte of VSR based on the bits mask. Using the instruction instead of constant pool can reduce the asm code size and instructions in power10.	2025-07-04 10:07:03 -04:00
Jie Fu	25d52fbf96	[PowerPC] Prevent copying in loop variables (NFC) /data/llvm-project/llvm/lib/Target/PowerPC/PPCISelLowering.cpp:5769:19: error: loop variable '[Reg, N]' creates a copy from type 'std::pair<unsigned int, llvm::SDValue> const' [-Werror,-Wrange-loop-construct] for (const auto [Reg, N] : RegsToPass) ^ /data/llvm-project/llvm/lib/Target/PowerPC/PPCISelLowering.cpp:5769:8: note: use reference type 'std::pair<unsigned int, llvm::SDValue> const &' to prevent copying for (const auto [Reg, N] : RegsToPass) ^~~~~~~~~~~~~~~~~~~~~ & /data/llvm-project/llvm/lib/Target/PowerPC/PPCISelLowering.cpp:6193:19: error: loop variable '[Reg, N]' creates a copy from type 'std::pair<unsigned int, llvm::SDValue> const' [-Werror,-Wrange-loop-construct] for (const auto [Reg, N] : RegsToPass) { ^ /data/llvm-project/llvm/lib/Target/PowerPC/PPCISelLowering.cpp:6193:8: note: use reference type 'std::pair<unsigned int, llvm::SDValue> const &' to prevent copying for (const auto [Reg, N] : RegsToPass) { ^~~~~~~~~~~~~~~~~~~~~ & /data/llvm-project/llvm/lib/Target/PowerPC/PPCISelLowering.cpp:6806:19: error: loop variable '[Reg, N]' creates a copy from type 'std::pair<unsigned int, llvm::SDValue> const' [-Werror,-Wrange-loop-construct] for (const auto [Reg, N] : RegsToPass) { ^ /data/llvm-project/llvm/lib/Target/PowerPC/PPCISelLowering.cpp:6806:8: note: use reference type 'std::pair<unsigned int, llvm::SDValue> const &' to prevent copying for (const auto [Reg, N] : RegsToPass) { ^~~~~~~~~~~~~~~~~~~~~ & 3 errors generated.	2025-06-29 10:21:00 +08:00
Kazu Hirata	bad5a740e1	[PowerPC] Use range-based for loops (NFC) (#146221 )	2025-06-28 13:04:08 -07:00
Wael Yehia	735d721de4	[PowerPC] Fix handling of undefs in the PPC::isSplatShuffleMask query (#145149 ) Currently, the query assumes that a single undef byte implies the rest of the `EltSize - 1` bytes are undefs, but that's not always true. e.g. isSplatShuffleMask( <0,1,2,3,4,5,6,7,undef,undef,undef,undef,0,1,2,3>, 8) should return false. --------- Co-authored-by: Wael Yehia <wyehia@ca.ibm.com>	2025-06-23 13:22:33 -04:00
Matt Arsenault	48155f93dd	CodeGen: Emit error if getRegisterByName fails (#145194 ) This avoids using report_fatal_error and standardizes the error message in a subset of the error conditions.	2025-06-23 16:33:35 +09:00
Nikita Popov	7ea7ccd24d	[PowerPC][AIX] Specify pointer info and alignment for stack store (#144526 ) When lowering call arguments to stack, specify a stack MPI, as well as the stack alignment, instead of using the defaults (which would be an unknown location with ABI alignment). I believe the asm diffs are just changes in scheduling.	2025-06-18 10:50:17 +02:00
zhijian lin	85a9f2e148	[PowerPC] enable AtomicExpandImpl::expandAtomicCmpXchg for powerpc (#142395 ) In PowerPC, the AtomicCmpXchgInst is lowered to ISD::ATOMIC_CMP_SWAP_WITH_SUCCESS. However, this node does not handle the weak attribute of AtomicCmpXchgInst. As a result, when compiling C++ atomic_compare_exchange_weak_explicit, the generated assembly includes a "reservation lost" loop — i.e., it branches back and retries if the stwcx. (store-conditional) fails. This differs from GCC’s codegen, which does not include that loop for weak compare-exchange. Since PowerPC uses LL/SC-style atomic instructions, the patch enables AtomicExpandImpl::expandAtomicCmpXchg for PowerPC. With this, the weak attribute is properly respected, and the "reservation lost" loop is removed for weak operations. --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2025-06-13 09:14:48 -04:00
Matt Arsenault	55858527da	PowerPC: Move runtime libcall configuration to RuntimeLibcallsInfo (#142542 ) These should not be set in the TargetLowering constructor, RuntimeLibcalls needs to be accurate outside of codegen contexts.	2025-06-10 07:28:04 +09:00
RolandF77	5d6218d311	[PowerPC] extend smaller splats into bigger splats (with fix) (#142194 ) For pwr9, xxspltib is a byte splat with a range -128 to 127 - it can be used with a following vector extend sign to make splats of i16, i32, or i64 element size. For pwr8, vspltisw with a following vector extend sign can be used to make splats of i64 elements in the range -16 to 15. Add check for P8 to make sure the 64-bit vector ops are there.	2025-06-09 14:01:38 -04:00
Lei Huang	649020c680	[PowerPC] Change default for auto gen stxvp for cpu=future (#142826 ) For cpu=future, we want to auto generate stxvp instructions by default.	2025-06-09 12:34:50 -04:00
Hubert Tong	8f486254e4	Revert "[PowerPC] extend smaller splats into bigger splats (#141282 )" The subject commit causes the build to ICE on AIX: https://lab.llvm.org/buildbot/#/builders/64/builds/3890/steps/5/logs/stdio This reverts commit 7fa365843d9f99e75c38a6107e8511b324950e74.	2025-05-29 01:10:55 -04:00
RolandF77	7fa365843d	[PowerPC] extend smaller splats into bigger splats (#141282 ) For pwr9, xxspltib is a byte splat with a range -128 to 127 - it can be used with a following vector extend sign to make splats of i16, i32, or i64 element size. For pwr8, vspltisw with a following vector extend sign can be used to make splats of i64 elements in the range -16 to 15.	2025-05-28 10:11:28 -04:00
Kazu Hirata	9738373c0b	[PowerPC] Fix warnings This patch fixes: llvm/lib/Target/PowerPC/PPCISelLowering.cpp:11897:8: error: unused variable 'IsV2048i1' [-Werror,-Wunused-variable] llvm/lib/Target/PowerPC/PPCISelLowering.cpp:12035:8: error: unused variable 'IsV2048i1' [-Werror,-Wunused-variable]	2025-05-26 08:55:51 -07:00
Maryam Moghadas	a54300b32c	[PowerPC] Add load/store support for v2048i1 and DMF cryptography instructions (#136145 ) This commit adds support for loading and storing v2048i1 DMR pairs and introduces Dense Math Facility cryptography instructions: DMSHA2HASH, DMSHA3HASH, and DMXXSHAPAD, along with their corresponding intrinsics and tests.	2025-05-26 10:59:35 -04:00
RolandF77	bbca78fbcb	[PowerPC] vector shift word/double by element size - 1 use all ones (#139794 ) Vector shift word or double requires a shift amount vector of 31 or 63 which is too big for splat immediate and requires a multi-instruction sequence. However the PPC instructions only use 5 or 6 bits of the shift amount vector elements so an all ones mask, which we can generate efficiently, works.	2025-05-23 10:49:37 -04:00
RolandF77	99f0309669	[PowerPC] catch v2i64 shift left by 1 is add case (#138772 ) Catch missing case in PPC BE for v2i64 x << 1 and generate x + x.	2025-05-13 11:26:46 -04:00
zhijian lin	41647412c6	[PowerPC] Fix an LowerADDSUBO_CARRY error when converting carry bit for usubo_carry (#137809 ) In PowerPC, if a borrow occurs during a subtraction, the carry bit is zero (unset). The carry bit is set if no borrow occurs. For ISD::USUBO_CARRY, the nodes produce two results: the normal result of the addition or subtraction, and a boolean value that is 1 if and only if there is an outgoing carry or borrow. Therefore, we need to convert a 1 (which indicates a borrow in ISD::USUBO_CARRY) to 0 to match PowerPC's definition of borrow. Similarly, we need to convert a 0 (no borrow in ISD::USUBO_CARRY) to 1 for PowerPC. To perform this conversion, we use XOR 1 instead of XOR DAG.getAllOnesConstant(DL, CarryOp.getValueType()). `	2025-04-30 10:39:09 -04:00
Craig Topper	e4d2ff5b01	[SelectionDAG][PowerPC] Remove setTruncatingStore from StoreSDNode. (#137667 ) Mutating a node after it has been created isn't a good idea. After e17f07c4debbe76f5ebcdeeda619e7438700e2ad, we have a version of setStore that can create a truncating indexed store. Use that instead of MorphNodeTo+setTruncatingStore in PowerPC. Unfortunately, if we return the newly created node, DAGCombiner will visit the node and change the constant. To prevent this, we use DCI.CombineTo and avoid adding the new node to the worklist.	2025-04-28 16:48:37 -07:00
RolandF77	a903c7b7f5	[PowerPC] Intrinsics and tests for dmr insert/extract (#135653 ) Add some intrinsics and LIT tests for PPC dmr insert/extract instructions.	2025-04-24 11:27:22 -04:00
Lei Huang	b518242156	[PowerPC] Fix instruction name for dmr insert (#134301 )	2025-04-04 15:56:30 -04:00
zhijian lin	1a540c3b8b	[PowerPC] Deprecate uses of ISD::ADDC/ISD::ADDE/ISD::SUBC/ISD::SUBE (#133155 ) ISD::ADDC, ISD::ADDE, ISD::SUBC and ISD::SUBE are being deprecated, using ISD::UADDO_CARRY,ISD::USUBO_CARRY instead. Lowering the UADDO, UADDO_CARRY, USUBO, USUBO_CARRY in the patch.	2025-04-03 13:22:49 -04:00
Kazu Hirata	86c382514e	[Target] Construct SmallVector with ArrayRef (NFC) (#134019 )	2025-04-01 21:59:19 -07:00
Rahul Joshi	74b7abf154	[IRBuilder] Add new overload for CreateIntrinsic (#131942 ) Add a new `CreateIntrinsic` overload with no `Types`, useful for creating calls to non-overloaded intrinsics that don't need additional mangling.	2025-03-31 08:10:34 -07:00
Lei Huang	ade22fc1d9	[PowerPC] Support conversion between f16 and f128 (#130158 ) Enables conversion between f16 and f128. Expanding on pre-Power9 targets and using HW instructions on Power9. Fixes https://github.com/llvm/llvm-project/issues/92866 Commandeer of: https://github.com/llvm/llvm-project/pull/97677 --------- Co-authored-by: esmeyi <esme.yi@ibm.com>	2025-03-19 10:19:57 -04:00
RolandF77	a73e591f33	[PowerPC] custom lower v1024i1 load/store (#126969 ) Support moving PPC dense math register values to and from storage with LLVM IR load/store.	2025-02-28 10:25:07 -05:00
David Tenty	aa9e519b24	Revert "[PowerPC] Deprecate uses of ISD::ADDC/ISD::ADDE/ISD::SUBC/ISD::SUBE (#116984 )" This reverts commit 7763119c6eb0976e4836f81c9876c49a36d46d73 (leaving the modifications from 03cb46d248b08)..	2025-02-19 09:44:39 -05:00
Nikita Popov	03cb46d248	[CodeGen] Use getSignedConstant() in more places (#127501 ) Use getSignedConstant() in a few more places, based on a search of `\bgetConstant(-`. Most of these were fine as-is (e.g. because they work on 64-bits), but I think it's better to use getSignedConstant() consistently for negative numbers.	2025-02-18 09:29:25 +01:00
Craig Topper	256145b4b0	[PowerPC] Use getSignedTargetConstant in SelectOptimalAddrMode. (#127305 ) Fixes #127298.	2025-02-15 14:13:32 -08:00
zhijian lin	7763119c6e	[PowerPC] Deprecate uses of ISD::ADDC/ISD::ADDE/ISD::SUBC/ISD::SUBE (#116984 ) ISD::ADDC, ISD::ADDE, ISD::SUBC and ISD::SUBE are being deprecated, using ISD::UADDO_CARRY,ISD::USUBO_CARRY instead. Lowering the UADDO, UADDO_CARRY, USUBO, USUBO_CARRY in the patch.	2025-02-13 09:09:17 -05:00
Craig Topper	7fff2527f8	[PowerPC] Use SelectionDAG::makeEquivalentMemoryOrdering(). NFC (#124889 )	2025-01-29 09:45:00 -08:00
yingopq	754ed95b66	[Mips] Fix compiler crash when returning fp128 after calling a functi… (#117525 ) …on returning { i8, i128 } Fixes https://github.com/llvm/llvm-project/issues/96432.	2025-01-20 16:47:40 +08:00
Craig Topper	e6b2495545	[SelectionDAG] Split SDNode::use_iterator into user_iterator and use_iterator. (#120531 ) SDNode::use_iterator now returns an SDUse& when dereferenced. SDNode::user_iterator returns SDNode*. SDNode::use_begin/use_end/uses work on use_iterator. SDNode::user_begin/user_end/users work on user_iterator. We can now write range based for loops using SDUse& and SDNode::uses(). I've converted many of these in this patch. I didn't update loops that have additional variables updated in their for statement. Some loops use SDNode::use_iterator::getOperandNo() which also prevents using range based for loops. I plan to move this into SDUse in a follow up patch.	2024-12-19 08:35:32 -08:00
Craig Topper	bd261ecc5a	[SelectionDAG] Add SDNode::user_begin() and use it in some places (#120509 ) Most of these are just places that want the first user and aren't iterating over the whole list. While there I changed some use_size() == 1 to hasOneUse() which is more efficient. This is part of an effort to rename use_iterator to user_iterator and provide a use_iterator that dereferences to SDUse&. This patch helps reduce the diff on later patches.	2024-12-18 22:13:04 -08:00
Craig Topper	104ad9258a	[SelectionDAG] Rename SDNode::uses() to users(). (#120499 ) This function is most often used in range based loops or algorithms where the iterator is implicitly dereferenced. The dereference returns an SDNode * of the user rather than SDUse * so users() is a better name. I've long beeen annoyed that we can't write a range based loop over SDUse when we need getOperandNo. I plan to rename use_iterator to user_iterator and add a use_iterator that returns SDUse& on dereference. This will make it more like IR.	2024-12-18 20:09:33 -08:00
Stefan Pintilie	67eb05b292	[PowerPC] Add special handling for arguments that are smaller than pointer size. (#119003 ) When arguments are passed in memory instead of registers we currently load the entire pointer size even though the argument may be smaller. For exmaple if the pointer size if i32 then we use a load word even if the argument is only an i8. This patch zeros / extends the bits that are not required to ensure that we are getting the correct value even if the load is larger.	2024-12-12 09:43:53 -05:00
Sergei Barannikov	e55c167777	[TargetLowering] Return Align from getByValTypeAlignment (NFC) (#119233 )	2024-12-09 23:39:19 +03:00
Maryam Moghadas	68e75eebec	[PPC] Custom lower ssubo for i64 (#118711 ) This is a follow-up patch to improve the codegen for ssubo node for i64 in 64-bit mode by custom lowering.	2024-12-05 17:22:44 -05:00
zhijian lin	6b5c67bd16	[PowerPC][Backend] using signed extend value instead of zero extend value for isIntS34Immediate() (#118703 ) The patch fix the issue https://github.com/llvm/llvm-project/issues/118695	2024-12-05 09:08:18 -05:00
Craig Topper	b076fbb844	[TargetLowering] Use Type* instead of EVT in shouldSignExtendTypeInLibCall. (#118587 ) I want to use this function for GISel too so Type * is a better common interface. All of the callers already convert EVT to Type * as needed by calling lowering anyway.	2024-12-03 22:06:55 -08:00
Maryam Moghadas	dab4121a55	[PowerPC] Add custom lowering for ssubo (#111748 ) (#115875 ) This patch is to improve the codegen for ssubo node for i32 by custom lowering.	2024-11-28 13:55:53 -05:00
RolandF77	a475180498	[PowerPC] Use setbc for values from vector compare conditions (#114858 ) For P10 use the setbc instruction to get int values from vector compare summary condition results.	2024-11-27 12:47:10 -05:00
Nikita Popov	5322415f92	[PowerPC] Use getSignedConstant() in SelectOptimalAddrMode() All of these immediates are signed, as the surrounding comments indicate. This fixes an assertion failure in CodeGen/Generic/dag-combine-ossfuzz-crash.ll when run with a powerpc-aix triple.	2024-11-26 14:34:30 +01:00
Craig Topper	bc282605df	[SelectionDAG] Require last operand of (STRICT_)FP_ROUND to be a TargetConstant. (#117639 ) Fix all the places I could find that did't do this. We were already mostly correct for FP_ROUND after 9a976f36615dbe15e76c12b22f711b2e597a8e51, but not STRICT_FP_ROUND.	2024-11-25 21:36:33 -08:00
Nikita Popov	157d847ba7	[PowerPC] Use getSignedConstant() where necessary (#117177 ) This is to prevent assertion failures when we disable implicit truncation in getConstant(). getCanonicalConstSplat() works with a mix of unsigned and signed values, so I explicitly truncate the APInt there.	2024-11-22 09:40:19 +01:00
Kazu Hirata	f71cb9dbb7	[PowerPC] Remove unused includes (NFC) (#116163 ) Identified with misc-include-cleaner.	2024-11-14 07:55:18 -08:00
Lei Huang	f895fc9550	[NFC][PowerPC] Add getScalarIntVT to return MVT based on arch (#115203 ) Add `getScalarIntVT()` to return scalar int VT based on if arch is 32 or 64bit.	2024-11-11 12:25:14 -05:00

1 2 3 4 5 ...

1929 Commits