llvm-project

Author	SHA1	Message	Date
Craig Topper	e6b2495545	[SelectionDAG] Split SDNode::use_iterator into user_iterator and use_iterator. (#120531 ) SDNode::use_iterator now returns an SDUse& when dereferenced. SDNode::user_iterator returns SDNode*. SDNode::use_begin/use_end/uses work on use_iterator. SDNode::user_begin/user_end/users work on user_iterator. We can now write range based for loops using SDUse& and SDNode::uses(). I've converted many of these in this patch. I didn't update loops that have additional variables updated in their for statement. Some loops use SDNode::use_iterator::getOperandNo() which also prevents using range based for loops. I plan to move this into SDUse in a follow up patch.	2024-12-19 08:35:32 -08:00
Craig Topper	bd261ecc5a	[SelectionDAG] Add SDNode::user_begin() and use it in some places (#120509 ) Most of these are just places that want the first user and aren't iterating over the whole list. While there I changed some use_size() == 1 to hasOneUse() which is more efficient. This is part of an effort to rename use_iterator to user_iterator and provide a use_iterator that dereferences to SDUse&. This patch helps reduce the diff on later patches.	2024-12-18 22:13:04 -08:00
Craig Topper	104ad9258a	[SelectionDAG] Rename SDNode::uses() to users(). (#120499 ) This function is most often used in range based loops or algorithms where the iterator is implicitly dereferenced. The dereference returns an SDNode * of the user rather than SDUse * so users() is a better name. I've long beeen annoyed that we can't write a range based loop over SDUse when we need getOperandNo. I plan to rename use_iterator to user_iterator and add a use_iterator that returns SDUse& on dereference. This will make it more like IR.	2024-12-18 20:09:33 -08:00
Stefan Pintilie	67eb05b292	[PowerPC] Add special handling for arguments that are smaller than pointer size. (#119003 ) When arguments are passed in memory instead of registers we currently load the entire pointer size even though the argument may be smaller. For exmaple if the pointer size if i32 then we use a load word even if the argument is only an i8. This patch zeros / extends the bits that are not required to ensure that we are getting the correct value even if the load is larger.	2024-12-12 09:43:53 -05:00
Sergei Barannikov	e55c167777	[TargetLowering] Return Align from getByValTypeAlignment (NFC) (#119233 )	2024-12-09 23:39:19 +03:00
Maryam Moghadas	68e75eebec	[PPC] Custom lower ssubo for i64 (#118711 ) This is a follow-up patch to improve the codegen for ssubo node for i64 in 64-bit mode by custom lowering.	2024-12-05 17:22:44 -05:00
zhijian lin	6b5c67bd16	[PowerPC][Backend] using signed extend value instead of zero extend value for isIntS34Immediate() (#118703 ) The patch fix the issue https://github.com/llvm/llvm-project/issues/118695	2024-12-05 09:08:18 -05:00
Craig Topper	b076fbb844	[TargetLowering] Use Type* instead of EVT in shouldSignExtendTypeInLibCall. (#118587 ) I want to use this function for GISel too so Type * is a better common interface. All of the callers already convert EVT to Type * as needed by calling lowering anyway.	2024-12-03 22:06:55 -08:00
Maryam Moghadas	dab4121a55	[PowerPC] Add custom lowering for ssubo (#111748 ) (#115875 ) This patch is to improve the codegen for ssubo node for i32 by custom lowering.	2024-11-28 13:55:53 -05:00
RolandF77	a475180498	[PowerPC] Use setbc for values from vector compare conditions (#114858 ) For P10 use the setbc instruction to get int values from vector compare summary condition results.	2024-11-27 12:47:10 -05:00
Nikita Popov	5322415f92	[PowerPC] Use getSignedConstant() in SelectOptimalAddrMode() All of these immediates are signed, as the surrounding comments indicate. This fixes an assertion failure in CodeGen/Generic/dag-combine-ossfuzz-crash.ll when run with a powerpc-aix triple.	2024-11-26 14:34:30 +01:00
Craig Topper	bc282605df	[SelectionDAG] Require last operand of (STRICT_)FP_ROUND to be a TargetConstant. (#117639 ) Fix all the places I could find that did't do this. We were already mostly correct for FP_ROUND after 9a976f36615dbe15e76c12b22f711b2e597a8e51, but not STRICT_FP_ROUND.	2024-11-25 21:36:33 -08:00
Nikita Popov	157d847ba7	[PowerPC] Use getSignedConstant() where necessary (#117177 ) This is to prevent assertion failures when we disable implicit truncation in getConstant(). getCanonicalConstSplat() works with a mix of unsigned and signed values, so I explicitly truncate the APInt there.	2024-11-22 09:40:19 +01:00
Kazu Hirata	f71cb9dbb7	[PowerPC] Remove unused includes (NFC) (#116163 ) Identified with misc-include-cleaner.	2024-11-14 07:55:18 -08:00
Lei Huang	f895fc9550	[NFC][PowerPC] Add getScalarIntVT to return MVT based on arch (#115203 ) Add `getScalarIntVT()` to return scalar int VT based on if arch is 32 or 64bit.	2024-11-11 12:25:14 -05:00
Amy Kwan	4981f8cb72	[PowerPC] Fix vector_shuffle combines when inputs are scalar_to_vector of differing types. (#80784 ) This patch fixes the combines for vector_shuffles when either or both of its left and right hand side inputs are scalar_to_vector nodes. Previously, when both left and right side inputs are scalar_to_vector nodes, the current combine could not handle this situation, as the shuffle mask was updated incorrectly. To temporarily solve this solution, this combine was simply disabled and not performed. Now, not only does this patch aim to resolve the previous issue of the incorrect shuffle mask adjustments respectively, but it also updates any test cases that are affected by this change. Patch migrated from https://reviews.llvm.org/D130487.	2024-11-11 10:53:51 -05:00
zhijian lin	2cd32132db	[PowerPC] Utilize getReservedRegs to find asm clobberable registers. (#107863 ) This patch utilizes getReservedRegs() to find asm clobberable registers. And to make the result of getReservedRegs() accurate, this patch implements the todo, which is to make r2 allocatable on AIX for some leaf functions.	2024-11-04 12:57:26 -05:00
Maryam Moghadas	c7c5042e3c	Revert "[PowerPC] Add custom lowering for ssubo (#111748 )" (#114672 ) This reverts commit 8a0cb9ac869334fd6c6bd6aad8408623a7ccd7f6. Reverting due to PPC bootstrap bot failure.	2024-11-02 10:48:36 -04:00
Maryam Moghadas	8a0cb9ac86	[PowerPC] Add custom lowering for ssubo (#111748 ) This patch is to improve the codegen for ssubo node for i32 in 64-bit mode by custom lowering.	2024-10-29 15:43:05 -04:00
Lei Huang	522f34cfff	[PowerPC] Expand global named register support (#113482 ) Enable all valid registers for intrinsics that read from and write to global named registers.	2024-10-24 10:05:18 -04:00
Lei Huang	a19f05b9ec	Revert "[PowerPC] Expand global named register support" (#113457 ) Reverts llvm/llvm-project#112603	2024-10-23 09:36:28 -04:00
Lei Huang	06d192925d	[PowerPC] Expand global named register support (#112603 ) Enable all valid registers for intrinsics that read from and write to global named registers.	2024-10-22 14:34:24 -04:00
RolandF77	fc59f2cc0f	[PowerPC] special case small int constant for custom scalar_to_vector (#109850 ) Special case small int constant in the PPC custom lowering of scalar_to_vector.	2024-10-21 12:19:07 -04:00
Zaara Syeda	c5ca1b8626	[PPC] Add custom lowering for uaddo (#110137 ) Improve the codegen for uaddo node for i64 in 64-bit mode and i32 in 32-bit mode by custom lowering.	2024-10-21 11:13:16 -04:00
Keith Packard	44b020a381	[PowerPC][ISelLowering] Support -mstack-protector-guard=tls (#110928 ) Add support for using a thread-local variable with a specified offset for holding the stack guard canary value. This supports both 32- and 64- bit PowerPC targets. This mirrors changes from #108942 but targeting PowerPC instead of RISCV. Because both of these PRs modify the same driver functions, this series is stack on top of the RISC-V one. --------- Signed-off-by: Keith Packard <keithp@keithp.com>	2024-10-17 19:06:47 -07:00
Jay Foad	85c17e4092	[LLVM] Make more use of IRBuilder::CreateIntrinsic. NFC. (#112706 ) Convert many instances of: Fn = Intrinsic::getOrInsertDeclaration(...); CreateCall(Fn, ...) to the equivalent CreateIntrinsic call.	2024-10-17 16:20:43 +01:00
Jay Foad	d9c95efb6c	[LLVM] Make more use of IRBuilder::CreateIntrinsic. NFC. (#112546 ) Convert almost every instance of: CreateCall(Intrinsic::getOrInsertDeclaration(...), ...) to the equivalent CreateIntrinsic call.	2024-10-16 15:43:30 +01:00
Rahul Joshi	fa789dffb1	[NFC] Rename `Intrinsic::getDeclaration` to `getOrInsertDeclaration` (#111752 ) Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).	2024-10-11 05:26:03 -07:00
Zaara Syeda	22067a8eb4	[PowerPC] Fix assert exposed by PR 95931 in LowerBITCAST (#108062 ) Hit Assertion failed: Num < NumOperands && "Invalid child # of SDNode!" Fix by checking opcode and value type before calling getOperand.	2024-09-10 14:14:01 -04:00
Qiu Chaofan	06c331163e	[PowerPC] Implement llvm.set.rounding intrinsic (#67302 )	2024-09-10 14:30:31 +08:00
anjenner	4af249fe6e	Add usub_cond and usub_sat operations to atomicrmw (#105568 ) These both perform conditional subtraction, returning the minuend and zero respectively, if the difference is negative.	2024-09-06 16:19:20 +01:00
RolandF77	26ba186bd0	[PowerPC] Improve pwr7 codegen for v4i8 load (#104507 ) There are no partial vector loads on pwr7 so current v4i8 codegen is an int load then store to vector sized temp and re-load as vector. Try to use lfiwax to load 32 bits into an FP reg and take advantage of VSX FP and vector reg sharing to move the result to the right vector position.	2024-09-04 12:55:27 -04:00
Matt Arsenault	911b96058a	PPC: Custom lower ppcf128 is_fpclass if is_fpclass is custom (#105540 ) Unfortunately expandIS_FPCLASS is called directly in SelectionDAGBuilder depending on whether IS_FPCLASS is custom or not. This helps avoid ppc test regressions in a future patch where the custom lowering would be bypassed.	2024-08-29 14:01:54 +04:00
RolandF77	89bbcbe285	[PowerPC] fix legalization crash (#105563 ) If v2i64 scalar_to_vector is made custom, llc can crash in certain legalization cases where v2i64 vectors are injected, even if they weren't otherwise present. The code generated would be fine, but that operation is not handled in ReplaceNodeResults. Add handling.	2024-08-28 11:22:23 -04:00
Craig Topper	4b0c0ec6b8	[CodeGen] Use MCRegister for CCState::AllocateReg and CCValAssign::getReg. NFC (#106032 )	2024-08-26 11:40:25 -07:00
Craig Topper	e994494a59	[PowerPC] Use MathExtras helpers to simplify code. NFC (#104691 )	2024-08-17 23:12:52 -07:00
Kazu Hirata	3080c80671	[PowerPC] Use range-based for loops (NFC) (#104410 )	2024-08-15 17:58:46 -07:00
RolandF77	8b6e9de3dd	[PowerPC] improve P10 store forwarding on P7 scalar to vector (#102330 ) Try to make P7 code with scalar to vector operations that use store/re-load to run smoother on P10 by supplying enough store width to cover the load and allow hardware store forwarding.	2024-08-12 12:30:06 -04:00
Kazu Hirata	f4fb735840	[llvm] Construct SmallVector<SDValue> with ArrayRef (NFC) (#102578 )	2024-08-09 09:15:42 -07:00
Tim Gymnich	408d82d352	[PowerPC] Respect endianness when bitcasting to fp128 (#95931 ) Fixes #92246 Match the behaviour of `bitcast v2i64 (BUILD_PAIR %lo %hi)` when encountering `bitcast fp128 (BUILD_PAIR %lo $hi)`. by inserting a missing swap of the arguments based on endianness. ### Current behaviour: fp128 bitcast fp128 (BUILD_PAIR %lo $hi) => BUILD_FP128 %lo %hi BUILD_FP128 %lo %hi => MTVSRDD %hi %lo v2i64 bitcast v2i64 (BUILD_PAIR %lo %hi) => BUILD_VECTOR %hi %lo BUILD_VECTOR %hi %lo => MTVSRDD %lo %hi	2024-08-08 08:51:04 +08:00
Qiu Chaofan	20957d2091	[AIX] Add -msave-reg-params to save arguments to stack (#97524 ) In PowerPC ABI, a few initial arguments are passed through registers, but their places in parameter save area are reserved, arguments passed by memory goes after the reserved location. For debugging purpose, we may want to save copy of the pass-by-reg arguments into correct places on stack. The new option achieves by adding new function level attribute and make argument lowering part aware of it.	2024-07-24 20:58:37 +08:00
azhan92	1df4d866cc	[PowerPC] Add support for -mcpu=pwr11 / -mtune=pwr11 (#99511 ) This PR adds support for -mcpu=pwr11/power11 and -mtune=pwr11/power11 in clang and llvm.	2024-07-23 09:49:41 -04:00
Joseph Huber	615b7eeaa9	Reapply "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit 740161a9b98c9920dedf1852b5f1c94d0a683af5. I moved the `ISD` dependencies into the CodeGen portion of the handling, it's a little awkward but it's the easiest solution I can think of for now.	2024-07-20 09:29:31 -05:00
NAKAMURA Takumi	740161a9b9	Revert "[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 )" This reverts commit c05126bdfc3b02daa37d11056fa43db1a6cdef69. (llvmorg-19-init-17714-gc05126bdfc3b) See #99610	2024-07-20 12:36:57 +09:00
Matt Arsenault	0f0cfcff2c	CodeGen: Avoid some references to MachineFunction's getMMI (#99652 ) MachineFunction's probably should not include a backreference to the owning MachineModuleInfo. Most of these references were used just to query the MCContext, which MachineFunction already directly stores. Other contexts are using it to query the LLVMContext, which can already be accessed through the IR function reference.	2024-07-19 22:09:05 +04:00
Amara Emerson	f270a4dd66	[AArch64] Don't tail call memset if it would convert to a bzero. (#98969 ) Well, not quite that simple. We can tc memset since it returns the first argument but bzero doesn't do that and therefore we can end up miscompiling. This patch also refactors the logic out of isInTailCallPosition() into the callers. As a result memcpy and memmove are also modified to do the same thing for consistency. rdar://131419786	2024-07-17 01:31:52 -07:00
Joseph Huber	c05126bdfc	[LLVM][LTO] Factor out RTLib calls and allow them to be dropped (#98512 ) Summary: The LTO pass and LLD linker have logic in them that forces extraction and prevent internalization of needed runtime calls. However, these currently take all RTLibcalls into account, even if the target does not support them. The target opts-out of a libcall if it sets its name to nullptr. This patch pulls this logic out into a class in the header so that LTO / lld can use it to determine if a symbol actually needs to be kept. This is important for targets like AMDGPU that want to be able to use `lld` to perform the final link step, but does not want the overhead of uncalled functions. (This adds like a second to the link time trivially)	2024-07-16 06:22:09 -05:00
Joseph Huber	3f1a767572	[LLVM] Factor disabled Libcalls into the initializer (#98421 ) Summary: These Libcalls represent which functions are available to the backend. If a runtime call is not available, the target sets the the name to `nullptr`. Currently, this logic is spread around the various targets. This patch pulls all of the locations that disable libcalls into the intializer. This patch is effectively NFC. The motivation behind this patch is that currently the LTO handling uses the list of all runtime calls to determine which functions cannot be internalized and must be extracted from static libraries. We do not want this to happen for libcalls that are not emitted by the backend. A follow-up patch will move out this logic so the LTO pass can know which rtlib calls are actually used by the backend.	2024-07-11 12:59:25 -05:00
Nikita Popov	9df71d7673	[IR] Add getDataLayout() helpers to Function and GlobalValue (#96919 ) Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, replacing the current `getParent()->getDataLayout()` pattern.	2024-06-28 08:36:49 +02:00
Kazu Hirata	0a5292ebbb	[PowerPC] Remove extraneous ArrayRef (NFC) (#96092 ) ArrayRef can be implicitly constructed from a C array while inferring its size.	2024-06-19 13:38:46 -07:00

1 2 3 4 5 ...

1894 Commits