llvm-project

Author	SHA1	Message	Date
anoopkg6	7e85b790b0	[SystemZ] Fix linux s390x main can't bootstrap itself on SanitizerSpecialCaseList.cpp #168088 (#168779 ) This test has long call chain in recursion. Search tree can be pruned early by swapping CC test and recursive simplifyAssumingCCVal. Fixes: https://github.com/llvm/llvm-project/issues/168088 Co-authored-by: anoopkg6 <anoopkg6@github.com>	2025-11-20 00:07:43 +01:00
Matt Arsenault	a757c4e74e	CodeGen: Add subtarget to TargetLoweringBase constructor (#168620 ) Currently LibcallLoweringInfo is defined inside of TargetLowering, which is owned by the subtarget. Pass in the subtarget so we can construct LibcallLoweringInfo with the subtarget. This is a temporary step that should be revertable in the future, after LibcallLoweringInfo is moved out of TargetLowering.	2025-11-19 19:18:13 +00:00
Sergei Barannikov	320c18a066	[SystemZ] TableGen-erate node descriptions (#168113 ) This allows SDNodes to be validated against their expected type profiles and reduces the number of changes required to add a new node. There is only one node that is missing a description -- `GET_CCMASK`, others were successfully imported. Part of #119709. Pull Request: https://github.com/llvm/llvm-project/pull/168113	2025-11-17 23:03:45 +03:00
Kazu Hirata	c04e57d133	[llvm] Use StringRef::contains (NFC) (#165397 ) Identified with readability-container-contains	2025-10-28 16:15:08 -07:00
anoopkg6	242c716c68	Fix Linux kernel build failure for SytemZ. (#165274 ) Linux kernel build fails for SystemZ as output of INLINEASM was GR32Bit general-purpose register instead of SystemZ::CC. --------- Co-authored-by: anoopkg6 <anoopkg6@github.com> Co-authored-by: Ulrich Weigand <ulrich.weigand@de.ibm.com>	2025-10-27 18:22:01 +01:00
anoopkg6	6712e20c52	Add support for flag output operand "=@cc" for SystemZ. (#125970 ) Added Support for flag output operand "=@cc", inline assembly constraint for SystemZ. - Clang now accepts "=@cc" assembly operands, and sets 2-bits condition code for output operand for SyatemZ. - Clang currently emits an assertion that flag output operands are boolean values, i.e. in the range [0, 2). Generalize this mechanism to allow targets to specify arbitrary range assertions for any inline assembly output operand. This will be used to assert that SystemZ two-bit condition-code values are in the range [0, 4). - SystemZ backend lowers "@cc" targets by using ipm sequence to extract condition code from PSW. - DAGCombine tries to optimize lowered ipm sequence by combining CCReg and computing effective CCMask and CCValid in combineCCMask for select_ccmask and br_ccmask. - Cost computation is done for merging conditionals for branch instruction in SelectionDAG, as split may cause branches conditions evaluation goes across basic block and difficult to combine. --------- Co-authored-by: anoopkg6 <anoopkg6@github.com> Co-authored-by: Ulrich Weigand <ulrich.weigand@de.ibm.com>	2025-10-14 11:53:42 +02:00
Folkert de Vries	8a9e3333dd	s390x: optimize 128-bit fshl and fshr by high values (#154919 ) Turn a funnel shift by N in the range `121..128` into a funnel shift in the opposite direction by `128 - N`. Because there are dedicated instructions for funnel shifts by values smaller than 8, this emits fewer instructions. This additional rule is useful because LLVM appears to canonicalize `fshr` into `fshl`, meaning that the rules for `fshr` on values less than 8 would not match on organic input.	2025-08-27 09:31:49 +02:00
Folkert de Vries	558657298a	s390x: pattern match saturated truncation (#155377 ) Simplify min/max instruction matching by making the related SelectionDAG operations legal. Add patterns to match (signed and unsigned) saturated truncation based on open-coded min/max patterns. Fixes https://github.com/llvm/llvm-project/issues/153655	2025-08-26 17:19:58 +02:00
Nikita Popov	9d37e80d3c	[SystemZ] Remove custom CCState pre-analysis (#154091 ) The calling convention lowering now has access to OrigTy, so use that to detect short vectors.	2025-08-19 09:28:09 +02:00
Nikita Popov	01bc742185	[CodeGen] Give ArgListEntry a proper constructor (NFC) (#153817 ) This ensures that the required fields are set, and also makes the construction more convenient.	2025-08-15 18:06:07 +02:00
sujianIBM	fc12fc635b	[SystemZ] Fix code in widening vector multiplication (#150836 ) Commit cdc7864 has an error which would wrongly fold widening multiplications into an even/odd widening operation. This PR fixes it and adds tests to check scenarios which should not be folded into an even/odd widening operation are actually not.	2025-07-31 13:18:23 -04:00
Boyao Wang	697beb3f17	[TargetLowering] Change getOptimalMemOpType and findOptimalMemOpLowering to take LLVM Context (#147664 ) Add LLVM Context to getOptimalMemOpType and findOptimalMemOpLowering. So that we can use EVT::getVectorVT to generate EVT type in getOptimalMemOpType. Related to [#146673](https://github.com/llvm/llvm-project/pull/146673).	2025-07-10 11:11:09 +08:00
MangalaPG	dd54b8e462	Clang-Tidy issues in fixed in file SystemZISelLowering.cpp (#147251 ) Corrected variable names corrections according to the clang-tidy standards. --------- Signed-off-by: MangalaPG <mangala.P.G@ibm.com>	2025-07-09 20:26:42 +02:00
Matt Arsenault	d8ef156379	DAG: Remove verifyReturnAddressArgumentIsConstant (#147240 ) The intrinsic argument is already marked with immarg so non-constant values are rejected by the IR verifier.	2025-07-07 16:28:47 +09:00
Jie Fu	842f4f711d	[Target] Prevent copying in loop variables (NFC) /data/llvm-project/llvm/lib/Target/SystemZ/SystemZISelLowering.cpp:2390:19: error: loop variable '[Reg, N]' creates a copy from type 'std::pair<unsigned int, llvm::SDValue> const' [-Werror,-Wrange-loop-construct] for (const auto [Reg, N] : RegsToPass) { ^ /data/llvm-project/llvm/lib/Target/SystemZ/SystemZISelLowering.cpp:2390:8: note: use reference type 'std::pair<unsigned int, llvm::SDValue> const &' to prevent copying for (const auto [Reg, N] : RegsToPass) { ^~~~~~~~~~~~~~~~~~~~~ & /data/llvm-project/llvm/lib/Target/SystemZ/SystemZISelLowering.cpp:2402:19: error: loop variable '[Reg, N]' creates a copy from type 'std::pair<unsigned int, llvm::SDValue> const' [-Werror,-Wrange-loop-construct] for (const auto [Reg, N] : RegsToPass) ^ /data/llvm-project/llvm/lib/Target/SystemZ/SystemZISelLowering.cpp:2402:8: note: use reference type 'std::pair<unsigned int, llvm::SDValue> const &' to prevent copying for (const auto [Reg, N] : RegsToPass) ^~~~~~~~~~~~~~~~~~~~~ & 2 errors generated.	2025-06-29 14:24:41 +08:00
Kazu Hirata	9cf251d9d8	[Target] Use range-based for loops (NFC) (#146253 )	2025-06-28 20:41:39 -07:00
Matt Arsenault	48155f93dd	CodeGen: Emit error if getRegisterByName fails (#145194 ) This avoids using report_fatal_error and standardizes the error message in a subset of the error conditions.	2025-06-23 16:33:35 +09:00
Iris Shi	24d730b380	Reland "[SelectionDAG] Make `(a & x) \| (~a & y) -> (a & (x ^ y)) ^ y` available for all targets" (#143651 )	2025-06-11 15:56:37 +08:00
Iris Shi	8c890eaa3f	Revert "[SelectionDAG] Make `(a & x) \| (~a & y) -> (a & (x ^ y)) ^ y` available for all targets" (#143648 )	2025-06-11 10:19:12 +08:00
Iris Shi	bfb48363b0	[SelectionDAG] Make `(a & x) \| (~a & y) -> (a & (x ^ y)) ^ y` available for all targets (#137641 )	2025-06-09 17:57:15 +08:00
Matt Arsenault	0a3e9aa336	SystemZ: Move runtime libcall setting out of TargetLowering (#142622 ) RuntimeLibcallInfo needs to be correct outside of codegen contexts.	2025-06-04 06:21:46 +09:00
Rahul Joshi	52c2e45c11	[NFC][CodeGen] Adopt MachineFunctionProperties convenience accessors (#141101 )	2025-05-23 08:30:29 -07:00
Craig Topper	dcd62f3674	[SelectionDAG] Rename MemSDNode::getOriginalAlign to getBaseAlign. NFC (#139930 ) This matches the underlying function in MachineMemOperand and how it is printed when BaseAlign differs from Align.	2025-05-16 09:37:02 -07:00
Jonas Paulsson	94a14f9f0d	[SystemZ] Add DAGCombine for FCOPYSIGN to remove rounding. (#136131 ) Add a DAGCombine for FCOPYSIGN that removes the rounding which is never needed as the sign bit is already in the correct place. This helps in particular the rounding to f16 case which needs a libcall. Also remove the roundings for other FP VTs and simplify the CPSDR patterns correspondingly. fp-copysign-03.ll test updated, now also covering the other FP VT combinations.	2025-04-24 11:05:51 +02:00
Jonas Paulsson	1ec22fae7e	[SystemZ] Handle f16 load positive/negative/complement without libcalls. (#136286 ) This can be done directly with the (64-bit) target instruction as only the sign bit is changed.	2025-04-24 10:49:40 +02:00
Craig Topper	f6178cdad0	[SelectionDAG] Pass LoadExtType when ATOMIC_LOAD is created. (#136653 ) Rename one signature of getAtomic to getAtomicLoad and pass LoadExtType. Previously we had to set the extension type after the node was created, but we don't usually modify SDNodes once they are created. It's possible the node already existed and has been CSEd. If that happens, modifying the node may affect the other users. It's therefore safer to add the extension type at creation so that it is part of the CSE information. I don't know of any failures related to the current implementation. I only noticed that it doesn't match how we usually do things.	2025-04-22 09:11:46 -07:00
Kazu Hirata	8a00efd26d	[SystemZ] Fix warnings This patch fixes: llvm/lib/Target/SystemZ/SystemZISelLowering.cpp:6916:7: error: unused variable 'RegVT' [-Werror,-Wunused-variable] llvm/lib/Target/SystemZ/SystemZInstrInfo.cpp:1265:30: error: unused variable 'RC' [-Werror,-Wunused-variable]	2025-04-16 11:25:55 -07:00
Jonas Paulsson	6d03f51f0c	[SystemZ] Add support for 16-bit floating point. (#109164 ) - _Float16 is now accepted by Clang. - The half IR type is fully handled by the backend. - These values are passed in FP registers and converted to/from float around each operation. - Compiler-rt conversion functions are now built for s390x including the missing extendhfdf2 which was added. Fixes #50374	2025-04-16 20:02:56 +02:00
Ulrich Weigand	80267f8148	Support z17 processor name and scheduler description (#135254 ) The recently announced IBM z17 processor implements the architecture already supported as "arch15" in LLVM. This patch adds support for "z17" as an alternate architecture name for arch15. This patch also add the scheduler description for the z17 processor, provided by Jonas Paulsson.	2025-04-11 00:20:58 +02:00
Jonas Paulsson	b13373db25	[SystemZ] Use hasAddressTaken() with verifyNarrowIntegerArgs (NFC). (#131039 ) Use hasAddressTaken() in SystemZ instead of doing this computation in isFullyInternal(), and make sure to only do this once per Function.	2025-03-21 19:07:46 +01:00
Ulrich Weigand	f4ea1055ad	[SystemZ] Implement i128 funnel shifts These can be handled via the VECTOR SHIFT LEFT/RIGHT DOUBLE family of instructions, depending on architecture level. Fixes: https://github.com/llvm/llvm-project/issues/129955	2025-03-15 18:28:44 +01:00
Ulrich Weigand	4155cc0fb3	[SystemZ] Recognize carry/borrow computation Generate code using the VECTOR ADD COMPUTE CARRY and VECTOR SUBTRACT COMPUTE BORROW INDICATION instructions to implement open-coded IR with those semantics. Handles integer vector types as well as i128. Fixes: https://github.com/llvm/llvm-project/issues/129608	2025-03-15 18:28:44 +01:00
Ulrich Weigand	4a4987be36	[SystemZ] Optimize vector zero/sign extensions Generate more efficient code for zero or sign extensions where the source is a subvector generated via SHUFFLE_VECTOR. Specifically, recognize patterns corresponding to (series of) VECTOR UNPACK instructions, or the VECTOR SIGN EXTEND TO DOUBLEWORD instruction. As a special case, also handle zero or sign extensions of a vector element to i128. Fixes: https://github.com/llvm/llvm-project/issues/129576 Fixes: https://github.com/llvm/llvm-project/issues/129899	2025-03-15 18:28:44 +01:00
Ulrich Weigand	cdc7864986	[SystemZ] Optimize widening and high-word vector multiplication Detect (non-intrinsic) IR patterns corresponding to the semantics of the various widening and high-word multiplication instructions. Specifically, this is done by: - Recognizing even/odd widening multiplication patterns in DAGCombine - Recognizing widening multiply-and-add on top during ISel - Implementing the standard MULHS/MUHLU IR opcodes - Detecting high-word multiply-and-add (which common code does not) Depending on architecture level, this can support all integer vector types as well as the scalar i128 type. Fixes: https://github.com/llvm/llvm-project/issues/129705	2025-03-15 18:28:44 +01:00
Ulrich Weigand	7af3d3929e	[SystemZ] Optimize vector comparison reductions Generate efficient code using the condition code set by the VECTOR (FP) COMPARE family of instructions to implement vector comparison reductions, e.g. as resulting from __builtin_reduce_and/or of some vector comparsion. Fixes: https://github.com/llvm/llvm-project/issues/129434	2025-03-15 18:28:44 +01:00
Jonas Paulsson	378739f182	[SystemZ] Move disabling of arg verification to before isFullyInternal(). (#130693 ) It has found to be quite a slowdown to traverse the users of a function from each call site when it is called many (~70k) times. This patch fixes this for now as long as this verification is disabled by default, but there is still a need to eventually cache the results to avoid recomputation. Fixes #130541	2025-03-12 18:33:12 +01:00
Ulrich Weigand	adacbf68eb	[SystemZ] Add codegen support for llvm.roundeven This is straightforward as we already had all the necessary instructions, they simply were not wired up. Also allows implementing the vec_round intrinsic via the standard llvm.roundeven IR instead of a platform intrinsic now.	2025-02-14 00:10:37 +01:00
Kazu Hirata	5a056f91be	[SystemZ] Avoid repeated hash lookups (NFC) (#126005 ) Co-authored-by: Nikita Popov <github@npopov.com>	2025-02-06 16:22:31 -08:00
Ulrich Weigand	6d5697f7cb	[SystemZ] Fix ICE with i128->i64 uaddo carry chain We can only optimize a uaddo_carry via specialized instruction if the carry was produced by another uaddo(_carry) instruction; there is already a check for that. However, i128 uaddo(_carry) use a completely different mechanism; they indicate carry in a vector register instead of the CC flag. Thus, we must also check that we don't mix those two - that check has been missing. Fixes: https://github.com/llvm/llvm-project/issues/124001	2025-01-23 19:15:11 +01:00
Ulrich Weigand	8424bf207e	[SystemZ] Add support for new cpu architecture - arch15 This patch adds support for the next-generation arch15 CPU architecture to the SystemZ backend. This includes: - Basic support for the new processor and its features. - Detection of arch15 as host processor. - Assembler/disassembler support for new instructions. - Exploitation of new instructions for code generation. - New vector (signed\|unsigned\|bool) __int128 data types. - New LLVM intrinsics for certain new instructions. - Support for low-level builtins mapped to new LLVM intrinsics. - New high-level intrinsics in vecintrin.h. - Indicate support by defining __VEC__ == 10305. Note: No currently available Z system supports the arch15 architecture. Once new systems become available, the official system name will be added as supported -march name.	2025-01-20 19:30:21 +01:00
yingopq	754ed95b66	[Mips] Fix compiler crash when returning fp128 after calling a functi… (#117525 ) …on returning { i8, i128 } Fixes https://github.com/llvm/llvm-project/issues/96432.	2025-01-20 16:47:40 +08:00
Craig Topper	e6b2495545	[SelectionDAG] Split SDNode::use_iterator into user_iterator and use_iterator. (#120531 ) SDNode::use_iterator now returns an SDUse& when dereferenced. SDNode::user_iterator returns SDNode*. SDNode::use_begin/use_end/uses work on use_iterator. SDNode::user_begin/user_end/users work on user_iterator. We can now write range based for loops using SDUse& and SDNode::uses(). I've converted many of these in this patch. I didn't update loops that have additional variables updated in their for statement. Some loops use SDNode::use_iterator::getOperandNo() which also prevents using range based for loops. I plan to move this into SDUse in a follow up patch.	2024-12-19 08:35:32 -08:00
Craig Topper	bd261ecc5a	[SelectionDAG] Add SDNode::user_begin() and use it in some places (#120509 ) Most of these are just places that want the first user and aren't iterating over the whole list. While there I changed some use_size() == 1 to hasOneUse() which is more efficient. This is part of an effort to rename use_iterator to user_iterator and provide a use_iterator that dereferences to SDUse&. This patch helps reduce the diff on later patches.	2024-12-18 22:13:04 -08:00
Craig Topper	104ad9258a	[SelectionDAG] Rename SDNode::uses() to users(). (#120499 ) This function is most often used in range based loops or algorithms where the iterator is implicitly dereferenced. The dereference returns an SDNode * of the user rather than SDUse * so users() is a better name. I've long beeen annoyed that we can't write a range based loop over SDUse when we need getOperandNo. I plan to rename use_iterator to user_iterator and add a use_iterator that returns SDUse& on dereference. This will make it more like IR.	2024-12-18 20:09:33 -08:00
anoopkg6	dc04d414df	SystemZ: Add support for __builtin_setjmp and __builtin_longjmp. (#119257 ) This pr includes fixes for original pr##116642. Implementation for __builtin_setjmp and __builtin_longjmp for SystemZ..	2024-12-10 19:50:51 +01:00
Ulrich Weigand	8787bc72a6	Revert "[SystemZ] Add support for __builtin_setjmp and __builtin_longjmp (#116642 )" This reverts commit 030bbc92a705758f1131fb29cab5be6d6a27dd1f.	2024-12-07 00:55:54 +01:00
Ulrich Weigand	9f430bd415	Revert "[SystemZ] Fix a warning" This reverts commit 3c47e63723b1aa9e76f30fc8d1acef9caf4ea783.	2024-12-07 00:55:41 +01:00
Kazu Hirata	3c47e63723	[SystemZ] Fix a warning This patch fixes: llvm/lib/Target/SystemZ/SystemZISelLowering.cpp:953:30: error: unused variable 'TRI' [-Werror,-Wunused-variable]	2024-12-06 14:52:22 -08:00
anoopkg6	030bbc92a7	[SystemZ] Add support for __builtin_setjmp and __builtin_longjmp (#116642 ) Implementation for __builtin_setjmp and __builtin_longjmp for SystemZ.	2024-12-06 23:33:33 +01:00
Craig Topper	b076fbb844	[TargetLowering] Use Type* instead of EVT in shouldSignExtendTypeInLibCall. (#118587 ) I want to use this function for GISel too so Type * is a better common interface. All of the callers already convert EVT to Type * as needed by calling lowering anyway.	2024-12-03 22:06:55 -08:00

1 2 3 4 5 ...

633 Commits