llvm-project

Author	SHA1	Message	Date
Yusra Syeda	0768253c20	[SystemZ][z/OS] Add exception handling for XPLINK (#74638 ) Adds emitting the exception table and the EH registers for XPLINK. --------- Co-authored-by: Yusra Syeda <yusra.syeda@ibm.com>	2023-12-19 13:58:33 -05:00
Ulrich Weigand	59f7f35a90	[SystemZ] ABI support for single-element vector types Support passing and returning values of single-element vector types (i.e. <1 x i128> and <1 x fp128>). Now that i128 is a legal type, supporting these types can be done simply by providing a getRegisterTypeForCallingConv implementation that handles them. Fixes https://github.com/llvm/llvm-project/issues/61291	2023-12-15 19:31:00 +01:00
Ulrich Weigand	a65ccc1b9f	[SystemZ] Support i128 as legal type in VRs (#74625 ) On processors supporting vector registers and SIMD instructions, enable i128 as legal type in VRs. This allows many operations to be implemented via native instructions directly in VRs (including add, subtract, logical operations and shifts). For a few other operations (e.g. multiply and divide, as well as atomic operations), we need to move the i128 value back to a GPR pair to use the corresponding instruction there. Overall, this is still beneficial. The patch includes the following LLVM changes: - Enable i128 as legal type - Set up legal operations (in SystemZInstrVector.td) - Custom expansion for i128 add/subtract with carry - Custom expansion for i128 comparisons and selects - Support for moving i128 to/from GPR pairs when required - Handle 128-bit integer constant values everywhere - Use i128 as intrinsic operand type where appropriate - Updated and new test cases In addition, clang builtins are updated to reflect the intrinsic operand type changes (which also improves compatibility with GCC).	2023-12-15 12:55:15 +01:00
Jonas Paulsson	435ba72afd	[SystemZ] Simplify handling of AtomicRMW instructions. (#74789 ) Let the AtomicExpand pass do more of the job of expanding AtomicRMWInst:s in order to simplify the handling in the backend. The only cases that the backend needs to handle itself are those of subword size (8/16 bits) and those directly corresponding to a target instruction.	2023-12-08 17:19:17 +01:00
Jonas Paulsson	c568927f3e	[SystemZ] Properly support 16 byte atomic int/fp types and ops. (#73134 ) - Clang FE now has MaxAtomicPromoteWidth / MaxAtomicInlineWidth set to 128, and now produces IR instead of calls to __atomic instrinsics for 16 bytes as well. - Atomic __int128 (and long double) variables are now aligned to 16 bytes by default (like gcc 14). - AtomicExpand pass now expands 16 byte operations as well. - tests for __atomic builtins for all integer widths, and __atomic_is_lock_free with friends. - TODO: AtomicExpand pass handles with this patch expansion of i128 atomicrmw:s. As a next step smaller integer types should also be possible to handle this way instead of by the backend.	2023-12-05 17:17:21 +01:00
Ulrich Weigand	c61eb44005	[SystemZ] Implement vector rotate in terms of funnel shift Clang currently implements a set of vector rotate builtins (__builtin_s390_verll) in terms of platform-specific LLVM intrinsics. To simplify the IR (and allow for common code optimizations if applicable), this patch removes those LLVM intrinsics and implements the builtins in terms of the platform-independent funnel shift intrinsics instead. Also, fix the prototype of the __builtin_s390_verll builtins for full compatibility with GCC.	2023-12-04 16:52:00 +01:00
Ilya Leoshkevich	d79fff0abb	[SystemZ] Add backchain target-feature (#71668 ) GCC supports building individual functions with backchain using the __attribute__((target("backchain"))) syntax, and Clang should too. Clang translates this into the "target-features"="+backchain" attribute, and the -mbackchain command-line option into the "backchain" attribute. The backend currently checks only the latter; furthermore, the backchain target feature is not defined. Handle backchain like soft-float. Define a target feature, convert function attribute into it in getSubtargetImpl(), and check for target feature instead of function attribute everywhere. Add an end-to-end test to the Clang testsuite.	2023-11-08 15:05:47 +01:00
Ilya Leoshkevich	03934e70ef	[SystemZ] Enable AtomicExpand pass (#70398 ) The upcoming OpenMP support for SystemZ requires handling of IR insns like `atomicrmw fadd`. Normally atomic float operations are expanded by Clang and such insns do not occur, but OpenMP generates them directly. Other architectures handle this using the AtomicExpand pass, which SystemZ did not need so far. Enable it. Currently AtomicExpand treats atomic load and stores of floats pessimistically: it casts them to integers, which SystemZ does not need, since the floating point load and store instructions are already atomic. However, the way Clang currently expands them is pessimistic as well, so this change does not make things worse. Optimizing operations on atomic floats can be a separate change in the future. This change does not create any differences the Linux kernel build.	2023-10-31 09:51:06 +01:00
Ilya Leoshkevich	8e810dc7d9	[SystemZ] Support builtin_{frame,return}_address() with non-zero argument (#69405 ) When the code is built with -mbackchain, it is possible to retrieve the caller's frame and return addresses. GCC already can do this, add this support to Clang as well. Use RISCVTargetLowering and GCC's s390_return_addr_rtx() as inspiration. Add tests based on what GCC is emitting.	2023-10-18 19:05:31 +02:00
Yusra Syeda	6cf41ada44	[SystemZ][z/OS] Add vararg support to z/OS (#68834 ) This PR adds vararg support to z/OS and updates the call-zos-vararg.ll lit test. Co-authored-by: Yusra Syeda <yusra.syeda@ibm.com>	2023-10-12 12:42:55 +02:00
Yusra Syeda	5c4d35d8cf	[SystemZ][z/OS] Update lowerCall (#68259 ) This PR moves some calculation out of `LowerCall` and into `SystemZXPLINKFrameLowering::processFunctionBeforeFrameFinalized`. We need to make this change because LowerCall isn't invoked for functions that don't have function calls, and it is required for some tooling to work correctly. A function that does not make any calls is required to allocate 32 bytes for the parameter area required by the ABI. However, we allocate 64 bytes because this additional space is utilized by certain tools, like the debugger. Co-authored-by: Yusra Syeda <yusra.syeda@ibm.com>	2023-10-05 10:32:57 +03:00
Kai Nacke	42de2b7e99	[SystemZ/z/OS] Add library names for intrinsics (#68114 ) On z/OS, many library functions have a non-standard name. This change initializes the table of runtime function which results from lowering intrinsics to library calls.	2023-10-03 18:53:52 +03:00
Nick Desaulniers	330fa7d2a4	[TargetLowering] Deduplicate choosing InlineAsm constraint between ISels (#67057 ) Given a list of constraints for InlineAsm (ex. "imr") I'm looking to modify the order in which they are chosen. Before doing so, I noticed a fair amount of logic is duplicated between SelectionDAGISel and GlobalISel for this. That is because SelectionDAGISel is also trying to lower immediates during selection. If we detangle these concerns into: 1. choose the preferred constraint 2. attempt to lower that constraint Then we can slide down the list of constraints until we find one that can be lowered. That allows the implementation to be shared between instruction selection frameworks. This makes it so that later I might only need to adjust the priority of constraints in one place, and have both selectors behave the same.	2023-09-25 08:53:03 -07:00
Matt Arsenault	ad9d13d535	SelectionDAG: Swap operands of atomic_store Irritatingly, atomic_store had operands in the opposite order from regular store. This made it difficult to share patterns between regular and atomic stores. There was a previous incomplete attempt to move atomic_store into the regular StoreSDNode which would be better. I think it was a mistake for all atomicrmw to swap the operand order, so maybe it's better to take this one step further. https://reviews.llvm.org/D123143	2023-08-31 17:30:10 -04:00
Yusra Syeda	163aad6bcb	[SystemZ][z/OS] z/OS ADA codegen and emission This patch adds support for the ADA (associated data area), doing the following: -Creates the ADA table to handle displacements -Emits the ADA section in the SystemZAsmPrinter -Lowers the ADA_ENTRY node into the appropriate load instruction Differential Revision: https://reviews.llvm.org/D153788	2023-07-05 13:21:52 -04:00
Yusra Syeda	1bfdc534aa	Revert "[SystemZ][z/OS] This patch adds support for the ADA (associated data area), doing the following:" This reverts commit 9df0f66af5462e23216eae31aedbd4d2f459cc3d.	2023-06-28 11:18:12 -04:00
Yusra Syeda	9df0f66af5	[SystemZ][z/OS] This patch adds support for the ADA (associated data area), doing the following: - Creates the ADA table to handle displacements - Emits the ADA section in the SystemZAsmPrinter - Lowers the ADA_ENTRY node into the appropriate load instruction Differential Revision: https://reviews.llvm.org/D153788	2023-06-28 10:13:10 -04:00
Neumann Hon	8a7a2da18f	[SystemZ][z/OS] Correct value of length/4 of params field in PPA1. The Length/4 of Params field in the PPA1 ought to be the length of the parameters for the current function. Currently we are storing the length of the parameter area in the current function's stack frame, which represents the length of the params of the longest callee in the current function. Differential Revision: https://reviews.llvm.org/D152920 Reviewed By: uweigand	2023-06-14 13:37:46 -04:00
Neumann Hon	049324ac5e	Revert "[SystemZ][z/OS] Correct value of length/4 of params field in PPA1." This reverts commit e0f7b0e0f704dc3759925602e474b9e669270fcb.	2023-06-14 13:34:16 -04:00
Neumann Hon	e0f7b0e0f7	[SystemZ][z/OS] Correct value of length/4 of params field in PPA1. The Length/4 of Params field in the PPA1 ought to be the length of the parameters for the current function. Currently we are storing the length of the parameter area in the current function's stack frame, which represents the length of the params of the longest callee in the current function. Differential revision: https://reviews.llvm.org/D119049 Reviewed By: uweigand	2023-06-14 13:20:45 -04:00
Sergei Barannikov	01a7967447	[CodeGen] Replace CCState's getNextStackOffset with getStackSize (NFC) The term "next stack offset" is misleading because the next argument is not necessarily allocated at this offset due to alignment constrains. It also does not make much sense when allocating arguments at negative offsets (introduced in a follow-up patch), because the returned offset would be past the end of the next argument. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D149566	2023-05-17 21:51:45 +03:00
Jay Foad	d8229e2f14	[KnownBits] Define and use intersectWith and unionWith Define intersectWith and unionWith as two complementary ways of combining KnownBits. The names are chosen for consistency with ConstantRange. Deprecate commonBits as a synonym for intersectWith. Differential Revision: https://reviews.llvm.org/D150443	2023-05-16 09:23:51 +01:00
Craig Topper	9ad9380fbc	[LegalizeVectorOps][AArch64][RISCV][X86] Use OpVT for ISD::SETCC in LegalizeVectorOps. Previously, LegalizeVectorOps used the result VT while LegalizeDAG used the operand VT. This patch makes them both use the operand VT. This also makes it consistent with how the default cost model works. I've hacked the AArch64 cost model to maintain old behavior for some f16 vectors. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D149572	2023-05-13 23:33:00 -07:00
Zequan Wu	3977b77a6b	[CodeGen] Fix nomerge attribute not working in tail calls. In D79537, `nomerge` was made to only apply to non-tail calls. This fixes it by also applying it to tail calls. For ARM, I only made the new MI to inherit the flag under `TCRETURNdi` and `TCRETURNri`, because that's the place tail calls got replaced. Not sure if there's any other place needed. Fixes #61545. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D146749	2023-05-10 14:25:11 -04:00
Jonas Paulsson	655f0fc4b9	Reapply "[SystemZ] Bugfix in expansion of memmem operations." The new test case showed that the NoPHIs flag needs to be cleared. Original commit message: [SystemZ] Bugfix in expansion of memmem operations. Since NC, OC, and XC clobber CC, the EXRL_Pseudo targeting these must also be marked to do so. Original patch by uweigand. Reviewed by: uweigand Differential Revision: https://reviews.llvm.org/D150251 Fixes: https://github.com/llvm/llvm-project/issues/62572	2023-05-10 12:40:57 +02:00
Jonas Paulsson	dfa42a69b8	Revert "[SystemZ] Bugfix in expansion of memmem operations." Sorry - mir test fails with expensive checks on build bot. Seems to relate to the fact that there are no PHIs in the .mir input, but after they are created the verifyer reports "Found PHI instruction with NoPHIs property set". This reverts commit 00454a17f361d677d5423905c888daca1a80661a.	2023-05-10 11:34:55 +02:00
Jonas Paulsson	00454a17f3	[SystemZ] Bugfix in expansion of memmem operations. Since NC, OC, and XC clobber CC, the EXRL_Pseudo targeting these must also be marked to do so. Original patch by uweigand. Reviewed by: uweigand Differential Revision: https://reviews.llvm.org/D150251 Fixes: https://github.com/llvm/llvm-project/issues/62572	2023-05-10 11:05:13 +02:00
Sergei Barannikov	e744e51b12	[SelectionDAG] Rename ADDCARRY/SUBCARRY to UADDO_CARRY/USUBO_CARRY (NFC) This will make them consistent with other overflow-aware nodes. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D148196	2023-04-29 21:59:58 +03:00
Craig Topper	219ff07f72	[Targets] Rename Flag->Glue. NFC Long long ago Glue was called Flag, and it was never completely renamed.	2023-04-02 19:28:51 -07:00
Simon Pilgrim	8153b92d9b	[DAG] Add SelectionDAG::SplitScalar helper Similar to the existing SelectionDAG::SplitVector helper, this helper creates the EXTRACT_ELEMENT nodes for the LO/HI halves of the scalar source. Differential Revision: https://reviews.llvm.org/D147264	2023-03-31 18:35:40 +01:00
Jonas Paulsson	b4b4950f7f	[SystemZ] Allow fp/int casting with inline assembly operands. Support bitcasting between int/fp/vector values and 'r'/'f'/'v' inline assembly operands. This is intended to match GCCs beahvior. Reviewed By: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D146059	2023-03-24 19:57:25 +01:00
Kazu Hirata	7ada7bbee1	[Target] Use *{Set,Map}::contains (NFC)	2023-03-14 18:06:55 -07:00
Jonas Paulsson	f8803919ad	[SystemZ] Clear NW flags on an ISD::SUB when reused as comparison. The SystemZ backend will try to reuse an existing subtraction of two values whenever they are to be compared for equality. This depends on the SystemZ subtraction instruction setting the condition code, which can also signal overflow. A later pass will remove the compare and reuse the CC from the subtraction directly. However, if that subtraction has the NSW flag set it will not include the overflow bit in the updated CC user. That was a bug which can lead to wrong results, as shown by a csmith program. Fixes: https://github.com/llvm/llvm-project/issues/61268 Reviewed By: nikic, uweigand Differential Revision: https://reviews.llvm.org/D145811	2023-03-14 19:46:41 +01:00
Jonas Paulsson	a5b4291911	[SystemZ] NFC minor cleanup Don't create local pointers to Subtarget in places in SystemZTargetLowering as the class already has a Subtarget member. Review: Ulrich Weigand	2023-03-13 18:55:24 +01:00
Jonas Paulsson	0ece2050da	[SystemZ] Implement isGuaranteedNotToBeUndefOrPoisonForTargetNode(). Returning true from this method for PCREL_WRAPPER and PCREL_OFFSET avoids problems when a PCREL_OFFSET node ends up with a freeze operand, which is not handled or expected by the backend. Fixes #60107 Reviewed By: uweigand, RKSimon Differential Revision: https://reviews.llvm.org/D142971	2023-02-01 13:28:18 +01:00
Kazu Hirata	54048ce863	[SystemZ] Use llvm::bit_floor (NFC)	2023-01-28 15:04:16 -08:00
Kazu Hirata	e078201835	[Target] Use llvm::count{l,r}_{zero,one} (NFC)	2023-01-28 09:23:07 -08:00
Kazu Hirata	7510b7ec4a	[SystemZ] Use llvm::countl_zero and llvm::countr_zero (NFC) isVectorConstantLegal calls findFirstSet and findLastSet, but we don't rely on their ability to return std::numeric_limits<T>::max() on input 0. This patch replaces those calls with calls to llvm::countl_zero and llvm::countr_zero. Due to an off-by-one error in the original code, the value of Upper could change at bit N, where N is the index of the highest set bit in SplatBitsZ, but the difference doesn't matter at the end. Without this patch, Upper could have bit N set. With this patch, Upper never has bit N set. Either way, both calls to tryValue have this bit set because the argument is ORed with SplatBitsZ.	2023-01-25 22:05:06 -08:00
Kazu Hirata	e599094038	[llvm] Use llvm::bit_ceil (NFC) In both of these cases, the arguments to Log2_32_Ceil are known to be nonzero.	2023-01-22 14:05:14 -08:00
Tulio Magno Quites Machado Filho	1136cf1721	[SystemZ] Implement lowering of GET_ROUNDING Add support for _FLT_ROUNDS_ in SystemZ. Patch by Tulio Magno Quites Machado Filho. Reviewed By: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D140988	2023-01-18 14:41:19 -06:00
Guillaume Chatelet	8fd5558b29	[NFC] Use TypeSize::geFixedValue() instead of TypeSize::getFixedSize() This change is one of a series to implement the discussion from https://reviews.llvm.org/D141134.	2023-01-11 16:49:38 +00:00
Philip Reames	037636e695	[SDAG] Introduce a common MEMBARRIER node [nfc] We have multiple targets which have defined custom instructions and sdag nodes to represent a compiler memory barrier. This patch consolidates the sdag node definition into common code. This is a companion to D92842, but a bit different in focus. This change consolidates the existing sdag node definitions; that patch skipped defining a sdag node by instead going straight to a target node. That patch is also not NFC - as being so is quite hard for commoning up the instruction definitions. I started with two backends to ensure the new common code was reusable while not having a massive diff. Once this lands, I'll submit a series of NFCs for backends where the changes are obvious, or reviews if more discussion is needed. Differential Revision: https://reviews.llvm.org/D141317	2023-01-09 15:20:08 -08:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Matt Arsenault	c16a58b36c	Attributes: Add function getter to parse integer string attributes The most common case for string attributes parses them as integers. We don't have a convenient way to do this, and as a result we have inconsistent missing attribute and invalid attribute handling scattered around. We also have inconsistent radix usage to getAsInteger; some places use the default 0 and others use base 10. Update a few of the uses, but there are quite a lot of these.	2022-12-14 13:12:35 -05:00
Krzysztof Parzyszek	864aaa21b4	TargetLowering: convert Optional to std::optional	2022-12-01 16:19:10 -08:00
Jonas Paulsson	ca51529487	[SystemZ] Extend combineGET_CCMASK() to handle a truncated SELECT_CCMASK. In cases where the SELECT_CCMASK has an additional user of the carry, a truncated SELECT_CCMASK may result as the input to the GET_CCMASK, which need to be recognized. Fixes https://github.com/llvm/llvm-project/issues/59054 Reviewed By: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D138324	2022-11-23 09:53:07 -05:00
Stanislav Mekhanoshin	bcaf31ec3f	[AMDGPU] Allow finer grain control of an unaligned access speed A target can return if a misaligned access is 'fast' as defined by the target or not. In reality there can be different levels of 'fast' and 'slow'. This patch changes the boolean 'Fast' argument of the allowsMisalignedMemoryAccesses family of functions to an unsigned representing its speed. A target can still define it as it wants and the direct translation of the current code uses 0 and 1 for current false and true. This makes the change an NFC. Subsequent patch will start using an actual value of speed in the load/store vectorizer to compare if a vectorized access going to be not just fast, but not slower than before. Differential Revision: https://reviews.llvm.org/D124217	2022-11-17 09:23:53 -08:00
Josh Stone	4dcfb09e40	[NFC][CodeGen] Use const MF in TargetLowering stack probe functions This makes them callable from places like canUseAsPrologue. Differential Revision: https://reviews.llvm.org/D134492	2022-09-23 09:30:32 -07:00
Sergei Barannikov	c6acb4eb0f	[SDAG] Add `getCALLSEQ_END` overload taking `uint64_t`s All in-tree targets pass pointer-sized ConstantSDNodes to the method. This overload reduced amount of boilerplate code a bit. This also makes getCALLSEQ_END consistent with getCALLSEQ_START, which already takes uint64_ts.	2022-09-15 14:02:12 -04:00
Kazu Hirata	8feb60756c	[llvm] Use range-based for loops (NFC)	2022-08-28 23:28:58 -07:00

1 2 3 4 5 ...

537 Commits