llvm-project

Author	SHA1	Message	Date
Alan Zhao	f4999d3535	Revert "[CodeGen][ShrinkWrap] Split restore point" This reverts commit 1ddfd1c8186735c62b642df05c505dc4907ffac4. The original commit causes a Chrome build assertion failure with ThinLTO: https://crbug.com/1443635	2023-05-08 16:27:59 -07:00
Stefan Pintilie	be95b4dec2	[PowerPC] Look through OR, AND, XOR instructions when checking a clear. This patch adds the additional step of looking through AND, OR, XOR instructions when we check the number of leading zeros. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D149223	2023-05-08 14:25:20 -04:00
sgokhale	1ddfd1c818	[CodeGen][ShrinkWrap] Split restore point Try to reland D42600 Differential Revision: https://reviews.llvm.org/D42600	2023-05-08 13:21:07 +05:30
sgokhale	7cba800104	[CodeGen] Autogen tests as prerequisite for D42600 Autogenerating tests as suggested in D42600	2023-05-08 12:25:51 +05:30
Florian Hahn	4e2b4f97a0	[ShrinkWrap] Use underlying object to rule out stack access. Allow shrink-wrapping past memory accesses that only access globals or function arguments. This patch uses getUnderlyingObject to try to identify the accessed object by a given memory operand. If it is a global or an argument, it does not access the stack of the current function and should not block shrink wrapping. Note that the caller's stack may get accessed when passing an argument via the stack, but not the stack of the current function. This addresses part of the TODO from D63152. Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D149668	2023-05-03 09:28:07 +01:00
Nikita Popov	0659000ff7	[LICM] Don't duplicate instructions just because they're free D37076 makes LICM duplicate instructions into exit blocks if the instruction is free. For GEPs, the motivation appears to be that this allows the GEP to be folded into addressing modes, while non-foldable users outside the loop might prevent this. TBH I don't think LICM is the place to do this (why doesn't CGP apply this heuristic itself?) but at least I understand the motivation. However, the transform is also applied to all other "free" instructions, which are just that (removed during lowering and not "folded" in some way). For such instructions, this transform seems somewhere between useless, counter-productive (undoing CSE/GVN) and actively incorrect. For example, this transform can duplicate freeze instructions, which is illegal. This patch limits the transform to just foldable GEPs, though we might want to drop it from LICM entirely as a followup. This is a small compile-time improvement, because querying TTI cost model for every single instruction is expensive. Differential Revision: https://reviews.llvm.org/D149136	2023-04-28 14:31:23 +02:00
ManuelJBrito	8b56da5e9f	[IR] Change shufflevector undef mask to poison With this patch an undefined mask in a shufflevector will be printed as poison. This change is done to support the new shufflevector semantics for undefined mask elements. Differential Revision: https://reviews.llvm.org/D149210	2023-04-27 14:41:10 +01:00
Fangrui Song	398d68f624	[PPCMIPeephole] Fix incorrect compare elimination D38236 moves a redundant compare instruction from the loop body to the preheader. It has a bug: when `MBB1 == &MBB2`, there may be only one compare instruction in the loop. The code will lift the compare instruction to the preheader, failing to account for the change of the compare result in a tail call, leading to a miscompile. Suppress the compare elimination to fix https://github.com/llvm/llvm-project/issues/62294 Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D149030	2023-04-24 10:02:06 -07:00
Maryam Moghadas	e5de760c31	[PowerPC] Add a new test for vperm with a swapped vector operand and a constant pool This patch adds a new test that includes a vperm instruction with xxswapd as its vector operand on little-endian Power8. The test demonstrates the constant pool for the mask operand, which is intended to indicate the optimization of vperm and the modification of the constant pool in subsequent patches. Reviewed By: amyk Differential Revision: https://reviews.llvm.org/D148942	2023-04-24 15:41:23 +00:00
Stefan Pintilie	1162a38685	[NFC][PowerPC] Added a test case to show extra clear instructions. Added a number of functions that have a clear instruction that is not actually required. This test is added first and then a patch will be added later in order to remove the unnecessary instructions.	2023-04-24 09:47:44 -04:00
David Tenty	8d2e9fc855	[PowerPC] Add function pointer alignment to DataLayout The alignment of function pointers was added to the Datalayout by D57335 but currently is unset for the Power target. This will cause us to compute a conservative minimum alignment of one if places like Value::getPointerAlignment. This patch implements the function pointer alignment in the Datalayout for the Power backend and Power targets in clang, so we can query the value for a particular Power target. We come up with the correct value one of two ways: - If the target uses function descriptor objects (i.e. ELFv1 & AIX ABIs), then a function pointer points to the descriptor, so use the alignment we would emit the descriptor with. - If the target doesn't use function descriptor objects (i.e. ELFv2), a function pointer points to the global entry point, so use the minimum alignment for code on Power (i.e. 4-bytes). Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D147016	2023-04-18 13:00:27 -04:00
Kai Luo	eee024bf1b	[PowerPC] Update `incr` after resetting the register in MI After performing signed extension, we update the register in MI. We should also update `incr` register which is tracking the register in `MI`. Fixes https://github.com/llvm/llvm-project/issues/61882. Reviewed By: #powerpc, shchenz Differential Revision: https://reviews.llvm.org/D147594	2023-04-14 17:36:30 +08:00
sgokhale	bb5befefc6	Revert "[CodeGen][ShrinkWrap] Split restore point" This reverts commit 5f0bccc3d1a74111458c71f009817c9995f4bf83. An issue has been reported here: https://github.com/ClangBuiltLinux/linux/issues/1833	2023-04-13 10:52:28 +05:30
Nikita Popov	b8917ac62a	[LICM] Reassociate GEPs to allow hoisting Reassociate gep (gep ptr, idx1), idx2 to gep (gep ptr, idx2), idx1 if this would make the inner GEP loop invariant and thus hoistable. This is intended to replace an InstCombine fold that does this (in `04f61fb73d/llvm/lib/Transforms/InstCombine/InstructionCombining.cpp (L2006)`). The problem with the InstCombine fold is that LoopInfo is an optional dependency, so it is not performed reliably. Differential Revision: https://reviews.llvm.org/D146813	2023-04-11 10:34:04 +02:00
sgokhale	5f0bccc3d1	[CodeGen][ShrinkWrap] Split restore point This patch splits a restore point to allow it to only post-dominate blocks reachable by use or def of CSRs(Callee Saved Registers)/FI(Frame Index). Benchmarking this on SPEC2017, this gives around 4% improvement on povray and no significant change for others. Co-authored-by: junbuml Differential Revision: https://reviews.llvm.org/D42600	2023-04-11 11:58:50 +05:30
Maryam Moghadas	6dbb2a717a	[PowerPC] Update pr61315.ll to address D146632 failure This patch is to update pr61315.ll what was needed as part of D146632 and caused build failures. Reviewed By: stefanp Differential Revision: https://reviews.llvm.org/D147675	2023-04-05 21:24:59 -05:00
Maryam Moghadas	cf0395f816	[PowerPC] Fix the xxperm swap requirements This patch is to fix the xxperm vector operand swap condition so that the single-use operand is in V2 to prevent copying, it also fixes the subtarget condition to exploit the xpperm. Reviewed By: stefanp Differential Revision: https://reviews.llvm.org/D146632	2023-04-05 20:13:40 -05:00
Kai Luo	4639653492	[PowerPC] Precommit test case for issue 61882. NFC.	2023-04-05 16:25:12 +08:00
Nikita Popov	ab232c9ddf	[PowerPC] Convert tests to opaque pointers (NFC)	2023-04-04 12:11:26 +02:00
Nikita Popov	39eb7ae9c9	[PowerPC] Name instructions in tests (NFC)	2023-04-04 12:08:03 +02:00
Zequan Wu	321d02cc6b	[NFC] Update CodeGen/*/nomerge.ll tests with utils/update_llc_test_checks.py. Precommit this patch for better diff view on D146749. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D147454	2023-04-03 19:52:39 -04:00
Stefan Pintilie	be15db8cf2	[PowerPC][NFC] Forgot to add requires asserts to ppc-TOC-stats.ll When I sumbitted the original patch I forgot to add that: GodeGen/PowerPC/ppc-TOC-stats.ll requires asserts. Added that now.	2023-04-03 19:45:27 -04:00
Stefan Pintilie	398effac36	[PowerPC] Add statistics to show the number of entries in the TOC. On Power PC some data is stored in the TOC. This pass adds statistics to show how many entries are emitted to the TOC and what types of entries those are. Reviewed By: amyk Differential Revision: https://reviews.llvm.org/D146325	2023-04-03 14:20:51 -04:00
Qiu Chaofan	5b8ea2d0e1	[PowerPC] Lower IS_FPCLASS by test data class instruction Power ISA 3.0 introduced new 'test data class' instructions, which accept flags for: NaN/Infinity/Zero/Denormal. This instruction can be used to implement custom lowering for llvm.is.fpclass, but some extra bits provided by the intrinsic are missing (normal and QNaN/SNaN). For those categories not natively supported, this patch uses a two-way or three-way combination to implement correct behavior. Reviewed By: sepavloff, shchenz Differential Revision: https://reviews.llvm.org/D140381	2023-04-03 11:37:17 +08:00
Qiongsi Wu	f624372ccb	[AIX][CodeGen] Renaming mroptr to xcoff-mroptr This patch renames the `mroptr` option to `mxcoff-roptr` to indicate in the option itself that it is xcoff specific. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D147161	2023-03-31 10:09:48 -04:00
Amy Kwan	3508f12335	[PowerPC][GISel] Add initial GlobalISel support for vector functions. This patch adds the initial support for vector functions and register banks within GlobalISel. With this patch, we are able to support simple functions that return vectors, and also functions that perform simple operations. This patch also: - Legalizes vector types for G_AND, G_OR, G_XOR, G_ADD, G_SUB, G_BITCAST, G_FADD, G_FSUB - Introduce initial support for bitcasting (that will need to be extended upon) - Add various different test cases to for test vector support within GlobalISel Differential Revision: https://reviews.llvm.org/D137785	2023-03-27 08:23:05 -05:00
Amy Kwan	6126356d82	[PowerPC] Implement 64-bit ELFv2 Calling Convention in TableGen (for integers/floats/vectors in registers) This patch partially implements the parameter passing rules outlined in the ELFv2 ABI within TableGen. Specifically, it implements the parameter assignment of integers, floats, and vectors within registers - where the GPR numbering will be "skipped" depending on the ordering of floats and vectors that appear within a parameter list. As we begin to adopt GlobalISel to the PowerPC backend, there is a need for a TableGen definition that encapsulates the ELFv2 parameter passing rules. Thus, this patch also changes the default calling convention that is returned within the ccAssignFnForCall() function used in our GlobalISel implementation, and also adds some additional testing of the calling convention that is implemented. Future patches that build on top of this initial TableGen definition will aim to add more of the ABI complexities, including support for additional types and also in-memory arguments. Differential Revision: https://reviews.llvm.org/D137504	2023-03-27 08:23:04 -05:00
Nemanja Ivanovic	e7c35d7100	[SelectionDAG] Correctly reduce BV to shuffle with zero on big endian This DAG combine is correct on little endian targets but is incorrect on big endian targets. Add big endian code to correct it. Differential revision: https://reviews.llvm.org/D146460	2023-03-24 10:57:17 -04:00
Qiongsi Wu	4f9929add5	[AIX][CodeGen] Storage Locations for Constant Pointers This patch adds an `llc` option `-mroptr` to specify storage locations for constant pointers on AIX. When the `-mroptr` option is specified, constant pointers, virtual function tables, and virtual type tables are placed in read-only storage. Otherwise, by default, pointers, virtual function tables, and virtual type tables are placed are placed in read/write storage. https://reviews.llvm.org/D144190 enables the `-mroptr` option for `clang`. Reviewed By: hubert.reinterpretcast, stephenpeckham, myhsu, MaskRay, serge-sans-paille Differential Revision: https://reviews.llvm.org/D144189	2023-03-23 09:44:47 -04:00
esmeyi	49dcd08c3d	[XCOFF] support the ref directive for object generation. Summary: A R_REF relocation as a non-relocating reference is required to prevent garbage collection (by the binder) of the ref symbol in object generation. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D144356	2023-03-23 05:09:47 -04:00
Ting Wang	f64dc9bc6e	[PowerPC][NFC] add const-nonsplat-array-init.ll When doing store constant vector/scalar, some duplicated values can be reused. Add test case and will show combiner can improve these. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D146500	2023-03-22 00:32:18 -04:00
Nemanja Ivanovic	6ee4ea8e2f	[PowerPC][NFC] Test needs to include constant pool values	2023-03-20 16:43:59 -05:00
Nemanja Ivanovic	da40f7e8b1	[PowerPC][NFC] Pre-commit a test case for upcoming patch	2023-03-20 15:42:07 -05:00
Nikita Popov	687b5b9a0c	[SCEVExpander] Always use scevgep as name With opaque pointers the scevgep / uglygep distinction no longer makes sense -- GEPs are always emitted in offset-based representation.	2023-03-17 14:27:03 +01:00
zhijian	49bc3077cb	[AIX] unset bit "IsBackChainStored" of traceback table for leaf functions with no stack frame Summary: In function PPCAIXAsmPrinter::emitTracebackTable() ,the bit "IsBackChainStored" of traceback table always set true, it will cause aix debug tools "dbx" emit an error info "libdebug assertion "(framep->getGpr(STKP, &addr) == DB_SUCCESS && *nextStkpp == addr)" when debug a leaf functions with no stack frame. If a a leaf functions with no stack frame , the bit IsBackChainStored should be unset. Reviewers: ChenZheng Differential Revision: https://reviews.llvm.org/D146071	2023-03-16 15:26:12 -04:00
Nikita Popov	bbfb13a5ff	[ConstExpr] Remove select constant expression This removes the select constant expression, as part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179. Uses of this expressions have already been removed in advance, so this just removes related infrastructure and updates tests. Differential Revision: https://reviews.llvm.org/D145382	2023-03-16 10:32:08 +01:00
Simon Pilgrim	da570ef1b4	[DAG] Match select(icmp(x,y),sub(x,y),sub(y,x)) -> abd(x,y) patterns Pulled out of PowerPC, and added ABDS support as well (hence the additional v4i32 PPC matches) Differential Revision: https://reviews.llvm.org/D144789	2023-03-14 15:10:30 +00:00
Chen Zheng	a3b57bca97	[PowerPC] remove side effect for some cases for saturate instructions Fixes #60684 Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D145353	2023-03-13 21:37:56 -04:00
Chen Zheng	a81ba80eb0	add testcases for D145353; NFC	2023-03-13 21:37:49 -04:00
Chen Zheng	4f0ed16a46	Reland rGf35a09daebd0a90daa536432e62a2476f708150d and rG63854f91d3ee1056796a5ef27753648396cac6ec [DAGCombiner] handle more store value forwarding When lowering calls on target like PPC, some stack loads will be generated for by value parameters. Node CALLSEQ_START prevents such loads from being combined. Suggested by @RolandF, this patch removes the unnecessary loads for the byval parameter by extending ForwardStoreValueToDirectLoad Reviewed By: nemanjai, RolandF Differential Revision: https://reviews.llvm.org/D138899	2023-03-12 21:59:18 -04:00
Max Kazantsev	6b03ce374e	[LICM] Simplify (X < A && X < B) into (X < MIN(A, B)) if MIN(A, B) is loop-invariant We don't do this transform in InstCombine in general case for arbitrary values, because cost of AND and 2 ICMP's isn't higher than of MIN and ICMP. However, LICM also has a notion about the loop structure. This transform becomes profitable if `A` and `B` are loop-invariant and `X` is not: by doing this, we can compute min outside the loop. Differential Revision: https://reviews.llvm.org/D143726 Reviewed By: nikic	2023-03-10 17:36:52 +07:00
esmeyi	5541f47326	[PowerPC] Check if the latch block is in the value list for the PHI before get the incoming value. Summary: Fixes #60990. There is a crash reported during Running pass 'Prepare loop for ppc preferred instruction forms'. The crash occurs in 32bit PowerPC. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D145350	2023-03-08 02:19:35 -05:00
Nikita Popov	ddccc5ba44	[CodeGen] Always expand division larger than i128 Default MaxDivRemBitWidthSupported to 128, so that divisions larger than 128 bits are always expanded, without requiring additional configuration from the target. Note that this may still emit calls to __udivti3 on 32-bit targets, which likely don't have an implementation of that builtin. However, I believe this is sufficient to fix https://github.com/llvm/llvm-project/issues/60531, because Zig must already be defining those builtins. Differential Revision: https://reviews.llvm.org/D144871	2023-03-01 15:33:45 +01:00
Simon Pilgrim	448d896519	[PowerPC] Add coverage for select(icmp_sgt(x,y),sub(x,y),sub(y,x)) -> abds(x,y) patterns	2023-02-25 21:04:16 +00:00
Simon Pilgrim	ab76f5865f	[PPC] Fix abs(sub(x,y)) -> abs(x,y) tests As detailed on D142313, this fold should be restricted by sub nsw	2023-02-25 19:59:48 +00:00
Ting Wang	00ed95c3a2	[PowerPC][NFC] add const-splat-array-init.ll Add test case and will show combiner can improve these. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D144235	2023-02-21 20:24:12 -05:00
esmeyi	fd226142fc	[AIX] Lower some memory intrinsics to millicode functions on AIX Summary: Currently we lower MEMCPY/MEMMOVE/MEMSET/BZERO to the corresponding libc functions. And the libc functions call the millicode functions on AIX. We can lower these intrinsics directly to save one call layer. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D143997	2023-02-20 22:25:49 -05:00
Brad Smith	4b09cb2b16	[PowerPC] Correctly use ELFv2 ABI on all OS's that use the ELFv2 ABI Add a member function isPPC64ELFv2ABI() to determine what ABI is used on the 64-bit PowerPC big endian operating environment. Reviewed By: nemanjai, dim, pkubaj Differential Revision: https://reviews.llvm.org/D144321	2023-02-20 18:11:24 -05:00
Nick Desaulniers	39811e2e53	[llvm][test] enable/disable -verify-machineinstrs where possible for callbr I introduced new tests in commit 5cc1016a57b3 ("[llvm][SelectionDAGBuilder] codegen callbr.landingpad intrinsic") https://reviews.llvm.org/D140160 that fails expensive checks. Disable -verify-machineinstrs in those tests for now. Enable it in other tests for now, since MachineVerifier isn't on by default for assertion builds. Link: https://github.com/llvm/llvm-project/issues/60827	2023-02-16 20:28:18 -08:00
Nick Desaulniers	a3a84c9e25	[llvm] add CallBrPrepare pass to pipelines Capstone of https://discourse.llvm.org/t/rfc-syncing-asm-goto-with-outputs-with-gcc/65453/8 Clang changes are still necessary to enable the use of outputs along indirect edges of asm goto statements. Link: https://github.com/llvm/llvm-project/issues/53562 Reviewed By: void Differential Revision: https://reviews.llvm.org/D140180	2023-02-16 17:58:34 -08:00

1 2 3 4 5 ...

3602 Commits