llvm-project

Author	SHA1	Message	Date
Kazu Hirata	4501133d96	Ensure newlines at the end of files (NFC)	2022-12-16 23:36:51 -08:00
Christudasan Devadasan	b5efec4b27	[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot With D134950, targets get notified when a virtual register is created and/or cloned. Targets can do the needful with the delegate callback. AMDGPU propagates the virtual register flags maintained in the target file itself. They are useful to identify a certain type of machine operands while inserting spill stores and reloads. Since RegAllocFast spills the physical register itself, there is no way its virtual register can be mapped back to retrieve the flags. It can be solved by passing the virtual register as an additional argument. This argument has no use when the spill interfaces are called during the greedy allocator or even the PrologEpilogInserter and can pass a null register in such cases. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D138656	2022-12-17 11:55:34 +05:30
Christudasan Devadasan	ce02d5a539	[CodeGen] Use cloneVirtualRegister in LiveIntervals and LiveRangeEdit It is needed to invoke the delegate methods effectively whenever a virtual register is cloned from an existing register of the same class. Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D138517	2022-12-17 11:54:33 +05:30
Christudasan Devadasan	2f23f5c0d5	[CodeGen] Use delegate to notify targets when virtual registers are created This will help targets to customize certain codegen decisions based on the virtual registers involved in special operations. This patch also extends the existing delegate in MRI to start support multicast. Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D134950	2022-12-17 11:53:34 +05:30
Fangrui Song	036e092282	[CodeGen] std::optional::value => operator*/operator-> value() has undesired exception checking semantics and calls __throw_bad_optional_access in libc++. Moreover, the API is unavailable without _LIBCPP_NO_EXCEPTIONS on older Mach-O platforms (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS). This fixes LLVMMIRParser, LLVMGlobalISel, LLVMAsmPrinter, LLVMSelectionDAG.	2022-12-16 23:41:36 +00:00
Fangrui Song	51b685734b	[Transforms,CodeGen] std::optional::value => operator*/operator-> value() has undesired exception checking semantics and calls __throw_bad_optional_access in libc++. Moreover, the API is unavailable without _LIBCPP_NO_EXCEPTIONS on older Mach-O platforms (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS).	2022-12-16 23:21:27 +00:00
Sprite	a9f9f3dff4	Correct typos (NFC) Just found some typos while reading the llvm/circt project. compliment -> complement emitsd -> emits	2022-12-16 10:51:26 -08:00
Nemanja Ivanovic	cb3f415cd2	[PowerPC] Fix up memory ordering after combining BV to a load The combiner for BUILD_VECTOR that merges consecutive loads into a wide load had two issues: - It didn't check that the input loads all have the same input chain - It didn't update nodes that are chained to the original loads to be chained to the new load This caused issues with bootstrap when 3c4d2a03968ccf5889bacffe02d6fa2443b0260f was committed. This patch fixes the issue so it can unblock this commit. Differential revision: https://reviews.llvm.org/D140046	2022-12-16 08:57:36 -06:00
Fangrui Song	b1df3a2c0b	[Support] llvm::Optional => std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-16 08:49:10 +00:00
Craig Topper	c09edce1b3	[SelectionDAG] Give all the target specific subclasses of SelectionDAGISel their own pass ID. Previously we had a shared ID in SelectionDAGISel. AMDGPU has an initializePass function for its subclass of SelectionDAGISel. No other target does. This causes all target specific SelectionDAGISel passes to be known as "amdgpu-isel". I'm not sure what would happen if another target tried to implement an initializePass function too since the ID is already claimed. This patch gives all targets their own ID and passes it down to SelectionDAGISel constructor to MachineFunctionPass's constructor. Unfortunately, I think this causes most targets to lose print-before/after-all support for their SelectionDAGISel pass. And they probably no longer support start/stop-before/after. We can add initializePass functions to fix this as a follow up. NOTE: This was probably also broken if the AMDGPU target isn't compiled in. Step 1 to fixing PR59538. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D140161	2022-12-15 15:48:55 -08:00
Kazu Hirata	ee0c631716	[mlgo] Retire LLVM_HAVE_TF_API I've eliminated all uses of LLVM_HAVE_TF_API except a couple of them being removed in llvm/lib/CodeGen/CMakeLists.txt. This patch removes remaining definitions and uses of LLVM_HAVE_TF_API. Differential Revision: https://reviews.llvm.org/D140169	2022-12-15 14:40:16 -08:00
Vasileios Porpodas	32b38d248f	[NFC] Rename Instruction::insertAt() to Instruction::insertInto(), to be consistent with BasicBlock::insertInto() Differential Revision: https://reviews.llvm.org/D140085	2022-12-15 12:27:45 -08:00
Kevin Athey	ec7cffc579	Revert "Revert "[AArch64][GlobalISel][Legalizer] Legalize G_SHUFFLE_VECTOR with different lengths"" This reverts commit 192cc76e0be688106492989cd845ba786a7ae36d. Reverted Revert, as build was fixed while I was examining.	2022-12-15 11:19:24 -08:00
Kevin Athey	192cc76e0b	Revert "[AArch64][GlobalISel][Legalizer] Legalize G_SHUFFLE_VECTOR with different lengths" This reverts commit 4c52fb1a5ee20846627d16e38f5dec08c08f8884. Breaks sanitizer ubsan buildbot: https://lab.llvm.org/buildbot/#/builders/85/builds/12983	2022-12-15 11:15:55 -08:00
Ron Lieberman	38f1abef86	Revert "[SelectionDAG] Do not second-guess alignment for alloca" Breaks amdgpu buildbot https://lab.llvm.org/buildbot/#/builders/193 23491 This reverts commit ffedf47d8b793e07317f82f9c2a5f5425ebb71ad.	2022-12-15 10:55:18 -06:00
Nikita Popov	e253382cd3	[MRI] Print more debug infor in clearVirtRegs() (NFC)	2022-12-15 16:42:56 +01:00
Andrew Savonichev	ffedf47d8b	[SelectionDAG] Do not second-guess alignment for alloca Alignment of an alloca in IR can be lower than the preferred alignment on purpose, but this override essentially treats the preferred alignment as the minimum alignment. The patch changes this behavior to always use the specified alignment. If alignment is not set explicitly in LLVM IR, it is set to DL.getPrefTypeAlign(Ty) in computeAllocaDefaultAlign. Tests are changed as well: explicit alignment is increased to match the preferred alignment if it changes output, or omitted when it is hard to determine the right value (e.g. for pointers, some structs, or weird types). Differential Revision: https://reviews.llvm.org/D135462	2022-12-15 18:18:12 +03:00
Benjamin Maxwell	3010f60381	Reland "[TargetLowering] Teach DemandedBits about VSCALE" Reland with a fixup to avoid converting APInts to int64_t which allowed for overflows (UB) with sufficiently high/low multiplier values. This allows DemandedBits to see the result of VSCALE will be at most VScaleMax * some compile-time constant. This relies on the vscale_range() attribute being present on the function, with a max set. (This is done by default when clang is targeting AArch64+SVE). Using this various redundant operations (zexts, sexts, ands, ors, etc) can be eliminated. Differential Revision: https://reviews.llvm.org/D138508	2022-12-15 13:50:02 +00:00
Juan Manuel MARTINEZ CAAMAÑO	4d852374b1	[DAGCombine] Fix always true condition in combineShiftToMULH Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D139550	2022-12-15 13:04:42 +01:00
Vladislav Dzhidzhoev	4c52fb1a5e	[AArch64][GlobalISel][Legalizer] Legalize G_SHUFFLE_VECTOR with different lengths Legalize G_SHUFFLE_VECTOR having destination vector length greater than source vector length by reshaping source vectors. Partial implementation of SelectionDAGBuilder::visitShuffleVector. Differential Revision: https://reviews.llvm.org/D132190	2022-12-15 15:03:34 +03:00
Benjamin Maxwell	20b29a59c5	Revert "[TargetLowering] Teach DemandedBits about VSCALE" This reverts commit c165b0553a96394b9bbf3984782703cdae99821d.	2022-12-15 11:29:34 +00:00
Kazu Hirata	6eb0b0a045	Don't include Optional.h These files no longer use llvm::Optional.	2022-12-14 21:16:22 -08:00
Michael Buch	c9861e5718	[llvm][DebugInfo] Backport DW_AT_default_value for template args Summary Starting with DWARFv5, DW_AT_default_value can be used to indicate that a template argument has a default value. With this patch LLVM will emit the this attribute earlier versions of DWARF, unless compiling with -gstrict-dwarf. Differential Revision: https://reviews.llvm.org/D139953	2022-12-14 22:31:46 +00:00
Hendrik Greving	ddf2f90a48	[EarlyIfConversion] Add target hook to allow for multiple ifcvt iterations. Adds a target hook canPredicatePredicatedInstr(const MachineInstr&) that assumes an instruction is already predicated and returns true if it can be predicated again, used by the early if-conversion pass in order to iterate multiple times on architectures supporting predicate logic. No test added since there is no upstream target that can take advantage. Differential Revision: https://reviews.llvm.org/D139981	2022-12-14 13:36:20 -08:00
Matt Arsenault	c16a58b36c	Attributes: Add function getter to parse integer string attributes The most common case for string attributes parses them as integers. We don't have a convenient way to do this, and as a result we have inconsistent missing attribute and invalid attribute handling scattered around. We also have inconsistent radix usage to getAsInteger; some places use the default 0 and others use base 10. Update a few of the uses, but there are quite a lot of these.	2022-12-14 13:12:35 -05:00
Benjamin Maxwell	c165b0553a	[TargetLowering] Teach DemandedBits about VSCALE This allows DemandedBits to see the result of VSCALE will be at most VScaleMax * some compile-time constant. This relies on the vscale_range() attribute being present on the function, with a max set. (This is done by default when clang is targeting AArch64+SVE). Using this various redundant operations (zexts, sexts, ands, ors, etc) can be eliminated. Differential Revision: https://reviews.llvm.org/D138508	2022-12-14 15:49:08 +00:00
Fangrui Song	d4b6fcb32e	[Analysis] llvm::Optional => std::optional	2022-12-14 07:32:24 +00:00
Yeting Kuo	ad68586a37	[VP][RISCV] Add vp.ctpop and RISC-V support. The patch also adds expandVPCTPOP in TargetLowering to expand VP_CTPOP nodes. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139920	2022-12-14 09:47:44 +08:00
Bill Wendling	14d4cddc55	[X86] Don't zero out %eax if both %al and %ah are used The iterator over super and sub registers doesn't include both 8-bit registers in its list. So if both registers are used and only one of them is live on return, then we need to make sure that the other 8-bit register is also marked as live and not zeroed out. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D139679	2022-12-13 15:06:53 -08:00
Rahman Lavaee	96b6ee1bdc	Revert "[Propeller] Use Fixed MBB ID instead of volatile MachineBasicBlock::Number." This reverts commit 6015a045d768feab3bae9ad9c0c81e118df8b04a. Differential Revision: https://reviews.llvm.org/D139952	2022-12-13 11:13:57 -08:00
Pierre van Houtryve	678d8946ba	[AMDGPU] Add bf16 storage support - [Clang] Declare AMDGPU target as supporting BF16 for storage-only purposes on amdgcn - Add Sema & CodeGen tests cases. - Also add cases that D138651 would have covered as this patch replaces it. - [AMDGPU] Add BF16 storage-only support - Support legalization/dealing with bf16 operations in DAGIsel. - bf16 as a type remains illegal and is represented as i16 for storage purposes. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D139398	2022-12-13 10:34:26 -05:00
Phoebe Wang	7168e501e4	[NFC] Add checks for potential null returns	2022-12-13 22:30:31 +08:00
Fangrui Song	67819a72c6	[CodeGen] llvm::Optional => std::optional	2022-12-13 09:06:36 +00:00
David Blaikie	4790b74332	DebugInfo: Test DW_AT_prototyped and generalize it to handle C11 and C17	2022-12-12 22:34:49 +00:00
Vasileios Porpodas	06911ba6ea	[NFC] Cleanup: Replaces BB->getInstList().insert() with I->insertAt(). This is part of a series of cleanup patches towards making BasicBlock::getInstList() private. Differential Revision: https://reviews.llvm.org/D138877	2022-12-12 13:33:05 -08:00
Kazu Hirata	edc83a15b4	[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API in C++ code (NFC) We use LLVM_HAVE_TFLITE as the key to enable the mlgo work these days, and LLVM_HAVE_TF_API is defined whenever LLVM_HAVE_TF_API is defined. I'm posting this patch because it's purely mechanical. I'll post a follow-up patch to remove LLVM_HAVE_TF_API in non-C++ files, and that will not be as mechanical as this one. Differential Revision: https://reviews.llvm.org/D139863	2022-12-12 11:28:40 -08:00
Nicholas Guy	a3dc5b534a	[ARM][CodeGen] Add integer support for complex deinterleaving Differential Revision: https://reviews.llvm.org/D139628	2022-12-12 11:38:19 +00:00
David Green	fd716925ec	[DAGCombine] Fold Splat(bitcast(buildvector(x,..))) to splat(x) This adds a fold which teaches the backend to fold splat(bitcast(buildvector(x,..))) or splat(bitcast(scalar_to_vector(x))) to a single splat. This only handles lane 0 splats, which are only valid under LE, and needs to be a little careful with the types it creates for the new buildvector. Differential Revision: https://reviews.llvm.org/D139611	2022-12-12 08:35:43 +00:00
Nikita Popov	8005332835	[AA] Remove CFL AA passes The CFL Steens/Anders alias analysis passes are not enabled by default, and to the best of my knowledge have no pathway towards ever being enabled by default. The last significant interest in these passes seems to date back to 2016. Given the little maintenance these have seen in recent times, I also have very little confidence in the correctness of these passes. I don't think we should keep these in-tree. Differential Revision: https://reviews.llvm.org/D139703	2022-12-12 09:34:20 +01:00
jacquesguan	c2f199fa48	[DAGCombiner] Scalarize extend/truncate for splat vector. This revision scalarizes extend/truncate for splat vector. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D122875	2022-12-12 14:53:10 +08:00
Xiang1 Zhang	7557d94bd8	[NFC] Update comment for TRUNC followed by a masked store	2022-12-12 11:24:57 +08:00
Yeting Kuo	47b9da72e0	[VP][RISCV] Add vp.bitreverse and RISC-V support. The patch also added function expandVPBITREVERSE to expand ISD::VP_BITREVERSE nodes. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139697	2022-12-12 10:58:44 +08:00
Xiang1 Zhang	9c88ccf9a9	[DAG] Stop combine for masked compressstore Reviewed By: WangPengfei Differential Revision: https://reviews.llvm.org/D139682	2022-12-12 10:40:20 +08:00
Roman Lebedev	7a2001509b	[StackProtector] Rewrite dominator tree update handling	2022-12-12 04:53:11 +03:00
Xiang1 Zhang	d656ae2809	Enhance stack protector Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D139254	2022-12-12 08:39:50 +08:00
Manuel Brito	45a892d012	Use poison instead of undef where its used as a placeholder [NFC] Differential Revision: https://reviews.llvm.org/D139789	2022-12-11 17:18:00 +00:00
Kazu Hirata	3eebbaf0e2	[llvm] Use std::optional instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 17:09:01 -08:00
Kazu Hirata	f7dffc28b3	Don't include None.h (NFC) I've converted all known uses of None to std::nullopt, so we no longer need to include None.h. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 11:24:26 -08:00
Kazu Hirata	9c444f7021	[llvm] Use std::nullopt instead of None (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-09 18:32:32 -08:00
Ariel Burton	6b2829dd87	Allow epilogue_begin to be emitted when generating DWARF We identify epilogue code by looking for instructions tagged with FrameDestroy. A function may have more than one epilogue, e.g., because of early returns or code duplicated during optimization. We need only track the current block, and emit epilogie_begin at most once per block. We reduce the number of entries in the line table by combining epilogue_begin with other flags instead of emitting a separate entry just for epilogue_begin. Reviewed By: dblaikie, aprantl Differential Revision: https://reviews.llvm.org/D133376	2022-12-09 20:17:37 +00:00

1 2 3 4 5 ...

33362 Commits