llvm-project

Author	SHA1	Message	Date
Fangrui Song	89ecb8001a	MapVector: add C++17-style try_emplace and insert_or_assign (#71969 ) Similar to https://wg21.link/n4279 For example, insert_or_assign can be used to simplify CodeGenModule::AddDeferredUnusedCoverageMapping in clang/lib/CodeGen/CodeGenModule.cpp	2023-11-12 20:45:37 -08:00
Carl Ritson	52b247b1d3	[PHIElimination] Handle subranges in LiveInterval updates (#69429 ) Add subrange tracking and handling for LiveIntervals during PHI elimination. This requires extending MachineBasicBlock::SplitCriticalEdge to also update subrange intervals.	2023-11-13 12:16:26 +09:00
Kazu Hirata	01702c3f7f	[llvm] Stop including llvm/ADT/SmallSet.h (NFC) Identified with clangd.	2023-11-11 12:32:15 -08:00
Kazu Hirata	1564c225ef	[llvm] Stop including llvm/ADT/SmallString.h (NFC) Identified with clangd.	2023-11-11 12:32:13 -08:00
Kazu Hirata	d4360e428f	[llvm] Stop including llvm/ADT/DenseMap.h (NFC) Ientified with clangd.	2023-11-11 10:07:19 -08:00
Kazu Hirata	ac4a272913	[llvm] Stop including llvm/ADT/DenseSet.h (NFC) Identified with clangd.	2023-11-11 09:48:29 -08:00
Kazu Hirata	197d6ac793	[llvm] Stop including llvm/ADT/PointerIntPair.h (NFC) Identified with clangd.	2023-11-11 08:55:22 -08:00
Youngsuk Kim	5c91b2886f	[clang] Replace uses of CreatePointerBitCastOrAddrSpaceCast (NFC) (#68277 ) With opaque pointers, `CreatePointerBitCastOrAddrSpaceCast` can be replaced with `CreateAddrSpaceCast`. Replace or remove uses of `CreatePointerBitCastOrAddrSpaceCast`. Opaque pointer cleanup effort.	2023-11-11 10:57:44 -05:00
Kazu Hirata	bafd35ca04	[llvm] Stop including llvm/ADT/SmallPtrSet.h (NFC) Identified with clangd.	2023-11-11 00:35:14 -08:00
Kazu Hirata	0d55ea25a6	[llvm] Stop including llvm/ADT/DenseMapInfo.h (NFC) Identified with clangd.	2023-11-11 00:13:29 -08:00
Kazu Hirata	c22fffcba4	[llvm] Stop including llvm/ADT/MapVector.h (NFC) Identified with clangd.	2023-11-10 23:56:20 -08:00
Kazu Hirata	8dad9f85a3	[llvm] Stop including llvm/ADT/StringMap.h (NFC) Identified with clangd.	2023-11-10 23:42:17 -08:00
Vidhush Singhal	754b93e466	[Attributor] New attribute to identify what byte ranges are alive for an allocation (#66148 ) Changes the size of allocations automatically. For now, implements the case when a single range from start of the allocation is alive and the allocation can be reduced.	2023-11-10 16:26:37 -08:00
Joseph Huber	237adfca4e	[OpenMP] Rework handling of global ctor/dtors in OpenMP (#71739 ) Summary: This patch reworks how we handle global constructors in OpenMP. Previously, we emitted individual kernels that were all registered and called individually. In order to provide more generic support, this patch moves all handling of this to the target backend and the runtime plugin. This has the benefit of supporting the GNU extensions for constructors an destructors, removing a class of failures related to shared library destruction order, and allows targets other than OpenMP to use the same support without needing to change the frontend. This is primarily done by calling kernels that the backend emits to iterate a list of ctor / dtor functions. For x64, this is automatic and we get it for free with the standard `dlopen` handling. For AMDGPU, we emit `amdgcn.device.init` and `amdgcn.device.fini` functions which handle everything atuomatically and simply need to be called. For NVPTX, a patch https://github.com/llvm/llvm-project/pull/71549 provides the kernels to call, but the runtime needs to set up the array manually by pulling out all the known constructor / destructor functions. One concession that this patch requires is the change that for GPU targets in OpenMP offloading we will use `llvm.global_dtors` instead of using `atexit`. This is because `atexit` is a separate runtime function that does not mesh well with the handling we're trying to do here. This should be equivalent in all cases except for cases where we would need to destruct manually such as: ``` struct S { ~S() { foo(); } }; void foo() { static S s; } ``` However this is broken in many other ways on the GPU, so it is not regressing any support, simply increasing the scope of what we can handle. This changes the handling of ctors / dtors. This patch now outputs a information message regarding the deprecation if the old format is used. This will be completely removed in a later release. Depends on: https://github.com/llvm/llvm-project/pull/71549	2023-11-10 14:53:53 -06:00
Hans Wennborg	96a0d714d5	Revert "ValueTracking: Identify implied fp classes by general fcmp (#66505 )" This causes asserts to fire: llvm/lib/Analysis/ValueTracking.cpp:4262: std::tuple<Value , FPClassTest, FPClassTest> llvm::fcmpImpliesClass(CmpInst::Predicate, const Function &, Value , const APFloat *, bool): Assertion `(RHSClass == fcPosNormal \|\| RHSClass == fcNegNormal \|\| RHSClass == fcPosSubnormal \|\| RHSClass == fcNegSubnormal) && "should have been recognized as an exact class test"' failed. See comments on the PR. > Previously we could recognize exact class tests performed by > an fcmp with special values (0s, infs and smallest normal). > Expand this to recognize the implied classes by a compare with a general > constant. e.g. fcmp ogt x, 1 implies positive and non-0. > > The API should be better merged with fcmpToClassTest but that > made the diff way bigger, will try to do that in a future > patch. This reverts commit dc3faf0ed0e3f1ea9e435a006167d9649f865da1.	2023-11-10 14:45:52 +01:00
Yingwei Zheng	650026897c	[RISCV][SDAG] Prefer ShortForwardBranch to lower sdiv by pow2 (#67364 ) This patch lowers `sdiv x, +/-2k` to `add + select + shift` when the short forward branch optimization is enabled. The latter inst seq performs faster than the seq generated by target-independent DAGCombiner. This algorithm is described in Hacker's Delight**. This patch also removes duplicate logic in the X86 and AArch64 backend. But we cannot do this for the PowerPC backend since it generates a special instruction `addze`.	2023-11-10 21:38:47 +08:00
Nikita Popov	192e7d3d52	[IRBuilder] Add IsNonNeg param to CreateZExt() (NFC)	2023-11-10 12:00:34 +01:00
Serge Pavlov	5b0f703918	Revert "[ARM][FPEnv] Lowering of fpenv intrinsics" This reverts commit d62f040418bd167d1ddd2b79c640a90c0c2ea353. Some cuda buildbots start failing.	2023-11-10 16:24:51 +07:00
Serge Pavlov	d62f040418	[ARM][FPEnv] Lowering of fpenv intrinsics The change implements lowering of `get_fpenv`, `set_fpenv` and `reset_fpenv`. Differential Revision: https://reviews.llvm.org/D81843	2023-11-10 16:06:33 +07:00
Matt Arsenault	dc3faf0ed0	ValueTracking: Identify implied fp classes by general fcmp (#66505 ) Previously we could recognize exact class tests performed by an fcmp with special values (0s, infs and smallest normal). Expand this to recognize the implied classes by a compare with a general constant. e.g. fcmp ogt x, 1 implies positive and non-0. The API should be better merged with fcmpToClassTest but that made the diff way bigger, will try to do that in a future patch.	2023-11-10 11:39:19 +09:00
Igor Kudrin	652ceaddc0	[YAMLParser] Unfold multi-line scalar values (#70898 ) Long scalar values can be split into multiple lines to improve readability. The rules are described in Section 6.5. "Line Folding", https://yaml.org/spec/1.2.2/#65-line-folding. In addition, for flow scalar styles, the Spec states that "All leading and trailing white space characters on each line are excluded from the content", https://yaml.org/spec/1.2.2/#73-flow-scalar-styles. The patch implements these unfolding rules for double-quoted, single-quoted, and plain scalars.	2023-11-10 07:19:24 +07:00
Jeremy Morse	10a9e7442c	[DebugInfo][RemoveDIs] Add conversion utilities for new-debug-info format This patch plumbs the command line --experimental-debuginfo-iterators flag in to the pass managers, so that modules can be converted to the new format, passes run, then converted back to the old format. That allows developers to test-out the new debuginfo representation across some part of LLVM with no further work, and from the command line. It also installs flag-catchers at the various points that bitcode and textual IR can egress from a process, and temporarily convert the module to dbg.value format when doing so. No tests alas as it's designed to be transparent. Differential Revision: https://reviews.llvm.org/D154372	2023-11-09 22:30:49 +00:00
Cyndy Ishida	d8a4011f5b	[TextAPI] Add error code for invalid input formats (#71824 )	2023-11-09 13:17:41 -08:00
Michael Maitland	bede0106d0	[CodeGen][LLT] Add isFixedVector and isScalableVector (#71713 ) The current isScalable function requires a user to call isVector before hand in order to avoid an assertion failure in the case that the LLT is not a vector. This patch addds helper functions that allow a user to query whether the LLT is fixed or scalable, not wanting an assertion failure in the case that the LLT was never a vector in the first place.	2023-11-09 14:31:38 -05:00
Mingming Liu	bb642f8b94	[NFC][InstrProf]Refactor readPGOFuncNameStrings (#71566 ) Refactor this function to take a callback for each decoded string, rename it and change it to a static function in cpp. Move its (sole) caller definition from header to cpp. - This is a split of patch https://github.com/llvm/llvm-project/pull/66825; to minimize the diff created in a big PR.	2023-11-09 10:47:44 -08:00
Fangrui Song	1df5ea29b4	[RISCV] Support R_RISCV_SET_ULEB128/R_RISCV_SUB_ULEB128 for .uleb128 directives For a label difference like `.uleb128 A-B`, MC folds A-B even if A and B are separated by a RISC-V linker-relaxable instruction. This incorrect behavior is currently abused by DWARF v5 .debug_loclists/.debug_rnglists (DW_LLE_offset_pair/DW_RLE_offset_pair entry kinds) implemented in Clang/LLVM (see https://github.com/ClangBuiltLinux/linux/issues/1719 for an instance). `96d6e190e9` defined R_RISCV_SET_ULEB128/R_RISCV_SUB_ULEB128. This patch generates such a pair of relocations to represent A-B that should not be folded. GNU assembler computes the directive size by ignoring shrinkable section content, therefore after linking the value of A-B cannot use more bytes than the reserved number (`final size of uleb128 value at offset ... exceeds available space`). We make the same assumption. ``` w1: call foo w2: .space 120 w3: .uleb128 w2-w1 # 1 byte, 0x08 .uleb128 w3-w1 # 2 bytes, 0x80 0x01 ``` We do not conservatively reserve 10 bytes (maximum size of an uleb128 for uint64_t) as that would pessimize DWARF v5 DW_LLE_offset_pair/DW_RLE_offset_pair, nullifying the benefits of introducing R_RISCV_SET_ULEB128/R_RISCV_SUB_ULEB128 relocations. The supported expressions are limited. For example, * non-subtraction `.uleb128 A` is not allowed * `.uleb128 A-B`: report an error unless A and B are both defined and in the same section The new cl::opt `-riscv-uleb128-reloc` can be used to suppress the relocations. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D157657	2023-11-09 09:27:32 -08:00
Cyndy Ishida	e17efa60b1	[llvm][TextAPI] Add new `not_for_dyld_shared_cache` attribute to file… (#71735 ) … format. Formats >= TBDv4 will now encode new attribute that the system static linker wil read when tbd files replace binary dylibs.	2023-11-09 09:22:16 -08:00
Wang Pengcheng	dfecfc7ed0	[ADT][NFC] Remove Bitset comments copied from FeatureBitset (#71778 )	2023-11-10 01:21:50 +08:00
Juergen Ributzka	6d1d7be133	Obsolete WebKit Calling Convention (#71567 ) The WebKit Calling Convention was created specifically for the WebKit FTL. FTL doesn't use LLVM anymore and therefore this calling convention is obsolete. This commit removes the WebKit CC, its associated tests, and documentation.	2023-11-09 09:08:41 -08:00
Andy Kaylor	2984156fd3	Make SmallVectorImpl destructor protected (#71746 ) Because the SmallVectorImpl destructor is not virtual, the destructor of derived classes will not be called if pointers to the SmallVectorImpl class are deleted directly. Making the SmallVectorImpl destructor protected will prevent this.	2023-11-09 08:52:47 -08:00
chuongg3	451bc3ec1d	[AArch64][GlobalISel] Legalize G_VECREDUCE_{MIN/MAX} (#69461 ) Legalizes G_VECREDUCE_{MIN/MAX} and selects instructions for vecreduce_{min/max}	2023-11-09 16:29:14 +00:00
Jeremy Morse	b002b38fd9	[DebugInfo][RemoveDIs] Add new behind-the-scenes plumbing for debug-info This is the "central" patch to the removing-debug-intrinsics project: it changes the instruction movement APIs (insert, move, splice) to interpret the "Head" bits we're attaching to BasicBlock::iterators, and updates debug-info records in the background to preserve the ordering of debug-info (which is in DPValue objects instead of dbg.values). The cost is the complexity of this patch, plus memory. The benefit is that LLVM developers can cease thinking about whether they're moving debug-info or not, because it'll happen behind the scenes. All that complexity appears in BasicBlock::spliceDebugInfo, see the diagram there for how we now manually shuffle debug-info around. Each potential splice configuration gets tested in the added unit tests. The rest of this patch applies the same reasoning in a variety of scenarios. When moveBefore (and it's siblings) are used to move instructions around, the caller has to indicate whether they intend for debug-info to move too (is it a "Preserving" call or not), and then the "Head" bits used to determine where debug-info moves to. Similar reasoning is needed for insertBefore. Differential Revision: https://reviews.llvm.org/D154353	2023-11-09 15:25:39 +00:00
Jeremy Morse	b90cba10f9	[NFC][RemoveDIs] Shuffle header inclusions for "new" debug-info BasicBlock.h and Instruction.h will eventually need to include DebugProgramInstruction.h so that debug-info attached to instructions can be enumerated and cloned. Originally including it made compiling clang much slower, I think I've pinned that down as being the inclusion of DebugInfoMetadata.h causing ~every LLVM translation unit to parse all the debug-info classes. This patch avoids that by shifting some functions into the cpp file rather than the header, and restores the inclusion of DebugProgramInstruction.h in BasicBlock.h so that the rest of the RemoveDIs functionality can land.	2023-11-09 13:19:05 +00:00
Eymen Ünay	f6d525f8d8	[JITLink][AArch32] Unittest for error paths of readAddend and applyFixup functionality (#69636 ) This test checks for error paths in relocation dependent functions of readAddend and applyFixup. It is useful to check these to avoid unexpected assert errors. Currently opcode errors are triggered in most of the cases in AArch32 but there might be further checks to look for in the future. Different backends can also implement a similar test.	2023-11-09 12:22:36 +03:00
Eymen Ünay	87081f1c18	[JITLink][AArch32] Add support for ELF::R_ARM_THM_MOV{W_PREL_NC,T_PREL} (#70364 ) Support for ELF::R_ARM_THM_MOVW_PREL_NC and ELF::R_ARM_THM_MOVT_PREL is added. Move instructions with PC-relative immediates can be handled in Thumb mode with this addition.	2023-11-09 11:51:02 +03:00
Chuanqi Xu	b7b5907b56	[Coroutines] Introduce [[clang::coro_only_destroy_when_complete]] (#71014 ) Close https://github.com/llvm/llvm-project/issues/56980. This patch tries to introduce a light-weight optimization attribute for coroutines which are guaranteed to only be destroyed after it reached the final suspend. The rationale behind the patch is simple. See the example: ```C++ A foo() { dtor d; co_await something(); dtor d1; co_await something(); dtor d2; co_return 43; } ``` Generally the generated .destroy function may be: ```C++ void foo.destroy(foo.Frame frame) { switch(frame->suspend_index()) { case 1: frame->d.~dtor(); break; case 2: frame->d.~dtor(); frame->d1.~dtor(); break; case 3: frame->d.~dtor(); frame->d1.~dtor(); frame->d2.~dtor(); break; default: // coroutine completed or haven't started break; } frame->promise.~promise_type(); delete frame; } ``` Since the compiler need to be ready for all the cases that the coroutine may be destroyed in a valid state. However, from the user's perspective, we can understand that certain coroutine types may only be destroyed after it reached to the final suspend point. And we need a method to teach the compiler about this. Then this is the patch. After the compiler recognized that the coroutines can only be destroyed after complete, it can optimize the above example to: ```C++ void foo.destroy(foo.Frame frame) { frame->promise.~promise_type(); delete frame; } ``` I spent a lot of time experimenting and experiencing this in the downstream. The numbers are really good. In a real-world coroutine-heavy workload, the size of the build dir (including .o files) reduces 14%. And the size of final libraries (excluding the .o files) reduces 8% in Debug mode and 1% in Release mode.	2023-11-09 14:42:07 +08:00
Alex Langford	d42b2ceb6c	Revert "[Pass][CodeGen] Add some necessary passes for codegen (#70903 )" This change broke building LLVM with Module support enabled, i.e. `LLVM_ENABLE_MODULES=ON`. This reverts commit f40da072ed51ba77bf46191b35a74208b1045042.	2023-11-08 15:59:44 -08:00
Jeremy Morse	f1b0a54451	Reapply 7d77bbef4ad92, adding new debug-info classes This reverts commit 957efa4ce4f0391147cec62746e997226ee2b836. Original commit message below -- in this follow up, I've shifted un-necessary inclusions of DebugProgramInstruction.h into being forward declarations (fixes clang-compile time I hope), and a memory leak in the DebugInfoTest.cpp IR unittests. I also tracked a compile-time regression in D154080, more explanation there, but the result of which is hiding some of the changes behind the EXPERIMENTAL_DEBUGINFO_ITERATORS compile-time flag. This is tested by the "new-debug-iterators" buildbot. [DebugInfo][RemoveDIs] Add prototype storage classes for "new" debug-info This patch adds a variety of classes needed to record variable location debug-info without using the existing intrinsic approach, see the rationale at [0]. The two added files and corresponding unit tests are the majority of the plumbing required for this, but at this point isn't accessible from the rest of LLVM as we need to stage it into the repo gently. An overview is that classes are added for recording variable information attached to Real (TM) instructions, in the form of DPValues and DPMarker objects. The metadata-uses of DPValues is plumbed into the metadata hierachy, and a field added to class Instruction, which are all stimulated in the unit tests. The next few patches in this series add utilities to convert to/from this new debug-info format and add instruction/block utilities to have debug-info automatically updated in the background when various operations occur. This patch was reviewed in Phab in D153990 and D154080, I've squashed them together into this commit as there are dependencies between the two patches, and there's little profit in landing them separately. [0] https://discourse.llvm.org/t/rfc-instruction-api-changes-needed-to-eliminate-debug-intrinsics-from-ir/68939	2023-11-08 16:42:35 +00:00
alexfh	067632e141	Revert "[DAGCombiner] Transform `(icmp eq/ne (and X,C0),(shift X,C1))` to use rotate or to getter constants." due to a miscompile (#71598 ) - Revert "[DAGCombiner] Transform `(icmp eq/ne (and X,C0),(shift X,C1))` to use rotate or to getter constants." - causes a miscompile, see `112e49b381 (commitcomment-131943923)` - Revert "[X86] Fix gcc warning about mix of enumeral and non-enumeral types. NFC", which fixes a compiler warning in the commit above	2023-11-08 15:07:12 +01:00
Mitch Phillips	a141a9fa97	Revert "[OpenMP] atomic compare fail : Parser & AST support" This reverts commit 086b65340cca2648a2a91a0a47d28c7d9bafd1e5. Reason: Broke under -Werror. More details in https://reviews.llvm.org/D123235	2023-11-08 11:20:17 +01:00
Jay Foad	d5f3b3b3b1	[RegScavenger] Simplify state tracking for backwards scavenging (#71202 ) Track the live register state immediately before, instead of after, MBBI. This makes it simple to track the state at the start or end of a basic block without a separate (and poorly named) Tracking flag. This changes the API of the backward(MachineBasicBlock::iterator I) method, which now recedes to the state just before, instead of just after, *I. Some clients are simplified by this change. There is one small functional change shown in the lit tests where multiple spilled registers all need to be reloaded before the same instruction. The reloads will now be inserted in the opposite order. This should not affect correctness.	2023-11-08 09:49:07 +00:00
Pierre van Houtryve	96e9786414	[TableGen][GlobalISel] Add MIFlags matching & rewriting (#71179 ) Also disables generation of MutateOpcode. It's almost never used in combiners anyway. If we really want to use it, it needs to be investigated & properly fixed (see TODO) Fixes #70780	2023-11-08 10:31:49 +01:00
Diana	9d4094ae80	[AMDGPU] Add llvm.amdgcn.set.inactive.chain.arg intrinsic (#71530 ) Add a new intrinsic, similar to llvm.amdgcn.set.inactive, but used only in functions with the `amdgpu_cs_chain` or `amdgpu_cs_chain_preserve` calling conventions. It allows setting the inactive lanes to those of a value received as a VGPR argument (whereas llvm.amdgcn.set.inactive usually takes a constant as the value of the inactive lanes). Differential Revision: https://reviews.llvm.org/D158604	2023-11-08 08:44:13 +01:00
paperchalice	f40da072ed	[Pass][CodeGen] Add some necessary passes for codegen (#70903 ) These passes are used in `TargetPassConfig.cpp`, so add them here. Part of #69879. @arsenm Thanks for reviwing.	2023-11-08 16:24:39 +09:00
Pierre van Houtryve	573fa770d0	[TableGen][GlobalISel] Add rule-wide type inference (#66377 ) The inference is trivial and leverages the MCOI OperandTypes encoded in CodeGenInstructions to infer types across patterns in a CombineRule. It's thus very limited and only supports CodeGenInstructions (but that's the main use case so it's fine). We only try to infer untyped operands in apply patterns when they're temp reg defs, or immediates. Inference always outputs a `GITypeOf<$x>` where $x is a named operand from a match pattern. This allows us to drop the `GITypeOf` in most cases without any errors.	2023-11-08 08:10:22 +01:00
Michael Spencer	fb07d9cc09	[clang][DepScan] Make OptimizeArgs a bit mask enum and enable by default (#71588 ) Make it easier to control which optimizations are enabled by making OptimizeArgs a bit masked enum. There's currently only one such optimization, but more will be added in followup commits.	2023-11-07 16:06:59 -08:00
Vladislav Dzhidzhoev	6beddd668a	Revert "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)" This caused assert: llvm/llvm/lib/CodeGen/AsmPrinter/DwarfFile.cpp:110: void llvm::DwarfFile::addScopeVariable(LexicalScope , DbgVariable ): Assertion `Ret.second' failed. See comments https://reviews.llvm.org/D144006#4656350. This reverts commit 3b449bd46a11a55a40cbc0016a99b202fa05248e.	2023-11-08 00:29:24 +01:00
Sunil Kuravinakop	086b65340c	[OpenMP] atomic compare fail : Parser & AST support This is a support for " #pragma omp atomic compare fail ". It has Parser & AST support for now. Reviewed By: tianshilei1992, ABataev Differential Revision: https://reviews.llvm.org/D123235	2023-11-07 16:57:50 -06:00
Lang Hames	66a76759fe	[ORC] Remove an unused typedef.	2023-11-07 14:25:01 -08:00
Michael Maitland	ac4ff6168a	[CodeGen][MachineVerifier] Use TypeSize instead of unsigned for getRe… (#70881 ) …gSizeInBits This patch changes getRegSizeInBits to return a TypeSize instead of an unsigned in the case that a virtual register has a scalable LLT. In the case that register is physical, a Fixed TypeSize is returned. The MachineVerifier pass is updated to allow copies between fixed and scalable operands as long as the Src size will fit into the Dest size. This is a precommit which will be stacked on by a change to GISel to generate COPYs with a scalable destination but a fixed size source. This patch is stacked on https://github.com/llvm/llvm-project/pull/70893 for the ability to use scalable vector types in MIR tests.	2023-11-07 14:38:46 -05:00

1 2 3 4 5 ...

53189 Commits