llvm-project

Author	SHA1	Message	Date
Sterling-Augustine	23f1ba3ee4	Reapply "[NFC][DebugInfo][DWARF] Create new low-level dwarf library (#… (#145959 ) (#146112 ) Reapply "[NFC][DebugInfo][DWARF] Create new low-level dwarf library (#… (#145959) This reapplies cbf781f0bdf2f680abbe784faedeefd6f84c246e, with fixes for the shared-library build and the unconventional sanitizer-runtime build. Original Description: This is the culmination of a series of changes described in [1]. Although somewhat large by line count, it is almost entirely mechanical, creating a new library in DebugInfo/DWARF/LowLevel. This new library has very minimal dependencies, allowing it to be used from more places than the normal DebugInfo/DWARF library--in particular from MC. 1. https://discourse.llvm.org/t/rfc-debuginfo-dwarf-refactor-into-to-lower-and-higher-level-libraries/86665/2	2025-06-27 11:05:49 -07:00
Sterling-Augustine	5d03e7a204	Revert "[NFC][DebugInfo][DWARF] Create new low-level dwarf library (#… (#145959 ) …145081)" This reverts commit cbf781f0bdf2f680abbe784faedeefd6f84c246e. Breaks a couple of buildbots.	2025-06-26 13:09:20 -07:00
Sterling-Augustine	cbf781f0bd	[NFC][DebugInfo][DWARF] Create new low-level dwarf library (#145081 ) This is the culmination of a series of changes described in [1]. Although somewhat large by line count, it is almost entirely mechanical, creating a new library in DebugInfo/DWARF/LowLevel. This new library has very minimal dependencies, allowing it to be used from more places than the normal DebugInfo/DWARF library--in particular from MC. I am happy to put it in another location, or to structure it differently if that makes sense. Some have suggested in BinaryFormat, but it is not a great fit there. But if that makes more sense to the reviewers, I can do that. Another possibility would be to use pass-through headers to allow clients who don't care to depend only on DebugInfo/DWARF. This would be a much less invasive change, and perhaps easier for clients. But also a system that hides details. Either way, I'm open. 1. https://discourse.llvm.org/t/rfc-debuginfo-dwarf-refactor-into-to-lower-and-higher-level-libraries/86665/2	2025-06-26 11:23:46 -07:00
peremyach	246fa9a9f7	Revert gsymutil changes due to concurrency problems (#142829 ) We saw occasional segfaults while processing some binaries. The reason probably is that we may clear the DIE while we are reading it's data from another thread which happens due to cross-unit references. --------- Co-authored-by: Arslan Khabutdinov <akhabutdinov@fb.com>	2025-06-04 11:34:12 -07:00
peremyach	beb6972cbb	fix llvm-gsymutil verification (#141751 ) Verification crashed here https://github.com/llvm/llvm-project/blob/main/llvm/lib/DebugInfo/DWARF/DWARFUnit.cpp#L519 The reason being that during verification to extract inline_info we recreate compile unit dies. Assert fails because we previously cleaned up just the DIEs but some other fields remained initialized. Co-authored-by: Arslan Khabutdinov <akhabutdinov@fb.com>	2025-05-29 14:02:22 -07:00
peremyach	d997b4f531	Reduce llvm-gsymutil memory usage (#140740 ) Same as https://github.com/llvm/llvm-project/pull/139907/ except there is now a special dovoidwork helper function. Previous approach with assert(f();return success;) failed tests for release builds, so I created a separate helper. Open to suggestions how to solve this more elegantly. Co-authored-by: Arslan Khabutdinov <akhabutdinov@fb.com>	2025-05-21 09:49:12 -07:00
peremyach	5ed0b3a2d7	Revert "Reduce llvm-gsymutil memory usage" (#140696 ) Reverts llvm/llvm-project#139907 as per discussion in https://github.com/llvm/llvm-project/issues/140545 due to tests becoming flaky	2025-05-20 15:31:39 +04:00
peremyach	ebb15353d2	Reduce llvm-gsymutil memory usage (#139907 ) For large binaries gsymutil ends up using too much memory. This diff adds DIE tree cleanup per compile unit to reduce memory usage. P. S. Not sure about formatting. Maybe it hasn't been run in a while, or I have misconfigured something. `$ git clang-format HEAD~1 clang-format did not modify any files $ clang-format --version clang-format version 21.0.0git (git@github.com:peremyach/llvm-project.git 8d945c8357e1bd9872a34f92620d4916bfd27482) ` Co-authored-by: Arslan Khabutdinov <akhabutdinov@fb.com>	2025-05-16 09:44:17 -07:00
Kamau Bridgeman	3386d24ff4	Revert "Reduce llvm-gsymutil memory usage" (#97603 ) Reverts llvm/llvm-project#91023 Build break found in clang-ppc64le-linux-multistage build no. 583.	2024-07-03 12:22:26 -04:00
Kevin Frei	60cd3eb880	Reduce llvm-gsymutil memory usage (#91023 ) llvm-gsymutil eats a lot of RAM. On some large binaries, it causes OOM's on smaller hardware, consuming well over 64GB of RAM. This change frees line tables once we're done with them, and frees DWARFUnits's DIE's when we finish processing each DU, though they may get reconstituted if there are references from other DU's during processing. Once the conversion is complete, all DIE's are freed. The reduction in peak memory usage from these changes showed between 7-12% in my tests. The double-checked locking around the creation & freeing of the data structures was tested on a 166 core system. I validated that it trivially malfunctioned without the locks (and with stupid reordering of the locks) and worked reliably with them. --------- Co-authored-by: Kevin Frei <freik@meta.com>	2024-07-02 10:14:26 -07:00
Alex Langford	1a8935ada7	[DebugInfo] Report errors when DWARFUnitHeader::applyIndexEntry fails (#89156 ) Motivation: LLDB is able to report errors about these scenarios whereas LLVM's DWARF parser only gives a boolean success/fail. I want to migrate LLDB to using LLVM's DWARFUnitHeader class, but I don't want to lose some of the error reporting, so I'm adding it to the LLVM class first.	2024-04-23 11:01:54 -07:00
Alex Langford	814a79aea6	[DebugInfo] Separate error generation from reporting in DWARFHeaderUnit::extract (#68242 ) Instead of reporting the error directly through the DWARFContext passed in as an argument, it would be more flexible to have extract return the error and allow the caller to react appropriately. This will be useful for using llvm's DWARFHeaderUnit from lldb which may report header extraction errors through a different mechanism.	2023-10-18 09:06:39 -07:00
Chen Zheng	efb11c4022	Support big endian in llvm-symbolizer's data location dwarf info parser (#67284 ) For now, data location expression is hard coded to little endian. We are going to support sanitizers on AIX which is big endian. Support big endian too in the data location expression parser of llvm-symbolizer.	2023-10-10 09:13:25 +08:00
Takuya Shimizu	01b88dd66d	[NFC] Remove unused variables declared in conditions D152495 makes clang warn on unused variables that are declared in conditions like `if (int var = init) {}` This patch is an NFC fix to suppress the new warning in llvm,clang,lld builds to pass CI in the above patch. Differential Revision: https://reviews.llvm.org/D158016	2023-08-30 10:05:06 +09:00
Alex Langford	c1b84d9b7c	[DebugInfo] Add error handling to DWARFDebugAbbrev::getAbbreviationDeclarationSet This gives us more meaningful information when `getAbbreviationDeclarationSet` fails. Right now only `verifyAbbrevSection` actually uses the error that it returns, but the other call sites could be rewritten to take advantage of the returned error. Differential Revision: https://reviews.llvm.org/D153459	2023-06-27 10:30:50 -07:00
David Blaikie	3c5e3b70a3	llvm-symbolizer: access the base address from the skeleton CU, not the split unit In Split DWARF, if the unit had a non-trivial base address (a real low_pc, rather than one with fixed value 0) then computing addresses needs to access that base address to add to any base address-relative values. But the code was trying to access the base address in the split unit, when it's actually in the skeleton unit. So delegate to the skeleton if it's available. Fixes #62941	2023-05-26 00:59:40 +00:00
David Blaikie	35fd37177b	llvm-symbolizer: Don't crash when referencing an invalid CU in a dwp file twice Previously we'd stash a null pointer in a sorted vector of CUs - the next time around, we'd try to do a binary search in that vector (sorting on a key inside the objects pointed to by the elements of the vector) which would deref null if we'd stashed a null in there previously. As a reasonable, but not ideal, workaround - don't stash any result in the vector - this means every query will produce a new warning (resulting in duplicate warnings) but better than a crash. Stashing null in the list could be workable if we also stashed the offset in a pair - but then all the clients would need to be fixed up (maybe using a filtering iterator) which seems like overkill for this uncommon error case.	2023-03-14 00:49:32 +00:00
Alexander Yermolovich	7fc7934023	[llvm][dwwarf] Change CU/TU index to 64-bit Changed contribution data structure to 64 bit. I added the 32bit and 64bit accessors to make it explicit where we use 32bit and where we use 64bit. Also to make sure sure we catch all the cases where this data structure is used. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D139379	2023-01-11 15:07:11 -08:00
Alexander Yermolovich	6a4a697e17	Revert "[llvm][dwwarf] Change CU/TU index to 64-bit" This reverts commit fa3fa4d0d42326005dfd5887bf047b86904d3be6.	2023-01-11 14:41:24 -08:00
Alexander Yermolovich	fa3fa4d0d4	[llvm][dwwarf] Change CU/TU index to 64-bit Changed contribution data structure to 64 bit. I added the 32bit and 64bit accessors to make it explicit where we use 32bit and where we use 64bit. Also to make sure sure we catch all the cases where this data structure is used. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D139379	2023-01-10 10:33:52 -08:00
Gregory Alfonso	d22f050e15	Remove redundant .c_str() and .get() calls Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D139485	2022-12-18 00:33:53 +00:00
Alexander Yermolovich	f2f8f70953	Revert "[llvm][dwwarf] Change CU/TU index to 64-bit" This reverts commit 5ebd28f3e56c00a739fda46c72c9e0f6528add87.	2022-12-07 13:14:23 -08:00
Alexander Yermolovich	5ebd28f3e5	[llvm][dwwarf] Change CU/TU index to 64-bit Summary: Changed contribution data structure to 64 bit. I added the 32bit and 64bit accessors to make it explicit where we use 32bit and where we use 64bit. Also to make sure sure we catch all the cases where this data structure is used.	2022-12-07 13:08:35 -08:00
Fangrui Song	89fab98e88	[DebugInfo] llvm::Optional => std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-05 00:09:22 +00:00
Kazu Hirata	110115993c	[DebugInfo] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:39 -08:00
Carlos Alberto Enciso	4f06d46f46	[llvm-debuginfo-analyzer] (08/09) - ELF Reader llvm-debuginfo-analyzer is a command line tool that processes debug info contained in a binary file and produces a debug information format agnostic “Logical View”, which is a high-level semantic representation of the debug info, independent of the low-level format. The code has been divided into the following patches: 1) Interval tree 2) Driver and documentation 3) Logical elements 4) Locations and ranges 5) Select elements 6) Warning and internal options 7) Compare elements 8) ELF Reader 9) CodeView Reader Full details: https://discourse.llvm.org/t/llvm-dev-rfc-llvm-dva-debug-information-visual-analyzer/62570 This patch: This is a high level summary of the changes in this patch. ELF Reader - Support for ELF/DWARF. LVBinaryReader, LVELFReader Reviewed By: psamolysov, probinson Differential Revision: https://reviews.llvm.org/D125783	2022-10-27 05:37:51 +01:00
Shubham Sandeep Rastogi	636de2bf34	Change isLittleEndian to follow llvm style and add an accessor Differential Revision: https://reviews.llvm.org/D134290	2022-09-20 17:00:47 -07:00
Kazu Hirata	ff2fe7b829	Use llvm::upper_bound (NFC)	2022-09-03 11:17:39 -07:00
Alexey Lapshin	ece341f598	[Debuginfo][DWARF][NFC] Add paired methods working with DWARFDebugInfoEntry. This review is extracted from D96035. DWARF Debuginfo classes have two representations for DIEs: DWARFDebugInfoEntry (short) and DWARFDie(extended). Depending on the task, it might be more convenient to use DWARFDebugInfoEntry or/and DWARFDie. DWARFUnit class already has methods working with DWARFDie and DWARFDebugInfoEntry. This patch adds more methods working with DWARFDebugInfoEntry to have paired functionality. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D126059	2022-07-29 16:40:17 +03:00
Kazu Hirata	129b531c9c	[llvm] Use value_or instead of getValueOr (NFC)	2022-06-18 23:07:11 -07:00
Hyoun Kyu Cho	6c12ae8163	Exposes interface to free up caching data structure in DWARFDebugLine and DWARFUnit for memory management This is minimum changes extracted from https://reviews.llvm.org/D78950. The old patch tried to add LRU eviction of caching data structure. Due to multiple layers of interfaces that users could be using, it was not clear where to put the functionality. While we work out on where to put that functionality, it'll be great to add this minimum interface change so that the user could implement their own memory management. More specifically: * Add a clearLineTable method for DWARFDebugLine which erases the given offset from the LineTableMap. * DWARFDebugContext adds the clearLineTableForUnit method that leverages clearLineTable to remove the object corresponding to a given compile unit, for memory management purposes. When it is referred to again, the line table object will be repopulated. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D90006	2022-05-24 03:23:24 +00:00
Mitch Phillips	cead4eceb0	[symbolizer] Parse DW_TAG_variable DIs to show line info for globals Currently, llvm-symbolizer doesn't like to parse .debug_info in order to show the line info for global variables. addr2line does this. In the future, I'm looking to migrate AddressSanitizer off of internal metadata over to using debuginfo, and this is predicated on being able to get the line info for global variables. This patch adds the requisite support for getting the line info from the .debug_info section for symbolizing global variables. This only happens when you ask for a global variable to be symbolized as data. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D123538	2022-05-23 13:30:22 -07:00
David Blaikie	727c590fe9	DebugInfo: Use hash-based unit lookup when available in dwp files Fix a test case that had a bogus (probably I hand crafted it at some point) index that didn't point to the right data in the process.	2022-04-27 21:18:14 +00:00
serge-sans-paille	290e482342	Cleanup LLVMDWARFDebugInfo As usual with that header cleanup series, some implicit dependencies now need to be explicit: llvm/DebugInfo/DWARF/DWARFContext.h no longer includes: - "llvm/DebugInfo/DWARF/DWARFAcceleratorTable.h" - "llvm/DebugInfo/DWARF/DWARFCompileUnit.h" - "llvm/DebugInfo/DWARF/DWARFDebugAbbrev.h" - "llvm/DebugInfo/DWARF/DWARFDebugAranges.h" - "llvm/DebugInfo/DWARF/DWARFDebugFrame.h" - "llvm/DebugInfo/DWARF/DWARFDebugLoc.h" - "llvm/DebugInfo/DWARF/DWARFDebugMacro.h" - "llvm/DebugInfo/DWARF/DWARFGdbIndex.h" - "llvm/DebugInfo/DWARF/DWARFSection.h" - "llvm/DebugInfo/DWARF/DWARFTypeUnit.h" - "llvm/DebugInfo/DWARF/DWARFUnitIndex.h" Plus llvm/Support/Errc.h not included by a bunch of llvm/DebugInfo/DWARF/DWARF*.h files Preprocessed lines to build llvm on my setup: after: 1065629059 before: 1066621848 Which is a great diff! Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119723	2022-02-15 09:16:03 +01:00
Sylvestre Ledru	f2c2e924e7	Fix a typo (occured => occurred) Reported: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1005195	2022-02-08 21:35:26 +01:00
Kazu Hirata	9c0a4227a9	Use Optional::getValueOr (NFC)	2021-12-24 20:57:40 -08:00
David Blaikie	92f2d02b4a	DebugInfo: Sink string form validation down from verifier to form parsing Avoid duplicating the string decoding - improve the error messages down in form parsing (& produce an Expected<const char> instead of Optional<const char> to communicate the extra error details)	2021-12-14 15:41:53 -08:00
Jack Anderson	d7733f8422	[DebugInfo] Expand ability to load 2-byte addresses in dwarf sections Some dwarf loaders in LLVM are hard-coded to only accept 4-byte and 8-byte address sizes. This patch generalizes acceptance into `DWARFContext::isAddressSizeSupported` and provides a common way to generate rejection errors. The MSP430 target has been given new tests to cover dwarf loading cases that previously failed due to 2-byte addresses. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D111953	2021-10-21 17:31:00 -07:00
Alexey Lapshin	0b8c50812b	[DWARF][NFC] add ParentIdx and SiblingIdx to DWARFDebugInfoEntry for faster navigation. This patch implements suggestion done while reviewing D102634. It adds two fields: ParentIdx and SiblingIdx. These fields allow fast navigation to die parent and die sibling. These fields are set at the moment when dies are loaded. dsymutil works 2% faster with this patch(run on clang binary). Differential Revision: https://reviews.llvm.org/D110363	2021-10-02 08:11:06 +03:00
Alexey Lapshin	3493540830	[DebugInfo][NFC] Erase capacity in DWARFUnit::clearDIEs(). DWARFUnit::clearDIEs() uses std::vector::shrink_to_fit() to make capacity of DieArray matched with its size(). The shrink_to_fit() is not binding request to make capacity match with size(). Thus the memory could still be reserved after DWARFUnit::clearDIEs() is called. This patch erases capacity when DWARFUnit::clearDIEs() is requested. So the memory occupied by dies would be freed. Differential Revision: https://reviews.llvm.org/D109499	2021-09-10 10:07:28 +03:00
Kazu Hirata	49d7b2beae	[DWARF] Remove parseListTableHeader (NFC) The last use was removed on Oct 4, 2020 in commit 6d0be74af5555f7bc56ac72cbd98ff270fd1291b.	2021-08-19 23:34:22 -07:00
David Blaikie	ea91749f01	DebugInfo: Use debug_rnglists.dwo for ranges in debug_info.dwo when parsing DWARFv5 This call would incorrectly overwrite (with the .debug_rnglists.dwo from the executable, if there was one) the rnglists section instead of the correct value (from the .debug_rnglists.dwo in the .dwo file) that's applied in DWARFUnit::tryExtractDIEsIfNeeded	2021-07-12 18:15:09 -07:00
David Blaikie	b447b9dce0	Reapply "llvm-symbolizer: Fix "start file" to work with Split DWARF" Originally committed as 04c203e310bd3fb58e16c936c0200d680100526e Reverted in 768510632c5ddbf9438693d9c7db1903e39295ad due to the test failing when encountering windows directory separators. Fix the path separator platform issue with a FileCheck pattern {{[/\\]}} Original commit message: A followup to the feature added in 69da27c7496ea373567ce5121e6fe8613846e7a5 that added the optional "start file name" to match "start line" - but this didn't work with Split DWARF because of the need for the decl file number resolution code to refer back to the skeleton unit to find its .debug_line contribution. So this patch adds the necessary infrastructure to track the skeleton unit corresponding to a split full unit for the purpose of this lookup.	2021-07-10 18:50:55 -07:00
Nico Weber	768510632c	Revert "llvm-symbolizer: Fix "start file" to work with Split DWARF" This reverts commit 04c203e310bd3fb58e16c936c0200d680100526e. Test fails on Windows.	2021-07-10 13:35:05 -04:00
David Blaikie	04c203e310	llvm-symbolizer: Fix "start file" to work with Split DWARF A followup to the feature added in 69da27c7496ea373567ce5121e6fe8613846e7a5 that added the optional "start file name" to match "start line" - but this didn't work with Split DWARF because of the need for the decl file number resolution code to refer back to the skeleton unit to find its .debug_line contribution. So this patch adds the necessary infrastructure to track the skeleton unit corresponding to a split full unit for the purpose of this lookup.	2021-07-09 18:31:32 -07:00
Jan Kratochvil	c19a28919f	llvm-dwarfdump: Print warnings on invalid DWARF llvm-dwarfdump was silent even when the format of DWARF was invalid and/or llvm-dwarfdump did not understand/support some of the constructs. This can be pretty confusing as llvm-dwarfdump is a tool for DWARF producers+consumers development. Review comments also by @dblaikie. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104271	2021-06-27 11:38:35 +02:00
Alexander Yermolovich	51504bc1d9	[DWARF] Check for AddrOffsetSectionBase to work with DWO Units. Context: https://lists.llvm.org/pipermail/llvm-dev/2021-February/148521.html A fix for llvm-symbolizer, and other tools like BOLT, that allows retrieving address when built with -gsplit-dwarf=single mode. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D96827	2021-03-15 14:46:09 -07:00
Alex Orlov	df6d0579e1	Fix a crash in DWARFUnit::getInlinedChainForAddress in case of unexpected DWARF information. In some cases a broken or invalid debug info could cause a crash in DWARFUnit::getInlinedChainForAddress during parsing a chain of in-lined functions. This patch fixes this issue. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D98119	2021-03-09 14:20:27 +04:00
Kazu Hirata	6de4865545	[llvm] Use hasSingleElement (NFC)	2021-01-20 21:35:55 -08:00
David Blaikie	9670a45c98	libDebugInfoDWARF: Don't try to parse loclist[.dwo] headers when parsing debug_info[.dwo] There's no way to know whether there's a loclist contribution to parse if there's no loclistx encoding - and if there is one, there's no need to walk back from the loclist_base (or, uin the case of info.dwo/loclist.dwo - starting at 0 in the contribution) to parse the header, instead rely on the DWARF32/64 and address size in the CU that's already available. This would come up in split DWARF (non-split wouldn't try to read a loclist header in the absence of a loclist_base) when one unit had location lists and another does not (because the loclists.dwo section would be non-empty in that case - in the case where it's empty the parsing would silently skip). Simplify the testing a bit, rather than needing a whole dwp, etc - by creating a malformed loclists.dwo section (and use single file Split DWARF) that would trip up any attempt to parse it - but no attempt should be made.	2020-10-13 22:28:59 -07:00

1 2 3 4

199 Commits