llvm-project

Author	SHA1	Message	Date
Paul Walker	13f38c97d5	[LLVM][SelectionDAG] Align poison/undef binop folds with IR. (#149334 ) The "at construction" binop folds in SelectionDAG::getNode() has different behaviour when compared to the equivalent LLVM IR. This PR makes the behaviour consistent while also extending the coverage to include signed/unsigned max/min operations.	2025-07-30 11:20:30 +01:00
Tcc100	7daa1defd2	Reland "[CodeGen] Expose the extensibility of PassConfig to plugins (#139059 )" Add missing dependencies to unittest target Original patch broke BUILD_SHARED bots and required revert #147947	2025-07-10 15:26:48 +02:00
Jan Patrick Lehr	0481d2a161	Revert "[CodeGen] Expose the extensibility of PassConfig to plugins" (#147947 ) Reverts llvm/llvm-project#139059 This broke https://lab.llvm.org/buildbot/#/builders/10/builds/9125/steps/8/logs/stdio The bot does a SHARED_LIBS=ON build. I can reproduce locally with the CMake cache file in offload/cmake/caches/AMDGPUBot.cmake as the build config.	2025-07-10 14:00:55 +02:00
Tcc100	56a8655f4a	[CodeGen] Expose the extensibility of PassConfig to plugins (#139059 ) This PR exposes the backend pass config to plugins via a callback. Plugin authors can register a callback that is being triggered before the target backend adds their passes to the pipeline. In the callback they then get access to the `TargetMachine`, the `PassManager`, and the `TargetPassConfig`. This allows plugins to call `TargetPassConfig::insertPass`, which is honored in the subsequent `addPass` of the main backend. We implemented this using the legacy pass manager since backends still use it as the default.	2025-07-10 12:43:09 +02:00
Benjamin Maxwell	6a477f6577	[AArch64] TableGen-erate SDNode descriptions (#140472 ) This continues s-barannikov's work TableGen-erating SDNode descriptions. This takes the initial patch from #119709 and moves documentation and the rest of the AArch64ISD nodes to TableGen. Some issues were found by the generated SDNode verification added in this patch. These issues have been described and fixed in the following PRs: - #140706 - #140711 - #140713 - #140715 --------- Co-authored-by: Sergei Barannikov <barannikov88@gmail.com>	2025-05-28 12:02:58 +01:00
paperchalice	14836597f5	[GC] Use `MapVector` for `GCStrategyMap` (#132729 ) Use `MapVector` so `GCStrategyMap` can support forward and reverse iterator, which is required in `AsmPrinter`.	2025-05-14 17:51:18 +08:00
weiwei chen	1f72fa29ec	[X86Backend][M68KBackend] Make Ctx in X86MCInstLower (M68KInstLower) the same as AsmPrinter.OutContext (#133352 ) In `X86MCInstLower::LowerMachineOperand`, a new `MCSymbol` can be created in `GetSymbolFromOperand(MO)` where `MO.getType()` is `MachineOperand::MO_ExternalSymbol` ``` case MachineOperand::MO_ExternalSymbol: return LowerSymbolOperand(MO, GetSymbolFromOperand(MO)); ``` at `725a7b664b/llvm/lib/Target/X86/X86MCInstLower.cpp (L196)` However, this newly created symbol will not be marked properly with its `IsExternal` field since `Ctx.getOrCreateSymbol(Name)` doesn't know if the newly created `MCSymbol` is for `MachineOperand::MO_ExternalSymbol`. Looking at other backends, for example `Arch64MCInstLower` is doing for handling `MC_ExternalSymbol` `14c36db16f/llvm/lib/Target/AArch64/AArch64MCInstLower.cpp (L366-L367)` `14c36db16f/llvm/lib/Target/AArch64/AArch64MCInstLower.cpp (L145-L148)` It creates/gets the MCSymbol from `AsmPrinter.OutContext` instead of from `Ctx`. Moreover, `Ctx` for `AArch64MCLower` is the same as `AsmPrinter.OutContext`. `8e7d6baf0e/llvm/lib/Target/AArch64/AArch64AsmPrinter.cpp (L100)`. This applies to almost all the other backends except X86 and M68k. ``` $git grep "MCInstLowering(" lib/Target/AArch64/AArch64AsmPrinter.cpp💯 : AsmPrinter(TM, std::move(Streamer)), MCInstLowering(OutContext, this), lib/Target/AMDGPU/AMDGPUMCInstLower.cpp:223: AMDGPUMCInstLower MCInstLowering(OutContext, STI, this); lib/Target/AMDGPU/AMDGPUMCInstLower.cpp:257: AMDGPUMCInstLower MCInstLowering(OutContext, STI, this); lib/Target/AMDGPU/R600MCInstLower.cpp:52: R600MCInstLower MCInstLowering(OutContext, STI, this); lib/Target/ARC/ARCAsmPrinter.cpp:41: MCInstLowering(&OutContext, this) {} lib/Target/AVR/AVRAsmPrinter.cpp:196: AVRMCInstLower MCInstLowering(OutContext, this); lib/Target/BPF/BPFAsmPrinter.cpp:144: BPFMCInstLower MCInstLowering(OutContext, this); lib/Target/CSKY/CSKYAsmPrinter.cpp:41: : AsmPrinter(TM, std::move(Streamer)), MCInstLowering(OutContext, this) {} lib/Target/Lanai/LanaiAsmPrinter.cpp:147: LanaiMCInstLower MCInstLowering(OutContext, this); lib/Target/Lanai/LanaiAsmPrinter.cpp:184: LanaiMCInstLower MCInstLowering(OutContext, this); lib/Target/MSP430/MSP430AsmPrinter.cpp:149: MSP430MCInstLower MCInstLowering(OutContext, this); lib/Target/Mips/MipsAsmPrinter.h:126: : AsmPrinter(TM, std::move(Streamer)), MCInstLowering(this) {} lib/Target/WebAssembly/WebAssemblyAsmPrinter.cpp:695: WebAssemblyMCInstLower MCInstLowering(OutContext, this); lib/Target/X86/X86MCInstLower.cpp:2200: X86MCInstLower MCInstLowering(MF, this); ``` This patch makes `X86MCInstLower` and `M68KInstLower` to have their `Ctx` from `AsmPrinter.OutContext` instead of getting it from `MF.getContext()` to be consistent with all the other backends. I think since normal use case (probably anything other than our un-conventional case) only handles one llvm module all the way through in the codegen pipeline till the end of code emission (AsmPrint), `AsmPrinter.OutContext` is the same as MachineFunction's MCContext, so this change is an NFC. ---- This fixes an error while running the generated code in ORC JIT for our use case with [MCLinker](https://youtu.be/yuSBEXkjfEA?si=HjgjkxJ9hLfnSvBj&t=813) (see more details below): https://github.com/llvm/llvm-project/pull/133291#issuecomment-2759200983 We (Mojo) are trying to do a MC level linking so that we break llvm module into multiple submodules to compile and codegen in parallel (technically into .o files with symbol linkage type change), but instead of archive all of them into one `.a` file, we want to fix the symbol linkage type and still produce one .o file. The parallel codegen pipeline generates the codegen data structures in their own `MCContext` (which is `Ctx` here). So if function `f` and `g` got split into different submodules, they will have different `Ctx`. And when we try to create an external symbol with the same name for each of them with `Ctx.getOrCreate(SymName)`, we will get two different `MCSymbol` because `f` and `g`'s `MCContext` are different and they can't see each other. This is unfortunately not what we want for external symbols. Using `AsmPrinter.OutContext` helps, since it is shared, if we try to get or create the `MCSymbol` there, we'll be able to deduplicate.	2025-04-04 22:44:07 -04:00
Shubham Sandeep Rastogi	92f916faba	Add a pass to collect dropped var statistics for MIR (#126686 ) This patch attempts to reland https://github.com/llvm/llvm-project/pull/120780 while addressing the issues that caused the patch to be reverted. Namely: 1. The patch had included code from the llvm/Passes directory in the llvm/CodeGen directory. 2. The patch increased the backend compile time by 2% due to adding a very expensive include in MachineFunctionPass.h The patch has been re-structured so that there is no dependency between the llvm/Passes and llvm/CodeGen directory, by moving the base class, `class DroppedVariableStats` to the llvm/IR directory. The expensive include in MachineFunctionPass.h has been changed to contain forward declarations instead of other header includes which was pulling a ton of code into MachineFunctionPass.h and should resolve any issues when it comes to compile time increase.	2025-02-12 14:08:18 -08:00
NAKAMURA Takumi	d328d41061	Revert "Add a pass to collect dropped var stats for MIR (#120780 )" This reverts commit 3bf91ad2a9c75dd045961e45fdd830fd7b7a5455. (llvmorg-20-init-16123-g3bf91ad2a9c7) `llvm/CodeGen` should not depend on `llvm/Passes`.	2024-12-21 12:42:26 +09:00
Shubham Sandeep Rastogi	3bf91ad2a9	Add a pass to collect dropped var stats for MIR (#120780 ) This patch uses the DroppedVariableStats class to add dropped variable statistics for MIR passes. Reland 1c082c9cd12efaa67a32c5da89a328c458ed51c5	2024-12-20 10:08:54 -08:00
Shubham Sandeep Rastogi	e7e622f153	Revert "Move DroppedVariableStats to CodeGen lib (#120650 )" This reverts commit 4307198d51487cc16f98eebb2113caf4a1905914. Broke bot ppc64le-clang-multistage-test: undefined reference to `llvm::DroppedVariableStats::populateVarIDSetAndInlinedMap in In function `llvm::DroppedVariableStatsIR::visitEveryInstruction	2024-12-19 19:59:34 -08:00
Shubham Sandeep Rastogi	4307198d51	Move DroppedVariableStats to CodeGen lib (#120650 ) To get Dropped variable statistics for MIR, we need to move the base class DroppedVariableStats code to the CodeGen library because we cannot have CodeGen link against Passes. Also moved the code for the virtual functions to the header because clang/lib/CodeGen doesn't link against llvm/lib/CodeGen however it does link against Passes which contains the `class StandardInstrumentations` code but not the definition for the virtual functions leading to the error about not finding vtable for `class DroppedVariableStatsIR`	2024-12-19 18:09:14 -08:00
Shubham Sandeep Rastogi	077cc3deee	Revert "Move DroppedVariableStatsIRTest.cpp to CodeGen folder" This reverts commit 10ed7d94b52c21317a1e02ef1e2c3ff2b2d08301. Revert "Reland 2de78815604e9027efd93cac27c517bf732587d2 (#119650)" This reverts commit 0e80f9a1b51e0e068adeae1278d59cd7baacd5d8. This is because the clang-ppc64le-linux-multistage bot breaks with error undefined reference to `vtable for llvm::DroppedVariableStatsIR'	2024-12-11 23:10:14 -08:00
Shubham Sandeep Rastogi	10ed7d94b5	Move DroppedVariableStatsIRTest.cpp to CodeGen folder	2024-12-11 20:05:24 -08:00
Shubham Sandeep Rastogi	abc4183c73	Revert "Reland "[NFC] Move DroppedVariableStats to its own file and redesign it to be extensible (#118546 )" (#119048 )" This reverts commit 37606b4c22654ab66eee8f89448a117f3534f2f4. Broke the llvm-nvptx-nvidia-ubuntu bot with error: the vtable symbol may be undefined because the class is missing its key function	2024-12-06 23:19:39 -08:00
Shubham Sandeep Rastogi	37606b4c22	Reland "[NFC] Move DroppedVariableStats to its own file and redesign it to be extensible (#118546 )" (#119048 ) Move the virtual destructor definition to the cpp file and see if that gets rid of the undefined vtable error.	2024-12-06 23:13:30 -08:00
Shubham Sandeep Rastogi	259bdc0033	Revert "Reland "[NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 )" (#118546 )" This reverts commit 0c8928d456ac3ef23ed25bfc9e5d491dd7b62a11. Broke Bot: https://lab.llvm.org/buildbot/#/builders/76/builds/5008 error: undefined reference to `vtable for llvm::DroppedVariableStatsIR'	2024-12-03 16:50:53 -08:00
Shubham Sandeep Rastogi	0c8928d456	Reland "[NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 )" (#118546 ) Removed the virtual destructor in the derived class DroppedVariableStatsIR	2024-12-03 14:13:06 -08:00
Shubham Sandeep Rastogi	80987ef4b6	Revert "Reland [NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 )" This reverts commit acf3b1aa932b2237c181686e52bc61584a80a3ff. Broke https://lab.llvm.org/buildbot/#/builders/76/builds/5002 tools/clang/lib/CodeGen/CMakeFiles/obj.clangCodeGen.dir/BackendUtil.cpp.o:(.toc+0x258): undefined reference to `vtable for llvm::DroppedVariableStatsIR'	2024-12-03 12:51:24 -08:00
Shubham Sandeep Rastogi	d8b5af4504	Revert "Reland "Add a pass to collect dropped var stats for MIR" (#117044 )" This reverts commit 249755cedb17ffa707253edcef1a388f807caa35. Broke https://lab.llvm.org/buildbot/#/builders/160/builds/9420 Note: This is test shard 99 of 154. [==========] Running 2 tests from 2 test suites. [----------] Global test environment set-up. [----------] 1 test from DroppedVariableStatsMIR [ RUN ] DroppedVariableStatsMIR.InlinedAt -- exit: -11	2024-12-03 12:50:13 -08:00
Shubham Sandeep Rastogi	249755cedb	Reland "Add a pass to collect dropped var stats for MIR" (#117044 ) Moved the MIR Test to the unittests/CodeGen folder This is patch is part of a stack of patches, and follows https://github.com/llvm/llvm-project/pull/117042 I moved the MIR test to the unittests/CodeGen folder I am trying to reland https://github.com/llvm/llvm-project/pull/115566	2024-12-03 12:37:30 -08:00
Shubham Sandeep Rastogi	acf3b1aa93	Reland [NFC] Move DroppedVariableStats to its own file and redesign it to be extensible. (#117042 ) Moved the IR unit test to the CodeGen folder to resolve linker errors: `error: undefined reference to 'vtable for llvm::DroppedVariableStatsIR'` This patch is trying to reland https://github.com/llvm/llvm-project/pull/115563	2024-12-03 10:39:40 -08:00
paperchalice	c931ac5994	Reapply "[CodeGen] Introduce `MachineDomTreeUpdater`" (#96846 ) (#96851 ) This reverts commit 0f8849349ae3d3f2f537ad6ab233a586fb39d375. Resolve conflict in `MachinePostDominators.h` There is a conflict after merging #96378, resolved in #96852. Both PRs modified `MachinePostDominators.h` and triggered build failure.	2024-06-28 14:48:09 +08:00
paperchalice	0f8849349a	Revert "[CodeGen] Introduce `MachineDomTreeUpdater`" (#96846 ) Reverts llvm/llvm-project#95369 Many build bots failed	2024-06-27 12:31:24 +08:00
paperchalice	6ca387cbcb	[CodeGen] Introduce `MachineDomTreeUpdater` (#95369 ) This commit converts most of `DomTreeUpdater` into `GenericDomTreeUpdater` class template, so IR and MIR can reuse some codes. There are some differences between interfaces of `BasicBlock` and `MachineBasicBlock`, so subclasses still need to implement some functions, like `forceFlushDeletedBB`.	2024-06-27 12:25:18 +08:00
Min-Yih Hsu	5874874c24	[SelectionDAG] Introducing the SelectionDAG pattern matching framework (#78654 ) Akin to `llvm::PatternMatch` and `llvm::MIPatternMatch`, the `llvm::SDPatternMatch` introduced in this patch provides a DSL-alike framework to match SDValue / SDNode with a more succinct syntax.	2024-02-23 11:03:36 -08:00
paperchalice	7e50f006f7	[NewPM][CodeGen][llc] Add NPM support (#70922 ) Add new pass manager support to `llc`. Users can use `--passes=pass1,pass2...` to run mir passes, and use `--enable-new-pm` to run default codegen pipeline. This patch is taken from [D83612](https://reviews.llvm.org/D83612), the original author is @yuanfang-chen. --------- Co-authored-by: Yuanfang Chen <455423+yuanfang-chen@users.noreply.github.com>	2024-01-24 09:27:25 +08:00
paperchalice	ae1c1ed6af	[CodeGen] Allow `CodeGenPassBuilder` to add module pass after function pass (#77084 ) In fact, there are several backends, e.g. AArch64, AMDGPU etc. add module pass after function pass, this patch removes this constraint. This patch also adds a simple unit test for `CodeGenPassBuilder`.	2024-01-12 08:37:12 +08:00
Matt Arsenault	65b40f273f	RegAlloc: Rename MLRegalloc* files to use consistent captalization The other regalloc related files use RegAlloc, not Regalloc.	2023-09-03 09:00:27 -04:00
Francesco Petrogalli	aee34000f9	[MISched][rework] Introduce and use ResourceSegments. Re-landing the code that was reverted because of the buildbot failure in https://lab.llvm.org/buildbot#builders/9/builds/27319. Original commit message ====================== The class `ResourceSegments` is used to keep track of the intervals that represent resource usage of a list of instructions that are being scheduled by the machine scheduler. The collection is made of intervals that are closed on the left and open on the right (represented by the standard notation `[a, b)`). These collections of intervals can be extended by `add`ing new intervals accordingly while scheduling a basic block. Unit tests are added to verify the possible configurations of intervals, and the relative possibility of scheduling a new instruction in these configurations. Specifically, the methods `getFirstAvailableAtFromBottom` and `getFirstAvailableAtFromTop` are tested to make sure that both bottom-up and top-down scheduling work when tracking resource usage across the basic block with `ResourceSegments`. Note that the scheduler tracks resource usage with two methods: 1. counters (via `std::vector<unsigned> ReservedCycles;`); 2. intervals (via `std::map<unsigned, ResourceSegments> ReservedResourceSegments;`). This patch can be considered a NFC test for existing scheduling models because the tracking system that uses intervals is turned off by default (field `bit EnableIntervals = false;` in the tablegen class `SchedMachineModel`). Reviewed By: andreadb Differential Revision: https://reviews.llvm.org/D150312	2023-06-09 15:02:00 +02:00
Francesco Petrogalli	f1d1ca3d74	Revert "[MISched] Introduce and use ResourceSegments." Reverted because it produces the following builbot failure at https://lab.llvm.org/buildbot#builders/9/builds/27319: /b/ml-opt-rel-x86-64-b1/llvm-project/llvm/unittests/CodeGen/SchedBoundary.cpp: In member function ‘virtual void ResourceSegments_getFirstAvailableAtFromBottom_empty_Test::TestBody()’: /b/ml-opt-rel-x86-64-b1/llvm-project/llvm/unittests/CodeGen/SchedBoundary.cpp:395:31: error: call of overloaded ‘ResourceSegments(<brace-enclosed initializer list>)’ is ambiguous 395 \| auto X = ResourceSegments({}); \| ^ This reverts commit dc312f0331309692e8d6e06e93b3492b6a40989f.	2023-06-09 13:23:37 +02:00
Francesco Petrogalli	dc312f0331	[MISched] Introduce and use ResourceSegments. The class `ResourceSegments` is used to keep track of the intervals that represent resource usage of a list of instructions that are being scheduled by the machine scheduler. The collection is made of intervals that are closed on the left and open on the right (represented by the standard notation `[a, b)`). These collections of intervals can be extended by `add`ing new intervals accordingly while scheduling a basic block. Unit tests are added to verify the possible configurations of intervals, and the relative possibility of scheduling a new instruction in these configurations. Specifically, the methods `getFirstAvailableAtFromBottom` and `getFirstAvailableAtFromTop` are tested to make sure that both bottom-up and top-down scheduling work when tracking resource usage across the basic block with `ResourceSegments`. Note that the scheduler tracks resource usage with two methods: 1. counters (via `std::vector<unsigned> ReservedCycles;`); 2. intervals (via `std::map<unsigned, ResourceSegments> ReservedResourceSegments;`). This patch can be considered a NFC test for existing scheduling models because the tracking system that uses intervals is turned off by default (field `bit EnableIntervals = false;` in the tablegen class `SchedMachineModel`). Reviewed By: andreadb Differential Revision: https://reviews.llvm.org/D150312	2023-06-09 13:00:50 +02:00
Bjorn Pettersson	a23f984616	[CodeGen] Add unittest for findDebugLoc, rfindDebugLoc, findPrevDebugLoc and rfindPrevDebugLoc. NFC - Add some unittests for the findDebugLoc, rfindDebugLoc, findPrevDebugLoc and rfindPrevDebugLoc helpers in MachineBasicBlock. - Clean up code comments and code formatting related to the functions mentioned above. This was extracted as a pre-commit to D150577, adn some of the tests are commented out since they would crash/assert in a rather uncontrolled way.	2023-05-25 14:48:52 +02:00
Sergei Barannikov	da42b2846c	[CodeGen] Support allocating of arguments by decreasing offsets Previously, `CCState::AllocateStack` always allocated stack space by increasing offsets. For targets with stack growing up (away from zero) it is more convenient to allocate arguments by decreasing offsets, so that the first argument is at the top of the stack. This is important when calling a function with variable number of arguments: the callee does not know the size of the stack, but must be able to access "fixed" arguments. For that to work, the "fixed" arguments should have fixed offsets relative to the stack top, i.e. the variadic arguments area should be at the stack bottom (at lowest addresses). The in-tree target with stack growing up is AMDGPU, but it allocates arguments by increasing addresses. It does not support variadic arguments. A drive-by change is to promote stack size/offset to 64-bit integer. This is what MachineFrameInfo expects. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D149575	2023-05-17 21:51:52 +03:00
NAKAMURA Takumi	5d71ec6e44	Split out `CodeGenTypes` from `CodeGen` for LLT/MVT This reduces dependencies on `llvm-tblgen` so much. `CodeGenTypes` depends on `Support` at the moment. Be careful to append deps on this, since Targets' tablegens depend on this. Depends on D149024 Differential Revision: https://reviews.llvm.org/D148769	2023-05-03 00:13:20 +09:00
Archibald Elliott	f09cf34d00	[Support] Move TargetParsers to new component This is a fairly large changeset, but it can be broken into a few pieces: - `llvm/Support/TargetParser` are all moved from the LLVM Support component into a new LLVM Component called "TargetParser". This potentially enables using tablegen to maintain this information, as is shown in https://reviews.llvm.org/D137517. This cannot currently be done, as llvm-tblgen relies on LLVM's Support component. - This also moves two files from Support which use and depend on information in the TargetParser: - `llvm/Support/Host.{h,cpp}` which contains functions for inspecting the current Host machine for info about it, primarily to support getting the host triple, but also for `-mcpu=native` support in e.g. Clang. This is fairly tightly intertwined with the information in `X86TargetParser.h`, so keeping them in the same component makes sense. - `llvm/ADT/Triple.h` and `llvm/Support/Triple.cpp`, which contains the target triple parser and representation. This is very intertwined with the Arm target parser, because the arm architecture version appears in canonical triples on arm platforms. - I moved the relevant unittests to their own directory. And so, we end up with a single component that has all the information about the following, which to me seems like a unified component: - Triples that LLVM Knows about - Architecture names and CPUs that LLVM knows about - CPU detection logic for LLVM Given this, I have also moved `RISCVISAInfo.h` into this component, as it seems to me to be part of that same set of functionality. If you get link errors in your components after this patch, you likely need to add TargetParser into LLVM_LINK_COMPONENTS in CMake. Differential Revision: https://reviews.llvm.org/D137838	2022-12-20 11:05:50 +00:00
Aiden Grossman	e5e3dccd07	[mlgo] Add in-development instruction based features for regalloc advisor This patch adds in instruction based features to the regalloc advisor gated behind a flag so a user can decide at runtime whether or not they want to enable the feature. The features are only enabled when LLVM is compiled in MLGO develpment mode (LLVM_HAVE_TF_API) is set to true. To extract the instruction features, I'm taking a list of segments from each LiveInterval and noting the start and end SlotIndices. This list is then sorted based on the start SlotIndex and I iterate through each SlotIndex to grab instructions, making sure to check for overlaps. This results in a vector of opcodes and binary mapping matrix that maps live ranges to the opcodes of the instructions within that LR. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D131930	2022-09-17 19:54:45 +00:00
Alexey Lapshin	554aea52d7	[reland][Debuginfo][DWARF][NFC] Refactor DwarfStringPoolEntryRef. This review is extracted from D96035. This patch adds possibility to keep not only DwarfStringPoolEntry, but also pointer to it. The DwarfStringPoolEntryRef keeps reference to the string map entry. String map keeps string data and corresponding DwarfStringPoolEntry info. Not all string map entries may be included into the result, and then not all string entries should have DwarfStringPoolEntry info. Currently StringMap keeps DwarfStringPoolEntry for all entries. It leads to extra memory usage. This patch allows to keep DwarfStringPoolEntry info only for entries which really need it. [reland] : make msan happy. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D126883	2022-07-01 20:08:09 +03:00
Vitaly Buka	72cd6b6c83	Revert "[Debuginfo][DWARF][NFC] Refactor DwarfStringPoolEntryRef." Breaks msan bot, see D126883 This reverts commit 77df3be0dee415713cf5c79543f00532674f428b.	2022-06-29 17:53:42 -07:00
Alexey Lapshin	77df3be0de	[Debuginfo][DWARF][NFC] Refactor DwarfStringPoolEntryRef. This review is extracted from D96035. This patch adds possibility to keep not only DwarfStringPoolEntry, but also pointer to it. The DwarfStringPoolEntryRef keeps reference to the string map entry. String map keeps string data and corresponding DwarfStringPoolEntry info. Not all string map entries may be included into the result, and then not all string entries should have DwarfStringPoolEntry info. Currently StringMap keeps DwarfStringPoolEntry for all entries. It leads to extra memory usage. This patch allows to keep DwarfStringPoolEntry info only for entries which really need it. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D126883	2022-06-29 00:12:03 +03:00
Sebastian Neubauer	4a02562275	[AMDGPU] Lazily init pal metadata on first function Delay reading global metadata until the first function or the end of the file is emitted. That way, earlier module passes can set metadata that is emitted in the ELF. `emitStartOfAsmFile` gets called when the passes are initialized, which prevented earlier passes from changing the metadata. This fixes issues encountered after converting AMDGPUResourceUsageAnalysis to a Module pass in D117504. Differential Revision: https://reviews.llvm.org/D118492	2022-02-04 18:39:35 +01:00
Mircea Trofin	fa99cb64ff	[mlgo][regalloc] Add score calculation for training Add the calculation of a score, which will be used during ML training. The score qualifies the quality of a regalloc policy, and is independent of what we train (currently, just eviction), or the regalloc algo itself. We can then use scores to guide training (which happens offline), by formulating a reward based on score variation - the goal being lowering scores (currently, that reward is percentage reduction relative to Greedy's heuristic) Currently, we compute the score by factoring different instruction counts (loads, stores, etc) with the machine basic block frequency, regardless of the instructions' provenance - i.e. they could be due to the regalloc policy or be introduced previously. This is different from RAGreedy::reportStats, which accummulates the effects of the allocator alone. We explored this alternative but found (at least currently) that the more naive alternative introduced here produces better policies. We do intend to consolidate the two, however, as we are actively investigating improvements to our reward function, and will likely want to re-explore scoring just the effects of the allocator. In either case, we want to decouple score calculation from allocation algorighm, as we currently evaluate it after a few more passes after allocation (also, because score calculation should be reusable regardless of allocation algorithm). We intentionally accummulate counts independently because it facilitates per-block reporting, which we found useful for debugging - for instance, we can easily report the counts indepdently, and then cross-reference with perf counter measurements. Differential Revision: https://reviews.llvm.org/D115195	2021-12-07 09:00:27 -08:00
Jeremy Morse	838b4a533e	[DebugInfo][NFC] Move LiveDebugValues class to header This patch shifts the InstrRefBasedLDV class declaration to a header. Partially because it's already massive, but mostly so that I can start writing some unit tests for it. This patch also adds the boilerplate for said unit tests. Differential Revision: https://reviews.llvm.org/D110165	2021-10-12 16:07:26 +01:00
Michael Liao	b9c05aff20	[MIRPrinter] Add machine metadata support. - Distinct metadata needs generating in the codegen to attach correct AAInfo on the loads/stores after lowering, merging, and other relevant transformations. - This patch adds 'MachhineModuleSlotTracker' to help assign slot numbers to these newly generated unnamed metadata nodes. - To help 'MachhineModuleSlotTracker' track machine metadata, the original 'SlotTracker' is rebased from 'AbstractSlotTrackerStorage', which provides basic interfaces to create/retrive metadata slots. In addition, once LLVM IR is processsed, additional hooks are also introduced to help collect machine metadata and assign them slot numbers. - Finally, if there is any such machine metadata, 'MIRPrinter' outputs an additional 'machineMetadataNodes' field containing all the definition of those nodes. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D103205	2021-06-19 12:48:08 -04:00
Hsiangkai Wang	8d06a678a5	[SelectionDAG] Avoid aliasing analysis if the object size is unknown. If the size of memory access is unknown, do not use it to analysis. One example of unknown size memory access is to load/store scalable vector objects on the stack. Differential Revision: https://reviews.llvm.org/D91833	2020-11-25 06:13:37 +08:00
Mircea Trofin	6d193ba333	[NFC][regalloc] Unit test for AllocationOrder iteration. Added unittests. In the process, separated core construction - which just needs the hits, order, and 'HardHints' values - from construction from current register allocation state, to simplify testing. Differential Revision: https://reviews.llvm.org/D88455	2020-09-29 10:48:07 -07:00
Igor Kudrin	a8058c6f8d	[DebugInfo] Fix DIE value emitters to be compatible with DWARF64 (2/19). DW_FORM_sec_offset and DW_FORM_strp imply values of different sizes with DWARF32 and DWARF64. The patch fixes DIE value classes to use correct sizes when emitting their values. For DIELocList it ensures that the requested DWARF form matches the current DWARF format because that class uses a method that selects the size automatically. Differential Revision: https://reviews.llvm.org/D87009	2020-09-15 11:30:02 +07:00
Igor Kudrin	380e746bcc	[DebugInfo] Fix methods of AsmPrinter to emit values corresponding to the DWARF format (1/19). These methods are used to emit values which are 32-bit in DWARF32 and 64-bit in DWARF64. The patch fixes them so that they choose the length automatically, depending on the DWARF format set in the Context. Differential Revision: https://reviews.llvm.org/D87008	2020-09-15 11:29:48 +07:00
Yuanfang Chen	f5b5ccf2a6	Reland "Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager"" This relands commit 320eab2d558fde0b61437e9b9075bfd301c2c474. The test failed because it was looking for x86-linux target unconditionally. Now it gets the default target.	2020-08-07 16:40:49 -07:00
Yuanfang Chen	320eab2d55	Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager" This reverts commit 911565d1085d9447363fe8ad041817436c4998fe. Broke some non-Linux bots.	2020-08-07 11:59:58 -07:00

1 2

70 Commits