llvm-project

Author	SHA1	Message	Date
Nikita Popov	b74182edae	[llvm-reduce] Handle new flags in complexity score This has gotten out of sync with the actual flag reduction. It's not particularly important though, as it only seems to be used to determine whether to do another round of delta reduction.	2024-08-07 14:08:35 +02:00
Alexis Engelke	41491c7723	[CodeGen] Allocate RegAllocHints map lazily (#102186 ) This hint map is not required whenever a new register is added, in fact, at -O0, it is not used at all. Growing this map is quite expensive, as SmallVectors are not trivially copyable. Grow this map only when hints are actually added to avoid multiple grows and grows when no hints are added at all.	2024-08-07 07:56:32 +02:00
Matt Arsenault	63e1647827	CodeGen: Remove MachineModuleInfo reference from MachineFunction (#100357 ) This avoids another unserializable field. Move the DbgInfoAvailable field into the AsmPrinter, which is only really a cache/convenience bit for checking a direct IR module metadata check.	2024-07-26 13:10:08 +04:00
Matt Arsenault	9a25866402	CodeGen: Avoid using MachineFunction::getMMI in MachineModuleSlotTracker (#100310 )	2024-07-24 12:27:00 +04:00
Jay Foad	63a5dc4aed	[CodeGen] Do not pass MF into MachineRegisterInfo methods. NFC. (#84770 ) MachineRegisterInfo already knows the MF so there is no need to pass it in as an argument.	2024-03-11 15:35:05 +00:00
Nikita Popov	ea668144d9	[CodeGen] Split off PseudoSourceValueManager into separate header (NFC) (#73327 ) Most users of PseudoSourceValue.h only need PseudoSourceValue, not the PseudoSourceValueManager. However, this header pulls in some very expensive dependencies like ValueMap.h, which is only used for the manager. Split off the manager into a separate header and include it only where used.	2023-12-04 10:17:59 +01:00
Matt Arsenault	748f861bea	llvm-reduce: Handle cloning for MachineJumpTableInfo (#69086 )	2023-10-31 21:51:50 +09:00
Alex Richardson	a8f8613dec	Introduce and use codegen::createTargetMachineForTriple() This creates a TargetMachine with the default options (from the command line flags). This allows us to share a bit more code between tools. Differential Revision: https://reviews.llvm.org/D141057	2023-10-04 13:45:16 -07:00
Arthur Eubanks	0a1aa6cda2	[NFC][CodeGen] Change CodeGenOpt::Level/CodeGenFileType into enum classes (#66295 ) This will make it easy for callers to see issues with and fix up calls to createTargetMachine after a future change to the params of TargetMachine. This matches other nearby enums. For downstream users, this should be a fairly straightforward replacement, e.g. s/CodeGenOpt::Aggressive/CodeGenOptLevel::Aggressive or s/CGFT_/CodeGenFileType::	2023-09-14 14:10:14 -07:00
Jay Foad	2dcf051259	[CodeGen] Store call frame size in MachineBasicBlock Record the call frame size on entry to each basic block. This is usually zero except when a basic block has been split in the middle of a call sequence. This simplifies PEI::replaceFrameIndices which previously had to visit basic blocks in a specific order and had special handling for unreachable blocks. More importantly it paves the way for an equally simple implementation of a backwards version of replaceFrameIndices, which is required to fully convert PrologEpilogInserter to backwards register scavenging, which is preferred because it does not rely on accurate kill flags. Differential Revision: https://reviews.llvm.org/D156113	2023-07-27 10:32:00 +01:00
Oliver Stannard	aea8db8eb9	Revert "[CodeGen] Store SP adjustment in MachineBasicBlock. NFCI." This reverts commit 58d1eaa3b6ce4f7285c51f83faff7a3ac374c746.	2023-07-13 14:25:39 +01:00
Jay Foad	58d1eaa3b6	[CodeGen] Store SP adjustment in MachineBasicBlock. NFCI. Record the SP adjustment on entry to each basic block. This is almost always zero except on targets like ARM which can split a basic block in the middle of a call sequence. This simplifies PEI::replaceFrameIndices which previously had to visit basic blocks in a specific order and had special handling for unreachable blocks. More importantly it paves the way for an equally simple implementation of a backwards version of replaceFrameIndices, which is required to fully convert PrologEpilogInserter to backwards register scavenging, which is preferred because it does not rely on accurate kill flags. Differential Revision: https://reviews.llvm.org/D154281	2023-07-12 14:29:26 +01:00
Matt Arsenault	363d99db49	llvm-reduce: Fix not preserving uselistorder with bitcode Fix accidentally passing pointer to bool argument This was supposed to be writing bitcode with preserved uselistorder, but instead was only enabling it with LTO module summaries.	2023-06-30 10:26:32 -04:00
Shraiysh Vaishay	7021182d6b	[nfc][llvm] Replace pointer cast functions in PointerUnion by llvm casting functions. This patch replaces the uses of PointerUnion.is function by llvm::isa, PointerUnion.get function by llvm::cast, and PointerUnion.dyn_cast by llvm::dyn_cast_if_present. This is according to the FIXME in the definition of the class PointerUnion. This patch does not remove them as they are being used in other subprojects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D148449	2023-04-17 13:40:51 -05:00
wangpc	267708f9d5	[MachineOutliner] Add IsOutlined to MachineFunction We add a field `IsOutlined` to indicate whether a MachineFunction is outlined and set it true for outlined functions in MachineOutliner. Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D146191	2023-04-10 10:57:29 +08:00
Archibald Elliott	d768bf994f	[NFC][TargetParser] Replace uses of llvm/Support/Host.h The forwarding header is left in place because of its use in `polly/lib/External/isl/interface/extract_interface.cc`, but I have added a GCC warning about the fact it is deprecated, because it is used in `isl` from where it is included by Polly.	2023-02-10 09:59:46 +00:00
Matt Arsenault	c5fa6b1610	llvm-reduce: Parse file from the opened buffer instead of the file If this wasn't bitcode this was opening a second MemoryBuffer.	2023-01-27 20:14:36 -04:00
Matt Arsenault	c0a10b2772	llvm-reduce: Use WithColor in another place Use more consistently capitalized/colorized/punctuated error messages.	2023-01-27 20:14:36 -04:00
Matt Arsenault	d78b4c44ab	llvm-reduce: Fix default handling of intermediate format Bitcode inputs should produce bitcode intermediates by default.	2023-01-20 23:21:13 -04:00
Matt Arsenault	592536a9ec	llvm-reduce: Reorganize some function locations Move things that are naturally methods of ReducerWorkItem to be methods of ReducerWorkItem in the same source file.	2023-01-20 23:21:13 -04:00
Matt Arsenault	333ffafb45	llvm-reduce: Trim includes and avoid using namespace in a header	2023-01-19 21:48:56 -04:00
Matt Arsenault	0782d97ff5	llvm-reduce: Account for initializer complexity	2023-01-19 21:35:27 -04:00
Matt Arsenault	a6000c143b	llvm-reduce: Account for aliases and ifuncs in IR complexity score	2023-01-19 21:35:27 -04:00
Jannik Silvanus	df1a74ac3c	[IR] Support importing modules with invalid data layouts. Use the existing mechanism to change the data layout using callbacks. Before this patch, we had a callback type DataLayoutCallbackTy that receives a single StringRef specifying the target triple, and optionally returns the data layout string to be used. Module loaders (both IR and BC) then apply the callback to potentially override the module's data layout, after first having imported and parsed the data layout from the file. We can't do the same to fix invalid data layouts, because the import will already fail, before the callback has a chance to fix it. Instead, module loaders now tentatively parse the data layout into a string, wait until the target triple has been parsed, apply the override callback to the imported string and only then parse the tentative string as a data layout. Moreover, add the old data layout string S as second argument to the callback, in addition to the already existing target triple argument. S is either the default data layout string in case none is specified, or the data layout string specified in the module, possibly after auto-upgrades (for the BitcodeReader). This allows callbacks to inspect the old data layout string, and fix it instead of setting a fixed data layout. Also allow to pass data layout override callbacks to lazy bitcode module loader functions. Differential Revision: https://reviews.llvm.org/D140985	2023-01-12 10:10:45 +01:00
Matt Arsenault	9c8b89f580	llvm-reduce: Refine missing argument behavior We required the test and input arguments for --print-delta-passes which is unhelpful. Also, start printing the help output if no arguments were supplied. It looks like there's more sophisticated ways to accomplish this with the opt library, but it was less work to manually emit these errors.	2023-01-03 16:01:36 -05:00
Krzysztof Parzyszek	110fe4f495	[IRReader] Convert Optional in DataLayoutCallbackTy to std::optional	2022-12-07 08:47:25 -08:00
Fangrui Song	89fae41ef1	[IR] llvm::Optional => std::optional Many llvm/IR/* files have been migrated by other contributors. This migrates most remaining files.	2022-12-05 04:13:11 +00:00
Fangrui Song	bac974278c	CodeGen/CommandFlags: Convert Optional to std::optional	2022-12-03 18:38:12 +00:00
Matt Arsenault	3c436ab0d4	llvm-reduce: Support emitting bitcode for final result Previously, this unconditionally emitted text IR. I ran into a bug that manifested in broken disassembly, so the desired output was the bitcode format. If the input format was binary bitcode, the requested output file ends in .bc, or an explicit -output-bitcode option was used, emit bitcode.	2022-10-31 20:35:08 -07:00
Matt Arsenault	c23ac22f0e	llvm-reduce: Don't write out IR to score IR complexity In a testcase I'm working on, the old write out and count IR lines was taking about 200-300ms per iteration. This drops it out of the profile. This doesn't account for everything, but it doesn't seem to matter. We should probably try to account for metadata and constantexpr tree depths.	2022-10-12 17:25:23 -07:00
Eli Friedman	cfd2c5ce58	Untangle the mess which is MachineBasicBlock::hasAddressTaken(). There are two different senses in which a block can be "address-taken". There can be a BlockAddress involved, which means we need to map the IR-level value to some specific block of machine code. Or there can be constructs inside a function which involve using the address of a basic block to implement certain kinds of control flow. Mixing these together causes a problem: if target-specific passes are marking random blocks "address-taken", if we have a BlockAddress, we can't actually tell which MachineBasicBlock corresponds to the BlockAddress. So split this into two separate bits: one for BlockAddress, and one for the machine-specific bits. Discovered while trying to sort out related stuff on D102817. Differential Revision: https://reviews.llvm.org/D124697	2022-08-16 16:15:44 -07:00
Matt Arsenault	0f9d9edd24	llvm-reduce: Add reduction for custom register masks I have a register allocator failure that only reproduces with IPRA enabled, and requires the specific regmask if I want to only run the one relevant pass. The printed custom regmask is enormous and I would like to reduce it. This reduces each individual bit in the mask, but it would probably be better to start at register units and clear all aliasing fields at a time. This would require stricter verification that all aliasing bits are set in regmasks (although I would prefer to switch regmasks to use register units in the first place).	2022-07-18 13:41:08 -04:00
Matthew Voss	6b3956e123	[llvm-reduce] Add support for LTO bitcode files Adds support for reading and writing LTO bitcode files. - Emit a summary if the original bitcode file had a summary - Use split LTO units if the original bitcode file used them. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D127168	2022-06-30 08:58:24 -07:00
Matt Arsenault	cc5a1b3dd9	llvm-reduce: Add cloning of target MachineFunctionInfo MIR support is totally unusable for AMDGPU without this, since the set of reserved registers is set from fields here. Add a clone method to MachineFunctionInfo. This is a subtle variant of the copy constructor that is required if there are any MIR constructs that use pointers. Specifically, at minimum fields that reference MachineBasicBlocks or the MachineFunction need to be adjusted to the values in the new function.	2022-06-07 10:14:48 -04:00
Matt Arsenault	56303223ac	llvm-reduce: Don't assert on functions which don't track liveness Use the query that doesn't assert if TracksLiveness isn't set, which needs to always be available. We also need to start printing liveins regardless of TracksLiveness.	2022-06-07 10:00:25 -04:00
Clemens Wasser	42c7f494d9	[tools] Forward declare classes & remove includes Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D120208	2022-06-03 16:32:04 -07:00
Matt Arsenault	a0dcbe45bd	llvm-reduce: Add reduction pass to remove regalloc hints I'm a bit confused by what's actually stored for the allocation hints. The MIR parser only handles the "simple" case where there's a single hint. I don't really understand the assertion in clearSimpleHint, or under what circumstances there are multiple hint registers.	2022-06-01 09:15:41 -04:00
Matt Arsenault	35264e7179	llvm-reduce: Introduce new scoring mechanism for MIR reductions Many MIR reductions benefit from or require increasing the instruction count. For example, unlike in the IR, you may need to insert a new instruction to represent an undef. The current instruction reduction pass works around this by sticking implicit defs on whatever instruction happens to be first in the entry block block. Other strategies I've applied manually include breaking instructions with multiple defs into separate instructions, or breaking large register defs into multiple subregister defs. Make up a simple scoring system based on what I generally try to get rid of first when manually reducing. Counts implicit defs as free since reduction passes will be introducing them, although they probably should count for something. It also might make more sense to have a comparison the two functions, rather than having to compute a contextless number. This isn't particularly well tested since overall the MIR support isn't in a place where it is useful on the kinds of testcases I want to throw at it.	2022-05-01 18:24:04 -04:00
Matt Arsenault	717209763e	llvm-reduce: Fix incorrect cloning of MachineMemOperands There were two problems with directly copying the MMOs from the old function. The MMOs are owned by the function's Allocator, so need to be reallocated anyways (surprisingly I didn't notice breakage on this). Second, the PseudoSourceValues are also allocated per function and need to be reallocated.	2022-04-27 18:51:38 -04:00
Matt Arsenault	e39e9d339c	llvm-reduce: Fix crashing on file opening error for mir path	2022-04-27 18:15:12 -04:00
Matt Arsenault	7c2db66632	llvm-reduce: Support multiple MachineFunctions The current testcase I'm trying to reduce only reproduces with IPRA enabled and requires handling multiple functions. The only real difference vs. the IR is the extra indirect to look for the underlying MachineFunction, so treat the ReduceWorkItem as the module instead of the function. The ugliest piece of this is really the ugliness of MachineModuleInfo. It not only tracks actual module state, but has a number of transient fields used for isel and/or the asm printer. These shouldn't do any harm for the use here, though they should be separated out.	2022-04-27 18:11:59 -04:00
Matt Arsenault	1747a93b28	llvm-reduce: Try to parse triple/datalayout from module This saves needing to specify -mtriple on nearly every use for MIR reduction.	2022-04-27 17:47:46 -04:00
Matt Arsenault	18b9c46370	llvm-reduce: Fix not cloning MachineInstr flags	2022-04-27 17:29:18 -04:00
Matt Arsenault	7b57ef670c	llvm-reduce: Simplify virtual register cloning Just clone all the virtual registers instead of looking for def operands. This preserves the register values used, simplifying the rest of the code. This avoids needing to expose the register map to target code.	2022-04-26 13:17:13 -04:00
Matt Arsenault	a27b9ab391	llvm-reduce: Preserve frame index values when cloning function Previously the specific values used for fixed frame indexes was in reverse order in the cloned function from the original, and a map was used to adjust all frame indexes to the potentially new values. Insert the fixed objects in reverse to avoid this. This simplifies other code, since now we don't need to track down all frame indexes. This will allow targets that store frame indexes in MachineFunctionInfo to simply copy the values. Note this isn't directly observable in the test since the resulting MIR print/parse can shuffle the IDs around (in particular the final serialization implicitly strips out dead objects).	2022-04-26 13:17:13 -04:00
Matt Arsenault	debfb96be6	llvm-reduce: Fix cloning unset maxCallFrameSize This was promoting an unset max call frame size to a max call frame size of 0.	2022-04-22 18:28:45 -04:00
Matt Arsenault	53d88581f1	llvm-reduce: Clone properties of blocks getSuccProbability was private for some reason, saying to go through MachineBranchProbabilityInfo. There doesn't seem to be much point to that, as that wrapper directly calls this. Like other areas, some of these fields aren't handled by the MIR printer/parser so aren't tested.	2022-04-20 09:47:45 -04:00
Matt Arsenault	193fde7509	llvm-reduce: Clone some of the easy function properties Error on some of these other fields, since tracking down test cases for all of these at once is exhausting.	2022-04-15 20:31:07 -04:00
Matt Arsenault	f163106f39	llvm-reduce: Handle cloning MachineFrameInfo and stack objects This didn't work at all before, and would assert on any frame index. Also copy the other fields, which I believe should cover everything. There are a few that are untested since MIR serialization is apparently still missing them (isStatepointSpillSlot, ObjectSSPLayout, and ObjectSExt/ObjectZExt).	2022-04-14 21:25:06 -04:00
Matt Arsenault	e33b07f859	llvm-reduce: Inform MRI of used phys reg masks I'm not sure how to directly observe this invisible cache for a test.	2022-04-14 20:52:05 -04:00

1 2

59 Commits