llvm-project

Author	SHA1	Message	Date
Akshat Oke	44d0e9522a	[CodeGen][NewPM] Port TailDuplicate pass to NPM (#113293 )	2024-10-30 11:48:40 +05:30
Kyungwoo Lee	0dd9fdcf83	[StructuralHash] Support Differences (#112638 ) This computes a structural hash while allowing for selective ignoring of certain operands based on a custom function that is provided. Instead of a single hash value, it now returns FunctionHashInfo which includes a hash value, an instruction mapping, and a map to track the operand location and its corresponding hash value that is ignored. Depends on https://github.com/llvm/llvm-project/pull/112621. This is a patch for https://discourse.llvm.org/t/rfc-global-function-merging/82608.	2024-10-26 20:02:05 -07:00
Akshat Oke	c4c60c0db9	[CodeGen][NewPM] Port OptimizePHIs to NPM (#113433 )	2024-10-23 16:55:21 +05:30
Justin Fargnoli	8a12e0131f	Revert "[LLVM] Add IRNormalizer Pass" (#113392 ) Reverts llvm/llvm-project#68176 Introduced BuildBot failure: https://github.com/llvm/llvm-project/pull/68176#issuecomment-2428243474	2024-10-22 16:01:32 -07:00
Akshat Oke	4e32d7236b	[NewPM][CodeGen] Port LiveRegMatrix to NPM (#109938 )	2024-10-22 15:28:04 +05:30
Akshat Oke	93802815ab	[NewPM][CodeGen] Port VirtRegMap to NPM (#109936 )	2024-10-22 15:15:56 +05:30
Justin Fargnoli	1295d2e6da	[LLVM] Add IRNormalizer Pass (#68176 ) Add the llvm-canon tool. Description from the [original PR](https://reviews.llvm.org/D66029#change-wZv3yOpDdxIu): > Added a new llvm-canon tool which aims to transform LLVM Modules into a canonical form by reordering and renaming instructions while preserving the same semantics. This tool makes it easier to spot semantic differences while diffing two modules which have undergone different transformation passes. The current version of this tool can: - Reorder instructions within a function. - Rename instructions based on the operands. - Sort commutative operands. This code was originally written by @michalpaszkowski and [submitted to mainline LLVM](`14d358537f`). However, it was quickly [reverted](`335de55fa3`) to do BuildBot errors. Michal presented his version of the tool in [LLVM-Canon: Shooting for Clear Diffs](https://www.youtube.com/watch?v=c9WMijSOEUg). @AidanGoldfarb and I ported the code to the new pass manager, added more tests, and fixed some bugs related to PHI nodes that may have been the root cause of the BuildBot errors that caused the patch to be reverted. Additionally, we rewrote the implementation of instruction reordering to fix cases where the original algorithm would break use-def chains. Note that this is @AidanGoldfarb and I's first time submitting to LLVM. Please liberally critique the PR! CC @plotfi for initial review. --------- Co-authored-by: Aidan <aidan.goldfarb@mail.mcgill.ca>	2024-10-21 18:11:11 -07:00
Nicholas Guy	7be1dc0f32	[PassRegistry] Add complex deinterleaving pass to PassRegistry.def (#112874 ) Allow for the complex deinterleaving pass to be invoked via `opt --passes=complex-deinterleaving`	2024-10-18 13:08:24 +01:00
Christudasan Devadasan	488d3924dd	[CodeGen][NewPM] Port EarlyIfConversion pass to NPM. (#108508 )	2024-10-16 13:22:57 +05:30
Christudasan Devadasan	732b804e5f	[CodeGen][NewPM] Port machine trace metrics analysis to new pass manager. (#108507 )	2024-10-16 13:19:55 +05:30
Akshat Oke	cd6c2b80be	[NewPM][CodeGen] Port StackColoring to NPM (#111812 )	2024-10-14 19:23:34 +05:30
Matt Arsenault	1bc9b67bd8	Scalarizer: Replace cl::opts with pass parameters (#110645 ) Preserve the existing defaults (although load-store defaulting to false is a really bad one). Also migrate DirectX tests to new PM.	2024-10-02 14:45:26 +04:00
Akshat Oke	d2d78e584b	[NewPM][CodeGen] Port MachineLICM to NPM (#107376 )	2024-09-20 11:34:18 +05:30
Antonio Frighetto	2ae968a0d9	[Instrumentation] Move out to Utils (NFC) (#108532 ) Utility functions have been moved out to Utils. Minor opportunity to drop the header where not needed.	2024-09-15 21:07:40 -07:00
Yuxuan Chen	a416267a5f	[LLVM][Coroutines] Transform "coro_elide_safe" calls to switch ABI coroutines to the `noalloc` variant (#99285 ) This patch is episode three of the middle end implementation for the coroutine HALO improvement project published on discourse: https://discourse.llvm.org/t/language-extension-for-better-more-deterministic-halo-for-c-coroutines/80044 After we attribute the calls to some coroutines as "coro_elide_safe" in the C++ FE and creating a `noalloc` ramp function, we use a new middle end pass to move the call to coroutines to the noalloc variant. This pass should be run after CoroSplit. For each node we process in CoroSplit, we look for its callers and replace the attributed ones in presplit coroutines to the noalloc one. The transformed `noalloc` ramp function will also require a frame pointer to a block of memory it can use as an activation frame. We allocate this on the caller's frame with an alloca. Please note that we cannot safely transform such attributed calls in post-split coroutines due to memory lifetime reasons. The CoroSplit pass is responsible for creating the coroutine frame spills for all the allocas in the coroutine. Therefore it will be unsafe to create new allocas like this one in post-split coroutines. This happens relatively rarely because CGSCC performs the passes on the callees before the caller. However, if multiple coroutines coexist in one SCC, this situation does happen (and prevents us from having potentially unbound frame size due to recursion.) You can find episode 1: Clang FE of this patch series at https://github.com/llvm/llvm-project/pull/99282 Episode 2: CoroSplit at https://github.com/llvm/llvm-project/pull/99283	2024-09-08 23:09:40 -07:00
Mingming Liu	d4ddf06b0c	[NFCI]Remove EntryCount from FunctionSummary and clean up surrounding synthetic count passes. (#107471 ) The primary motivation is to remove `EntryCount` from `FunctionSummary`. This frees 8 bytes out of `sizeof(FunctionSummary)` (136 bytes as of `64498c5483`). While I'm at it, this PR clean up {SummaryBasedOptimizations, SyntheticCountsPropagation} since they were not used and there are no plans to further invest on them. With this patch, bitcode writer writes a placeholder 0 at the byte offset of `EntryCount` and bitcode reader can parse the function entry count at the correct byte offset. Added a TODO to stop writing `EntryCount` and bump bitcode version	2024-09-06 16:38:17 -07:00
Mircea Trofin	775c50709c	[ctx_prof] Flattened profile lowering pass (#107329 ) Pass to flatten and lower the contextual profile to profile (i.e. `MD_prof`) metadata. This is expected to be used after all IPO transformations have happened. Prior to lowering, the instrumentation is maintained during IPO and the contextual profile is kept in sync (see PRs #105469, #106154). Flattening (#104539) sums up all the counters belonging to all a function's context nodes. We first propagate counter values (from the flattened profile) using the same propagation algorithm as `PGOUseFunc::populateCounters`, then map the edge values to `branch_weights`. Functions. in the module that don't have an entry in the flattened profile are deemed cold, and any `MD_prof` metadata they may have is reset. The profile summary is also reset at this point. Issue [#89287](https://github.com/llvm/llvm-project/issues/89287)	2024-09-06 13:47:08 -07:00
vporpo	52dca6ffae	[SandboxVec] Boilerplate (#107431 ) This patch implements the new pass and registers it with the pass manager. For context, this is a vectorizer that operates on Sandbox IR, which is a transactional IR on top of LLVM IR.	2024-09-05 13:38:39 -07:00
Christudasan Devadasan	6c143a86cd	[CodeGen][NewPM] Port MachineCSE pass to new pass manager. (#106605 )	2024-09-04 18:54:07 +05:30
Nikita Popov	34b10e165d	[InstCombine] Remove optional LoopInfo dependency https://github.com/llvm/llvm-project/pull/106075 has removed the last dependency on LoopInfo in InstCombine, so don't fetch the analysis anymore and remove the use-loop-info pass option.	2024-09-02 10:25:45 +02:00
Shengchen Kan	87c86aa6b9	[X86,SimplifyCFG] Support hoisting load/store with conditional faulting (Part I) (#96878 ) This is simplifycfg part of https://github.com/llvm/llvm-project/pull/95515 In this PR, we support hoisting load/store with conditional faulting in `SimplifyCFGOpt::speculativelyExecuteBB` to eliminate conditional branches. This is for cases like ``` void test (int a, int b) { if (a) b = a; } ``` In the following patches, we will support the hoist in `SimplifyCFGOpt::hoistCommonCodeFromSuccessors`. That is for cases like ``` void test (int a, int c, int d) { if (a) c = a; else d = a; } ```	2024-08-29 10:42:44 +08:00
Philip Reames	27a62ec72a	[LSR] Split the -lsr-term-fold transformation into it's own pass (#104234 ) This transformation doesn't actually use any of the internal state of LSR and recomputes all information from SCEV. Splitting it out makes it easier to test. Note that long term I would like to write a version of this transform which is integrated with LSR's solver, but if that happens, we'll just delete the extra pass. Integration wise, I switched from using TTI to using a pass configuration variable. This seems slightly more idiomatic, and means we don't run the extra logic on any target other than RISCV.	2024-08-17 18:34:23 -07:00
Mircea Trofin	50c876a486	[nfc][ctx_prof] Remove the need for `PassBuilder` to know about `UseCtxProfile` (#104492 )	2024-08-15 13:16:55 -07:00
Justin Bogner	372ddcd1ba	[DXIL][Analysis] Boilerplate for DXILResourceAnalysis pass Broke this out into its own commit to make the next one easier to review. Pull Request: https://github.com/llvm/llvm-project/pull/100700	2024-08-15 00:11:45 +03:00
S. Bharadwaj Yadavalli	03e6675fc7	[DXIL][Analysis] Add DXILMetadataAnalysis pass (#102079 ) DXIL Metadata Analysis passes (one for legacy PM and one for new PM) that collect following DXIL module metadata information in a structure are added. 1. Shader Model version 2. DXIL version 3. Shader Stage Information collected using the legacy pass is verified by adding additional test commands to existing metadata test sources.	2024-08-12 13:51:09 -04:00
Matt Arsenault	f86da4cb7d	StructurizeCFG: Add SkipUniformRegions pass parameter to new PM version (#102812 ) Keep respecting the old cl::opt for now.	2024-08-12 15:13:15 +04:00
Chris Apple	8acf8852e9	[LLVM][rtsan] Add RealtimeSanitizer transform pass (#101232 ) Split from #100596. Introduce the RealtimeSanitizer transform, which inserts the rtsan_enter/exit functions at the appropriate places in an instrumented function.	2024-08-08 16:32:54 -07:00
Mircea Trofin	dbbf0762b6	[ctx_prof] CtxProfAnalysis (#102084 ) This is an immutable analysis that loads and makes the contextual profile available to other passes. This patch introduces the analysis and an analysis printer pass. Subsequent patches will introduce the APIs that IPO passes will call to modify the profile as result of their changes.	2024-08-07 14:39:48 -04:00
Tianqing Wang	3d494bfc7f	[SimplifyCFG] Increase budget for FoldTwoEntryPHINode() if the branch is unpredictable. (#98495 ) The `!unpredictable` metadata has been present for a long time, but it's usage in optimizations is still limited. This patch teaches `FoldTwoEntryPHINode()` to be more aggressive with an unpredictable branch to reduce mispredictions. A TTI interface `getBranchMispredictPenalty()` is added to distinguish between different hardwares to ensure we don't go too far for simpler cores. For simplicity, only a naive x86 implementation is included for the time being.	2024-07-23 07:47:21 +08:00
Christudasan Devadasan	15b41d207e	[CodeGen] change prototype of regalloc filter function (#93525 ) [CodeGen] Change the prototype of regalloc filter function Change the prototype of the filter function so that we can filter not just by RegClass. We need to implement more complicated filter based upon some other info associated with each register. Patch provided by: Gang Chen (gangc@amd.com)	2024-07-22 16:49:39 +05:30
paperchalice	1b873e565e	[CodeGen][NewPM] Port `phi-node-elimination` to new pass manager (#98867 ) - Add `PHIEliminationPass `. - Support new pass manager in `MachineBasicBlock:: SplitCriticalEdge `	2024-07-17 11:26:56 +08:00
paperchalice	01191874f9	[CodeGen] Port `two-address-instructions` to new pass manager (#98632 ) Add `TwoAddressInstructionPass`.	2024-07-15 15:11:06 +08:00
paperchalice	c09ed6a29e	[CodeGen][NewPM] Port `MachineVerifier` to new pass manager (#98628 ) - Add `MachineVerifierPass`. - Use complete `MachineVerifierPass` in `VerifyInstrumentation` if possible. `LiveStacksAnalysis` will be added in future, all other analyses are done.	2024-07-15 12:42:44 +08:00
paperchalice	fb128630a7	[CodeGen][NewPM] Add `MachineOptimizationRemarkEmitterAnalysis` (#98601 ) Add `MachineOptimizationRemarkEmitterAnalysis` the legacy version `MachineOptimizationRemarkEmitterPass` is already a wrapper.	2024-07-14 14:29:08 +08:00
paperchalice	099899961c	[CodeGen][NewPM] Port `machine-block-freq` to new pass manager (#98317 ) - Add `MachineBlockFrequencyAnalysis`. - Add `MachineBlockFrequencyPrinterPass`. - Use `MachineBlockFrequencyInfoWrapperPass` in legacy pass manager. - `LazyMachineBlockFrequencyInfo::print` is empty, drop it due to new pass manager migration.	2024-07-12 15:45:01 +08:00
paperchalice	abde52aa66	[CodeGen][NewPM] Port `LiveIntervals` to new pass manager (#98118 ) - Add `LiveIntervalsAnalysis`. - Add `LiveIntervalsPrinterPass`. - Use `LiveIntervalsWrapperPass` in legacy pass manager. - Use `std::unique_ptr` instead of raw pointer for `LICalc`, so destructor and default move constructor can handle it correctly. This would be the last analysis required by `PHIElimination`.	2024-07-10 19:34:48 +08:00
paperchalice	4010f894a1	[CodeGen][NewPM] Port `SlotIndexes` to new pass manager (#97941 ) - Add `SlotIndexesAnalysis`. - Add `SlotIndexesPrinterPass`. - Use `SlotIndexesWrapperPass` in legacy pass.	2024-07-09 12:09:11 +08:00
paperchalice	ac0b2814c3	[CodeGen][NewPM] Port `LiveVariables` to new pass manager (#97880 ) - Port `LiveVariables` to new pass manager. - Convert to `LiveVariablesWrapperPass` in legacy pass manager.	2024-07-09 10:50:43 +08:00
paperchalice	79d0de2ac3	[CodeGen][NewPM] Port `machine-loops` to new pass manager (#97793 ) - Add `MachineLoopAnalysis`. - Add `MachineLoopPrinterPass`. - Convert to `MachineLoopInfoWrapperPass` in legacy pass manager.	2024-07-09 09:11:18 +08:00
Alexander Shaposhnikov	1710679237	Reapply "[LLVM][Instrumentation] Add numerical sanitizer (#85916 )" This reverts commit 493c384a7d94cce1d18824a6b0e1f9ee20cdc681 and includes a fix for the build breakage.	2024-06-29 09:04:43 +00:00
Alexander Shaposhnikov	493c384a7d	Revert "[LLVM][Instrumentation] Add numerical sanitizer (#85916 )" This reverts commit 15ad7919f6dd18b5d7f5a22daad6a5c25ecb8793. The commit broke the build bot https://lab.llvm.org/buildbot/#/builders/11/builds/822.	2024-06-29 04:57:50 +00:00
Alexander Shaposhnikov	15ad7919f6	[LLVM][Instrumentation] Add numerical sanitizer (#85916 ) This PR introduces the numerical sanitizer originally proposed by Clement Courbet on https://reviews.llvm.org/D97854 (https://arxiv.org/abs/2102.12782). The main additions include: - Migration to LLVM opaque pointers - Migration to various updated APIs - Extended coverage for LLVM instructions/intrinsics - Code refactoring The tool is still very experimental, the coverage (e.g. for intrinsics / library functions) is incomplete. Link: https://discourse.llvm.org/t/rfc-revival-of-numerical-sanitizer/79601 --------- Co-authored-by: Fangrui Song <i@maskray.me>	2024-06-28 15:41:37 -07:00
paperchalice	d38b518e04	Reapply "[CodeGen][NewPM] Port machine-branch-prob to new pass manager" (#96858 ) (#96869 ) This reverts commit ab58b6d58edf6a7c8881044fc716ca435d7a0156. In `CodeGen/Generic/MachineBranchProb.ll`, `llc` crashed with dumped MIR when targeting PowerPC. Move test to `llc/new-pm`, which is X86 specific.	2024-06-28 10:59:23 +08:00
Nikita Popov	253a294b54	[PassManager] Add pretty stack frames (#96078 ) In NewPM pass managers, add a "pretty stack frame" that tells you which pass crashed while running which function. For example `opt -O3 -passes-ep-peephole=trigger-crash-function test.ll` will print something like this: ``` Stack dump: 0. Program arguments: build/bin/opt -S -O3 -passes-ep-peephole=trigger-crash-function test.ll 1. Running pass "function<eager-inv>(mem2reg,instcombine<max-iterations=1;no-use-loop-info;no-verify-fixpoint>,trigger-crash-function,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-sink-common-insts;speculate-blocks;simplify-cond-branch>)" on module "test.ll" 2. Running pass "trigger-crash-function" on function "fshl_concat_i8_i8" ``` While the crashing pass is usually evident from the stack trace, this also shows which function triggered the crash, as well as the pipeline string for the pass (including options). Similar functionality existed in the LegacyPM.	2024-06-27 12:16:30 +02:00
paperchalice	ab58b6d58e	Revert "[CodeGen][NewPM] Port machine-branch-prob to new pass manager" (#96858 ) Reverts llvm/llvm-project#96389 Some ppc bots failed.	2024-06-27 15:00:17 +08:00
paperchalice	73e46c2bb4	[CodeGen][NewPM] Port machine-branch-prob to new pass manager (#96389 ) Like IR version `print<branch-prob>`, there is also a `print<machine-branch-prob>`.	2024-06-27 14:04:51 +08:00
paperchalice	1dbc2aad68	[PassBuilder] Parse machine function analyses inside require/invalidate (#96634 ) Now we have several machine function analyses but forgot to support them in `parseMachinePass`.	2024-06-26 17:10:48 +08:00
paperchalice	8599629d39	[CodeGen][NewPM] Port machine post dominator tree analysis to new pass manager (#96378 ) Follows #95879.	2024-06-25 14:23:01 +08:00
Nikita Popov	5cd0ba30f5	Reapply [IR] Lazily initialize the class to pass name mapping (NFC) (#96321 ) (#96462 ) On MSVC the `this` uses inside `decltype` require a lambda capture. On clang they result in an unused capture warning instead. Add the capture and suppress the warning with `(void)this`. ----- Initializing this map is somewhat expensive (especially for O0), so we currently only do it if certain flags are used. I would like to make use of it for crash dumps (#96078), where we don't know in advance whether it will be needed or not. This patch changes the initialization to a lazy approach, where a callback is registered that does the actual initialization. The callbacks will be run the first time the pass name is requested. This way there is no compile-time impact if the mapping is not used.	2024-06-24 15:00:11 +02:00
Nikita Popov	e5a41f0afc	Revert "[IR] Lazily initialize the class to pass name mapping (NFC) (#96321 )" My attempt to fix the Windows build made things worse, revert entirely for now. This reverts commit e7137f2fed5cfee822ae3c4c6d39188adb59a16c. This reverts commit 6eaf204dbb0a6a81cddfd02f625c130f7bb1aae5. This reverts commit 957dc4366dd2ce9d5d2991c3ad76bbf438e9954e.	2024-06-24 10:32:03 +02:00

1 2 3 4 5 ...

861 Commits