llvm-project

Author	SHA1	Message	Date
Pedro Lobo	9865296343	[StructurizeCFG] Use `poison` instead of `undef` as placeholder [NFC] (#119137 )	2024-12-10 15:03:44 +00:00
Jay Foad	231e63d816	[StructurizeCFG] Refactor insertConditions. NFC. (#115476 ) This just makes it more obvious that having Parent as the single predecessor is a special case, instead of checking for it in the middle of a loop that finds the nearest common dominator of multiple predecessors.	2024-11-26 09:40:33 +00:00
Jay Foad	b535e4ecac	[StructurizeCFG] Remove one SSAUpdater::AddAvailableValue. NFCI. (#115472 )	2024-11-08 17:20:29 +00:00
Jay Foad	107af4a62e	[StructurizeCFG] Introduce struct PredInfo. NFC. (#115457 ) This just provides a neater encapsulation of the info about the predicate for an edge, rather than ValueWeightPair aka std::pair.	2024-11-08 14:26:29 +00:00
Kazu Hirata	94f9cbbe49	[Scalar] Remove unused includes (NFC) (#114645 ) Identified with misc-include-cleaner.	2024-11-02 08:32:26 -07:00
Ruiling, Song	54d31bde32	Reapply "StructurizeCFG: Optimize phi insertion during ssa reconstruction (#101301 )" (#114347 ) This reverts commit be40c723ce2b7bf2690d22039d74d21b2bd5b7cf.	2024-11-01 08:29:59 +08:00
Juan Manuel Martinez Caamaño	b40ff5ac2d	[AMDGPU][StructurizeCFG] Maintain branch MD_prof metadata (#109813 ) Currently `StructurizeCFG` drops branch_weight metadata . This metadata can be generated from user annotations in the source code like: ```cpp if (...) [[likely]] { } ```	2024-09-25 13:15:23 +02:00
Kazu Hirata	a2f659c134	[StructurizeCFG] Avoid repeated hash lookups (NFC) (#107797 )	2024-09-09 07:15:12 -07:00
Matt Arsenault	f86da4cb7d	StructurizeCFG: Add SkipUniformRegions pass parameter to new PM version (#102812 ) Keep respecting the old cl::opt for now.	2024-08-12 15:13:15 +04:00
Yaxun (Sam) Liu	be40c723ce	Revert "StructurizeCFG: Optimize phi insertion during ssa reconstruction (#101301 )" This reverts commit c62e2a2a4ed69d53a3c6ca5c24ee8d2504d6ba2b. Since it caused regression in HIP buildbot: https://lab.llvm.org/buildbot/#/builders/123/builds/3282	2024-08-08 11:59:39 -04:00
Ruiling, Song	c62e2a2a4e	StructurizeCFG: Optimize phi insertion during ssa reconstruction (#101301 ) After investigating more while-break cases, I think we should try to optimize the way we reconstruct phi nodes. Previously, we reconstruct each phi nodes separately, but this is not optimal. For example: ``` header: %v.1 = phi float [ %v, %entry ], [ %v.2, %latch ] br i1 %cc, label %if, label %latch if: %v.if = fadd float %v.1, 1.0 br i1 %cc2, label %latch, label %exit latch: %v.2 = phi float [ %v.if, %if ], [ %v.1, %header ] br i1 %cc3, label %exit, label %header exit: %v.3 = phi float [ %v.2, %latch ], [ %v.if, %if ] ``` For this case, we have different copies of value `v`, but there is at most one copy of value `v` alive at any program point shown above. The existing ssa reconstruction will use the incoming values from the old deleted phi. Below is a possible output after ssa reconstruction. ``` header: %v.1 = phi float [ %v, %entry ], [ %v.loop, %Flow1 ] br i1 %cc, label %if, label %flow if: %v.if = fadd float %v.1, 1.0 br label %flow flow: %v.exit.if = phi float [ %v.if, %if ], [ undef, %header ] %v.latch = phi float [ %v.if, %if ], [ %v.1, %header ] latch: br label %flow1 flow1: %v.loop = phi float [ %v.latch, %latch ], [ undef, %Flow ] %v.exit = phi float [ %v.latch, %latch ], [ %v.exit.if, %Flow ] exit: %v.3 = phi float [ %v.exit, %flow1 ] ``` If we look closely, in order to reconstruct `v.1` `v.2` `v.3`, we are having two simultaneous copies of `v` alive at `flow` and `flow1`. We highly depend on register coalescer to coalesce them together. But register coalescer may not always be able to coalesce them because of the complexity in the chain of phi. On the other side, now that we have only one copy of `v` alive at any program point before the transform, why not simplify the phi network as much as we can? Look at the incoming values of these PHIs: ``` header if latch v.1: -- -- v.2 v.2: v.1 v.if -- v.3: -- v.if v.2 ``` If we let them share the same incoming values for these three different incoming blocks, then we would have only one copy of alive `v` at any program point after ssa reconstruction. Something like: ``` header: %v.1 = phi float [ %v, %entry ], [ %v.2, %Flow1 ] br i1 %cc, label %if, label %flow if: %v.if = fadd float %v.1, 1.0 br label %flow flow: %v.2 = phi float [ %v.if, %if ], [ %v.1, %header ] latch: br label %flow1 flow1: ... exit: %v.3 = phi float [ %v.2, %flow1 ] ```	2024-08-08 14:47:49 +08:00
Nikita Popov	9df71d7673	[IR] Add getDataLayout() helpers to Function and GlobalValue (#96919 ) Similar to https://github.com/llvm/llvm-project/pull/96902, this adds `getDataLayout()` helpers to Function and GlobalValue, replacing the current `getParent()->getDataLayout()` pattern.	2024-06-28 08:36:49 +02:00
Ruiling, Song	ac24238002	[LowerSwitch] Don't let pass manager handle the dependency (#68662 ) Some passes has limitation that only support simple terminators: branch/unreachable/return. Right now, they ask the pass manager to add LowerSwitch pass to eliminate `switch`. Let's manage such kind of pass dependency by ourselves. Also add the assertion in the related passes.	2023-10-25 09:24:36 +08:00
Nuno Lopes	29d0b60430	[StructurizeCFG] Use poison instead of undef as placeholder [NFC] These are used to create branch instructions. The condition is patched later	2023-07-22 13:23:39 +01:00
pvanhout	e4ea2d5919	[StructurizeCFG] Correctly depend on UniformityAnalysis Small oversight in https://reviews.llvm.org/D145688 - the pass' dependency was not updated to reflect the change to UA. Also, change DivergenceAnalysis to UniformityAnalysis in a comment. That way, StructurizeCFG only refers to UA and not DA anymore.	2023-03-14 11:25:22 +01:00
pvanhout	240e2cba67	[StructurizeCFG] Use UniformityAnalysis instead of DivergenceAnalysis Depends on D145572 Reviewed By: foad, sameerds Differential Revision: https://reviews.llvm.org/D145688	2023-03-13 08:31:20 +01:00
Juan Manuel MARTINEZ CAAMAÑO	96ad51e3eb	[StructurizeCFG][DebugInfo] Avoid use-after-free Reviewed By: dstuttard Differential Revision: https://reviews.llvm.org/D137408	2022-11-04 13:39:49 +00:00
Juan Manuel MARTINEZ CAAMAÑO	256f8b06c6	[StructurizeCFG][DebugInfo] Maintain DILocations in the branches created by StructurizeCFG Make StructurizeCFG preserve the debug locations of the branch instructions it introduces. Differential Revision: https://reviews.llvm.org/D135967	2022-10-28 02:51:02 -05:00
Juan Manuel MARTINEZ CAAMAÑO	e9716c64ec	[StructurizeCFG] Remove imposible case and replace by assert In addition, replace outdated XFAIL test by a new one. Differential Revision: https://reviews.llvm.org/D134439	2022-09-29 08:27:49 +00:00
Ruiling Song	a5676a3a7e	StructurizeCFG: Set Undef for non-predecessors in setPhiValues() During structurization process, we may place non-predecessor blocks between the predecessors of a block in the structurized CFG. Take the typical while-break case as an example: ``` /---A(v=...) \| / \ ^ B C \| \ /\| \---L \| \ / E (r = phi (v:C)...) ``` After structurization, the CFG would be look like: ``` /---A \| \|\ \| \| C \| \|/ \| F1 ^ \|\ \| \| B \| \|/ \| F2 \| \|\ \| \| L \ \|/ \--F3 \| E ``` We can see that block B is placed between the predecessors(C/L) of E. During phi reconstruction, to achieve the same sematics as before, we are reconstructing the PHIs as: F1: v1 = phi (v:C), (undef:A) F3: r = phi (v1:F2), ... But this is also saying that `v1` would be live through B, which is not quite necessary. The idea in the change is to say the incoming value from B is Undef for the PHI in E. With this change, the reconstructed PHI would be: F1: v1 = phi (v:C), (undef:A) F2: v2 = phi (v1:F1), (undef:B) F3: r = phi (v2:F2), ... Reviewed by: sameerds Differential Revision: https://reviews.llvm.org/D132450	2022-09-26 09:54:47 +08:00
Ruiling Song	40e9284f3c	StructurizeCFG: prefer reduced number of live values The instruction simplification will try to simplify the affected phis. In some cases, this might extend the liveness of values. For example: BB0: \| \ \| BB1 \| / BB2:phi (BB0, v), (BB1, undef) The phi in BB2 will be simplified to v as v dominates BB2, but this is increasing the number of active values in BB1. By setting CanUseUndef to false, we will not simplify the phi in this way, this would help register pressure. This is mandatory for the later change to help reducing VGPR pressure for AMDGPU. Reviewed by: foad, sameerds Differential Revision: https://reviews.llvm.org/D132449	2022-09-26 09:54:47 +08:00
Kazu Hirata	6b1bc80188	[Scalar] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-20 21:18:25 -07:00
Kazu Hirata	e20d210eef	[llvm] Qualify auto (NFC) Identified with readability-qualified-auto.	2022-08-07 23:55:27 -07:00
Brendon Cahoon	c945d88d2b	Revert "[StructurizeCFG] Improve basic block ordering" This reverts commit f1b05a0a2bbbea160002be709f8a1c59de366761. Need to revert to due to issues identified with testing. The transformation is incorrect for blocks that contain convergent instructions.	2022-07-14 09:40:51 -05:00
Brendon Cahoon	f1b05a0a2b	[StructurizeCFG] Improve basic block ordering StructurizeCFG linearizes the successors of branching basic block by adding Flow blocks to record the true/false path for branches and back edges. This patch reduces the number of Phi values needed to capture the control flow path by improving the basic block ordering. Previously, StructurizeCFG adds loop exit blocks outside of the loop. StructurizeCFG sets a boolean value to indicate the path taken, and all exit block live values extend to after the loop. For loops with a large number of exits blocks, this creates a huge number of values that are maintained, which increases compilation time and register pressure. This is problem especially with ASAN, which adds early exits to blocks with unreachable instructions for each instrumented check in the loop. In specific cases, this patch reduces the number of values needed after the loop by moving the exit block into the loop. This is done for blocks that have a single predecessor and single successor by moving the block to appear just after the predecessor. Differential Revision: https://reviews.llvm.org/D123231	2022-06-22 16:10:41 -05:00
Simon Moll	b8c2781ff6	[NFC] format InstructionSimplify & lowerCaseFunctionNames Clang-format InstructionSimplify and convert all "FunctionName"s to "functionName". This patch does touch a lot of files but gets done with the cleanup of InstructionSimplify in one commit. This is the alternative to the less invasive clang-format only patch: D126783 Reviewed By: spatel, rengolin Differential Revision: https://reviews.llvm.org/D126889	2022-06-09 16:10:08 +02:00
serge-sans-paille	59630917d6	Cleanup includes: Transform/Scalar Estimated impact on preprocessor output line: before: 1062981579 after: 1062494547 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120817	2022-03-03 07:56:34 +01:00
Jay Foad	0e74d75a29	[StructurizeCFG] Fix boolean not bug D118623 added code to fold not-of-compare into a compare with the inverted predicate, if the compare had no other uses. This relies on accurate use lists in the IR but it was run before setPhiValues, when some phi inputs are still stored in a data structure on the side, instead of being real uses in the IR. The effect was that a phi that should be using the original compare result would now get an inverted result instead. Fix this by moving simplifyConditions after setPhiValues. Differential Revision: https://reviews.llvm.org/D120312	2022-02-22 17:36:20 +00:00
Jay Foad	d2e5d3512b	[StructurizeCFG] Clean up some boolean not instructions In some cases StructurizeCFG inserts i1 xor instructions to invert predicates. Add a quick loop to clean these up afterwards if we can get away with modifying an existing compare instruction instead. (StructurizeCFG is generally run late in the pipeline so instcombine does not clean them up for us.) Differential Revision: https://reviews.llvm.org/D118623	2022-02-01 09:35:37 +00:00
Kazu Hirata	5fc9e30985	[Scalar] Use range-based for loops (NFC)	2021-02-25 19:54:38 -08:00
Kazu Hirata	fb74e1e78a	[Transforms/Scalar] Use range-based for loops (NFC)	2021-02-04 21:18:05 -08:00
Fangrui Song	a5309438fe	static const char *const foo => const char foo[] By default, a non-template variable of non-volatile const-qualified type having namespace-scope has internal linkage, so no need for `static`.	2020-12-01 10:33:18 -08:00
Arthur Eubanks	baffd052b0	[StructurizeCFG][NewPM] Port -structurizecfg to NPM This doesn't support -structurizecfg-skip-uniform-regions since that would require porting LegacyDivergenceAnalysis. The NPM doesn't support adding a non-analysis pass as a dependency of another, so I had to add -lowerswitch to some tests or pin them to the legacy PM. This is the only RegionPass in tree, so I simply copied the logic for finding all Regions from the legacy PM's RGManager into StructurizeCFG::run(). Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D89026	2020-10-23 15:54:03 -07:00
Arthur Eubanks	f7aa1563eb	[LowerSwitch][NewPM] Port lowerswitch to NPM Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D87726	2020-09-15 18:18:31 -07:00
Simon Pilgrim	0128b9505c	Revert rG5dd566b7c7b78bd- "PassManager.h - remove unnecessary Function.h/Module.h includes. NFCI." This reverts commit 5dd566b7c7b78bd385418c72d63c79895be9ae97. Causing some buildbot failures that I'm not seeing on MSVC builds.	2020-07-24 13:02:33 +01:00
Simon Pilgrim	5dd566b7c7	PassManager.h - remove unnecessary Function.h/Module.h includes. NFCI. PassManager.h is one of the top headers in the ClangBuildAnalyzer frontend worst offenders list. This exposes a large number of implicit dependencies on various forward declarations/includes in other headers that need addressing.	2020-07-24 12:40:50 +01:00
Ehud Katz	8a84158e5b	[StructurizeCFG] Fix an incorrect comment, NFC.	2020-06-01 17:42:09 +03:00
Ehud Katz	85c3088049	[StructurizeCFG] Fix region nodes ordering This is a reimplementation of the `orderNodes` function, as the old implementation didn't take into account all cases. The new implementation uses SCCs instead of Loops to take account of irreducible loops. Fix PR41509 Differential Revision: https://reviews.llvm.org/D79037	2020-06-01 12:50:35 +03:00
Ehud Katz	c6c265527d	Revert "[StructurizeCFG] Fix region nodes ordering" This reverts commit 897d8ee5cd693e17f95a7e84194bca4c089a520b, due to causing an infinite loop when encountering a loop with a sub-region with an inner loop.	2020-05-14 17:56:39 +03:00
Ehud Katz	897d8ee5cd	[StructurizeCFG] Fix region nodes ordering This is a reimplementation of the `orderNodes` function, as the old implementation didn't take into account all cases. Fix PR41509 Differential Revision: https://reviews.llvm.org/D79037	2020-05-13 15:33:36 +03:00
Sameer Sahasrabuddhe	3cbbded68c	Introduce unify-loop-exits pass. For each natural loop with multiple exit blocks, this pass creates a new block N such that all exiting blocks now branch to N, and then control flow is redistributed to all the original exit blocks. The bulk of the tranformation is a new function introduced in BasicBlockUtils that an redirect control flow from a set of incoming blocks to a set of outgoing blocks via a common "hub". This is a useful workaround for a limitation in the structurizer which incorrectly orders blocks when processing a nest of loops. This pass bypasses that issue by ensuring that each natural loop is recognized as a separate region. Since the structurizer is a region pass, it no longer sees a nest of loops in a single region, and instead processes each "level" in the nesting as a separate region. The AMDGPU backend provides a new option to enable this pass before the structurizer, which may eventually be enabled by default. Reviewers: madhur13490, arsenm, nhaehnle Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D75865	2020-03-30 13:23:56 -04:00
Sameer Sahasrabuddhe	42febbab91	StructurizeCFG: simplify phi nodes when possible After structurization, some phi nodes can have a single incoming edge and can be simplified away. This change runs a simplify query on all phis that are either modified or added by the structurizer. This also moves some phis closer to their use as a side benefit. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D75500	2020-03-05 10:33:15 +05:30
Reid Kleckner	4c1a1d3cf9	Add missing includes needed to prune LLVMContext.h include, NFC These are a pre-requisite to removing #include "llvm/Support/Options.h" from LLVMContext.h: https://reviews.llvm.org/D70280	2019-11-14 15:23:15 -08:00
Reid Kleckner	05da2fe521	Sink all InitializePasses.h includes This file lists every pass in LLVM, and is included by Pass.h, which is very popular. Every time we add, remove, or rename a pass in LLVM, it caused lots of recompilation. I found this fact by looking at this table, which is sorted by the number of times a file was changed over the last 100,000 git commits multiplied by the number of object files that depend on it in the current checkout: recompiles touches affected_files header 342380 95 3604 llvm/include/llvm/ADT/STLExtras.h 314730 234 1345 llvm/include/llvm/InitializePasses.h 307036 118 2602 llvm/include/llvm/ADT/APInt.h 213049 59 3611 llvm/include/llvm/Support/MathExtras.h 170422 47 3626 llvm/include/llvm/Support/Compiler.h 162225 45 3605 llvm/include/llvm/ADT/Optional.h 158319 63 2513 llvm/include/llvm/ADT/Triple.h 140322 39 3598 llvm/include/llvm/ADT/StringRef.h 137647 59 2333 llvm/include/llvm/Support/Error.h 131619 73 1803 llvm/include/llvm/Support/FileSystem.h Before this change, touching InitializePasses.h would cause 1345 files to recompile. After this change, touching it only causes 550 compiles in an incremental rebuild. Reviewers: bkramer, asbirlea, bollu, jdoerfert Differential Revision: https://reviews.llvm.org/D70211	2019-11-13 16:34:37 -08:00
Tim Renouf	5a0794327a	[StructurizeCFG] Enable -structurizecfg-relaxed-uniform-regions by default D62198 introduced an option to relax the checks for hasOnlyUniformBranches. This commit turns the option on by default, for better code generation in some cases in AMDGPU. Differential Revision: https://reviews.llvm.org/D63198 Change-Id: I9cbff002a1e74d3b7eb96b4192dc8129936d537d llvm-svn: 368042	2019-08-06 14:30:19 +00:00
Whitney Tsang	15b7f5b72d	PHINode: introduce setIncomingValueForBlock() function, and use it. Summary: There is PHINode::getBasicBlockIndex() and PHINode::setIncomingValue() but no function to replace incoming value for a specified BasicBlock* predecessor. Clearly, there are a lot of places that could use that functionality. Reviewer: craig.topper, lebedev.ri, Meinersbur, kbarton, fhahn Reviewed By: Meinersbur, fhahn Subscribers: fhahn, hiraditya, zzheng, jsji, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D63338 llvm-svn: 363566	2019-06-17 14:38:56 +00:00
Neil Henning	119c31ad93	StructurizeCFG: Relax uniformity checks. This change relaxes the checks for hasOnlyUniformBranches such that our region is uniform if: 1. All conditional branches that are direct children are uniform. 2. And either: a. All sub-regions are uniform. b. There is one or less conditional branches among the direct children. Differential Revision: https://reviews.llvm.org/D62198 llvm-svn: 361610	2019-05-24 08:59:17 +00:00
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Nicolai Haehnle	0823050b9f	StructurizeCFG: Simplify inserted PHI nodes Summary: This improves subsequent divergence analysis in some cases. Change-Id: I5e95e7ec7fd3fa80d414d1a53a02fea23e3d67d3 Reviewers: arsenm, rampitec Subscribers: jvesely, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D53316 llvm-svn: 344697	2018-10-17 15:37:41 +00:00
Chandler Carruth	edb12a838a	[TI removal] Make variables declared as `TerminatorInst` and initialized by `getTerminator()` calls instead be declared as `Instruction`. This is the biggest remaining chunk of the usage of `getTerminator()` that insists on the narrow type and so is an easy batch of updates. Several files saw more extensive updates where this would cascade to requiring API updates within the file to use `Instruction` instead of `TerminatorInst`. All of these were trivial in nature (pervasively using `Instruction` instead just worked). llvm-svn: 344502	2018-10-15 10:04:59 +00:00

1 2 3

117 Commits