
This commit introduces support for outlining functions across modules using codegen data generated from previous codegen. The codegen data currently manages the outlined hash tree, which records outlining instances that occurred locally in the past. The machine outliner now operates in one of three modes: 1. CGDataMode::None: This is the default outliner mode that uses the suffix tree to identify (local) outlining candidates within a module. This mode is also used by (full)LTO to maintain optimal behavior with the combined module. 2. CGDataMode::Write (`-codegen-data-generate`): This mode is identical to the default mode, but it also publishes the stable hash sequences of instructions in the outlined functions into a local outlined hash tree. It then encodes this into the `__llvm_outline` section, which will be dead-stripped at link time. 3. CGDataMode::Read (`-codegen-data-use-path={.cgdata}`): This mode reads a codegen data file (.cgdata) and initializes a global outlined hash tree. This tree is used to generate global outlining candidates. Note that the codegen data file has been post-processed with the raw `__llvm_outline` sections from all native objects using the `llvm-cgdata` tool (or a linker, `LLD`, or a new ThinLTO pipeline later). This depends on https://github.com/llvm/llvm-project/pull/105398. After this PR, LLD (https://github.com/llvm/llvm-project/pull/90166) and Clang (https://github.com/llvm/llvm-project/pull/90304) will follow for each client side support. This is a patch for https://discourse.llvm.org/t/rfc-enhanced-machine-outliner-part-2-thinlto-nolto/78753.
41 lines
1.5 KiB
LLVM
41 lines
1.5 KiB
LLVM
; This test verifies the stable hash values for different global variables
|
|
; that have distinct names.
|
|
; We generate two different cgdata files from nearly identical outline instances,
|
|
; with the only difference being the last call target globals, @g vs @h.
|
|
|
|
; RUN: split-file %s %t
|
|
|
|
; RUN: llc -mtriple=arm64-apple-darwin -enable-machine-outliner -codegen-data-generate=true -filetype=obj %t/local-g.ll -o %t/local-g.o
|
|
; RUN: llvm-cgdata --merge %t/local-g.o -o %t/local-g.cgdata
|
|
; RUN: llvm-cgdata --convert %t/local-g.cgdata -o %t/local-g.cgtext
|
|
; RUN: llc -mtriple=arm64-apple-darwin -enable-machine-outliner -codegen-data-generate=true -filetype=obj %t/local-h.ll -o %t/local-h.o
|
|
; RUN: llvm-cgdata --merge %t/local-h.o -o %t/local-h.cgdata
|
|
; RUN: llvm-cgdata --convert %t/local-h.cgdata -o %t/local-h.cgtext
|
|
|
|
; We compare the trees which are only different at the terminal node's hash value.
|
|
; Here we simply count the different lines that have `Hash` string.
|
|
; RUN: not diff %t/local-g.cgtext %t/local-h.cgtext 2>&1 | grep Hash | wc -l | FileCheck %s
|
|
; CHECK: 2
|
|
|
|
;--- local-g.ll
|
|
declare i32 @g(i32, i32, i32)
|
|
define i32 @f1() minsize {
|
|
%1 = call i32 @g(i32 10, i32 1, i32 2);
|
|
ret i32 %1
|
|
}
|
|
define i32 @f2() minsize {
|
|
%1 = call i32 @g(i32 20, i32 1, i32 2);
|
|
ret i32 %1
|
|
}
|
|
|
|
;--- local-h.ll
|
|
declare i32 @h(i32, i32, i32)
|
|
define i32 @f1() minsize {
|
|
%1 = call i32 @h(i32 10, i32 1, i32 2);
|
|
ret i32 %1
|
|
}
|
|
define i32 @f2() minsize {
|
|
%1 = call i32 @h(i32 20, i32 1, i32 2);
|
|
ret i32 %1
|
|
}
|