llvm-project

History

Fangrui Song ad31a2dcad Change -fsanitize=function to place two words before the function entry

The current implementation of -fsanitize=function places two words (the prolog
signature and the RTTI proxy) at the function entry, which makes the feature
incompatible with Intel Indirect Branch Tracking (IBT) that needs an ENDBR instruction
at the function entry. To allow the combination, move the two words before the
function entry, similar to -fsanitize=kcfi.

Armv8.5 Branch Target Identification (BTI) has a similar requirement.

Note: for IBT and BTI, whether a function gets a marker instruction at the entry
generally cannot be assumed (it can be disabled by a function attribute or
stronger LTO optimizations).

It is extremely unlikely for two words preceding a function entry to be
inaccessible. One way to achieve this is by ensuring that a function is
aligned at a page boundary and making the preceding page unmapped or
unreadable. This is not reasonable for application or library code.
(Think: the first text section has crt* code not instrumented by
-fsanitize=function.)

We use 0xc105cafe for all targets. .long 0xc105cafe disassembles to invalid
instructions on all architectures I have tested, except Power where it is
`lfs 8, -13570(5)` (Load Floating-Point with a weird offset, unlikely to be used in real code).

---

For the removed function in AsmPrinter.cpp, remove an assert: `mdconst::extract`
already asserts non-nullness.

For compiler-rt/test/ubsan/TestCases/TypeCheck/Function/function.cpp,
when the function doesn't have prolog/epilog (-O1 and above), after moving the two words,
the address of the function equals the address of ret instruction,
so symbolizing the function will additionally get a non-zero column number.
Adjust the test to allow an optional column number.
```
  .long   3238382334
  .long   .L__llvm_rtti_proxy-_Z1fv
_Z1fv:   // symbolizing here retrieves the line table entry from the second .loc
  .file   0 ...
  .loc    0 1 0
  .cfi_startproc
  .loc    0 2 1 prologue_end
  retq
```

Reviewed By: peter.smith

Differential Revision: https://reviews.llvm.org/D148665

2023-05-19 07:50:29 -07:00

ABIInfo.h

[clang][CodeGen] Reformat ABIInfo.h (NFC)

2023-05-18 04:29:16 +03:00

Address.h

[CodeGen] Stop storing alignment information into pointers in Address

2023-02-24 10:33:10 -08:00

BackendUtil.cpp

Revert "[RFC][MC][MachO]Only emits compact-unwind format for "canonical" personality symbols. For the rest, use DWARFs."

2023-05-19 09:40:54 -04:00

CGAtomic.cpp

…

CGBlocks.cpp

…

CGBlocks.h

…

CGBuilder.h

[CodeGen] Add a flag to Address and Lvalue that is used to keep

2023-02-15 10:15:13 -08:00

CGBuiltin.cpp

Revert "[NVPTX/CUDA] added an optional src_size argument to __nvvm_cp_async*"

2023-05-18 11:45:06 -07:00

CGCall.cpp

LangRef: Add "dynamic" option to "denormal-fp-math"

2023-04-29 08:44:59 -04:00

CGCall.h

…

CGClass.cpp

[CodeGen] Add a flag to Address and Lvalue that is used to keep

2023-02-15 10:15:13 -08:00

CGCleanup.cpp

[Windows SEH] Fix ehcleanup crash for Windows -EHa

2023-04-12 14:44:11 +08:00

CGCleanup.h

…

CGCoroutine.cpp

[C++20] [Coroutines] Handle function-try-block in SemaCoroutine

2023-04-06 15:11:34 +08:00

CGCUDANV.cpp

[CUDA] Update cached kernel handle when the function instance changes.

2023-03-21 15:36:12 -07:00

CGCUDARuntime.cpp

…

CGCUDARuntime.h

…

CGCXX.cpp

…

CGCXXABI.cpp

…

CGCXXABI.h

[Clang][CodeGen] Fix this argument type for certain destructors

2023-02-28 16:43:03 -08:00

CGDebugInfo.cpp

[gcov] Simplify cc1 options and remove CodeGenOptions EmitCovNotes/EmitCovArcs

2023-05-17 16:09:12 -07:00

CGDebugInfo.h

[NFC][Clang][Coverity] Fix Static Code Analysis Concerns with copy without assign

2023-05-18 18:14:07 -07:00

CGDecl.cpp

Emit const globals with constexpr destructor as constant LLVM values

2023-03-16 11:02:27 +01:00

CGDeclCXX.cpp

[NFC] [C++20] [Modules] Rename ASTContext::getNamedModuleForCodeGen to ASTContext::getCurrentNamedModule

2023-05-16 11:24:35 +08:00

CGException.cpp

Fix assertion when try is used inside catch(...) block

2023-05-17 14:42:39 -07:00

CGExpr.cpp

Change -fsanitize=function to place two words before the function entry

2023-05-19 07:50:29 -07:00

CGExprAgg.cpp

[C2x] Implement support for empty brace initialization (WG14 N2900 and WG14 N3011)

2023-04-03 15:22:52 -04:00

CGExprComplex.cpp

…

CGExprConstant.cpp

[clang] Do not attempt to zero-extend _BitInt(1) when not required

2023-05-02 08:23:22 -04:00

CGExprCXX.cpp

Use APInt::getOneBitSet (NFC)

2023-04-10 18:19:17 -07:00

CGExprScalar.cpp

[CodeGen] Only consider innermost cast for !heapallocsite

2023-05-09 09:49:42 +02:00

CGGPUBuiltin.cpp

[NFC][CLANG] Fix coverity remarks about large copy by values

2023-04-28 12:17:10 -07:00

CGHLSLRuntime.cpp

Recommit: [NFC][IR] Make Module::getGlobalList() private

2023-02-14 15:12:51 -08:00

CGHLSLRuntime.h

…

CGLoopInfo.cpp

…

CGLoopInfo.h

…

CGNonTrivialStruct.cpp

[NFC][CLANG] Fix coverity remarks about large copy by values

2023-04-28 12:17:10 -07:00

CGObjC.cpp

…

CGObjCGNU.cpp

…

CGObjCMac.cpp

Recommit: [NFC][IR] Make Module::getGlobalList() private

2023-02-14 15:12:51 -08:00

CGObjCRuntime.cpp

…

CGObjCRuntime.h

…

CGOpenCLRuntime.cpp

[clang] Use *{Map,Set}::contains (NFC)

2023-03-15 18:06:34 -07:00

CGOpenCLRuntime.h

[Clang][SPIR-V] Emit target extension types for OpenCL types on SPIR-V.

2023-03-13 14:20:24 -04:00

CGOpenMPRuntime.cpp

[Clang][Flang][OpenMP] Add loadOffloadInfoMetadata and createOffloadEntriesAndInfoMetadata into OMPIRBuilder's finalize and initialize

2023-05-16 11:51:36 -05:00

CGOpenMPRuntime.h

[Clang][Flang][OpenMP] Add loadOffloadInfoMetadata and createOffloadEntriesAndInfoMetadata into OMPIRBuilder's finalize and initialize

2023-05-16 11:51:36 -05:00

CGOpenMPRuntimeGPU.cpp

AMDGPU: Add basic gfx942 target

2023-05-10 11:51:06 -04:00

CGOpenMPRuntimeGPU.h

[OpenMP] Prefix outlined and reduction func names with original func's name

2023-04-19 23:00:26 +03:00

CGRecordLayout.h

…

CGRecordLayoutBuilder.cpp

[clang][CodeGen] Use base subobject type layout for potentially-overlapping fields

2023-02-17 15:11:42 +01:00

CGStmt.cpp

[clang] Return std::string_view from TargetInfo::getClobbers()

2023-04-24 12:16:54 +03:00

CGStmtOpenMP.cpp

[OpenMP] Prefix outlined and reduction func names with original func's name

2023-04-19 23:00:26 +03:00

CGValue.h

[CodeGen] Add a flag to Address and Lvalue that is used to keep

2023-02-15 10:15:13 -08:00

CGVTables.cpp

[clang] Don't emit type tests for dllexport/import classes

2023-04-25 14:00:57 -07:00

CGVTables.h

…

CGVTT.cpp

…

CMakeLists.txt

Split out CodeGenTypes from CodeGen for LLT/MVT

2023-05-03 00:13:20 +09:00

CodeGenABITypes.cpp

…

CodeGenAction.cpp

Revert "[Demangle] make llvm::demangle take std::string_view rather than const std::string&"

2023-05-02 15:54:09 -07:00

CodeGenFunction.cpp

[IR] Adds Instruction::setNoSanitizeMetadata()

2023-05-19 19:18:57 +08:00

CodeGenFunction.h

[OpenMP][5.1] Fix parallel masked is ignored #59939

2023-04-03 20:33:55 +00:00

CodeGenModule.cpp

[gcov] Simplify cc1 options and remove CodeGenOptions EmitCovNotes/EmitCovArcs

2023-05-17 16:09:12 -07:00

CodeGenModule.h

LangRef: Add "dynamic" option to "denormal-fp-math"

2023-04-29 08:44:59 -04:00

CodeGenPGO.cpp

[profiling] Improve error message for raw profile header mismatches

2023-04-27 14:51:38 -07:00

CodeGenPGO.h

[clang][CodeGenPGO] Don't use an invalid index when region counts disagree

2023-05-10 22:53:53 -04:00

CodeGenTBAA.cpp

…

CodeGenTBAA.h

…

CodeGenTypeCache.h

…

CodeGenTypes.cpp

[AArch64] Add svboolx2_t and svboolx4_t tuple types

2023-03-14 10:16:51 +00:00

CodeGenTypes.h

…

ConstantEmitter.h

…

ConstantInitBuilder.cpp

…

CoverageMappingGen.cpp

[clang] Apply -fcoverage-prefix-map reverse order

2023-04-27 00:24:18 +00:00

CoverageMappingGen.h

[CodeGen] Remove unneeded CoveragePrefixMap. NFC

2023-04-25 15:21:15 -07:00

EHScopeStack.h

[NFC][Clang][Coverity] Fix Static Code Analysis Concerns with copy without assign

2023-05-18 18:14:07 -07:00

ItaniumCXXABI.cpp

[IR] Adds Instruction::setNoSanitizeMetadata()

2023-05-19 19:18:57 +08:00

MacroPPCallbacks.cpp

…

MacroPPCallbacks.h

…

MicrosoftCXXABI.cpp

[NFC][clang] Fix Coverity bugs with AUTO_CAUSES_COPY

2023-04-24 14:52:55 -07:00

ModuleBuilder.cpp

…

ObjectFilePCHContainerOperations.cpp

[clang][deps] Make clang-scan-deps write modules in raw format

2023-05-03 12:07:46 -07:00

PatternInit.cpp

…

PatternInit.h

…

README.txt

…

SanitizerMetadata.cpp

[IR] Adds Instruction::setNoSanitizeMetadata()

2023-05-19 19:18:57 +08:00

SanitizerMetadata.h

[IR] Adds Instruction::setNoSanitizeMetadata()

2023-05-19 19:18:57 +08:00

SwiftCallingConv.cpp

…

TargetInfo.cpp

Change -fsanitize=function to place two words before the function entry

2023-05-19 07:50:29 -07:00

TargetInfo.h

Change -fsanitize=function to place two words before the function entry

2023-05-19 07:50:29 -07:00

VarBypassDetector.cpp

…

VarBypassDetector.h

…

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//