llvm-project

History

[LoongArch] Impl TTI hooks for LoongArch to support LoopDataPrefetch pass (#118437 )

Inspired by https://reviews.llvm.org/D146600, this commit adds
some TTI hooks for LoongArch to make LoopDataPrefetch pass
really work. Including:

- `getCacheLineSize()`: 64 for loongarch64.
- `getPrefetchDistance()`: After testing SPEC CPU 2017, improvements
taken by prefetching are more obvious when set PrefetchDistance to
200(results shown blow), although different benchmarks fit for different
best choice.
- `enableWritePrefetching()`: store prefetch is supported by LoongArch,
so set WritePrefetching to true in default.
- `getMinPrefetchStride()` and `getMaxPrefetchIterationsAhead()` still
use default values: 1 and UINT_MAX, so not override them.

After this commit, the test added by https://reviews.llvm.org/D146600
can generate llvm.prefetch intrinsic IR correctly.

Results of spec2017rate benchmarks (testing date: ref, copies: 1):
- For all C/C++ benchmarks, compared to O3+novec/lsx/lasx, prefetch can
bring about -1.58%/0.31%/0.07% performance improvement for int
benchmarks and 3.26%/3.73%/3.78% improvement for floating point
benchmarks. (Only O3+novec+prefetch decreases when testing intrate.)
- But prefetch results in performance reduction almost for every Fortran
benchmark compiled by flang. While considering all C/C++/Fortran
benchmarks, prefetch performance will decrease about 1% ~ 5%.

FIXME: Keep `loongarch-enable-loop-data-prefetch` option default to
false for now due to the bad effect for Fortran.

2025-01-20 16:20:15 +08:00

ADCE

[llvm] Remove br i1 undef from some regression tests [NFC] (#115688 )

2024-11-12 08:41:27 +00:00

AddDiscriminators

…

AggressiveInstCombine

[AggressiveInstCombine] Use APInt and avoid truncation when folding loads

2024-12-04 10:20:14 +01:00

AlignmentFromAssumptions

[llvm] Remove br i1 undef from some regression tests [NFC] (#115688 )

2024-11-12 08:41:27 +00:00

ArgumentPromotion

[ArgPromotion] Use poison instead of undef as placeholder in deleted metadata [NFC]

2024-11-05 13:44:34 +00:00

AtomicExpand

[PowerPC] Update data layout aligment of i128 to 16 (#118004 )

2024-12-09 18:02:24 -05:00

Attributor

[LLVM][IR] Use splat syntax when printing ConstantExpr based splats. (#116856 )

2024-11-21 11:21:12 +00:00

BDCE

[llvm] Remove br i1 undef from some regression tests [NFC] (#115688 )

2024-11-12 08:41:27 +00:00

BlockExtractor

…

BranchFolding

…

CalledValuePropagation

…

CallSiteSplitting

[llvm] Remove br i1 undef from some regression tests [NFC] (#115688 )

2024-11-12 08:41:27 +00:00

CanonicalizeAliases

…

CanonicalizeFreezeInLoops

…

CodeExtractor

[llvm] Remove br i1 undef from some regression tests [NFC] (#115688 )

2024-11-12 08:41:27 +00:00

CodeGenPrepare

[AArch64] Improve codegen of vectorised early exit loops (#119534 )

2025-01-06 13:17:14 +00:00

ConstantHoisting

[LLVM][IR] Use splat syntax when printing ConstantExpr based splats. (#116856 )

2024-11-21 11:21:12 +00:00

ConstantMerge

…

ConstraintElimination

[ConstraintElim] Decompose sub nsw (#118219 )

2024-12-16 16:41:04 +08:00

Coroutines

[InstCombine] Infer nusw + nneg -> nuw for getelementptr (#111144 )

2024-12-05 14:36:40 +01:00

CorrelatedValuePropagation

[LVI] Learn value ranges from ctpop results (#121945 )

2025-01-15 09:53:31 +01:00

CrossDSOCFI

…

DCE

…

DeadArgElim

Revert "[Transforms][IPO] Add func suffix in ArgumentPromotion and DeadArgume… (#105742 )"

2024-09-19 03:54:13 -07:00

DeadStoreElimination

[MemoryLocation] Teach MemoryLocation about llvm.experimental.memset.pattern (#120421 )

2025-01-15 13:50:23 +00:00

DFAJumpThreading

[DFAJumpThreading] Don't bail early after encountering unpredictable values (#119774 )

2024-12-25 01:29:01 -08:00

DivRemPairs

[LLVM][IR] Use splat syntax when printing Constant[Data]Vector. (#112548 )

2024-11-06 11:53:33 +00:00

DXILUpgrade

…

EarlyCSE

[llvm-project] Fix typos mutli and mutliple. NFC. (#122880 )

2025-01-14 11:59:41 +00:00

EliminateAvailableExternally

[ctxprof] Move test serialization to yaml (#122545 )

2025-01-10 18:04:25 -08:00

EmbedBitcode

…

EntryExitInstrumenter

EntryExitInstrumenter: skip available_externally linkage

2025-01-03 09:25:08 -08:00

ExpandLargeDivRem/X86

…

ExpandLargeFpConvert/X86

…

ExpandMemCmp

[ExpandMemCmp][AArch64][PowerPC][RISCV][X86] Use llvm.ucmp instead of (sub (zext (icmp ugt)), (zext (icmp ult))). (#121530 )

2025-01-03 09:19:32 -08:00

ExpandVariadics

…

FixIrreducible

[llvm] Remove br i1 undef from some regression tests [NFC] (#115691 )

2024-11-11 12:56:31 +00:00

Float2Int

…

ForcedFunctionAttrs

…

FunctionAttrs

[FunctionAttrs] Handle zero writes in initializes inference.

2025-01-18 20:01:07 +00:00

FunctionImport

[LTO] Print conflicting operands between Src and Dest modules (#115104 )

2024-11-21 10:07:39 -08:00

FunctionSpecialization

[ConstantFolding] Infer getelementptr nuw flag (#119214 )

2024-12-09 16:44:05 +01:00

GCOVProfiling

[gcov,test] Update exit-block.ll now that exit block is always the second

2025-01-04 10:56:45 -08:00

GlobalDCE

…

GlobalMerge

Fix issues with GlobalMerge on Mach-O. (#110046 )

2024-09-27 12:19:11 -04:00

GlobalOpt

[FMV][GlobalOpt] Do not statically resolve non-FMV callers. (#123383 )

2025-01-17 20:33:11 +00:00

GlobalSplit

…

GuardWidening

[llvm] Remove br i1 undef from some regression tests [NFC] (#115817 )

2024-11-12 09:11:47 +00:00

GVN

[ConstantFolding] Infer getelementptr nuw flag (#119214 )

2024-12-09 16:44:05 +01:00

GVNHoist

[llvm] Remove br i1 undef from some regression tests [NFC] (#115817 )

2024-11-12 09:11:47 +00:00

GVNSink

[llvm] Remove br i1 undef from some regression tests [NFC] (#115817 )

2024-11-12 09:11:47 +00:00

HardwareLoops

…

HelloNew

…

HipStdPar

…

HotColdSplit

[Transforms][CodeExtraction] bug fix regions with stackrestore (#118564 )

2024-12-19 09:19:11 -07:00

IndirectBrExpand

…

IndVarSimplify

SCEV: regen some tests with UTC (#123050 )

2025-01-15 14:19:23 +00:00

InferAddressSpaces

Revert "[MachineLICM] Use RegisterClassInfo::getRegPressureSetLimit (#119826 )"

2025-01-10 09:05:06 +01:00

InferAlignment

…

InferFunctionAttrs

[TLI] Add support for reallocarray (#114818 )

2024-11-13 20:57:29 +00:00

Inline

[AArch64][SME] Disable inlining of callees with new ZT0 state (#121338 )

2025-01-06 12:02:28 +00:00

InstCombine

[InstCombine] fold unsigned predicates on srem result (#122520 )

2025-01-18 14:05:31 -08:00

InstNamer

…

InstSimplify

[NVPTX] Constant fold NVVM fmin and fmax (#121966 )

2025-01-16 14:38:51 +00:00

InterleavedAccess

[InterleavedAccessPass]: Ensure that dead nodes get erased only once (#122643 )

2025-01-14 09:34:27 +00:00

Internalize

…

IRCE

[IRCE] Relax profitability check (#104659 )

2024-12-12 17:11:07 +01:00

IRNormalizer

Reland "[LLVM] Add IRNormalizer Pass" (#113780 )

2024-11-14 09:56:22 -08:00

IROutliner

[llvm-project] Fix typo "propogate" (#114795 )

2024-11-04 15:33:19 +00:00

JumpTableToSwitch

…

JumpThreading

[Local] Only intersect alias.scope,noalias & parallel_loop if inst moves (#117716 )

2024-11-26 20:39:53 +00:00

KCFI

…

LCSSA

[llvm] Remove br i1 undef from some regression tests [NFC] (#116739 )

2024-11-19 08:12:25 +00:00

LICM

[Options] Use UseDerefAtPointSemantics cl::opt<bool>. (#123192 )

2025-01-16 14:07:03 +00:00

LoadStoreVectorizer

[LoadStoreVectorizer] Postprocess and merge equivalence classes (#121861 )

2025-01-07 17:17:26 -08:00

LoopBoundSplit

…

LoopDataPrefetch

[LoongArch] Impl TTI hooks for LoongArch to support LoopDataPrefetch pass (#118437 )

2025-01-20 16:20:15 +08:00

LoopDeletion

[llvm] Remove br i1 undef from some regression tests [NFC] (#116739 )

2024-11-19 08:12:25 +00:00

LoopDistribute

[LAA] Don't require Stride == 1/-1 for inbounds pointer AddRecs nowrap. (#113126 )

2024-11-05 22:45:56 +01:00

LoopFlatten

…

LoopFusion

…

LoopIdiom

[LLVM][IR] Use splat syntax when printing ConstantExpr based splats. (#116856 )

2024-11-21 11:21:12 +00:00

LoopInstSimplify

…

LoopInterchange

[loop-interchange] Move tests over to use remarks (#123053 )

2025-01-16 15:13:18 +00:00

LoopLoadElim

[llvm] Remove br i1 undef from some regression tests [NFC] (#117112 )

2024-11-21 08:06:56 +00:00

LoopPredication

[llvm] Remove br i1 undef from some regression tests [NFC] (#117112 )

2024-11-21 08:06:56 +00:00

LoopRotate

[LoopRotate] Use poison instead of undef as placeholder in debug info [NFC] (#119135 )

2024-12-10 15:06:48 +00:00

LoopSimplify

[llvm] Remove br i1 undef from some regression tests [NFC] (#117112 )

2024-11-21 08:06:56 +00:00

LoopSimplifyCFG

[llvm] Remove br i1 undef from some regression tests [NFC] (#117112 )

2024-11-21 08:06:56 +00:00

LoopStrengthReduce

[NVPTX] Switch front-ends and tests to ptx_kernel cc (#120806 )

2025-01-07 18:24:50 -08:00

LoopTransformWarning

…

LoopUnroll

[AArch64] Unroll some loops with early-continues on Apple Silicon. (#118499 )

2024-12-22 13:10:54 +00:00

LoopUnrollAndJam

…

LoopVectorize

[LV][EVL] Address post-commit comments for 9720be9. (NFC) (#123311 )

2025-01-20 14:20:40 +08:00

LoopVersioning

LoopVersioning: improve a test, regen with UTC (#122876 )

2025-01-14 12:06:22 +00:00

LoopVersioningLICM

[InstCombine] Infer nusw + nneg -> nuw for getelementptr (#111144 )

2024-12-05 14:36:40 +01:00

LowerAtomic

…

LowerConstantIntrinsics

[llvm] Bail out when meeting pointer with negative offset in approximated mode instead of … (#120424 )

2024-12-20 12:16:49 +00:00

LowerExpectIntrinsic

…

LowerGlobalDestructors

…

LowerGuardIntrinsic

…

LowerIFunc

…

LowerInvoke

…

LowerMatrixIntrinsics

[InstCombine] Move gep of phi fold into separate function

2024-12-05 15:20:56 +01:00

LowerSwitch

[llvm] Remove br i1 undef from some regression tests [NFC] (#117112 )

2024-11-21 08:06:56 +00:00

LowerTypeTests

[CFI][LowerTypeTests] Fix indirect call with alias (#113987 )

2024-10-31 13:29:07 -07:00

LowerWidenableCondition

…

MakeGuardsExplicit

…

Mem2Reg

[InstSimplify] Fix incorrect poison propagation when folding phi (#96631 )

2024-11-07 14:09:45 +01:00

MemCpyOpt

[ValueTracking] Return poison for zero-sized types (#122647 )

2025-01-16 10:05:30 +01:00

MemProfContextDisambiguation

[MemProf] Disable cloning of callsites in recursive cycles by default (#122354 )

2025-01-09 12:01:43 -08:00

MergedLoadStoreMotion

[llvm] Remove br i1 undef from some regression tests [NFC] (#117292 )

2024-11-26 20:50:54 +00:00

MergeFunc

[MergeFuncs] Handle ConstantRangeList attributes

2024-12-06 12:21:45 +01:00

MergeICmps

TargetLibraryInfo: Use pointer index size to determine getSizeTSize(). (#118747 )

2024-12-12 15:45:44 +13:00

MetaRenamer

…

MoveAutoInit

…

NameAnonGlobals

…

NaryReassociate

[test] Change llc -march= to -mtriple=

2024-12-15 13:08:02 -08:00

NewGVN

[ConstantFolding] Infer getelementptr nuw flag (#119214 )

2024-12-09 16:44:05 +01:00

ObjCARC

[llvm] Remove br i1 undef from some regression tests [NFC] (#118419 )

2024-12-03 20:54:36 +00:00

OpenMP

[OpenMP] Fix RPC client not being optimized out after changes

2024-11-27 15:56:23 -06:00

PartialInlining

…

PartiallyInlineLibCalls

…

PGOProfile

[memprof] Undrift MemProf profile even when some frames are missing (#120500 )

2024-12-20 15:40:08 -08:00

PhaseOrdering

[InstCombine,PhaseOrder] Add additional tests with align assumptions.

2025-01-17 14:05:54 +00:00

PlaceSafepoints

…

PreISelIntrinsicLowering

[LLVM][IR] Use splat syntax when printing ConstantExpr based splats. (#116856 )

2024-11-21 11:21:12 +00:00

Reassociate

[llvm] Remove br i1 undef from some regression tests [NFC] (#118419 )

2024-12-03 20:54:36 +00:00

Reg2Mem

…

RelLookupTableConverter/X86

…

RewriteStatepointsForGC

[InstCombine] Infer nusw + nneg -> nuw for getelementptr (#111144 )

2024-12-05 14:36:40 +01:00

SafeStack

SafeStack: Respect alloca addrspace (#112536 )

2024-11-04 17:51:30 -08:00

SampleProfile

[PseudoProbe] Fix cleanup for pseudo probe after annotation (#119660 )

2024-12-13 17:05:03 +08:00

SandboxVectorizer

[SandboxVec][Legality] Implement ShuffleMask (#123404 )

2025-01-17 15:48:24 -08:00

ScalarizeMaskedMemIntrin

[LLVM][IR] Use splat syntax when printing Constant[Data]Vector. (#112548 )

2024-11-06 11:53:33 +00:00

Scalarizer

[llvm] Remove br i1 undef from some regression tests [NFC] (#118419 )

2024-12-03 20:54:36 +00:00

SCCP

[ConstantRange] Estimate tighter lower (upper) bounds for masked binary and (or) (#120352 )

2024-12-31 18:40:17 -08:00

SeparateConstOffsetFromGEP

…

SimpleLoopUnswitch

[InstCombine] Infer nusw + nneg -> nuw for getelementptr (#111144 )

2024-12-05 14:36:40 +01:00

SimplifyCFG

[MemProf][PGO] Prevent dropping of profile metadata during optimization (#121359 )

2025-01-02 12:11:59 -08:00

Sink

[llvm] Remove br i1 undef from some regression tests [NFC] (#118419 )

2024-12-03 20:54:36 +00:00

SLPVectorizer

[SLP]Fix createInsertVector mask emission

2025-01-18 11:48:53 -08:00

SpeculativeExecution

[llvm] Remove br i1 undef from some regression tests [NFC] (#118419 )

2024-12-03 20:54:36 +00:00

SROA

[IR] Treat calls with byval ptrs as read-only (#122961 )

2025-01-15 10:25:55 -08:00

StraightLineStrengthReduce

[NVPTX] Switch front-ends and tests to ptx_kernel cc (#120806 )

2025-01-07 18:24:50 -08:00

StripDeadPrototypes

…

StripSymbols

…

StructurizeCFG

[llvm] Remove br i1 undef from some regression tests [NFC] (#118419 )

2024-12-03 20:54:36 +00:00

TailCallElim

…

ThinLTOBitcodeWriter

…

TypePromotion

…

UnifyFunctionExitNodes

…

UnifyLoopExits

…

Util

[RISCV][SLEEF]: Support SLEEF vector library for RISC-V target. (#114014 )

2024-11-26 12:25:54 +03:00

VectorCombine

IR: handle FP predicates in CmpPredicate::getMatching (#122924 )

2025-01-14 18:17:07 +00:00

WholeProgramDevirt

[WPD]Regard unreachable function as a possible devirtualizable target (#115668 )

2024-11-13 11:28:36 -08:00

lower-builtin-allow-check-remarks.ll

…

lower-builtin-allow-check.ll

…