llvm-project/test at 4cec4938c67b5dec64a2512806f84b3ddcd499f2 - llvm-project - shylie's gitea

shylie/llvm-project

History

Florian Hahn 078d214672

[TailDup] Delay aggressive computed-goto taildup to after RegAlloc. (#150911 )

https://github.com/llvm/llvm-project/pull/114990 allowed more aggressive
tail duplication for computed-gotos in both pre- and post-regalloc tail
duplication.

In some cases, performing tail-duplication too early can lead to worse
results, especially if we duplicate blocks with a number of phi nodes.

This is causing a ~3% performance regression in some workloads using
Python 3.12.

This patch updates TailDup to delay aggressive tail-duplication for
computed gotos to after register allocation.

This means we can keep the non-duplicated version for a bit longer
throughout the backend, which should reduce compile-time as well as
allowing a number of optimizations and simplifications to trigger before
drastically expanding the CFG.

For the case in https://github.com/llvm/llvm-project/issues/106846, I
get the same performance with and without this patch on Skylake.

PR: https://github.com/llvm/llvm-project/pull/150911

2025-07-31 19:20:05 +01:00

..

[SECV] Try to push the op into ZExt: A + zext (-A + B) -> zext (B) (#151227 )

2025-07-30 21:10:57 +01:00

[LLVM][NVPTX] Upstream tanh intrinsic for libdevice (#149596 )

2025-07-24 14:32:59 -07:00

…

…

…

[TailDup] Delay aggressive computed-goto taildup to after RegAlloc. (#150911 )

2025-07-31 19:20:05 +01:00

[BranchFolding] Follow up #149999 crash fix

2025-07-29 09:09:58 +01:00

…

DWARFCFIChecker/X86

…

…

ExecutionEngine

…

…

[test][FileCheck] Prefix FileCheck test with %ProtectFileCheckOutput, per post-commit review feedback

2025-07-23 11:49:17 -07:00

…

Instrumentation

[msan] Approximately handle AVX Galois Field Affine Transformation (#150794 )

2025-07-30 08:06:50 -07:00

…

…

…

…

MachineVerifier

…

[AMDGPU] Add v_cvt_pk_f16_f32 instruction for gfx1250 (#151469 )

2025-07-31 10:45:06 -07:00

…

…

[PGO] Add llvm.loop.estimated_trip_count metadata (#148758 )

2025-07-31 12:28:25 -04:00

SafepointIRVerifier

…

…

…

[TableGen] Implement getNamedOperandIdx with another table lookup. NFC. (#151116 )

2025-07-30 14:03:26 +01:00

Reapply "[MemProf] Ensure all callsite clones are assigned a function clone" (#150856 ) (#151055 )

2025-07-28 17:04:45 -07:00

[MemProf] Fix FileCheck prefix in the histogram test. (#150506 )

2025-07-31 08:59:16 -07:00

[PGO] Add llvm.loop.estimated_trip_count metadata (#148758 )

2025-07-31 12:28:25 -04:00

…

[PGO] Add llvm.loop.estimated_trip_count metadata (#148758 )

2025-07-31 12:28:25 -04:00

…

.clang-format

…

CMakeLists.txt

[PGO] Drive profile validator from opt (#147418 )

2025-07-26 16:14:00 +02:00

lit.cfg.py

…

lit.site.cfg.py.in

[PGO] Drive profile validator from opt (#147418 )

2025-07-26 16:14:00 +02:00

TestRunner.sh

…