[AMDGPU] Enable scheduler mfma rewrite stage by default (#180751)

After performance testing, it was determined that the large number of
copies that are inserted outside the loop are more than offset by better
allocation within the loop as a result of the rewrite. Additionally, there is a
minor cleanup of the cost logic.

---------

Co-authored-by: Tony Linthicum <tlinthic@gmail.com>
This commit is contained in:
Tony Linthicum 2026-02-23 09:34:17 -06:00 committed by GitHub
parent 329cd8985d
commit 3215645b8d
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -101,7 +101,7 @@ static cl::opt<bool> PrintMaxRPRegUsageAfterScheduler(
static cl::opt<bool> DisableRewriteMFMAFormSchedStage(
"amdgpu-disable-rewrite-mfma-form-sched-stage", cl::Hidden,
cl::desc("Disable rewrie mfma rewrite scheduling stage"), cl::init(true));
cl::desc("Disable rewrite mfma rewrite scheduling stage"), cl::init(false));
const unsigned ScheduleMetrics::ScaleFactor = 100;