Sanjay Patel fff3e1dbaa [x86] enable fast sqrtss/sqrtps tuning for AMD Zen cores
As discussed in D118534, all of the recent AMD CPUs have
relatively fast (<14 cycle latency) "sqrtss" and "sqrtps"
instructions:
https://uops.info/table.html?search=sqrtps&cb_lat=on&cb_tp=on&cb_SNB=on&cb_SKL=on&cb_ZENp=on&cb_ZEN2=on&cb_ZEN3=on&cb_measurements=on&cb_avx=on&cb_sse=on

So we should set this tuning flag to alter codegen of plain
"sqrt(X)" expansion (as opposed to reciprocal-sqrt - there
is other test coverage for that pattern). The expansion is
both slower and less accurate than the hardware instruction.

Differential Revision: https://reviews.llvm.org/D119001
2022-02-04 13:59:20 -05:00
..
2021-04-13 09:48:13 +08:00
2021-12-20 12:55:32 -08:00
2021-09-17 15:48:55 +02:00
2021-07-27 12:09:25 +01:00
2021-08-16 13:13:56 +01:00
2022-01-17 19:02:44 +00:00
2022-01-26 12:32:49 +00:00
2021-05-24 12:09:02 -07:00