[Mesa-dev] [PATCH 0/2] codegen: add support for native SQRT for gm200+
Karol Herbst
kherbst at redhat.com
Sat Aug 4 02:52:04 UTC 2018
A significant perf increase is expected for shaders doing a lot of sqrt
operations.
shader-db changes for complete series:
total instructions in shared programs : 5472103 -> 5454825 (-0.32%)
total gprs used in shared programs : 647530 -> 647551 (0.00%)
total shared used in shared programs : 389120 -> 389120 (0.00%)
total local used in shared programs : 21064 -> 21064 (0.00%)
total bytes used in shared programs : 58459304 -> 58274448 (-0.32%)
local shared gpr inst bytes
helped 0 0 0 8526 8526
hurt 0 0 21 83 83
Karol Herbst (2):
gm200/ir: add native OP_SQRT support
gm200/ir: optimize rcp(sqrt) to rsq
.../drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp | 4 +++-
.../drivers/nouveau/codegen/nv50_ir_lowering_nvc0.cpp | 3 +++
.../drivers/nouveau/codegen/nv50_ir_peephole.cpp | 11 ++++++++++-
.../drivers/nouveau/codegen/nv50_ir_target_gm107.cpp | 8 +++++++-
.../drivers/nouveau/codegen/nv50_ir_target_nvc0.cpp | 1 +
5 files changed, 24 insertions(+), 3 deletions(-)
--
2.17.1
More information about the mesa-dev
mailing list