[Mesa-dev] [PATCH 0/3] Do float texture clamping with streaming sse2
Juha-Pekka Heikkila
juhapekka.heikkila at gmail.com
Wed Nov 12 04:50:13 PST 2014
I tested this with uploading 1024x1024 656 textures in a loop for 10 seconds.
With glTexImage2D on SNB I get 17% better performance, mobile IVB
(interestingly only) 0..1% better performance and BDW 3% better performance.
For all these tests Mesa was compiled with -O2 -march=native and no Piglit
regressions.
/Juha-Pekka
Juha-Pekka Heikkila (3):
configure.ac: Add detection for sse2 compilation support
mesa/main: Add sse2 streaming clamping
mesa/main: Clamp rgba with streamed sse
configure.ac | 7 +++
src/mesa/Makefile.am | 8 +++
src/mesa/main/pixeltransfer.c | 62 +++++++++++++------
src/mesa/main/sse2_clamping.c | 138 ++++++++++++++++++++++++++++++++++++++++++
src/mesa/main/sse2_clamping.h | 49 +++++++++++++++
5 files changed, 245 insertions(+), 19 deletions(-)
create mode 100644 src/mesa/main/sse2_clamping.c
create mode 100644 src/mesa/main/sse2_clamping.h
--
1.8.5.1
More information about the mesa-dev
mailing list