[Mesa-dev] [PATCH 0/3] Do float texture clamping with streaming sse2

Juha-Pekka Heikkila juhapekka.heikkila at gmail.com
Wed Nov 12 04:50:13 PST 2014


I tested this with uploading 1024x1024 656 textures in a loop for 10 seconds.
With glTexImage2D on SNB I get 17% better performance, mobile IVB
(interestingly only) 0..1% better performance and BDW 3% better performance.
For all these tests Mesa was compiled with -O2 -march=native and no Piglit
regressions.

/Juha-Pekka

Juha-Pekka Heikkila (3):
  configure.ac: Add detection for sse2 compilation support
  mesa/main: Add sse2 streaming clamping
  mesa/main: Clamp rgba with streamed sse

 configure.ac                  |   7 +++
 src/mesa/Makefile.am          |   8 +++
 src/mesa/main/pixeltransfer.c |  62 +++++++++++++------
 src/mesa/main/sse2_clamping.c | 138 ++++++++++++++++++++++++++++++++++++++++++
 src/mesa/main/sse2_clamping.h |  49 +++++++++++++++
 5 files changed, 245 insertions(+), 19 deletions(-)
 create mode 100644 src/mesa/main/sse2_clamping.c
 create mode 100644 src/mesa/main/sse2_clamping.h

-- 
1.8.5.1



More information about the mesa-dev mailing list