[Liboil] [patch] sse2 optimized sad8x8_u8_avg

Will Dyson will.dyson at gmail.com
Tue Jun 13 01:52:25 PDT 2006


Hi,

Here is an sse2 optimized version of sad8x8_u8_avg.

On an Athlon64 it measures about 2.6 times the speed of the ref implementation.
On a PentiumM laptop, it measures about 1.8 times the speed of the ref
implementation.

-- 
Will Dyson
-------------- next part --------------
A non-text attachment was scrubbed...
Name: sad8x8avg_sse.c
Type: text/x-csrc
Size: 2663 bytes
Desc: not available
Url : http://lists.freedesktop.org/archives/liboil/attachments/20060613/f7ba603f/sad8x8avg_sse.c


More information about the Liboil mailing list