[Intel-gfx] GEM object write

Keith Packard keithp at keithp.com
Tue Mar 31 16:33:06 CEST 2009


On Tue, 2009-03-31 at 14:56 +0800, Ma, Ling wrote:
> Hi,
> 
> I did another test program based on original one,
> 
> The test result shows WB  is faster than WC - WC/WB is about 8369/4421.
> In this file I use movnti instruction to write in order to avoid  much clflush instruction.
>  may be we can do some optimization on it.

That's a good thought, but we've learned from the CPU architects that
non-temporal stores aren't guaranteed to bypass the cache, they just
avoid pulling memory into cache if it isn't already there. So, it's the
right instruction to use, you just have to combine that with clflush as
well.

-- 
keith.packard at intel.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://lists.freedesktop.org/archives/intel-gfx/attachments/20090331/896827af/attachment.sig>


More information about the Intel-gfx mailing list