[Intel-gfx] GEM object write
keithp at keithp.com
Tue Mar 31 16:33:06 CEST 2009
On Tue, 2009-03-31 at 14:56 +0800, Ma, Ling wrote:
> I did another test program based on original one,
> The test result shows WB is faster than WC - WC/WB is about 8369/4421.
> In this file I use movnti instruction to write in order to avoid much clflush instruction.
> may be we can do some optimization on it.
That's a good thought, but we've learned from the CPU architects that
non-temporal stores aren't guaranteed to bypass the cache, they just
avoid pulling memory into cache if it isn't already there. So, it's the
right instruction to use, you just have to combine that with clflush as
keith.packard at intel.com
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 189 bytes
Desc: This is a digitally signed message part
More information about the Intel-gfx