[Intel-gfx] [PATCH] drm/i915: Force synchronous TLB invalidations on the BLT ring for SNB+

Chris Wilson chris at chris-wilson.co.uk
Wed Nov 9 20:15:06 CET 2011


On Wed, 09 Nov 2011 10:57:00 -0800, Eric Anholt <eric at anholt.net> wrote:
> On Wed,  9 Nov 2011 17:32:26 +0000, Chris Wilson <chris at chris-wilson.co.uk> wrote:
> > We are advised that in order to workaround the TLB invalidation being
> > performed asynchronously with the command streamer that we need to use
> > a post-sync write along with the invalidations in the MI_FLUSH_DW so
> > that all operations are complete before executing the next command in
> > the ringbuffer.
> > 
> > References: https://bugs.freedesktop.org/show_bug.cgi?id=37990
> > Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
> > Cc: Jesse Barnes <jbarnes at virtuousgeek.org>
> 
> I find the "References:" to a bug that this is not confirmed to fix to
> be very misleading.

Because it references a bug I think is due to TLB invalidations as the
glitches look very similar to earlier TLB invalidation. Note the refer
to implied by references, which I have never used to mean fixed.

> > ---
> >  drivers/gpu/drm/i915/i915_reg.h         |   11 +++++++++--
> >  drivers/gpu/drm/i915/intel_ringbuffer.c |   13 +++++++++++--
> >  drivers/gpu/drm/i915/intel_ringbuffer.h |    1 +
> >  3 files changed, 21 insertions(+), 4 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/i915/i915_reg.h b/drivers/gpu/drm/i915/i915_reg.h
> > index cbf5f9f..bfda4b2 100644
> > --- a/drivers/gpu/drm/i915/i915_reg.h
> > +++ b/drivers/gpu/drm/i915/i915_reg.h
> > @@ -183,8 +183,15 @@
> >   */
> >  #define MI_LOAD_REGISTER_IMM(x)	MI_INSTR(0x22, 2*x-1)
> >  #define MI_FLUSH_DW		MI_INSTR(0x26, 1) /* for GEN6 */
> > -#define   MI_INVALIDATE_TLB	(1<<18)
> > -#define   MI_INVALIDATE_BSD	(1<<7)
> > +#define   FLUSH_DW_USE_INDEX		(1<<21)
> > +#define   MI_INVALIDATE_TLB		(1<<18)
> > +#define   FLUSH_DW_SYNC_GFDT		(1<<17)
> > +#define   FLUSH_DW_NO_WRITE		(0<<14)
> > +#define   FLUSH_DW_WRITE_QWORD		(1<<14)
> > +#define   FLUSH_DW_WRITE_TIMESTAMP	(3<<14)
> > +#define   MI_INVALIDATE_BSD		(1<<7)
> > +#define   FLUSH_DW_PPGTT		(0<<2)
> > +#define   FLUSH_DW_GTT			(1<<2)
> >  #define MI_BATCH_BUFFER		MI_INSTR(0x30, 1)
> >  #define   MI_BATCH_NON_SECURE	(1)
> >  #define   MI_BATCH_NON_SECURE_I965 (1<<8)
> > diff --git a/drivers/gpu/drm/i915/intel_ringbuffer.c b/drivers/gpu/drm/i915/intel_ringbuffer.c
> > index 3c30dba..a77858c 100644
> > --- a/drivers/gpu/drm/i915/intel_ringbuffer.c
> > +++ b/drivers/gpu/drm/i915/intel_ringbuffer.c
> > @@ -1431,10 +1431,19 @@ static int blt_ring_flush(struct intel_ring_buffer *ring,
> >  		return ret;
> >  
> >  	cmd = MI_FLUSH_DW;
> > -	if (invalidate & I915_GEM_DOMAIN_RENDER)
> > +	if (invalidate & I915_GEM_DOMAIN_RENDER) {
> >  		cmd |= MI_INVALIDATE_TLB;
> > +		/* In order to actually force the invalidations to take place
> > +		 * before proceeding, we need to employ a workaround of
> > +		 * enabling a post-sync write.
> > +		 */
> > +		cmd |= FLUSH_DW_USE_INDEX;
> > +		cmd |= FLUSH_DW_WRITE_TIMESTAMP;
> > +	}
> 
> The commit message says this is about TLB flushes, while the comment
> here talks about invalidations in general.  Which is true?

No the commit log doesn't say that.

> >  	intel_ring_emit(ring, cmd);
> > -	intel_ring_emit(ring, 0);
> > +	intel_ring_emit(ring,
> > +			I915_GEM_TIMESTAMP_INDEX << MI_STORE_DWORD_INDEX_SHIFT |
> > +			FLUSH_DW_GTT);
> >  	intel_ring_emit(ring, 0);
> 
> My docs have "This field specifies Bits 31:3 of the Address where the
> DWord or QWord will be stored.  Note that the address can only be QWord
> aligned, irrespective of data size." for MI_FLUSH_DW, but
> MI_STORE_DWORD_INDEX_SHIFT is 2 (which is correct for
> MI_STORE_DATA_IMM).

The shift is to convert the index into the HWS from being a dword index
into an offset, which itself is qword aligned.
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre



More information about the Intel-gfx mailing list