[Intel-gfx] [PATCH] drm/i915: Prefer setting PTE cache age to 3

Ville Syrjälä ville.syrjala at linux.intel.com
Fri Nov 22 19:26:25 CET 2013


On Fri, Nov 22, 2013 at 09:53:23AM -0800, Jesse Barnes wrote:
> On Fri, 22 Nov 2013 10:37:53 +0000
> Chris Wilson <chris at chris-wilson.co.uk> wrote:
> 
> > We have conflicting benchmark data that suggest either age 0 or age 3 is
> > better. However, the earlier benchmark on which we based the switch to
> > age 0
> > 
> > (commit 0d8ff15e9a15f2b393e53337a107b7a1e5919b6d
> > Author: Ben Widawsky <benjamin.widawsky at intel.com>
> > Date:   Thu Jul 4 11:02:03 2013 -0700
> > 
> >     drm/i915/hsw: Set correct Haswell PTE encodings)
> > 
> > actually seems to prefer the default PTE encoding as age 3. Presumably,
> > this is in part due to the use of MOCS to override the PTE encodings
> > when appropriate.
> > 
> > Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=69870
> > Tested-by: mengmeng.meng at intel.com
> > Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
> > ---
> >  drivers/gpu/drm/i915/i915_gem_gtt.c | 6 ++++--
> >  1 file changed, 4 insertions(+), 2 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/i915/i915_gem_gtt.c b/drivers/gpu/drm/i915/i915_gem_gtt.c
> > index efb5dab61c81..c4a3f9ae0d43 100644
> > --- a/drivers/gpu/drm/i915/i915_gem_gtt.c
> > +++ b/drivers/gpu/drm/i915/i915_gem_gtt.c
> > @@ -57,7 +57,9 @@ typedef gen8_gtt_pte_t gen8_ppgtt_pde_t;
> >  #define HSW_WB_LLC_AGE3			HSW_CACHEABILITY_CONTROL(0x2)
> >  #define HSW_WB_LLC_AGE0			HSW_CACHEABILITY_CONTROL(0x3)
> >  #define HSW_WB_ELLC_LLC_AGE0		HSW_CACHEABILITY_CONTROL(0xb)
> > +#define HSW_WB_ELLC_LLC_AGE3		HSW_CACHEABILITY_CONTROL(0x8)
> >  #define HSW_WT_ELLC_LLC_AGE0		HSW_CACHEABILITY_CONTROL(0x6)
> > +#define HSW_WT_ELLC_LLC_AGE3		HSW_CACHEABILITY_CONTROL(0x7)
> >  
> >  #define GEN8_PTES_PER_PAGE		(PAGE_SIZE / sizeof(gen8_gtt_pte_t))
> >  #define GEN8_PDES_PER_PAGE		(PAGE_SIZE / sizeof(gen8_ppgtt_pde_t))
> > @@ -185,10 +187,10 @@ static gen6_gtt_pte_t iris_pte_encode(dma_addr_t addr,
> >  	case I915_CACHE_NONE:
> >  		break;
> >  	case I915_CACHE_WT:
> > -		pte |= HSW_WT_ELLC_LLC_AGE0;
> > +		pte |= HSW_WT_ELLC_LLC_AGE3;
> >  		break;
> >  	default:
> > -		pte |= HSW_WB_ELLC_LLC_AGE0;
> > +		pte |= HSW_WB_ELLC_LLC_AGE3;
> >  		break;
> >  	}
> >  
> 
> Yeah I guess as long as userspace can override the default with MOCS,
> this is fine.

IIRC MOCS can only set the age to 3. So this change does limit what
userspace might achieve w/ MOCS usage a little bit.

I guess it might be better to change userspace to set everything to
LLC(+eLLC) at age 3 via MOCS (except potential scanout buffers), and
leave the PTE default at age 0.

But then again we need to also remember that not everything has a MOCS
field. So I'm not sure there's any truly correct answer for this.

-- 
Ville Syrjälä
Intel OTC



More information about the Intel-gfx mailing list