[PATCH] drm/xe/xe2lpm: Add Wa_15015404425

Tue May 28 23:03:31 UTC 2024

On Fri, May 24, 2024 at 12:02:28PM +0530, Tejas Upadhyay wrote:
> Wa_15015404425 applies to xe2_lpm all steppings
> 
> Signed-off-by: Tejas Upadhyay <tejas.upadhyay at intel.com>
> ---
>  drivers/gpu/drm/xe/regs/xe_regs.h |  1 +
>  drivers/gpu/drm/xe/xe_mmio.c      | 18 ++++++++++++++++++
>  2 files changed, 19 insertions(+)
> 
> diff --git a/drivers/gpu/drm/xe/regs/xe_regs.h b/drivers/gpu/drm/xe/regs/xe_regs.h
> index 722fb6dbb72e..07c8a23bd7c2 100644
> --- a/drivers/gpu/drm/xe/regs/xe_regs.h
> +++ b/drivers/gpu/drm/xe/regs/xe_regs.h
> @@ -42,6 +42,7 @@
>  #define STOLEN_RESERVED				XE_REG(0x1082c0)
>  #define   WOPCM_SIZE_MASK			REG_GENMASK64(9, 7)
>  
> +#define MEDIA_DUMMY_REG				XE_REG(0x138000)

The workaround description recommended using 0x130030 as the target
offset.  Is there a reason you're using 0x138000 (which is a real
RP_STATE_CAP register that you'll probably be clobbering if you write to
it)?

>  #define MTL_RP_STATE_CAP			XE_REG(0x138000)
>  
>  #define MTL_GT_RPE_FREQUENCY			XE_REG(0x13800c)
> diff --git a/drivers/gpu/drm/xe/xe_mmio.c b/drivers/gpu/drm/xe/xe_mmio.c
> index ff7a7cf99530..e1dc87078387 100644
> --- a/drivers/gpu/drm/xe/xe_mmio.c
> +++ b/drivers/gpu/drm/xe/xe_mmio.c
> @@ -450,6 +450,21 @@ int xe_mmio_init(struct xe_device *xe)
>  	return devm_add_action_or_reset(xe->drm.dev, mmio_fini, xe);
>  }
>  
> +static void mmio_flush_pending_writes(struct xe_gt *gt)
> +{
> +	struct xe_device *xe = gt_to_xe(gt);
> +
> +	if (!(xe->info.platform == XE_LUNARLAKE))
> +		return;

This workaround is a bit strange; it's tagged as only applying to
Xe2_LPM, but the description indicates that it actually lies within the
SoC fabric.  Our usual graphics/media workaround database doesn't always
capture SoC workarounds very well...have you confirmed that this applies
to all SoC steppings (which is different from the graphics/media
steppings)?

> +
> +	if (xe_gt_is_media_type(gt) &&  MEDIA_VER(xe) == 20) {

The description also notes that we need to apply this workaround before
issuing a "read to Gfx GTTMMADR BAR."  So even though this is tagged as
Xe2_LPM (probably because they didn't have a good way to express SoC
details in our usual workaround database), it sounds like this
workaround is probably needed for any reads through the PCI BAR, not
just the ones targetting media offsets.  In fact from the description it
sounds like it would also probably apply to any reads of the GSM through
the GTTMMADR BAR as well, not just the registers.

> +		int itr = 4;
> +		/* 4 dummy writes */
> +		while (itr--)
> +			xe_mmio_write32(gt, MEDIA_DUMMY_REG, 0);
> +	}
> +}
> +
>  u8 xe_mmio_read8(struct xe_gt *gt, struct xe_reg reg)
>  {
>  	struct xe_tile *tile = gt_to_tile(gt);
> @@ -479,6 +494,9 @@ u32 xe_mmio_read32(struct xe_gt *gt, struct xe_reg reg)
>  	struct xe_tile *tile = gt_to_tile(gt);
>  	u32 addr = xe_mmio_adjusted_addr(gt, reg.addr);
>  
> +	/* Wa_15015404425 */
> +	mmio_flush_pending_writes(gt);

There aren't a whole lot of them, but technically 8-bit and 16-bit reads
probably need this workaround as well.

Matt

> +
>  	return readl((reg.ext ? tile->mmio_ext.regs : tile->mmio.regs) + addr);
>  }
>  
> -- 
> 2.25.1
> 

-- 
Matt Roper
Graphics Software Engineer
Linux GPU Platform Enablement
Intel Corporation