[Intel-gfx] [PATCH 4/4] drm/i915/selftest: Use forcewake to sanity check engine wa lists

Gustavo Sousa gustavo.sousa at intel.com
Wed Feb 8 12:51:59 UTC 2023


On Tue, Feb 07, 2023 at 07:37:58PM -0300, Gustavo Sousa wrote:
> On Wed, Feb 01, 2023 at 02:28:31PM -0800, Matt Roper wrote:
> > Although register information in the bspec includes a field that is
> > supposed to reflect a register's reset characteristics (i.e., whether a
> > register maintains its value through engine resets), it's been
> > discovered that this information is incorrect for some register ranges
> > (i.e., registers that are not affected by engine resets are tagged in a
> > way that indicates they would be).
> > 
> > We can sanity check workaround registers placed on the RCS/CCS engine
> > workaround lists (including those placed there via the
> > general_render_compute_wa_init() function) by comparing against the
> > forcewake table.  As far as we know, there's never a case where a
> > register that lives outside the RENDER powerwell will be reset by an
> > RCS/CCS engine reset.
> > 
> > Signed-off-by: Matt Roper <matthew.d.roper at intel.com>
> > ---
> >  .../gpu/drm/i915/gt/selftest_workarounds.c    | 52 +++++++++++++++++++
> >  1 file changed, 52 insertions(+)
> > 
> > diff --git a/drivers/gpu/drm/i915/gt/selftest_workarounds.c b/drivers/gpu/drm/i915/gt/selftest_workarounds.c
> > index 14a8b25b6204..1bc8febc5c1d 100644
> > --- a/drivers/gpu/drm/i915/gt/selftest_workarounds.c
> > +++ b/drivers/gpu/drm/i915/gt/selftest_workarounds.c
> > @@ -1362,12 +1362,64 @@ live_engine_reset_workarounds(void *arg)
> >  	return ret;
> >  }
> >  
> > +/*
> > + * The bspec's documentation for register reset behavior can be unreliable for
> > + * some MMIO ranges.  But in general we do not expect registers outside the
> > + * RENDER forcewake domain to be reset by RCS/CCS engine resets.  If we find
> > + * workaround registers on an RCS or CCS engine's list, it likely indicates
> 
> I think "workaround registers" is too general and makes the sentence a bit
> confusing. I believe you mean those registers mentioned in the previous
> sentence, right? Maybe s/workaround registers/said registers/?
> 
> > + * the register is misdocumented in the bspec and the workaround implementation
> > + * should be moved to the GT workaround list instead.
> > + */
> > +static int
> > +live_check_engine_workarounds_fw(void *arg)
> > +{
> > +	struct intel_gt *gt = arg;
> > +	struct intel_engine_cs *engine;
> > +	struct wa_lists *lists;
> > +	enum intel_engine_id id;
> > +	int ret = 0;
> > +
> > +	lists = kzalloc(sizeof(*lists), GFP_KERNEL);
> > +	if (!lists)
> > +		return -ENOMEM;
> > +
> > +	reference_lists_init(gt, lists);
> > +
> > +	for_each_engine(engine, gt, id) {
> > +		struct i915_wa_list *wal = &lists->engine[id].wa_list;
> > +		struct i915_wa *wa;
> > +		int i;
> > +
> > +		if (engine->class != RENDER_CLASS &&
> > +		    engine->class != COMPUTE_CLASS)
> > +			continue;
> > +
> > +		for (i = 0, wa = wal->list; i < wal->count; i++, wa++) {
> > +			enum forcewake_domains fw;
> > +
> > +			fw = intel_uncore_forcewake_for_reg(gt->uncore, wa->reg,
> > +							    FW_REG_READ | FW_REG_WRITE);
> > +			if ((fw & FORCEWAKE_RENDER) == 0) {
> > +				pr_err("%s: Register %#x not in RENDER forcewake domain!\n",
> > +				       engine->name, i915_mmio_reg_offset(wa->reg));
> 
> I think it is safer to use the correct member (wa->reg vs wa->mcr_reg) according
> to the value of wa->is_mcr. Coincidently the reg member for both types have the
> same offset in the struct, but I do not think we should rely on that.
> 
> One issue is that, unlike i915_mmio_reg_offset(),
> intel_uncore_forcewake_for_reg() is not implemented with generics and expects
> i915_reg_t. A workaround here would be "converting" the wa->mcr_reg (when
> wa->is_mcr holds) an i915_reg_t by copying the correct fields. While this is
> trivial since both types have only one field, I think the proper way
> (future-proof) of doing that is by having a dedicated function/macro to do the
> transformation.

Ah, we already have that: mcr_reg_cast()

:-)

So my suggestion is:

    i915_reg_t reg = wa->is_mcr ? mcr_reg_cast(wa->mcr_reg) : wa->reg;

Ans use reg as argument for both intel_uncore_forcewake_for_reg() and
i915_mmio_reg_offset().

> 
> Thinking about an alternative approach, do you think we could say that
> i915_mcr_reg_t will always have the same fields as i915_reg_t? In that case, we
> could tweak the type definition (or at least formalize via code documentation)
> to reflect that and then it would be okay to always use wa->reg here, as
> i915_mcr_reg_t would be thought as a subclass of i915_reg_t.
> 
> > +				ret = -EINVAL;
> > +			}
> > +		}
> > +	}
> > +
> > +	reference_lists_fini(gt, lists);
> > +	kfree(lists);
> > +
> > +	return ret;
> > +}
> > +
> >  int intel_workarounds_live_selftests(struct drm_i915_private *i915)
> >  {
> >  	static const struct i915_subtest tests[] = {
> >  		SUBTEST(live_dirty_whitelist),
> >  		SUBTEST(live_reset_whitelist),
> >  		SUBTEST(live_isolated_whitelist),
> > +		SUBTEST(live_check_engine_workarounds_fw),
> >  		SUBTEST(live_gpu_reset_workarounds),
> >  		SUBTEST(live_engine_reset_workarounds),
> >  	};
> > -- 
> > 2.39.1
> > 


More information about the dri-devel mailing list