[Intel-gfx] [PATCH] drm/i915/gt: Sanitize GT first
Tvrtko Ursulin
tvrtko.ursulin at linux.intel.com
Mon Apr 27 09:50:01 UTC 2020
On 27/04/2020 09:40, Chris Wilson wrote:
> We see that if the HW doesn't actually sleep, the HW may eat the poison
> we set in its write-only HWSP during sanitize:
>
> intel_gt_resume.part.8: 0000:00:02.0
> __gt_unpark: 0000:00:02.0
> gt_sanitize: 0000:00:02.0 force:yes
> process_csb: 0000:00:02.0 vcs0: cs-irq head=5, tail=90
> process_csb: 0000:00:02.0 vcs0: csb[0]: status=0x5a5a5a5a:0x5a5a5a5a
> assert_pending_valid: Nothing pending for promotion!
>
> The CS TAIL pointer should have been reset by reset_csb_pointers(), so
> in this case it is likely that we have read back from the CPU cache and
> so we must clflush our control over that page. In doing so, push the
> sanitisation to the start of the GT sequence so that our poisoning is
> assuredly before we start talking to the HW.
>
> Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
> ---
> drivers/gpu/drm/i915/gt/intel_gt_pm.c | 3 ++-
> drivers/gpu/drm/i915/gt/intel_lrc.c | 3 +++
> 2 files changed, 5 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/i915/gt/intel_gt_pm.c b/drivers/gpu/drm/i915/gt/intel_gt_pm.c
> index 4c4c74ef4f21..5097786f4375 100644
> --- a/drivers/gpu/drm/i915/gt/intel_gt_pm.c
> +++ b/drivers/gpu/drm/i915/gt/intel_gt_pm.c
> @@ -198,11 +198,12 @@ int intel_gt_resume(struct intel_gt *gt)
> * Only the kernel contexts should remain pinned over suspend,
> * allowing us to fixup the user contexts on their first pin.
> */
> + gt_sanitize(gt, true);
> +
> intel_gt_pm_get(gt);
gt_sanitize declares hw access so I thin it should still be after this
pm get.
Regards,
Tvrtko
>
> intel_uncore_forcewake_get(gt->uncore, FORCEWAKE_ALL);
> intel_rc6_sanitize(>->rc6);
> - gt_sanitize(gt, true);
> if (intel_gt_is_wedged(gt)) {
> err = -EIO;
> goto out_fw;
> diff --git a/drivers/gpu/drm/i915/gt/intel_lrc.c b/drivers/gpu/drm/i915/gt/intel_lrc.c
> index c8014c265ffb..cd6afa2cf5fd 100644
> --- a/drivers/gpu/drm/i915/gt/intel_lrc.c
> +++ b/drivers/gpu/drm/i915/gt/intel_lrc.c
> @@ -3931,6 +3931,9 @@ static void execlists_sanitize(struct intel_engine_cs *engine)
> * reset the value in the HWSP.
> */
> intel_timeline_reset_seqno(engine->kernel_context->timeline);
> +
> + /* And scrub the dirty cachelines for the HWSP */
> + clflush_cache_range(engine->status_page.addr, PAGE_SIZE);
> }
>
> static void enable_error_interrupt(struct intel_engine_cs *engine)
>
More information about the Intel-gfx
mailing list