[Intel-gfx] [PATCH 1/2] drm/i915/guc: Fix for potential false positives in GuC hang selftest

Daniele Ceraolo Spurio daniele.ceraolospurio at intel.com
Thu Nov 9 20:33:35 UTC 2023



On 11/6/2023 3:59 PM, John.C.Harrison at Intel.com wrote:
> From: John Harrison <John.C.Harrison at Intel.com>
>
> Noticed that the hangcheck selftest is submitting a non-preemptoble
> spinner. That means that even if the GuC does not die, the heartbeat
> will still kick in and trigger a reset. Which is rather defeating the
> purpose of the test - to verify that the heartbeat will kick in if the
> GuC itself has died. The test is deliberately killing the GuC, so it
> should never hit the case of a non-dead GuC. But it is not impossible
> that the kill might fail at some future point due to other driver
> re-work.
>
> So, make the spinner pre-emptible. That way the heartbeat can get
> through if the GuC is alive and context switching. Thus a reset only
> happens if the GuC dies. Thus, if the kill should stop working the
> test will now fail rather than claim to pass.
>
> Signed-off-by: John Harrison <John.C.Harrison at Intel.com>

Reviewed-by: Daniele Ceraolo Spurio <daniele.ceraolospurio at intel.com>

Daniele

> ---
>   drivers/gpu/drm/i915/gt/uc/selftest_guc_hangcheck.c | 2 +-
>   1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/i915/gt/uc/selftest_guc_hangcheck.c b/drivers/gpu/drm/i915/gt/uc/selftest_guc_hangcheck.c
> index 34b5d952e2bcb..26fdc392fce6c 100644
> --- a/drivers/gpu/drm/i915/gt/uc/selftest_guc_hangcheck.c
> +++ b/drivers/gpu/drm/i915/gt/uc/selftest_guc_hangcheck.c
> @@ -74,7 +74,7 @@ static int intel_hang_guc(void *arg)
>   		goto err;
>   	}
>   
> -	rq = igt_spinner_create_request(&spin, ce, MI_NOOP);
> +	rq = igt_spinner_create_request(&spin, ce, MI_ARB_CHECK);
>   	intel_context_put(ce);
>   	if (IS_ERR(rq)) {
>   		ret = PTR_ERR(rq);



More information about the dri-devel mailing list