[Intel-gfx] [PATCH] drm/i915/guc: Don't error on reset of banned context
John Harrison
john.c.harrison at intel.com
Fri Jan 7 00:35:48 UTC 2022
On 1/6/2022 16:31, John.C.Harrison at Intel.com wrote:
> From: John Harrison <John.C.Harrison at Intel.com>
>
> There is a race (already documented in the code) whereby a context can
> be (re-)queued for submission at the same time as it is being banned
> due to a hang and reset. That leads to a hang/reset report from GuC
> for a context which i915 thinks is already banned.
>
> While the race is indented to be fixed in a future GuC update, there
> is no actual harm beyond the wasted execution time of that new hang
> detection period. The context has already been banned for bad
> behaviour so a fresh hang is hardly surprising and certainly isn't
> going to be losing any work that wouldn't already have been lost if
> there was no race.
>
> So don't treat this situation as an error. The error message is seen
> by the CI system as something fatal and causes test failures. Instead,
> just print an informational so the user at least knows a context reset
> occurred (given that the error capture is being skipped).
>
> Signed-off-by: John Harrison <John.C.Harrison at Intel.com>
PS: Meant to add:
Fixes: 249af7242af72 ("drm/i915/guc: Report error on invalid reset
notification")
More information about the Intel-gfx
mailing list