[PATCH v4 1/2] drm/xe: Don't short circuit TDR on jobs not started

Lucas De Marchi lucas.demarchi at intel.com
Thu Oct 31 05:20:52 UTC 2024


On Fri, Oct 25, 2024 at 02:43:29PM -0700, Matthew Brost wrote:
>Short circuiting TDR on jobs not started is an optimization which is not
>required. On LNL we are facing an issue where jobs do not get scheduled
>by the GuC if it misses a GGTT page update. When this occurs let the TDR
>fire, toggle the scheduling which may get the job unstuck, and print a
>warning message. If the TDR fires twice on job that hasn't started,
>timeout the job.
>
>v2:
> - Add warning message (Paulo)
> - Add fixes tag (Paulo)
> - Timeout job which hasn't started after TDR firing twice
>v3:
> - Include local change
>v4:
> - Short circuit check_timeout on job not started
> - use warn level rather than notice (Paulo)
>
>Fixes: 7ddb9403dd74 ("drm/xe: Sample ctx timestamp to determine if jobs have timed out")
>Cc: stable at vger.kernel.org
>Cc: Paulo Zanoni <paulo.r.zanoni at intel.com>
>Signed-off-by: Matthew Brost <matthew.brost at intel.com>

Reviewed-by: Lucas De Marchi <lucas.demarchi at intel.com>

and applied. Thanks

Lucas De Marchi


More information about the Intel-xe mailing list