[PATCH] drm/xe: Resume TDR after GT reset

Nirmoy Das nirmoy.das at linux.intel.com
Thu Sep 26 17:25:22 UTC 2024


On 7/25/2024 1:59 AM, Matthew Brost wrote:
> Not starting the TDR after GT reset on exec queue which have been
> restarted can lead to jobs being able to be run forever. Fix this by
> restarting the TDR.
>
> Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
> Signed-off-by: Matthew Brost <matthew.brost at intel.com>

Reviewed-by: Nirmoy Das <nirmoy.das at intel.com>

> ---
>  drivers/gpu/drm/xe/xe_gpu_scheduler.c | 5 +++++
>  drivers/gpu/drm/xe/xe_gpu_scheduler.h | 2 ++
>  drivers/gpu/drm/xe/xe_guc_submit.c    | 1 +
>  3 files changed, 8 insertions(+)
>
> diff --git a/drivers/gpu/drm/xe/xe_gpu_scheduler.c b/drivers/gpu/drm/xe/xe_gpu_scheduler.c
> index e4ad1d6ce1d5..7f24e58cc992 100644
> --- a/drivers/gpu/drm/xe/xe_gpu_scheduler.c
> +++ b/drivers/gpu/drm/xe/xe_gpu_scheduler.c
> @@ -90,6 +90,11 @@ void xe_sched_submission_stop(struct xe_gpu_scheduler *sched)
>  	cancel_work_sync(&sched->work_process_msg);
>  }
>  
> +void xe_sched_submission_resume_tdr(struct xe_gpu_scheduler *sched)
> +{
> +	drm_sched_resume_timeout(&sched->base, sched->base.timeout);
> +}
> +
>  void xe_sched_add_msg(struct xe_gpu_scheduler *sched,
>  		      struct xe_sched_msg *msg)
>  {
> diff --git a/drivers/gpu/drm/xe/xe_gpu_scheduler.h b/drivers/gpu/drm/xe/xe_gpu_scheduler.h
> index 10c6bb9c9386..6aac7fe68673 100644
> --- a/drivers/gpu/drm/xe/xe_gpu_scheduler.h
> +++ b/drivers/gpu/drm/xe/xe_gpu_scheduler.h
> @@ -22,6 +22,8 @@ void xe_sched_fini(struct xe_gpu_scheduler *sched);
>  void xe_sched_submission_start(struct xe_gpu_scheduler *sched);
>  void xe_sched_submission_stop(struct xe_gpu_scheduler *sched);
>  
> +void xe_sched_submission_resume_tdr(struct xe_gpu_scheduler *sched);
> +
>  void xe_sched_add_msg(struct xe_gpu_scheduler *sched,
>  		      struct xe_sched_msg *msg);
>  
> diff --git a/drivers/gpu/drm/xe/xe_guc_submit.c b/drivers/gpu/drm/xe/xe_guc_submit.c
> index 460808507947..2327e11ae311 100644
> --- a/drivers/gpu/drm/xe/xe_guc_submit.c
> +++ b/drivers/gpu/drm/xe/xe_guc_submit.c
> @@ -1768,6 +1768,7 @@ static void guc_exec_queue_start(struct xe_exec_queue *q)
>  	}
>  
>  	xe_sched_submission_start(sched);
> +	xe_sched_submission_resume_tdr(sched);
>  }
>  
>  int xe_guc_submit_start(struct xe_guc *guc)


More information about the Intel-xe mailing list