[PATCH] drm/xe: Do not print engine reset message on a killed queue
John Harrison
john.c.harrison at intel.com
Thu May 8 23:03:56 UTC 2025
On 5/8/2025 12:09 PM, Matthew Brost wrote:
> When an app is ctrl-c (killed) any queues running on the GPU have their
> preemption timeout set to the minimum value and scheduling is disabled.
> If the queue has something active on the GPU it is very likely for the
> GuC will trigger an engine reset resulting in the engine reset message
> being printed when this is fully expected. Do not print the engine reset
> message on queues which have been killed.
>
> Reported-by: Paulo Zanoni <paulo.r.zanoni at intel.com>
> Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/4904
> Signed-off-by: Matthew Brost <matthew.brost at intel.com>
> ---
> drivers/gpu/drm/xe/xe_guc_submit.c | 5 +++--
> 1 file changed, 3 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/gpu/drm/xe/xe_guc_submit.c b/drivers/gpu/drm/xe/xe_guc_submit.c
> index 369be36f7dc5..efff462ddd75 100644
> --- a/drivers/gpu/drm/xe/xe_guc_submit.c
> +++ b/drivers/gpu/drm/xe/xe_guc_submit.c
> @@ -2005,8 +2005,9 @@ int xe_guc_exec_queue_reset_handler(struct xe_guc *guc, u32 *msg, u32 len)
> if (unlikely(!q))
> return -EPROTO;
>
> - xe_gt_info(gt, "Engine reset: engine_class=%s, logical_mask: 0x%x, guc_id=%d",
> - xe_hw_engine_class_to_str(q->class), q->logical_mask, guc_id);
> + if (!exec_queue_killed(q))
> + xe_gt_info(gt, "Engine reset: engine_class=%s, logical_mask: 0x%x, guc_id=%d",
> + xe_hw_engine_class_to_str(q->class), q->logical_mask, guc_id);
Maybe make it an xe_gt_dbg in the case of a killed queue? It is still
useful to see such messages when triaging CI failures to get an idea of
what is going on behind the scenes.
John.
>
> trace_xe_exec_queue_reset(q);
>
More information about the Intel-xe
mailing list