[PATCH 0/4] drm/xe/xe_drm_client: Add per drm client reset stats
André Almeida
andrealmeid at igalia.com
Tue Feb 18 19:21:15 UTC 2025
Hi Jonathan,
Em 14/02/2025 17:37, Jonathan Cavitt escreveu:
> Add additional information to drm client so it can report the last 50
> exec queues to have been banned on it, as well as the last pagefault
> seen when said exec queues were banned. Since we cannot reasonably
> associate a pagefault to a specific exec queue, we currently report the
> last seen pagefault on the associated hw engine instead.
>
> The last pagefault seen per exec queue is saved to the hw engine, and the
> pagefault is updated during the pagefault handling process in
> xe_gt_pagefault. The last seen pagefault is reset when the engine is
> reset because any future exec queue bans likely were not caused by said
> pagefault after the reset.
>
> Also add a tracker that counts the number of times the drm client has
> experienced an engine reset.
What's the use case for this? How will userspace consume this information?
>
> Signed-off-by: Jonathan Cavitt <jonathan.cavitt at intel.com>
>
> Jonathan Cavitt (4):
> drm/xe/xe_exec_queue: Add ID param to exec queue struct
> drm/xe/xe_gt_pagefault: Migrate pagefault struct to header
> FIXME: drm/xe/xe_drm_client: Add per drm client pagefault info
> drm/xe/xe_drm_client: Add per drm client reset stats
>
> drivers/gpu/drm/xe/xe_drm_client.c | 130 +++++++++++++++++++++++
> drivers/gpu/drm/xe/xe_drm_client.h | 38 +++++++
> drivers/gpu/drm/xe/xe_exec_queue.c | 8 ++
> drivers/gpu/drm/xe/xe_exec_queue_types.h | 2 +
> drivers/gpu/drm/xe/xe_gt_pagefault.c | 46 ++++----
> drivers/gpu/drm/xe/xe_gt_pagefault.h | 51 +++++++++
> drivers/gpu/drm/xe/xe_guc_submit.c | 19 ++++
> drivers/gpu/drm/xe/xe_hw_engine.c | 4 +
> drivers/gpu/drm/xe/xe_hw_engine_types.h | 8 ++
> 9 files changed, 279 insertions(+), 27 deletions(-)
>
More information about the Intel-xe
mailing list