[PATCH v2 2/2] drm/ttm: Add a device flag to propagate -ENOSPC on OOM
Christian König
christian.koenig at amd.com
Wed Oct 2 13:46:29 UTC 2024
Am 02.10.24 um 14:24 schrieb Thomas Hellström:
> Some graphics APIs differentiate between out-of-graphics-memory and
> out-of-host-memory (system memory). Add a device init flag to
> have -ENOSPC propagated from the resource managers instead of being
> converted to -ENOMEM, to aid driver stacks in determining what
> error code to return or whether corrective action can be taken at
> the driver level.
>
> Cc: Christian König <christian.koenig at amd.com>
> Cc: Matthew Brost <matthew.brost at intel.com>
> Signed-off-by: Thomas Hellström <thomas.hellstrom at linux.intel.com>
Independent of how we communicate flags to the TTM device init function
this looks like the right approach to me.
So feel free to add Reviewed-by: Christian König <christian.koenig at amd.com>.
Regards,
Christian.
> ---
> drivers/gpu/drm/ttm/ttm_bo.c | 2 +-
> drivers/gpu/drm/ttm/ttm_device.c | 1 +
> include/drm/ttm/ttm_device.h | 13 +++++++++++++
> 3 files changed, 15 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/ttm/ttm_bo.c b/drivers/gpu/drm/ttm/ttm_bo.c
> index 320592435252..c4bec2ad301b 100644
> --- a/drivers/gpu/drm/ttm/ttm_bo.c
> +++ b/drivers/gpu/drm/ttm/ttm_bo.c
> @@ -835,7 +835,7 @@ int ttm_bo_validate(struct ttm_buffer_object *bo,
>
> /* For backward compatibility with userspace */
> if (ret == -ENOSPC)
> - return -ENOMEM;
> + return bo->bdev->propagate_enospc ? ret : -ENOMEM;
>
> /*
> * We might need to add a TTM.
> diff --git a/drivers/gpu/drm/ttm/ttm_device.c b/drivers/gpu/drm/ttm/ttm_device.c
> index 0c85d10e5e0b..aee9d52d745b 100644
> --- a/drivers/gpu/drm/ttm/ttm_device.c
> +++ b/drivers/gpu/drm/ttm/ttm_device.c
> @@ -203,6 +203,7 @@ int ttm_device_init(struct ttm_device *bdev, const struct ttm_device_funcs *func
> }
>
> bdev->funcs = funcs;
> + bdev->propagate_enospc = flags.propagate_enospc;
>
> ttm_sys_man_init(bdev);
>
> diff --git a/include/drm/ttm/ttm_device.h b/include/drm/ttm/ttm_device.h
> index 1534bd946c78..f9da78bbd925 100644
> --- a/include/drm/ttm/ttm_device.h
> +++ b/include/drm/ttm/ttm_device.h
> @@ -266,6 +266,13 @@ struct ttm_device {
> * @wq: Work queue structure for the delayed delete workqueue.
> */
> struct workqueue_struct *wq;
> +
> + /**
> + * @propagate_enospc: Whether -ENOSPC should be propagated to the caller after
> + * graphics memory allocation failure. If false, this will be converted to
> + * -ENOMEM, which is the default behaviour.
> + */
> + bool propagate_enospc;
> };
>
> int ttm_global_swapout(struct ttm_operation_ctx *ctx, gfp_t gfp_flags);
> @@ -295,6 +302,12 @@ struct ttm_device_init_flags {
> u32 use_dma_alloc : 1;
> /** @use_dma32: If we should use GFP_DMA32 for device memory allocations. */
> u32 use_dma32 : 1;
> + /**
> + * @propagate_enospc: Whether -ENOSPC should be propagated to the caller after
> + * graphics memory allocation failure. If false, this will be converted to
> + * -ENOMEM, which is the default behaviour.
> + */
> + u32 propagate_enospc : 1;
> };
>
> int ttm_device_init(struct ttm_device *bdev, const struct ttm_device_funcs *funcs,
More information about the amd-gfx
mailing list