[PATCH 1/1] drm/amdgpu: fix the memleak caused by fence not released

Christian König christian.koenig at amd.com
Wed Feb 12 13:08:17 UTC 2025


Am 12.02.25 um 12:30 schrieb Le Ma:
> On systems with CONFIG_SLUB_DEBUG enabled, the memleak like below
> will show up explicitly during driver unloading if created bo without
> drm_timeline object before.
>
>      BUG drm_sched_fence (Tainted: G           OE     ): Objects remaining in drm_sched_fence on __kmem_cache_shutdown()
>      -----------------------------------------------------------------------------
>      Call Trace:
>      <TASK>
>      dump_stack_lvl+0x4c/0x70
>      dump_stack+0x14/0x20
>      slab_err+0xb0/0xf0
>      ? srso_alias_return_thunk+0x5/0xfbef5
>      ? flush_work+0x12/0x20
>      ? srso_alias_return_thunk+0x5/0xfbef5
>      __kmem_cache_shutdown+0x163/0x2e0
>      kmem_cache_destroy+0x61/0x170
>      drm_sched_fence_slab_fini+0x19/0x900
>
> Thus call dma_fence_put properly to avoid the memleak.

Good catch, but there is even more broken here.

The call to amdgpu_gem_update_timeline_node() comes way to late, we need 
to make sure that this is called *before* doing the update. Probably 
best to just inline the functionality.

Then AMDGPU_VM_DELAY_UPDATE and is mutual exclusive to specifying the 
syncobj when the update is completed.

>
> Signed-off-by: Le Ma <le.ma at amd.com>
> ---
>   drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c | 5 ++++-
>   1 file changed, 4 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> index 8b67aae6c2fe..ad3c5fdf6970 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_gem.c
> @@ -125,8 +125,11 @@ amdgpu_gem_update_bo_mapping(struct drm_file *filp,
>   	struct amdgpu_vm *vm = &fpriv->vm;
>   	struct dma_fence *last_update;
>   
> -	if (!syncobj)
> +	if (!syncobj) {
> +		dma_fence_wait(fence, false);

Please drop that wait, this is just nonsense.

> +		dma_fence_put(fence);
>   		return;
> +	}

We might want to move all this into amdgpu_gem_va_update_vm().

Regards,
Christian.

>   
>   	/* Find the last update fence */
>   	switch (operation) {



More information about the amd-gfx mailing list