[PATCH] drm/amdgpu: fix slab-use-after-free in amdgpu_userq_mgr_fini+0x70c
Christian König
christian.koenig at amd.com
Mon Jun 23 15:30:26 UTC 2025
On 23.06.25 17:04, vitaly prosyak wrote:
>
> On 2025-06-23 10:34, Christian König wrote:
>> On 19.06.25 04:26, Alex Deucher wrote:
>>>> ---
>>>> drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 12 +++---------
>>>> drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c | 3 +++
>>>> 2 files changed, 6 insertions(+), 9 deletions(-)
>>>>
>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c
>>>> index 7fd233f160bf..204178d949e1 100644
>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c
>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c
>>>> @@ -2914,16 +2914,10 @@ static int amdgpu_pmops_runtime_idle(struct device *dev)
>>>>
>>>> static int amdgpu_drm_release(struct inode *inode, struct file *filp)
>>>> {
>>>> - struct drm_file *file_priv = filp->private_data;
>>>> - struct amdgpu_fpriv *fpriv = file_priv->driver_priv;
>>>> -
>>>> - if (fpriv) {
>>>> - fpriv->evf_mgr.fd_closing = true;
>>>> - amdgpu_eviction_fence_destroy(&fpriv->evf_mgr);
>>>> - amdgpu_userq_mgr_fini(&fpriv->userq_mgr);
>>>> - }
>>>> + int r;
>>>>
>>>> - return drm_release(inode, filp);
>>>> + r = drm_release(inode, filp);
>>>> + return r;
>>>> }
>>> You can just drop amdgpu_drm_release() altogether and just assign
>>> drm_release() as the callback directly.
>> Stop, that change here is just broken.
>>
>> We need to call amdgpu_userq_mgr_fini() before drm_release() since the later will release all BOs and so eventually also the ring buffer of the user queues.
> Yes, we observe that amdgpu_driver_postclose_kms is invoked before drm_release. Within amdgpu_driver_postclose_kms, we perform all the necessary cleanup steps, including:
>
> amdgpu_eviction_fence_destroy()
>
> amdgpu_userq_mgr_fini()
>
> amdgpu_ctx_mgr_fini()
>
> amdgpu_vm_fini()
>
> These functions handle eviction fences, user queue management, context management, and VM cleanup respectively
That won't work. See drm_file_free():
...
if (drm_core_check_feature(dev, DRIVER_GEM))
drm_gem_release(dev, file);
if (drm_is_primary_client(file))
drm_master_release(file);
if (dev->driver->postclose)
dev->driver->postclose(dev, file);
...
The GEM handles are released first and then postclose is called.
So postclose is to late to stop the queues from processing.
Regards,
Christian.
>
> Thanks, Vitaly
>
>>
>> Regards,
>> Christian.
>>
>>> Alex
>>>
>>>> long amdgpu_drm_ioctl(struct file *filp,
>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c
>>>> index d2ce7d86dbc8..195ed81d39ff 100644
>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c
>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c
>>>> @@ -1501,6 +1501,9 @@ void amdgpu_driver_postclose_kms(struct drm_device *dev,
>>>> amdgpu_vm_bo_del(adev, fpriv->prt_va);
>>>> amdgpu_bo_unreserve(pd);
>>>> }
>>>> + fpriv->evf_mgr.fd_closing = true;
>>>> + amdgpu_eviction_fence_destroy(&fpriv->evf_mgr);
>>>> + amdgpu_userq_mgr_fini(&fpriv->userq_mgr);
>>>>
>>>> amdgpu_ctx_mgr_fini(&fpriv->ctx_mgr);
>>>> amdgpu_vm_fini(adev, &fpriv->vm);
>>>> --
>>>> 2.34.1
>>>>
More information about the amd-gfx
mailing list