[PATCH v2] drm/amdgpu: Fix potential double free and null pointer dereference
Luben Tuikov
luben.tuikov at amd.com
Thu Nov 24 16:44:06 UTC 2022
Applied.
Regards,
Luben
On 2022-11-22 19:10, Luben Tuikov wrote:
> amdgpu_xgmi_hive_type does provide a release method which frees the allocated "hive",
> so we don't need a kfree() after a kobject_put().
>
> Reviewed-by: Luben Tuikov <luben.tuikov at amd.com>
>
> Regards,
> Luben
>
> On 2022-11-21 23:28, Liang He wrote:
>> In amdgpu_get_xgmi_hive(), we should not call kfree() after
>> kobject_put() as the PUT will call kfree().
>>
>> In amdgpu_device_ip_init(), we need to check the returned *hive*
>> which can be NULL before we dereference it.
>>
>> Signed-off-by: Liang He <windhl at 126.com>
>> ---
>> v1->v2: we need the extra GET to keep *hive* alive, it is
>> my fault to remove the GET in v1.
>>
>>
>> drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 5 +++++
>> drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c | 2 --
>> 2 files changed, 5 insertions(+), 2 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>> index f1e9663b4051..00976e15b698 100644
>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>> @@ -2462,6 +2462,11 @@ static int amdgpu_device_ip_init(struct amdgpu_device *adev)
>> if (!amdgpu_sriov_vf(adev)) {
>> struct amdgpu_hive_info *hive = amdgpu_get_xgmi_hive(adev);
>>
>> + if (WARN_ON(!hive)) {
>> + r = -ENOENT;
>> + goto init_failed;
>> + }
>> +
>> if (!hive->reset_domain ||
>> !amdgpu_reset_get_reset_domain(hive->reset_domain)) {
>> r = -ENOENT;
>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c
>> index 47159e9a0884..4b9e7b050ccd 100644
>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c
>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c
>> @@ -386,7 +386,6 @@ struct amdgpu_hive_info *amdgpu_get_xgmi_hive(struct amdgpu_device *adev)
>> if (ret) {
>> dev_err(adev->dev, "XGMI: failed initializing kobject for xgmi hive\n");
>> kobject_put(&hive->kobj);
>> - kfree(hive);
>> hive = NULL;
>> goto pro_end;
>> }
>> @@ -410,7 +409,6 @@ struct amdgpu_hive_info *amdgpu_get_xgmi_hive(struct amdgpu_device *adev)
>> dev_err(adev->dev, "XGMI: failed initializing reset domain for xgmi hive\n");
>> ret = -ENOMEM;
>> kobject_put(&hive->kobj);
>> - kfree(hive);
>> hive = NULL;
>> goto pro_end;
>> }
>
More information about the amd-gfx
mailing list