[PATCH v2] drm/amdgpu: Fix potential double free and null pointer dereference

Luben Tuikov luben.tuikov at amd.com
Thu Nov 24 16:44:06 UTC 2022


Applied.

Regards,
Luben

On 2022-11-22 19:10, Luben Tuikov wrote:
> amdgpu_xgmi_hive_type does provide a release method which frees the allocated "hive",
> so we don't need a kfree() after a kobject_put().
> 
> Reviewed-by: Luben Tuikov <luben.tuikov at amd.com>
> 
> Regards,
> Luben
> 
> On 2022-11-21 23:28, Liang He wrote:
>> In amdgpu_get_xgmi_hive(), we should not call kfree() after
>> kobject_put() as the PUT will call kfree().
>>
>> In amdgpu_device_ip_init(), we need to check the returned *hive*
>> which can be NULL before we dereference it.
>>
>> Signed-off-by: Liang He <windhl at 126.com>
>> ---
>>  v1->v2: we need the extra GET to keep *hive* alive, it is
>>  my fault to remove the GET in v1.
>>
>>
>>  drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 5 +++++
>>  drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c   | 2 --
>>  2 files changed, 5 insertions(+), 2 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>> index f1e9663b4051..00976e15b698 100644
>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
>> @@ -2462,6 +2462,11 @@ static int amdgpu_device_ip_init(struct amdgpu_device *adev)
>>  			if (!amdgpu_sriov_vf(adev)) {
>>  				struct amdgpu_hive_info *hive = amdgpu_get_xgmi_hive(adev);
>>  
>> +				if (WARN_ON(!hive)) {
>> +					r = -ENOENT;
>> +					goto init_failed;
>> +				}
>> +
>>  				if (!hive->reset_domain ||
>>  				    !amdgpu_reset_get_reset_domain(hive->reset_domain)) {
>>  					r = -ENOENT;
>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c
>> index 47159e9a0884..4b9e7b050ccd 100644
>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c
>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c
>> @@ -386,7 +386,6 @@ struct amdgpu_hive_info *amdgpu_get_xgmi_hive(struct amdgpu_device *adev)
>>  	if (ret) {
>>  		dev_err(adev->dev, "XGMI: failed initializing kobject for xgmi hive\n");
>>  		kobject_put(&hive->kobj);
>> -		kfree(hive);
>>  		hive = NULL;
>>  		goto pro_end;
>>  	}
>> @@ -410,7 +409,6 @@ struct amdgpu_hive_info *amdgpu_get_xgmi_hive(struct amdgpu_device *adev)
>>  				dev_err(adev->dev, "XGMI: failed initializing reset domain for xgmi hive\n");
>>  				ret = -ENOMEM;
>>  				kobject_put(&hive->kobj);
>> -				kfree(hive);
>>  				hive = NULL;
>>  				goto pro_end;
>>  			}
> 



More information about the amd-gfx mailing list