[PATCH] drm/xe: Fix UBSAN shift-out-of-bounds failure

Nirmoy Das nirmoy.das at linux.intel.com
Tue May 7 09:42:46 UTC 2024


On 5/7/2024 11:39 AM, Nirmoy Das wrote:
>
>
> On 5/7/2024 10:04 AM, Shuicheng Lin wrote:
>> Here is the failure stack:
>> [   12.988209] ------------[ cut here ]------------
>> [   12.988216] UBSAN: shift-out-of-bounds in ./include/linux/log2.h:57:13
>> [   12.988232] shift exponent 64 is too large for 64-bit type 'long unsigned int'
>> [   12.988235] CPU: 4 PID: 1310 Comm: gnome-shell Tainted: G     U             6.9.0-rc6+prerelease1158+ #19
>> [   12.988237] Hardware name: Intel Corporation Raptor Lake Client Platform/RPL-S ADP-S DDR5 UDIMM CRB, BIOS RPLSFWI1.R00.3301.A02.2208050712 08/05/2022
>> [   12.988239] Call Trace:
>> [   12.988240]  <TASK>
>> [   12.988242]  dump_stack_lvl+0xd7/0xf0
>> [   12.988248]  dump_stack+0x10/0x20
>> [   12.988250]  ubsan_epilogue+0x9/0x40
>> [   12.988253]  __ubsan_handle_shift_out_of_bounds+0x10e/0x170
>> [   12.988260]  dma_resv_reserve_fences.cold+0x2b/0x48
>> [   12.988262]  ? ww_mutex_lock_interruptible+0x3c/0x110
>> [   12.988267]  drm_exec_prepare_obj+0x45/0x60 [drm_exec]
>> [   12.988271]  ? vm_bind_ioctl_ops_execute+0x5b/0x740 [xe]
>> [   12.988345]  vm_bind_ioctl_ops_execute+0x78/0x740 [xe]
>>
>> It is caused by the value 0 of parameter num_fences in function drm_exec_prepare_obj.
>> And lead to in function __rounddown_pow_of_two, "0 - 1" causes the shift-out-of-bounds.
>> For the num_fences, it should be 1 at least.
>>
>> Cc: Matthew Brost<matthew.brost at intel.com>
>> Signed-off-by: Shuicheng Lin<shuicheng.lin at intel.com>
>> ---
>>   drivers/gpu/drm/xe/xe_vm.c | 4 ++--
>>   1 file changed, 2 insertions(+), 2 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/xe/xe_vm.c b/drivers/gpu/drm/xe/xe_vm.c
>> index d17192c8b7de..96cb4d9762a3 100644
>> --- a/drivers/gpu/drm/xe/xe_vm.c
>> +++ b/drivers/gpu/drm/xe/xe_vm.c
>> @@ -2692,7 +2692,7 @@ static int vma_lock_and_validate(struct drm_exec *exec, struct xe_vma *vma,
>>   
>>   	if (bo) {
>>   		if (!bo->vm)
>> -			err = drm_exec_prepare_obj(exec, &bo->ttm.base, 0);
>> +			err = drm_exec_prepare_obj(exec, &bo->ttm.base, 1);
>
> This needs to be fixed in drm_exec_prepare_obj() by checking 
> num_fences and not calling dma_resv_reserve_fences()
>
or just call drm_exec_lock_obj() here. ref: 
https://patchwork.freedesktop.org/patch/577487/

Nirmoy

>
> Regards,
>
> Nirmoy
>
>>   		if (!err && validate)
>>   			err = xe_bo_validate(bo, xe_vma_vm(vma), true);
>>   	}
>> @@ -2777,7 +2777,7 @@ static int vm_bind_ioctl_ops_lock_and_prep(struct drm_exec *exec,
>>   	struct xe_vma_op *op;
>>   	int err;
>>   
>> -	err = drm_exec_prepare_obj(exec, xe_vm_obj(vm), 0);
>> +	err = drm_exec_prepare_obj(exec, xe_vm_obj(vm), 1);
>>   	if (err)
>>   		return err;
>>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-xe/attachments/20240507/765a0ffd/attachment-0001.htm>


More information about the Intel-xe mailing list