[Intel-gfx] [PATCH] drm: round_up the size to the alignment value

Arunpravin Paneer Selvam arunpravin.paneerselvam at amd.com
Wed Mar 30 20:44:55 UTC 2022



On 30/03/22 2:42 pm, Christian König wrote:
> Am 30.03.22 um 11:20 schrieb Arunpravin Paneer Selvam:
>>
>> On 30/03/22 2:37 pm, Christian König wrote:
>>> Am 30.03.22 um 11:04 schrieb Arunpravin Paneer Selvam:
>>>> Round up the size value to the min_page_size and trim the last block to
>>>> the required size.
>>>>
>>>> This solves a bug detected when size is not aligned with the min_page_size.
>>>> Unigine Heaven has allocation requests for example required pages are 257
>>>> and alignment request is 256. To allocate the left over 1 page, continues
>>>> the iteration to find the order value which is 0 and when it compares with
>>>> min_order = 8, triggers the BUG_ON(order < min_order). To avoid this issue
>>>> we round_up the size value to the min_page_size and trim the last block to
>>>> the computed required size value.
>>> Well, Matthew and you convinced me to *not* do it like this.
>>>
>>> Has that conclusion changed somehow?
>>>
>> Yes, now he is ok to handle rounding + trimming in drm buddy
> 
> Yeah, but I'm no longer :)
> 
> How do we then handle the detection of contiguous allocation?
> 
> As I said we can do that like:
> 1. alloc
> 2. check if we only have a single node
I think verifying the list is a single node would allow all the power of
2 requests(1 page, 2 pages, 4 pages etc..) single node and CONTIGUOUS
flag not enabled cases entering into the trim function and simply return
since the original size == roundup_pow_of_2 size.

can we handle all the situation (alignment rounding trimming +
contiguous trimming) in a single if condition like below,

if (cur_size != (pages << PAGE_SHIFT))

where cur_size = stores the size value before round_up(alignment
rounding up) or round_pow_of_2 (contiguous rounding up)

pages = stores the size value after round_up(alignment rounding up) or
round_pow_of_2 (contiguous rounding up)

if there is a difference b/w these 2 numbers, we enter the trim block -
- For a single node, we pass the original size (contiguous trimming)
- For multiple node, we fetch the last block and trim the computed size
(alignment rounding trimming)


> 3. trim
> 
> But if we include the trim here we can't do it any more.
> 
> Only alternative would then be to inspect each node and see if it 
> follows directly behind the predecessor.
> 

ok. Therefore, we handle both contiguous allocation trimming and
alignment rounding up trimming (only last block) in amdgpu and i915 driver.

And, in drm buddy we just have a check to return -EINVAL if size is not
aligned to min_page_size.

If yes to above statements, I included alignment rounding up trimming
(only last block) in the same place where currently we trim for the
contiguous allocation. I will send the patch for review.
> Regards,
> Christian.
> 
>>
>>> Regards,
>>> Christian.
>>>
>>>> Signed-off-by: Arunpravin Paneer Selvam <Arunpravin.PaneerSelvam at amd.com>
>>>> ---
>>>>    drivers/gpu/drm/drm_buddy.c | 31 +++++++++++++++++++++++++++++++
>>>>    1 file changed, 31 insertions(+)
>>>>
>>>> diff --git a/drivers/gpu/drm/drm_buddy.c b/drivers/gpu/drm/drm_buddy.c
>>>> index 72f52f293249..98d7ec359b08 100644
>>>> --- a/drivers/gpu/drm/drm_buddy.c
>>>> +++ b/drivers/gpu/drm/drm_buddy.c
>>>> @@ -641,6 +641,7 @@ int drm_buddy_alloc_blocks(struct drm_buddy *mm,
>>>>    	unsigned int min_order, order;
>>>>    	unsigned long pages;
>>>>    	LIST_HEAD(allocated);
>>>> +	u64 cur_size;
>>>>    	int err;
>>>>    
>>>>    	if (size < mm->chunk_size)
>>>> @@ -665,6 +666,11 @@ int drm_buddy_alloc_blocks(struct drm_buddy *mm,
>>>>    	if (start + size == end)
>>>>    		return __drm_buddy_alloc_range(mm, start, size, blocks);
>>>>    
>>>> +	cur_size = size;
>>>> +
>>>> +	if (!IS_ALIGNED(size, min_page_size))
>>>> +		size = round_up(size, min_page_size);
>>>> +
>>>>    	pages = size >> ilog2(mm->chunk_size);
>>>>    	order = fls(pages) - 1;
>>>>    	min_order = ilog2(min_page_size) - ilog2(mm->chunk_size);
>>>> @@ -702,6 +708,31 @@ int drm_buddy_alloc_blocks(struct drm_buddy *mm,
>>>>    			break;
>>>>    	} while (1);
>>>>    
>>>> +
>>>> +	/*
>>>> +	 * If size value rounded up to min_page_size, trim the last block
>>>> +	 * to the required size
>>>> +	 */
>>>> +	if (cur_size != size) {
>>>> +		struct drm_buddy_block *trim_block;
>>>> +		LIST_HEAD(trim_list);
>>>> +		u64 required_size;
>>>> +
>>>> +		trim_block = list_last_entry(&allocated, typeof(*trim_block), link);
>>>> +		list_move_tail(&trim_block->link, &trim_list);
>>>> +		/*
>>>> +		 * Compute the required_size value by subtracting the last block size
>>>> +		 * with (aligned size - original size)
>>>> +		 */
>>>> +		required_size = drm_buddy_block_size(mm, trim_block) - (size - cur_size);
>>>> +
>>>> +		drm_buddy_block_trim(mm,
>>>> +				     required_size,
>>>> +				     &trim_list);
>>>> +
>>>> +		list_splice_tail(&trim_list, &allocated);
>>>> +	}
>>>> +
>>>>    	list_splice_tail(&allocated, blocks);
>>>>    	return 0;
>>>>    
>>>>
>>>> base-commit: ec57376fba5abc0e571617ff88e2ade7970c2e4b
> 


More information about the Intel-gfx mailing list