[PATCH 6/6] drm/amdgpu: use more than 64KB fragment size if possible
Jay Cornwall
jay at jcornwall.me
Tue Aug 9 15:49:31 UTC 2016
On 2016-08-09 07:52, Christian König wrote:
> From: Christian König <christian.koenig at amd.com>
>
> We align to 64KB, but when userspace aligns even more we can easily use
> more.
>
> Signed-off-by: Christian König <christian.koenig at amd.com>
> ---
> drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c | 12 ++++++++----
> 1 file changed, 8 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
> b/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
> index e6c030b..88f4109 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
> @@ -817,13 +817,13 @@ static void amdgpu_vm_frag_ptes(struct
> amdgpu_pte_update_params *params,
> * allocation size to the fragment size.
> */
>
> - /* SI and newer are optimized for 64KB */
> - uint64_t frag_flags = AMDGPU_PTE_FRAG(AMDGPU_LOG2_PAGES_PER_FRAG);
> - uint64_t frag_align = 1 << AMDGPU_LOG2_PAGES_PER_FRAG;
> + const uint64_t frag_align = 1 << AMDGPU_LOG2_PAGES_PER_FRAG;
>
> uint64_t frag_start = ALIGN(start, frag_align);
> uint64_t frag_end = end & ~(frag_align - 1);
>
> + uint32_t frag;
> +
> /* system pages are non continuously */
> if (params->src || params->pages_addr || !(flags & AMDGPU_PTE_VALID)
> ||
> (frag_start >= frag_end)) {
> @@ -832,6 +832,10 @@ static void amdgpu_vm_frag_ptes(struct
> amdgpu_pte_update_params *params,
> return;
> }
>
> + /* use more than 64KB fragment size if possible */
> + frag = lower_32_bits(frag_start | frag_end);
> + frag = likely(frag) ? __ffs(frag) : 31;
> +
> /* handle the 4K area at the beginning */
> if (start != frag_start) {
> amdgpu_vm_update_ptes(params, vm, start, frag_start,
> @@ -841,7 +845,7 @@ static void amdgpu_vm_frag_ptes(struct
> amdgpu_pte_update_params *params,
>
> /* handle the area in the middle */
> amdgpu_vm_update_ptes(params, vm, frag_start, frag_end, dst,
> - flags | frag_flags);
> + flags | AMDGPU_PTE_FRAG(frag));
>
> /* handle the 4K area at the end */
> if (frag_end != end) {
Would this change not direct larger fragments away from the BigK TLB
partition?
My understanding was VM_L2_CNTL3.L2_CACHE_BIGK_FRAGMENT_SIZE is an exact
match and not a minimum size. I can't find any immediate documentation
on that topic to confirm.
--
Jay Cornwall
More information about the amd-gfx
mailing list