[PATCH] drm/amdgpu: add shared fdinfo stats

Christian König ckoenig.leichtzumerken at gmail.com
Thu Nov 30 13:13:24 UTC 2023


Am 28.11.23 um 18:52 schrieb Rob Clark:
> On Tue, Nov 28, 2023 at 6:28 AM Alex Deucher <alexdeucher at gmail.com> wrote:
>> On Tue, Nov 28, 2023 at 9:17 AM Christian König
>> <ckoenig.leichtzumerken at gmail.com> wrote:
>>> Am 17.11.23 um 20:56 schrieb Alex Deucher:
>>>> Add shared stats.  Useful for seeing shared memory.
>>>>
>>>> Signed-off-by: Alex Deucher <alexander.deucher at amd.com>
>>>> ---
>>>>    drivers/gpu/drm/amd/amdgpu/amdgpu_fdinfo.c |  4 ++++
>>>>    drivers/gpu/drm/amd/amdgpu/amdgpu_object.c | 11 +++++++++++
>>>>    drivers/gpu/drm/amd/amdgpu/amdgpu_object.h |  6 ++++++
>>>>    3 files changed, 21 insertions(+)
>>>>
>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_fdinfo.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_fdinfo.c
>>>> index 5706b282a0c7..c7df7fa3459f 100644
>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_fdinfo.c
>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_fdinfo.c
>>>> @@ -97,6 +97,10 @@ void amdgpu_show_fdinfo(struct drm_printer *p, struct drm_file *file)
>>>>                   stats.requested_visible_vram/1024UL);
>>>>        drm_printf(p, "amd-requested-gtt:\t%llu KiB\n",
>>>>                   stats.requested_gtt/1024UL);
>>>> +     drm_printf(p, "drm-shared-vram:\t%llu KiB\n", stats.vram_shared/1024UL);
>>>> +     drm_printf(p, "drm-shared-gtt:\t%llu KiB\n", stats.gtt_shared/1024UL);
>>>> +     drm_printf(p, "drm-shared-cpu:\t%llu KiB\n", stats.cpu_shared/1024UL);
>>>> +
>>>>        for (hw_ip = 0; hw_ip < AMDGPU_HW_IP_NUM; ++hw_ip) {
>>>>                if (!usage[hw_ip])
>>>>                        continue;
>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>>>> index d79b4ca1ecfc..c24f7b2c04c1 100644
>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
>>>> @@ -1287,25 +1287,36 @@ void amdgpu_bo_get_memory(struct amdgpu_bo *bo,
>>>>                          struct amdgpu_mem_stats *stats)
>>>>    {
>>>>        uint64_t size = amdgpu_bo_size(bo);
>>>> +     struct drm_gem_object *obj;
>>>>        unsigned int domain;
>>>> +     bool shared;
>>>>
>>>>        /* Abort if the BO doesn't currently have a backing store */
>>>>        if (!bo->tbo.resource)
>>>>                return;
>>>>
>>>> +     obj = &bo->tbo.base;
>>>> +     shared = obj->handle_count > 1;
>>> Interesting approach but I don't think that this is correct.
>>>
>>> The handle_count is basically how many GEM handles are there for BO, so
>>> for example it doesn't catch sharing things with V4L.
>>>
>>> What we should probably rather do is to take a look if
>>> bo->tbo.base.dma_buf is NULL or not.
>> +Rob, dri-devel
>>
>> This is what the generic drm helper code does.  See
>> drm_show_memory_stats().  If that is not correct that code should
>> probably be fixed too.
> OTOH, v4l doesn't expose fdinfo.  What "shared" is telling you is
> whether the BO is counted multiple times when you look at all
> processes fdinfo.

Oh, then that's not fully correct either.

You can have multiple handles for the same GEM object in a single client 
as well.

This for example happens when you interact with KMS to get an handle for 
a displayed BO.

DRM flink was one of the major other reasons, but I hope we are not 
using that widely any more.

What exactly is the purpose? To avoid counting a BO multiple times 
because you go over the handles in the common code?

If yes than I would say use obj->handle_count in the common code and 
ob->dma_buf in amdgpu because that is certainly unique.

Regards,
Christian.

>
> But I guess it would be ok to look for obj->handle_count > 1 || obj->dma_buf
>
> BR,
> -R
>
>> Alex
>>
>>> Regards,
>>> Christian.
>>>
>>>
>>>> +
>>>>        domain = amdgpu_mem_type_to_domain(bo->tbo.resource->mem_type);
>>>>        switch (domain) {
>>>>        case AMDGPU_GEM_DOMAIN_VRAM:
>>>>                stats->vram += size;
>>>>                if (amdgpu_bo_in_cpu_visible_vram(bo))
>>>>                        stats->visible_vram += size;
>>>> +             if (shared)
>>>> +                     stats->vram_shared += size;
>>>>                break;
>>>>        case AMDGPU_GEM_DOMAIN_GTT:
>>>>                stats->gtt += size;
>>>> +             if (shared)
>>>> +                     stats->gtt_shared += size;
>>>>                break;
>>>>        case AMDGPU_GEM_DOMAIN_CPU:
>>>>        default:
>>>>                stats->cpu += size;
>>>> +             if (shared)
>>>> +                     stats->cpu_shared += size;
>>>>                break;
>>>>        }
>>>>
>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>>>> index d28e21baef16..0503af75dc26 100644
>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.h
>>>> @@ -138,12 +138,18 @@ struct amdgpu_bo_vm {
>>>>    struct amdgpu_mem_stats {
>>>>        /* current VRAM usage, includes visible VRAM */
>>>>        uint64_t vram;
>>>> +     /* current shared VRAM usage, includes visible VRAM */
>>>> +     uint64_t vram_shared;
>>>>        /* current visible VRAM usage */
>>>>        uint64_t visible_vram;
>>>>        /* current GTT usage */
>>>>        uint64_t gtt;
>>>> +     /* current shared GTT usage */
>>>> +     uint64_t gtt_shared;
>>>>        /* current system memory usage */
>>>>        uint64_t cpu;
>>>> +     /* current shared system memory usage */
>>>> +     uint64_t cpu_shared;
>>>>        /* sum of evicted buffers, includes visible VRAM */
>>>>        uint64_t evicted_vram;
>>>>        /* sum of evicted buffers due to CPU access */



More information about the dri-devel mailing list