[Intel-xe] [v2 1/2] drm/xe: Add a new memory directory under tile

Sundaresan, Sujaritha sujaritha.sundaresan at intel.com
Thu Dec 7 07:28:14 UTC 2023


On 12/7/2023 12:08 PM, Sundaresan, Sujaritha wrote:
>
> On 12/7/2023 11:36 AM, Riana Tauro wrote:
>>
>>
>> On 12/7/2023 10:51 AM, Sundaresan, Sujaritha wrote:
>>>
>>> On 12/7/2023 10:42 AM, Upadhyay, Tejas wrote:
>>>>
>>>>> -----Original Message-----
>>>>> From: Intel-xe <intel-xe-bounces at lists.freedesktop.org> On Behalf Of
>>>>> Sundaresan, Sujaritha
>>>>> Sent: Wednesday, December 6, 2023 5:44 PM
>>>>> To: Gupta, Anshuman <anshuman.gupta at intel.com>; intel-
>>>>> xe at lists.freedesktop.org
>>>>> Cc: Vivi, Rodrigo <rodrigo.vivi at intel.com>
>>>>> Subject: Re: [Intel-xe] [v2 1/2] drm/xe: Add a new memory 
>>>>> directory under
>>>>> tile
>>>>>
>>>>>
>>>>> On 12/6/2023 5:38 PM, Sundaresan, Sujaritha wrote:
>>>>>> On 12/6/2023 5:23 PM, Gupta, Anshuman wrote:
>>>>>>>> -----Original Message-----
>>>>>>>> From: Intel-xe <intel-xe-bounces at lists.freedesktop.org> On 
>>>>>>>> Behalf Of
>>>>>>>> Sujaritha Sundaresan
>>>>>>>> Sent: Wednesday, December 6, 2023 10:18 AM
>>>>>>>> To: intel-xe at lists.freedesktop.org
>>>>>>>> Cc: Sundaresan, Sujaritha <sujaritha.sundaresan at intel.com>; Vivi,
>>>>>>>> Rodrigo <rodrigo.vivi at intel.com>
>>>>>>>> Subject: [Intel-xe] [v2 1/2] drm/xe: Add a new memory directory
>>>>>>>> under tile
>>>>>>>>
>>>>>>>> Add a new memory directory under /device/tile<n> and move
>>>>>>>> physical_vram_size attribute to the new directory.
>>>>>>>>
>>>>>>>> New hierarchy:
>>>>>>>>
>>>>>>>> /device/tile<n>/memory/physical_vram_size_bytes
>>>>>>>>
>>>>>>>> v2: Fix heading typo (Riana)
>>>>>>>>       Fix cleanup error on unload/reload cycle
>>>>>>>>
>>>>>>>> Signed-off-by: Sujaritha Sundaresan 
>>>>>>>> <sujaritha.sundaresan at intel.com>
>>>>>>>> ---
>>>>>>>>    drivers/gpu/drm/xe/xe_tile_sysfs.c | 15 ++++++++++++---
>>>>>>>>    1 file changed, 12 insertions(+), 3 deletions(-)
>>>>>>>>
>>>>>>>> diff --git a/drivers/gpu/drm/xe/xe_tile_sysfs.c
>>>>>>>> b/drivers/gpu/drm/xe/xe_tile_sysfs.c
>>>>>>>> index 16376607c68f..e8ce4d9270e6 100644
>>>>>>>> --- a/drivers/gpu/drm/xe/xe_tile_sysfs.c
>>>>>>>> +++ b/drivers/gpu/drm/xe/xe_tile_sysfs.c
>>>>>>>> @@ -24,7 +24,8 @@ static ssize_t
>>>>>>>>    physical_vram_size_bytes_show(struct device *kdev, struct
>>>>>>>> device_attribute *attr,
>>>>>>>>                      char *buf)
>>>>>>>>    {
>>>>>>>> -    struct xe_tile *tile = kobj_to_tile(&kdev->kobj);
>>>>>>>> +    struct kobject *kobj = &kdev->kobj;
>>>>>>>> +    struct xe_tile *tile = kobj_to_tile(kobj->parent);
>>>>>>>>
>>>>>>>>        return sysfs_emit(buf, "%llu\n",
>>>>>>>> tile->mem.vram.actual_physical_size);
>>>>>>>>    }
>>>>>>>> @@ -38,7 +39,7 @@ static void tile_sysfs_fini(struct drm_device
>>>>>>>> *drm, void
>>>>>>>> *arg)  {
>>>>>>>>        struct xe_tile *tile = arg;
>>>>>>>>
>>>>>>>> -    kobject_put(tile->sysfs);
>>>>>>>> +    kobject_del(tile->sysfs);
>>>>>>> Why kobekct_del instead of kobject_put?
>>>>>>> Thanks,
>>>>>>> Anshuman Gupta.
>>>>>> Hi Anshuman,
>>>>>>
>>>>>> Basically when sanity checking, after reload we see that we are not
>>>>>> doing a proper cleanup.
>>>>>>
>>>>>> kobject_put will only decrement the ref count and possibly free the
>>>>>> kobject.
>>>>>>
>>>>>> But that is not happening in this case. There is a duplicate 
>>>>>> remaining
>>>>>> of the tile directory.
>>>>>>
>>>>>> This required a clean unregister of the parent from sysfs hence the
>>>>>> use of kobject_del.
>>>>>>
>>>>>> Thanks,
>>>>>>
>>>>>> Suja
>>>>> As a continuation of the above response;
>>>>>
>>>>> I can probably add a kobject_put call as well to ensure that we 
>>>>> are cleaning
>>>>> up the memory side of
>>>>>
>>>>> things as well. Will add.
>>>>>
>>>>> Thanks,
>>>>>
>>>>> Suja
>>>>>
>>>>>>>>    }
>>>>>>>>
>>>>>>>>    void xe_tile_sysfs_init(struct xe_tile *tile) @@ -46,6 +47,7 @@
>>>>>>>> void xe_tile_sysfs_init(struct xe_tile *tile)
>>>>>>>>        struct xe_device *xe = tile_to_xe(tile);
>>>>>>>>        struct device *dev = xe->drm.dev;
>>>>>>>>        struct kobj_tile *kt;
>>>>>>>> +    struct kobject *kobj;
>>>>>>>>        int err;
>>>>>>>>
>>>>>>>>        kt = kzalloc(sizeof(*kt), GFP_KERNEL); @@ -64,8 +66,15 
>>>>>>>> @@ void
>>>>>>>> xe_tile_sysfs_init(struct xe_tile *tile)
>>>>>>>>
>>>>>>>>        tile->sysfs = &kt->base;
>>>>>>>>
>>>>>>>> +    kobj = kobject_create_and_add("memory", tile->sysfs);
>>>>>>>> +    if (!kobj) {
>>>>>>>> +        kobject_put(kobj);
>>>> Do you mean to put kobject_put(tile->sysfs) instead of 
>>>> kobject_put(kobj) ? as there was no Kobj created by the time you 
>>>> reached here!
>>>>
>>>> Tejas
>>>
>>> Yup this should be fixed.
>> Hi Suja
>>
>> Removing tile won't be right, as there are other directories (gt#) 
>> dependent on it. Simple return should be good with a warn?
>>
>> Thanks
>> Riana
> Sure. We can probably have the original cleanup in fini.
>>>
>>> Thanks.
>>>
>>> Suja
>>>
>>>>
>>>>>>>> + drm_warn(&xe->drm, "%s failed, err: %d\n", __func__, -
>>>>>>>> ENOMEM);
>>>>>>>> +        return;
>>>>>>>> +    }
>>>>>>>> +
>>>>>>>>        if (IS_DGFX(xe) && xe->info.platform != XE_DG1 &&
>>>>>>>> -        sysfs_create_file(tile->sysfs, physical_memsize_attr))
>>>>>>>> +        sysfs_create_file(kobj, physical_memsize_attr))
>>>>>>>>            drm_warn(&xe->drm,
>>>>>>>>                 "Sysfs creation to read addr_range per tile
>>>>>>>> failed\n");
>>>>>>>>
>>>>>>>> -- 
>>>>>>>> 2.25.1

Hi all,

So after all of this discussion, here's the final results after testing 
the cleanup.

Regardless of anything we do in the init function, it looks like without 
having the

two-step kobject_del and kobject_put cleanup in the sysfs_fini function 
we will

see an error on reload with the tile directory not being fully cleaned 
up and reporting

duplicate creation.

The solution seems to be just to add a kobject_del before kobject_put in 
the fini

function.

I am only having warnings and simple returns on the init side.

If there can be a consensus about this from the reviewers, I can float 
the next version

accordingly.

Thanks,

Suja



More information about the Intel-xe mailing list