[PATCH] drm/ttm: stop warning on TT shrinker failure
Christian König
ckoenig.leichtzumerken at gmail.com
Sat Mar 20 09:04:31 UTC 2021
Am 19.03.21 um 20:06 schrieb Daniel Vetter:
> On Fri, Mar 19, 2021 at 07:53:48PM +0100, Christian König wrote:
>> Am 19.03.21 um 18:52 schrieb Daniel Vetter:
>>> On Fri, Mar 19, 2021 at 03:08:57PM +0100, Christian König wrote:
>>>> Don't print a warning when we fail to allocate a page for swapping things out.
>>>>
>>>> Also rely on memalloc_nofs_save/memalloc_nofs_restore instead of GFP_NOFS.
>>> Uh this part doesn't make sense. Especially since you only do it for the
>>> debugfs file, not in general. Which means you've just completely broken
>>> the shrinker.
>> Are you sure? My impression is that GFP_NOFS should now work much more out
>> of the box with the memalloc_nofs_save()/memalloc_nofs_restore().
> Yeah, if you'd put it in the right place :-)
>
> But also -mm folks are very clear that memalloc_no*() family is for dire
> situation where there's really no other way out. For anything where you
> know what you're doing, you really should use explicit gfp flags.
My impression is just the other way around. You should try to avoid the
NOFS/NOIO flags and use the memalloc_no* approach instead.
>>> If this is just to paper over the seq_printf doing the wrong allocations,
>>> then just move that out from under the fs_reclaim_acquire/release part.
>> No, that wasn't the problem.
>>
>> We have just seen to many failures to allocate pages for swapout and I think
>> that would improve this because in a lot of cases we can then immediately
>> swap things out instead of having to rely on upper layers.
> Yeah, you broke it. Now the real shrinker is running with GFP_KERNEL,
> because your memalloc_no is only around the debugfs function. And ofc it's
> much easier to allocate with GFP_KERNEL, right until you deadlock :-)
The problem here is that for example kswapd calls the shrinker without
holding a FS lock as far as I can see.
And it is rather sad that we can't optimize this case directly.
Anyway you are right if some caller doesn't use the memalloc_no*()
approach we are busted.
Going to change the patch to only not warn for the moment.
Regards,
Christian.
>
> Shrinking is hard, there's no easy way out here.
>
> Cheers, Daniel
>
>> Regards,
>> Christian.
>>
>>
>>> __GFP_NOWARN should be there indeed I think.
>>> -Daniel
>>>
>>>> Signed-off-by: Christian König <christian.koenig at amd.com>
>>>> ---
>>>> drivers/gpu/drm/ttm/ttm_tt.c | 5 ++++-
>>>> 1 file changed, 4 insertions(+), 1 deletion(-)
>>>>
>>>> diff --git a/drivers/gpu/drm/ttm/ttm_tt.c b/drivers/gpu/drm/ttm/ttm_tt.c
>>>> index 2f0833c98d2c..86fa3e82dacc 100644
>>>> --- a/drivers/gpu/drm/ttm/ttm_tt.c
>>>> +++ b/drivers/gpu/drm/ttm/ttm_tt.c
>>>> @@ -369,7 +369,7 @@ static unsigned long ttm_tt_shrinker_scan(struct shrinker *shrink,
>>>> };
>>>> int ret;
>>>> - ret = ttm_bo_swapout(&ctx, GFP_NOFS);
>>>> + ret = ttm_bo_swapout(&ctx, GFP_KERNEL | __GFP_NOWARN);
>>>> return ret < 0 ? SHRINK_EMPTY : ret;
>>>> }
>>>> @@ -389,10 +389,13 @@ static unsigned long ttm_tt_shrinker_count(struct shrinker *shrink,
>>>> static int ttm_tt_debugfs_shrink_show(struct seq_file *m, void *data)
>>>> {
>>>> struct shrink_control sc = { .gfp_mask = GFP_KERNEL };
>>>> + unsigned int flags;
>>>> fs_reclaim_acquire(GFP_KERNEL);
>>>> + flags = memalloc_nofs_save();
>>>> seq_printf(m, "%lu/%lu\n", ttm_tt_shrinker_count(&mm_shrinker, &sc),
>>>> ttm_tt_shrinker_scan(&mm_shrinker, &sc));
>>>> + memalloc_nofs_restore(flags);
>>>> fs_reclaim_release(GFP_KERNEL);
>>>> return 0;
>>>> --
>>>> 2.25.1
>>>>
>>>> _______________________________________________
>>>> dri-devel mailing list
>>>> dri-devel at lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/dri-devel
More information about the amd-gfx
mailing list