[Intel-gfx] [PATCH] drm/i915/selftests: fixup igt_shrink_thp
Tvrtko Ursulin
tvrtko.ursulin at linux.intel.com
Mon Sep 6 15:52:23 UTC 2021
On 06/09/2021 14:48, Matthew Auld wrote:
> On 06/09/2021 13:53, Tvrtko Ursulin wrote:
>>
>> On 06/09/2021 13:30, Matthew Auld wrote:
>>> On 06/09/2021 13:19, Tvrtko Ursulin wrote:
>>>>
>>>> On 06/09/2021 10:17, Matthew Auld wrote:
>>>>> Since the object might still be active here, the shrink_all will
>>>>> simply
>>>>> ignore it, which blows up in the test, since the pages will still be
>>>>> there. Currently THP is disabled which should result in the test being
>>>>> skipped, but if we ever re-enable THP we might start seeing the
>>>>> failure.
>>>>> Fix this by forcing I915_SHRINK_ACTIVE.
>>>>>
>>>>> v2: Some machine in the shard runs doesn't seem to have any available
>>>>> swap when running this test. Try to handle this.
>>>>>
>>>>> Signed-off-by: Matthew Auld <matthew.auld at intel.com>
>>>>> Cc: Tvrtko Ursulin <tvrtko.ursulin at intel.com>
>>>>> Cc: Thomas Hellström <thomas.hellstrom at linux.intel.com>
>>>>> Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin at intel.com> #v1
>>>>> ---
>>>>> .../gpu/drm/i915/gem/selftests/huge_pages.c | 31
>>>>> ++++++++++++++-----
>>>>> 1 file changed, 24 insertions(+), 7 deletions(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/i915/gem/selftests/huge_pages.c
>>>>> b/drivers/gpu/drm/i915/gem/selftests/huge_pages.c
>>>>> index a094f3ce1a90..46ea1997c114 100644
>>>>> --- a/drivers/gpu/drm/i915/gem/selftests/huge_pages.c
>>>>> +++ b/drivers/gpu/drm/i915/gem/selftests/huge_pages.c
>>>>> @@ -1519,6 +1519,7 @@ static int igt_shrink_thp(void *arg)
>>>>> struct i915_vma *vma;
>>>>> unsigned int flags = PIN_USER;
>>>>> unsigned int n;
>>>>> + bool should_swap;
>>>>> int err = 0;
>>>>> /*
>>>>> @@ -1567,23 +1568,39 @@ static int igt_shrink_thp(void *arg)
>>>>> break;
>>>>> }
>>>>> i915_gem_context_unlock_engines(ctx);
>>>>> + /*
>>>>> + * Nuke everything *before* we unpin the pages so we can be
>>>>> reasonably
>>>>> + * sure that when later checking get_nr_swap_pages() that some
>>>>> random
>>>>> + * leftover object doesn't steal the remaining swap space.
>>>>> + */
>>>>> + i915_gem_shrink(NULL, i915, -1UL, NULL,
>>>>> + I915_SHRINK_BOUND |
>>>>> + I915_SHRINK_UNBOUND |
>>>>> + I915_SHRINK_ACTIVE);
>>>>> i915_vma_unpin(vma);
>>>>> if (err)
>>>>> goto out_put;
>>>>> +
>>>>> /*
>>>>> - * Now that the pages are *unpinned* shrink-all should invoke
>>>>> - * shmem to truncate our pages.
>>>>> + * Now that the pages are *unpinned* shrinking should invoke
>>>>> + * shmem to truncate our pages, if we have available swap.
>>>>> */
>>>>> - i915_gem_shrink_all(i915);
>>>>> - if (i915_gem_object_has_pages(obj)) {
>>>>> - pr_err("shrink-all didn't truncate the pages\n");
>>>>> + should_swap = get_nr_swap_pages() > 0;
>>>>> + i915_gem_shrink(NULL, i915, -1UL, NULL,
>>>>> + I915_SHRINK_BOUND |
>>>>> + I915_SHRINK_UNBOUND |
>>>>> + I915_SHRINK_ACTIVE);
>>>>> + if (should_swap == i915_gem_object_has_pages(obj)) {
>>>>
>>>> Hmm is there any value running the test if no swap (given objects
>>>> used by the test are "willneed"), or you could simplify and just do
>>>> early skip?
>>>
>>> Maybe. My thinking was that this adds some coverage if say the device
>>> is not configured with swap. i.e assert that the pages don't
>>> magically disappear, and that their contents still persist etc.
>>>
>>> Happy to make it skip instead though?
>>
>> So reducing it to a basic shrinker test in that case. Hm.. do you know
>> if we have a non THP specific tests for that already somewhere in
>> selftests (I can't spot any), or just in IGT?
>
> Just IGT I think, outside of some cases where we call gem_shrink in very
> specific places, which would be hard to do from an IGT.
>
>>
>> If we indeed don't have it in selftests, then I guess question is
>> whether it is warranted to "hide" such a basic test in the THP
>> "drawer", or instead adding a generic shrinker test should be
>> considered. (And one could then follow with a question should a basic
>> generic test have a THP sub-test.)
>
> The reason for the selftest vs IGT is mostly because userspace doesn't
> have any knowledge of the underlying pages, or whether THP is used. IIRC
> there was some issue with THP + our shmem backend in the past, so also
> adding some basic coverage for THP + i915-gem shrinker seemed
> reasonable. Even if we don't have swap space, I think it still makes
> some sense to call into gem_shrink with our target THP object.
Okay, as you have probably guessed I have no strong feelings either way,
so you can freely upgrade my r-b to current.
Regards,
Tvrtko
>
>>
>> It's hard to say where the boundary for selftests-vs-IGT coverage
>> should be in this case. I mean would it be warranted to add such a
>> generic shrinker selftest. It is mostly testable from userspace, but
>> kernel can do a few more introspections and sanity checks at cost of
>> growing kernel code.
>>
>> Regards,
>>
>> Tvrtko
>>
>>>
>>>>
>>>> Regards,
>>>>
>>>> Tvrtko
>>>>
>>>>> + pr_err("unexpected pages mismatch, should_swap=%s\n",
>>>>> + yesno(should_swap));
>>>>> err = -EINVAL;
>>>>> goto out_put;
>>>>> }
>>>>> - if (obj->mm.page_sizes.sg || obj->mm.page_sizes.phys) {
>>>>> - pr_err("residual page-size bits left\n");
>>>>> + if (should_swap == (obj->mm.page_sizes.sg ||
>>>>> obj->mm.page_sizes.phys)) {
>>>>> + pr_err("unexpected residual page-size bits,
>>>>> should_swap=%s\n",
>>>>> + yesno(should_swap));
>>>>> err = -EINVAL;
>>>>> goto out_put;
>>>>> }
>>>>>
More information about the Intel-gfx
mailing list