[PATCH] i915/gt/selftest_lrc: Remove timestamp test
Tvrtko Ursulin
tursulin at ursulin.net
Fri Mar 7 08:14:14 UTC 2025
On 06/03/2025 10:37, Chris Wilson wrote:
> Quoting Tvrtko Ursulin (2025-03-04 16:43:45)
>>
>> On 04/03/2025 13:09, Mikolaj Wasiak wrote:
>>> This test exposes bug in tigerlake hardware which prevents it from
>>> succeeding. Since the tested feature is only available on bugged hardware
>>> and we won't support any new hardware, this test is obsolete and
>>> should be removed.
>>
>> I randomly clicked on one TGL, one DG2, one MTL and one RKL in the CI
>> and only saw test passes. Then I looked at the patch below to see if
>> there is a skip condition but don't see one. So I end up confused since
>> commit message is making it sound like this only exists on Tigerlake and
>> it's failing all the time. Is it perhaps a sporadic failure? On all
>> platforms or just TGL? What am I missing?
>
> The HW issue affects all gen12 platforms currently supported by i915. I
> don't have any data for derivatives, so I cannot confirm if this bug was
> fixed. The lrc_timestamp test was written to demonstrate this HW bug, to
> isolate it from (and explain) the pphwsp runtime discrepancies, covered
> by another selftest. The question is whether we want to keep a selftest
> that is expected to sporadically fail, that exists purely to hunt for
> those failures.
>
> In the past, we have kept such selftests, but hidden them behind
> !IS_ENABLED(CONFIG_DRM_I915_SELFTEST_BROKEN).
>
> So,
> - keep the selftest and expect sporadic failures in BAT, or
Up to Intel - it's not the first sporadically failing test and in the
past at least those were handled.
> - remove the selftest and completely forget about the HW issue, or
> - hide the selftest and stop it running on known bad platforms?
Either of these two are also fine I think, as long as, if the removal is
chosen, it is made sure that either we already have the comment briefly
explaining the above somewhere in code, at a suitable location, or that
a brief comment is added with the removal. And commit message improved
to be less misleading about the failure frequency.
Regards,
Tvrtko
More information about the Intel-gfx
mailing list