[Bug 110429] [CI][SHARDS] igt at i915_selftest@live_hangcheck - dmesg-fail - igt_atomic_reset_engine timed out, cancelling test.

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Apr 25 11:18:31 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=110429

--- Comment #3 from Arek Hiler <arkadiusz.hiler at intel.com> ---
(In reply to Francesco Balestrieri from comment #2)
> From:
> 
> <3> [2204.524458] [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset
> request timeout
> 
> It seems that the HW failed to respond and the test timed out. We should
> increase the timeout of the test to get the actual failure.

Was there anything done already? Did we increase the timeout? The issue was
seen only once in CI_DRM_5922.

If it is a real issue, not a single time random fluke, it means we may fail to
reset from atomic context from time to time, leading to a wedged GPU if it ever
happens on a live system. But that's a stretch.

TBH, I don't think that this particular issue has anything to do with atomic
contexts. It is just one engine being randomly stuck or taking more time to
reset than expected. So it is not that serious, especially with this failure
rate.

Let's keep an eye on this.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20190425/d4621156/attachment.html>


More information about the intel-gfx-bugs mailing list