[Bug 107809] [CI][DRMTIP] igt at gem_exec_capture@capture-render - dmesg-fail - Failed assertion: !wedged / Failed to reset chip
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Mon Sep 3 14:42:42 UTC 2018
https://bugs.freedesktop.org/show_bug.cgi?id=107809
--- Comment #1 from Chris Wilson <chris at chris-wilson.co.uk> ---
<5>[ 56.294173] i915 0000:00:02.0: Resetting chip for Manually set wedged
engine mask = ffffffffffffffff
<7>[ 56.296291] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 56.797734] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 57.300501] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 57.910710] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 58.412420] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 58.915581] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 59.526644] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 60.027767] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7>[ 60.530446] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<3>[ 61.133603] i915 0000:00:02.0: Failed to reset chip
<7>[ 61.133777] i915_gem_set_wedged rcs0
<7>[ 61.133784] i915_gem_set_wedged current seqno 2332c, last 1, hangcheck
2332c [4868 ms]
<7>[ 61.133789] i915_gem_set_wedged Reset count: 0 (global 1)
<7>[ 61.133821] i915_gem_set_wedged Requests:
<7>[ 61.133885] i915_gem_set_wedged first 1! [4:2374a] @ 4868ms:
rcs0
<7>[ 61.133892] i915_gem_set_wedged last 1! [4:2374a] @ 4868ms:
rcs0
<7>[ 61.133956] i915_gem_set_wedged RING_START: 0x00002000
<7>[ 61.133961] i915_gem_set_wedged RING_HEAD: 0x00000000
<7>[ 61.133966] i915_gem_set_wedged RING_TAIL: 0x00000000
<7>[ 61.133972] i915_gem_set_wedged RING_CTL: 0x00000400 [waiting]
<7>[ 61.133978] i915_gem_set_wedged RING_MODE: 0x00000100
<7>[ 61.133983] i915_gem_set_wedged ACTHD: 0x00000000_0081eac0
<7>[ 61.133989] i915_gem_set_wedged BBADDR: 0x00000000_00000000
<7>[ 61.133994] i915_gem_set_wedged DMA_FADDR: 0x00000000_0081eb80
<7>[ 61.133999] i915_gem_set_wedged IPEIR: 0x00000000
<7>[ 61.134005] i915_gem_set_wedged IPEHR: 0x00000000
<7>[ 61.134039] i915_gem_set_wedged E 1! [4:2374a] @ 4868ms: rcs0
<7>[ 61.134094] i915_gem_set_wedged Queue priority: -2147483648
<7>[ 61.134224] i915_gem_set_wedged IRQ? 0x0 (breadcrumbs? no)
<7>[ 61.134229] i915_gem_set_wedged HWSP:
<7>[ 61.134237] i915_gem_set_wedged [0000] 00000000 00000000 00000000
00000000 00000000 00000000 00000000 00000000
<7>[ 61.134242] i915_gem_set_wedged *
<7>[ 61.134249] i915_gem_set_wedged [00c0] 0002332c 00000000 00000000
00000000 00000000 00000000 00000000 00000000
<7>[ 61.134255] i915_gem_set_wedged [00e0] 00000000 00000000 00000000
00000000 00000000 00000000 00000000 00000000
<7>[ 61.134260] i915_gem_set_wedged *
<7>[ 61.134268] i915_gem_set_wedged Idle? no
That the gpu stops responding appears to be just one of those things (e.g.
RING_CTL we do clear which should undo the wait).
But the big standout is that seqno=0x2332c but we have had previously reset the
next seqno (i.e. the current request has seqno 1 and believes to be completed).
That the seqno is stale (and so we treat new requests as completed even before
they run on hw) may well explain some of the other bogosity.
--
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20180903/ab70128f/attachment.html>
More information about the intel-gfx-bugs
mailing list