[Bug 111424] Random recoverable GPU hangs in trivial GpuTest Triangle test

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Sep 16 15:33:03 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=111424

Eero Tamminen <eero.t.tamminen at intel.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |regression

--- Comment #7 from Eero Tamminen <eero.t.tamminen at intel.com> ---
(In reply to Eero Tamminen from comment #3)
> > If the drm-tip result is typical, the engine reset should be harmless.
> 
> Last night SKL GT4e didn't recover from the Triangle hang, but got stuck.

E.g. last night KBL GT3e GPU hangs caused 20s of extra run-time for the 40s
GpuTest Triangle runs, although following tests worked fine.  In summary, it
doesn't seem harmless.

[ 4866.563824] Iteration 3/3: /opt/benchmarks/GpuTest07/GpuTest /test=triangle
/width=1366 /height=768 /msaa=1 /no_scorebox /benchmark
/benchmark_duration_ms=35000
[ 4902.983138] [drm:intel_cpu_fifo_underrun_irq_handler [i915]] *ERROR* CPU
pipe A FIFO underrun
[ 4910.877780] i915 0000:00:02.0: GPU HANG: ecode 9:1:0x00000000, hang on rcs0
[ 4910.877782] GPU hangs can indicate a bug anywhere in the entire gfx stack,
including userspace.
[ 4910.877783] Please file a _new_ bug report on bugs.freedesktop.org against
DRI -> DRM/Intel
[ 4910.877784] drm/i915 developers can then reassign to the right component if
it's not a kernel issue.
[ 4910.877785] The GPU crash dump is required to analyze GPU hangs, so please
always attach it.
[ 4910.877786] GPU crash dump saved to /sys/class/drm/card0/error
[ 4910.878798] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 4910.879554] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
...
[ 4924.831512] i915 0000:00:02.0: Resetting chip for hang on rcs0
[ 4924.833273] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 4924.834022] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 4926.579189] Iteration 1/3: /opt/benchmarks/GpuTest07/GpuTest /test=triangle
/width=1920 /height=1080 /fullscreen /msaa=1 /no_scorebox /benchmark
/benchmark_duration_ms=35000
[ 4926.878687] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 4926.879436] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
...
[ 4974.877657] i915 0000:00:02.0: GPU recovery timed out, cancelling all
in-flight rendering.
[ 4974.879438] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 4974.880194] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 4974.880895] i915 0000:00:02.0: Resetting chip for hang on rcs0
[ 4986.598036] Iteration 2/3: /opt/benchmarks/GpuTest07/GpuTest /test=triangle
/width=1920 /height=1080 /fullscreen /msaa=1 /no_scorebox /benchmark
/benchmark_duration_ms=35000

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20190916/83a633d4/attachment.html>


More information about the intel-gfx-bugs mailing list