<html>
<head>
<base href="https://bugs.freedesktop.org/">
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW - [BXT] GPU HANG: ecode 9:2:0xbefffffe, in Main Thread [4018], reason: Hang on bsd ring,"
href="https://bugs.freedesktop.org/show_bug.cgi?id=106262">106262</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>[BXT] GPU HANG: ecode 9:2:0xbefffffe, in Main Thread [4018], reason: Hang on bsd ring,
</td>
</tr>
<tr>
<th>Product</th>
<td>DRI
</td>
</tr>
<tr>
<th>Version</th>
<td>unspecified
</td>
</tr>
<tr>
<th>Hardware</th>
<td>x86-64 (AMD64)
</td>
</tr>
<tr>
<th>OS</th>
<td>Linux (All)
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Severity</th>
<td>normal
</td>
</tr>
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Component</th>
<td>DRM/Intel
</td>
</tr>
<tr>
<th>Assignee</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Reporter</th>
<td>patrick.beaulieu@avigilon.com
</td>
</tr>
<tr>
<th>QA Contact</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>CC</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr></table>
<p>
<div>
<pre>After running for about 4-6wks of constant decoding h264 and copying back to
processor via ffmpeg+libva+beignet we are seeing a GPU hang. Has occurred 7-8
times on 5 of 20 different N4200 machines.
This is a headless system with nothing else using the GPU (no display
connected).
Environment:
ApolloLake/Broxton N4200
Linux Kernel 4.10 (yocto project pyro 2.3.3 with matched meta-intel)
libva 1.7.3
# vainfo
libva info: VA-API version 0.39.4
libva info: va_getDriverName() returns 0
libva info: Trying to open /usr/lib/dri/i965_drv_video.so
libva info: Found init function __vaDriverInit_0_39
libva info: va_openDriver() returns 0
vainfo: VA-API version: 0.39 (libva 1.7.3)
vainfo: Driver version: Intel i965 driver for Intel(R) Broxton - 1.7.3
vainfo: Supported profile and entrypoints
VAProfileMPEG2Simple : VAEntrypointVLD
VAProfileMPEG2Main : VAEntrypointVLD
VAProfileH264ConstrainedBaseline: VAEntrypointVLD
VAProfileH264ConstrainedBaseline: VAEntrypointEncSlice
VAProfileH264Main : VAEntrypointVLD
VAProfileH264Main : VAEntrypointEncSlice
VAProfileH264High : VAEntrypointVLD
VAProfileH264High : VAEntrypointEncSlice
VAProfileH264MultiviewHigh : VAEntrypointVLD
VAProfileH264MultiviewHigh : VAEntrypointEncSlice
VAProfileH264StereoHigh : VAEntrypointVLD
VAProfileH264StereoHigh : VAEntrypointEncSlice
VAProfileVC1Simple : VAEntrypointVLD
VAProfileVC1Main : VAEntrypointVLD
VAProfileVC1Advanced : VAEntrypointVLD
VAProfileNone : VAEntrypointVideoProc
VAProfileJPEGBaseline : VAEntrypointVLD
VAProfileJPEGBaseline : VAEntrypointEncPicture
VAProfileVP8Version0_3 : VAEntrypointVLD
VAProfileVP8Version0_3 : VAEntrypointEncSlice
VAProfileHEVCMain : VAEntrypointVLD
VAProfileHEVCMain : VAEntrypointEncSlice
VAProfileHEVCMain10 : VAEntrypointVLD
VAProfileVP9Profile0 : VAEntrypointVLD
# uname -a
Linux N4200-test-machine 4.10.17-yocto-standard #2 SMP PREEMPT Tue Apr 24
00:12:09 PDT 2018 x86_64 x86_64 x86_64 GNU/Linux
Tried cmdline i915.enable_rc6=0 and it did not help.
Will try switching to yocto rocko (kernel 4.12, new libva) but it will take a
very long time to confirm if it is fixed by the upgrade since this is a pretty
rare hang.
The userspace application tying into ffmpeg/libva/beignet is fairly mature and
has been used with a baytrail system for a year without issue on yocto krogoth
(kernel 4.4).
I can try to reproduce on a NUC6CAYS but it would take weeks to confirm/deny an
issue there.
Will Attach /sys/class/drm/card0/error and dmesg
One question/clarification: Is the additional linux sideloaded gpu firmware
mandatory to be loaded for proper operation? I was under the impression it is
optional and only required if advanced dmc power states are required.
# dmesg | grep "firmware"
[ 1.147206] i915 0000:00:02.0: Direct firmware load for
i915/bxt_dmc_ver1_07.bin failed with error -2
[ 1.147228] i915 0000:00:02.0: Failed to load DMC firmware
[<a href="https://01.org/linuxgraphics/intel-linux-graphics-firmwares">https://01.org/linuxgraphics/intel-linux-graphics-firmwares</a>], disabling
runtime power management.
[ 2.130898] [drm] GuC firmware load skipped</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
<li>You are on the CC list for the bug.</li>
<li>You are the QA Contact for the bug.</li>
</ul>
</body>
</html>