<html>
    <head>
      <base href="https://bugs.freedesktop.org/">
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - [BXT] GPU HANG: ecode 9:2:0xbefffffe, in Main Thread [4018], reason: Hang on bsd ring,"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=106262">106262</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>[BXT] GPU HANG: ecode 9:2:0xbefffffe, in Main Thread [4018], reason: Hang on bsd ring,
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>DRI
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>unspecified
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>x86-64 (AMD64)
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Linux (All)
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>normal
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>medium
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>DRM/Intel
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>patrick.beaulieu@avigilon.com
          </td>
        </tr>

        <tr>
          <th>QA Contact</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>CC</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr></table>
      <p>
        <div>
        <pre>After running for about 4-6wks of constant decoding h264 and copying back to
processor via ffmpeg+libva+beignet we are seeing a GPU hang. Has occurred 7-8
times on 5 of 20 different N4200 machines.
This is a headless system with nothing else using the GPU (no display
connected).
Environment:
ApolloLake/Broxton N4200
Linux Kernel 4.10 (yocto project pyro 2.3.3 with matched meta-intel)
libva 1.7.3

# vainfo
libva info: VA-API version 0.39.4
libva info: va_getDriverName() returns 0
libva info: Trying to open /usr/lib/dri/i965_drv_video.so
libva info: Found init function __vaDriverInit_0_39
libva info: va_openDriver() returns 0
vainfo: VA-API version: 0.39 (libva 1.7.3)
vainfo: Driver version: Intel i965 driver for Intel(R) Broxton - 1.7.3
vainfo: Supported profile and entrypoints
      VAProfileMPEG2Simple            : VAEntrypointVLD
      VAProfileMPEG2Main              : VAEntrypointVLD
      VAProfileH264ConstrainedBaseline: VAEntrypointVLD
      VAProfileH264ConstrainedBaseline: VAEntrypointEncSlice
      VAProfileH264Main               : VAEntrypointVLD
      VAProfileH264Main               : VAEntrypointEncSlice
      VAProfileH264High               : VAEntrypointVLD
      VAProfileH264High               : VAEntrypointEncSlice
      VAProfileH264MultiviewHigh      : VAEntrypointVLD
      VAProfileH264MultiviewHigh      : VAEntrypointEncSlice
      VAProfileH264StereoHigh         : VAEntrypointVLD
      VAProfileH264StereoHigh         : VAEntrypointEncSlice
      VAProfileVC1Simple              : VAEntrypointVLD
      VAProfileVC1Main                : VAEntrypointVLD
      VAProfileVC1Advanced            : VAEntrypointVLD
      VAProfileNone                   : VAEntrypointVideoProc
      VAProfileJPEGBaseline           : VAEntrypointVLD
      VAProfileJPEGBaseline           : VAEntrypointEncPicture
      VAProfileVP8Version0_3          : VAEntrypointVLD
      VAProfileVP8Version0_3          : VAEntrypointEncSlice
      VAProfileHEVCMain               : VAEntrypointVLD
      VAProfileHEVCMain               : VAEntrypointEncSlice
      VAProfileHEVCMain10             : VAEntrypointVLD
      VAProfileVP9Profile0            : VAEntrypointVLD

# uname -a
Linux N4200-test-machine 4.10.17-yocto-standard #2 SMP PREEMPT Tue Apr 24
00:12:09 PDT 2018 x86_64 x86_64 x86_64 GNU/Linux

Tried cmdline i915.enable_rc6=0 and it did not help.
Will try switching to yocto rocko (kernel 4.12, new libva) but it will take a
very long time to confirm if it is fixed by the upgrade since this is a pretty
rare hang.

The userspace application tying into ffmpeg/libva/beignet is fairly mature and
has been used with a baytrail system for a year without issue on yocto krogoth
(kernel 4.4).
I can try to reproduce on a NUC6CAYS but it would take weeks to confirm/deny an
issue there.

Will Attach /sys/class/drm/card0/error and dmesg

One question/clarification: Is the additional linux sideloaded gpu firmware
mandatory to be loaded for proper operation? I was under the impression it is
optional and only required if advanced dmc power states are required.
# dmesg | grep "firmware"
[    1.147206] i915 0000:00:02.0: Direct firmware load for
i915/bxt_dmc_ver1_07.bin failed with error -2
[    1.147228] i915 0000:00:02.0: Failed to load DMC firmware
[<a href="https://01.org/linuxgraphics/intel-linux-graphics-firmwares">https://01.org/linuxgraphics/intel-linux-graphics-firmwares</a>], disabling
runtime power management.
[    2.130898] [drm] GuC firmware load skipped</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are the assignee for the bug.</li>
          <li>You are on the CC list for the bug.</li>
          <li>You are the QA Contact for the bug.</li>
      </ul>
    </body>
</html>