[Bug 106262] New: [BXT] GPU HANG: ecode 9:2:0xbefffffe, in Main Thread [4018], reason: Hang on bsd ring,

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Apr 26 22:56:40 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=106262

            Bug ID: 106262
           Summary: [BXT] GPU HANG: ecode 9:2:0xbefffffe, in Main Thread
                    [4018], reason: Hang on bsd ring,
           Product: DRI
           Version: unspecified
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: normal
          Priority: medium
         Component: DRM/Intel
          Assignee: intel-gfx-bugs at lists.freedesktop.org
          Reporter: patrick.beaulieu at avigilon.com
        QA Contact: intel-gfx-bugs at lists.freedesktop.org
                CC: intel-gfx-bugs at lists.freedesktop.org

After running for about 4-6wks of constant decoding h264 and copying back to
processor via ffmpeg+libva+beignet we are seeing a GPU hang. Has occurred 7-8
times on 5 of 20 different N4200 machines.
This is a headless system with nothing else using the GPU (no display
connected).
Environment:
ApolloLake/Broxton N4200
Linux Kernel 4.10 (yocto project pyro 2.3.3 with matched meta-intel)
libva 1.7.3

# vainfo
libva info: VA-API version 0.39.4
libva info: va_getDriverName() returns 0
libva info: Trying to open /usr/lib/dri/i965_drv_video.so
libva info: Found init function __vaDriverInit_0_39
libva info: va_openDriver() returns 0
vainfo: VA-API version: 0.39 (libva 1.7.3)
vainfo: Driver version: Intel i965 driver for Intel(R) Broxton - 1.7.3
vainfo: Supported profile and entrypoints
      VAProfileMPEG2Simple            : VAEntrypointVLD
      VAProfileMPEG2Main              : VAEntrypointVLD
      VAProfileH264ConstrainedBaseline: VAEntrypointVLD
      VAProfileH264ConstrainedBaseline: VAEntrypointEncSlice
      VAProfileH264Main               : VAEntrypointVLD
      VAProfileH264Main               : VAEntrypointEncSlice
      VAProfileH264High               : VAEntrypointVLD
      VAProfileH264High               : VAEntrypointEncSlice
      VAProfileH264MultiviewHigh      : VAEntrypointVLD
      VAProfileH264MultiviewHigh      : VAEntrypointEncSlice
      VAProfileH264StereoHigh         : VAEntrypointVLD
      VAProfileH264StereoHigh         : VAEntrypointEncSlice
      VAProfileVC1Simple              : VAEntrypointVLD
      VAProfileVC1Main                : VAEntrypointVLD
      VAProfileVC1Advanced            : VAEntrypointVLD
      VAProfileNone                   : VAEntrypointVideoProc
      VAProfileJPEGBaseline           : VAEntrypointVLD
      VAProfileJPEGBaseline           : VAEntrypointEncPicture
      VAProfileVP8Version0_3          : VAEntrypointVLD
      VAProfileVP8Version0_3          : VAEntrypointEncSlice
      VAProfileHEVCMain               : VAEntrypointVLD
      VAProfileHEVCMain               : VAEntrypointEncSlice
      VAProfileHEVCMain10             : VAEntrypointVLD
      VAProfileVP9Profile0            : VAEntrypointVLD

# uname -a
Linux N4200-test-machine 4.10.17-yocto-standard #2 SMP PREEMPT Tue Apr 24
00:12:09 PDT 2018 x86_64 x86_64 x86_64 GNU/Linux

Tried cmdline i915.enable_rc6=0 and it did not help.
Will try switching to yocto rocko (kernel 4.12, new libva) but it will take a
very long time to confirm if it is fixed by the upgrade since this is a pretty
rare hang.

The userspace application tying into ffmpeg/libva/beignet is fairly mature and
has been used with a baytrail system for a year without issue on yocto krogoth
(kernel 4.4).
I can try to reproduce on a NUC6CAYS but it would take weeks to confirm/deny an
issue there.

Will Attach /sys/class/drm/card0/error and dmesg

One question/clarification: Is the additional linux sideloaded gpu firmware
mandatory to be loaded for proper operation? I was under the impression it is
optional and only required if advanced dmc power states are required.
# dmesg | grep "firmware"
[    1.147206] i915 0000:00:02.0: Direct firmware load for
i915/bxt_dmc_ver1_07.bin failed with error -2
[    1.147228] i915 0000:00:02.0: Failed to load DMC firmware
[https://01.org/linuxgraphics/intel-linux-graphics-firmwares], disabling
runtime power management.
[    2.130898] [drm] GuC firmware load skipped

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20180426/8dc433a3/attachment.html>


More information about the intel-gfx-bugs mailing list