<html>
<head>
<base href="https://bugs.freedesktop.org/">
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW - Skylake GPU HANG while gstreamer H264 vaapi encoding from MJPEG vaapi decode on drm-tip"
href="https://bugs.freedesktop.org/show_bug.cgi?id=110394">110394</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>Skylake GPU HANG while gstreamer H264 vaapi encoding from MJPEG vaapi decode on drm-tip
</td>
</tr>
<tr>
<th>Product</th>
<td>DRI
</td>
</tr>
<tr>
<th>Version</th>
<td>DRI git
</td>
</tr>
<tr>
<th>Hardware</th>
<td>x86-64 (AMD64)
</td>
</tr>
<tr>
<th>OS</th>
<td>Linux (All)
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Severity</th>
<td>critical
</td>
</tr>
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Component</th>
<td>DRM/Intel
</td>
</tr>
<tr>
<th>Assignee</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Reporter</th>
<td>andy.nicholas@shield.ai
</td>
</tr>
<tr>
<th>QA Contact</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>CC</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr></table>
<p>
<div>
<pre>Created <span class=""><a href="attachment.cgi?id=143919" name="attach_143919" title="GPU crash info plus dmesg log">attachment 143919</a> <a href="attachment.cgi?id=143919&action=edit" title="GPU crash info plus dmesg log">[details]</a></span>
GPU crash info plus dmesg log
Similar to <a class="bz_bug_link
bz_status_NEW "
title="NEW - GPU HANG when transcoding to H.264 using VAAPI on drm-tip"
href="show_bug.cgi?id=110297">Bug #110297</a> which I filed.
Skylake GPU hang when encoding video stream to H.264 using VAAPI. The stream is
decoded from a VAAPI MJPEG stream from a file. We run test loops where we
transcode this stream over and over, thousands of times. This GPU hang happened
on iteration 1026.
Running on Intel Compute Stick STK2mV64CC. We have locked the minimum and
maximum clock speeds of the GPU to 500 Mhz to attempt to avoid... this issue.
We are running this test because one of our products needs to have this exact
configuration: read an MJPEG stream from a V4L camera and transcode into H264.
This configuration needs to be super stable. Crashing once in 1026 iterations
is not considered "stable".
Using Ubuntu 18.04 plus DRM-TIP kernel from about 3 weeks ago which corresponds
with 5.1-rc1.
Using GStreamer 1.14.1:
shield@tobeprovisioned1804:~$ gst-launch-1.0 --version
gst-launch-1.0 version 1.14.1
GStreamer 1.14.1
<a href="https://launchpad.net/distros/ubuntu/+source/gstreamer1.0">https://launchpad.net/distros/ubuntu/+source/gstreamer1.0</a>
Full GPU hang log and dmesg enclosed. This is related to a similar bug which I
previously filed.
Especially concerning is that the machine is usable (but the GPU seems dead)
after this crash. We would like to figure out a way of determining that the GPU
has died and to kernel panic so that we can, eventually, reboot. Modifying the
kernel is A-OK to avoid this issue, so if Intel doesn't have a mechanism then I
will try to add something myself.
Leaving the machine in this "half dead" state is bad. We can't use the
gstreamer process termination as the "reboot the machine" trigger as we may
have other, less severe, bugs where we simply want to restart the gstreamer
process.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
<li>You are on the CC list for the bug.</li>
<li>You are the QA Contact for the bug.</li>
</ul>
</body>
</html>