<html>
    <head>
      <base href="https://bugs.freedesktop.org/">
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - Unrecoverable GPU hang with 5.4.0 kernel"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=112428">112428</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>Unrecoverable GPU hang with 5.4.0 kernel
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>DRI
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>unspecified
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>x86-64 (AMD64)
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Linux (All)
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>major
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>not set
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>DRM/Intel
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>L.Bonnaud@laposte.net
          </td>
        </tr>

        <tr>
          <th>QA Contact</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>CC</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr></table>
      <p>
        <div>
        <pre>Hi,

I was using my system, doing nothing special, and the GPU hung.

There are many reports about GPU hangs but this one seems different:
 - it occurred with kernel 5.4.0 instead of 5.3.x kernels (my Intel GPU also
had many problems with 5.3.x kernels)
 - the GPU never recovered (which BTW caused some data loss).  I had to ssh
into the system to get debug info.

Here is some system info (full details below):

Kernel: Linux xeelee 5.4.0-050400-generic #201911242031 SMP Mon Nov 25 01:35:10
UTC 2019 x86_64 x86_64 x86_64 GNU/Linux

Distribution: Ubuntu 19.10

Machine: Intel NUC7i5BNB

Display connector: HDMI 2.0

[233850.738984] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[233850.739750] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request
timed out: {request: 00000001, RESET_CTL: 00000001}                             
[233850.739824] i915 0000:00:02.0: Resetting chip for hang on rcs0
[233850.741595] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request
timed out: {request: 00000001, RESET_CTL: 00000001}                             
[233850.742349] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request
timed out: {request: 00000001, RESET_CTL: 00000001}                             
[234291.141681] INFO: task kworker/0:0:5853 blocked for more than 120 seconds.
[234291.141690]       Not tainted 5.4.0-050400-generic #201911242031
[234291.141693] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables
this message.
[234291.141697] kworker/0:0     D    0  5853      2 0x80004000
[234291.141823] Workqueue: events i915_hotplug_work_func [i915]
[234291.141826] Call Trace:
[234291.141839]  __schedule+0x2e3/0x740
[234291.141846]  schedule+0x42/0xb0
[234291.141852]  schedule_preempt_disabled+0xe/0x10
[234291.141857]  __ww_mutex_lock.isra.0+0x261/0x7f0
[234291.141864]  __ww_mutex_lock_slowpath+0x16/0x20
[234291.141869]  ww_mutex_lock+0x38/0x90
[234291.141916]  drm_modeset_lock+0x35/0xb0 [drm]
[234291.142025]  intel_dp_retrain_link+0x94/0x1c0 [i915]
[234291.142122]  intel_ddi_hotplug+0x7a/0x350 [i915]
[234291.142130]  ? __switch_to_asm+0x40/0x70
[234291.142135]  ? __switch_to_asm+0x34/0x70
[234291.142140]  ? __switch_to_asm+0x40/0x70
[234291.142146]  ? __switch_to_asm+0x40/0x70
[234291.142238]  i915_hotplug_work_func+0x18b/0x280 [i915]
[234291.142249]  process_one_work+0x1ec/0x3a0
[234291.142256]  worker_thread+0x4d/0x400
[234291.142262]  kthread+0x104/0x140
[234291.142268]  ? process_one_work+0x3a0/0x3a0
[234291.142274]  ? kthread_park+0x90/0x90
[234291.142281]  ret_from_fork+0x35/0x40</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are on the CC list for the bug.</li>
          <li>You are the QA Contact for the bug.</li>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>