<html>
    <head>
      <base href="https://bugs.freedesktop.org/">
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - [KBL] hangcheck /reset fails, keeps GPU at max"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=98748">98748</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>[KBL] hangcheck /reset fails, keeps GPU  at max
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>DRI
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>DRI git
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>Other
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>All
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>normal
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>medium
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>DRM/Intel
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>eero.t.tamminen@intel.com
          </td>
        </tr>

        <tr>
          <th>QA Contact</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>CC</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>i915 platform</th>
          <td>KBL
          </td>
        </tr>

        <tr>
          <th>i915 features</th>
          <td>GPU hang
          </td>
        </tr></table>
      <p>
        <div>
        <pre>Created <span class=""><a href="attachment.cgi?id=128014" name="attach_128014" title="Dmesg">attachment 128014</a> <a href="attachment.cgi?id=128014&action=edit" title="Dmesg">[details]</a></span>
Dmesg

Test setup:
- KBL-U QL9J (haven't seen this yet on other platforms)
- Fairly up to date Ubuntu 16.04 with DRI3 & Unity desktop
- Latest kernel and rest of 3D stack within a few weeks
kernel git://anongit.freedesktop.org/drm-intel at
04145fe15cf8c81c221e62fc9d65d93053f9bd1a 2016-11-15_14-49-57

Test-case:
- Boot
- Run Unigine, GLBenchmark 2.7, GfxBench 4.0, SynMark 7.0 benchmarks several
times

Expected outcome:
- Everything works fine

Actual outcome:
- After SynMark CSDof (spilling compute shader test), rest of tests fail to:
    intel_do_flush_locked failed: Input/output error
- After 3D tests have been stopped and few minutes have been waited, device
idle power usage is still very high (3x normal)

Logs show that when device is idling afterwards:
- Package & cores are in lower power states as expected
- GPU frequency is still at max (allowed by TDP), with 0% in RC6*
- compiz is 100% in (GPU?) IOWAIT
- Unlike in normal situations, powertop shows:
-------------------
Usage;Wakeups/s;GPU ops/s;Disk IO/s;GFX Wakeups/s;Category;Description
 77,9 ms/s;;;;;kWork;i915_hangcheck_elapsed
-------------------

Same issue happens with yesterday night version of X server, Intel DDX, Mesa
(which should fix one issue with spilling) and few week older versions of them.

Dmesg attached. Earlier GEN <a class="bz_bug_link 
          bz_status_CLOSED  bz_closed"
   title="CLOSED WONTFIX - [BSW] GPU reset fails after GPU HANG: *ERROR* Failed to reset chip: -5"
   href="show_bug.cgi?id=92774">bug 92774</a> seems to have had similar issue.</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are the assignee for the bug.</li>
          <li>You are on the CC list for the bug.</li>
          <li>You are the QA Contact for the bug.</li>
      </ul>
    </body>
</html>