<html>
    <head>
      <base href="https://bugs.freedesktop.org/">
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - [KBL] drm/i915: Resetting chip after gpu hang, RC6 on, TF2 segfault"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=103405">103405</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>[KBL] drm/i915: Resetting chip after gpu hang, RC6 on, TF2 segfault
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>Mesa
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>17.2
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>x86-64 (AMD64)
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Linux (All)
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>normal
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>medium
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>Drivers/DRI/i965
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>intel-3d-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>frail.knight@gmail.com
          </td>
        </tr>

        <tr>
          <th>QA Contact</th>
          <td>intel-3d-bugs@lists.freedesktop.org
          </td>
        </tr></table>
      <p>
        <div>
        <pre>Team Fortress 2 is crashing quite often.  All it takes is a few minutes of play
or a map switch/load.  This is what is being reported in dmesg:

[20194.868764] drm/i915: Resetting chip after gpu hang
[20194.868965] [drm] RC6 on
[20206.868784] drm/i915: Resetting chip after gpu hang
[20206.868919] [drm] RC6 on
[20219.860883] drm/i915: Resetting chip after gpu hang
[20219.861028] [drm] RC6 on
[20229.876940] drm/i915: Resetting chip after gpu hang
[20229.877087] [drm] RC6 on
[20239.861012] drm/i915: Resetting chip after gpu hang
[20239.861162] [drm] RC6 on
[20240.110535] MatQueue0[7395]: segfault at fffffffc ip 00000000dbade0f8 sp
00000000c95cba10 error 4 in client.so[dab98000+2041000]

I checked /sys/class/drm/card0/error and the last error report in that file
does not appear to match this set of crashes.  The date was current, but the
time was a few hours earlier.  dmesg consistently reported the same error with
each crash.

Dell XPS 13 9360 DE
Ubuntu 17.10

[tag] [reply] [−] Description Robert 2017-08-27 18:46:18 UTC
Created <span class=""><a href="attachment.cgi?id=133817" name="attach_133817" title="GPU dump file, CSGO dump file and dmesg output">attachment 133817</a> <a href="attachment.cgi?id=133817&action=edit" title="GPU dump file, CSGO dump file and dmesg output">[details]</a></span>
GPU dump file, CSGO dump file and dmesg output

CSGO crashed after playing ~2 hours in and out of matches.  The following was
reported in dmesg:

[ 7987.649974] [drm] GPU HANG: ecode 9:0:0x86df7cf9, in csgo_linux64 [4947],
reason: Hang on rcs, action: reset
[ 7987.649976] [drm] GPU hangs can indicate a bug anywhere in the entire gfx
stack, including userspace.
[ 7987.649978] [drm] Please file a _new_ bug report on bugs.freedesktop.org
against DRI -> DRM/Intel
[ 7987.649979] [drm] drm/i915 developers can then reassign to the right
component if it's not a kernel issue.
[ 7987.649980] [drm] The gpu crash dump is required to analyze gpu hangs, so
please always attach it.
[ 7987.649981] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[ 7987.650057] drm/i915: Resetting chip after gpu hang
[ 7987.650622] [drm] RC6 on
[ 8001.652386] drm/i915: Resetting chip after gpu hang
[ 8001.652537] [drm] RC6 on
[ 8013.652392] drm/i915: Resetting chip after gpu hang
[ 8013.652531] [drm] RC6 on
[ 8027.636176] drm/i915: Resetting chip after gpu hang
[ 8027.636314] [drm] RC6 on
[ 8038.644153] drm/i915: Resetting chip after gpu hang
[ 8038.644306] [drm] RC6 on
[ 8038.843763] show_signal_msg: 65 callbacks suppressed
[ 8038.843765] csgo_linux64[5008]: segfault at 1338 ip 00007f04bfe3f2a9 sp
00007f0444182710 error 6 in client_client.so[7f04bf1c6000+17cf000]

I've included this as well as the GPU crash dump in the attachment.
[tag] [reply] [−] <a href="show_bug.cgi?id=103405#c1">Comment 1</a> Robert 2017-08-27 18:53:48 UTC
I'd also like to mention:

Dell XPS 13 9360 DE
Ubuntu(Xubuntu) 17.10 (in development with current updates)
Mesa 17.2.2</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are the QA Contact for the bug.</li>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>