<html>
    <head>
      <base href="https://bugs.freedesktop.org/" />
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - [BDW/BSW Regression]igt/gem_reloc_vs_gpu/forked-faulting-reloc-thrashing-hang causes GPU reset fail"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=90732">90732</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>[BDW/BSW Regression]igt/gem_reloc_vs_gpu/forked-faulting-reloc-thrashing-hang causes GPU reset fail
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>DRI
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>unspecified
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>All
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Linux (All)
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>major
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>high
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>DRM/Intel
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>huax.lu@intel.com
          </td>
        </tr>

        <tr>
          <th>QA Contact</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>CC</th>
          <td>intel-gfx-bugs@lists.freedesktop.org
          </td>
        </tr></table>
      <p>
        <div>
        <pre>Created <span class=""><a href="attachment.cgi?id=116131" name="attach_116131" title="error state">attachment 116131</a> <a href="attachment.cgi?id=116131&action=edit" title="error state">[details]</a></span>
error state

==System Environment==
--------------------------
Regression: yes

good commit:  65de797816eadb227c45b0127d7ff92410fa3814(dinq)
bad commit: 99c044d7d5cc65661436f271754c011d0f1a02de(dinq)

Non-working platforms: BDW/BSW

==kernel==
--------------------------
drm-intel-nightly/b44f6771cba2cc90525d037445330ed766377aa9
commit b44f6771cba2cc90525d037445330ed766377aa9
Author: Daniel Vetter <<a href="mailto:daniel.vetter@ffwll.ch">daniel.vetter@ffwll.ch</a>>
Date:   Thu May 28 13:39:29 2015 +0200

    drm-intel-nightly: 2015y-05m-28d-11h-38m-51s UTC integration manifest


==Bug detailed description==
-----------------------------
Run ./gem_reloc_vs_gpu --run-subtest forked-faulting-reloc-thrashing-hang, gpu
reset fail.
Following cases also have this issue:
igt@gem_reloc_vs_gpu@forked-interruptible-thrashing-hang
igt@gem_reloc_vs_gpu@forked-thrashing-hang

dmesg:
[   91.753899] [drm] stuck on blitter ring
[   91.754663] [drm] GPU HANG: ecode 8:2:0xe77ffff2, in gem_reloc_vs_gp [4986],
reason: Ring hung, action: reset
[   91.754665] [drm] GPU hangs can indicate a bug anywhere in the entire gfx
stack, including userspace.
[   91.754666] [drm] Please file a _new_ bug report on bugs.freedesktop.org
against DRI -> DRM/Intel
[   91.754668] [drm] drm/i915 developers can then reassign to the right
component if it's not a kernel issue.
[   91.754669] [drm] The gpu crash dump is required to analyze gpu hangs, so
please always attach it.
[   91.754670] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[   91.754705] [drm:i915_reset_and_wakeup] resetting chip
[  101.748383] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748413] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748442] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748477] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748500] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748525] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748547] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748570] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.748617] [drm:i915_gem_wait_for_error.part.25 [i915]] *ERROR* Timed out
waiting for the gpu reset to complete
[  101.750656] Setting dangerous option prefault_disable - tainting kernel
[  101.751194] Setting dangerous option prefault_disable - tainting kernel
[  101.751291] Setting dangerous option prefault_disable - tainting kernel

[  240.060726] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[  240.060767] kworker/u16:3   D ffff8800a7c77aa8     0  1237      2 0x00000000
[  240.060797] Workqueue: i915-hangcheck i915_hangcheck_elapsed [i915]
[  240.060799]  ffff8800a7c77aa8 ffff880002ae0000 ffff8800a7f82120
ffff8800a7c77ad8
[  240.060802]  0000000000000246 0000000000000000 ffff8800a7c78000
0000000000000246
[  240.060805]  0000000000000000 ffff88000355c068 ffff8800a7f82120
ffff8800a7c77ac8
[  240.060808] Call Trace:
[  240.060814]  [<ffffffff81896db4>] schedule+0x75/0x84
[  240.060816]  [<ffffffff81897011>] schedule_preempt_disabled+0xe/0x10
[  240.060818]  [<ffffffff818986c5>] mutex_lock_nested+0x17c/0x2cb
[  240.060833]  [<ffffffffa0094a13>] ? i915_reset+0x3a/0x13e [i915]
[  240.060847]  [<ffffffffa0094a13>] i915_reset+0x3a/0x13e [i915]
[  240.060866]  [<ffffffffa00c80e2>] i915_reset_and_wakeup+0xd3/0x133 [i915]
[  240.060885]  [<ffffffffa00cbd51>] i915_handle_error+0x5ab/0x5bd [i915]
[  240.060905]  [<ffffffffa00dda30>] ? gen6_read32+0x11a/0x18b [i915]
[  240.060910]  [<ffffffff8109352f>] ? vprintk_default+0x1d/0x1f
[  240.060913]  [<ffffffff8188f3e9>] ? printk+0x46/0x48
[  240.060930]  [<ffffffffa00cc14f>] i915_hangcheck_elapsed+0x3a3/0x3c3 [i915]
[  240.060933]  [<ffffffff8105ab88>] ? process_one_work+0x1ba/0x409
[  240.060935]  [<ffffffff8105abf3>] process_one_work+0x225/0x409
[  240.060937]  [<ffffffff8105ab74>] ? process_one_work+0x1a6/0x409
[  240.060940]  [<ffffffff8105b694>] worker_thread+0x275/0x369
[  240.060942]  [<ffffffff8107c63a>] ? complete+0x42/0x4a
[  240.060944]  [<ffffffff8105b41f>] ? cancel_delayed_work_sync+0x15/0x15
[  240.060947]  [<ffffffff81060039>] kthread+0xf6/0xfe
[  240.060950]  [<ffffffff8105ff43>] ? kthread_create_on_node+0x1ac/0x1ac
[  240.060953]  [<ffffffff8189b892>] ret_from_fork+0x42/0x70
[  240.060955]  [<ffffffff8105ff43>] ? kthread_create_on_node+0x1ac/0x1ac
[  240.060957] INFO: lockdep is turned off.
[  240.060966] INFO: task gem_reloc_vs_gp:4986 blocked for more than 120
seconds.

==Reproduce steps==
---------------------------- 
1.  ./gem_reloc_vs_gpu --run-subtest forked-faulting-reloc-thrashing-hang</pre>
        </div>
      </p>
      <hr>
      <span>You are receiving this mail because:</span>
      
      <ul>
          <li>You are the QA Contact for the bug.</li>
          <li>You are on the CC list for the bug.</li>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>