[Bug 98748] New: [KBL] hangcheck /reset fails, keeps GPU at max

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Nov 16 16:23:47 UTC 2016


https://bugs.freedesktop.org/show_bug.cgi?id=98748

            Bug ID: 98748
           Summary: [KBL] hangcheck /reset fails, keeps GPU  at max
           Product: DRI
           Version: DRI git
          Hardware: Other
                OS: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: DRM/Intel
          Assignee: intel-gfx-bugs at lists.freedesktop.org
          Reporter: eero.t.tamminen at intel.com
        QA Contact: intel-gfx-bugs at lists.freedesktop.org
                CC: intel-gfx-bugs at lists.freedesktop.org
     i915 platform: KBL
     i915 features: GPU hang

Created attachment 128014
  --> https://bugs.freedesktop.org/attachment.cgi?id=128014&action=edit
Dmesg

Test setup:
- KBL-U QL9J (haven't seen this yet on other platforms)
- Fairly up to date Ubuntu 16.04 with DRI3 & Unity desktop
- Latest kernel and rest of 3D stack within a few weeks
kernel git://anongit.freedesktop.org/drm-intel at
04145fe15cf8c81c221e62fc9d65d93053f9bd1a 2016-11-15_14-49-57

Test-case:
- Boot
- Run Unigine, GLBenchmark 2.7, GfxBench 4.0, SynMark 7.0 benchmarks several
times

Expected outcome:
- Everything works fine

Actual outcome:
- After SynMark CSDof (spilling compute shader test), rest of tests fail to:
    intel_do_flush_locked failed: Input/output error
- After 3D tests have been stopped and few minutes have been waited, device
idle power usage is still very high (3x normal)

Logs show that when device is idling afterwards:
- Package & cores are in lower power states as expected
- GPU frequency is still at max (allowed by TDP), with 0% in RC6*
- compiz is 100% in (GPU?) IOWAIT
- Unlike in normal situations, powertop shows:
-------------------
Usage;Wakeups/s;GPU ops/s;Disk IO/s;GFX Wakeups/s;Category;Description
 77,9 ms/s;;;;;kWork;i915_hangcheck_elapsed
-------------------

Same issue happens with yesterday night version of X server, Intel DDX, Mesa
(which should fix one issue with spilling) and few week older versions of them.

Dmesg attached. Earlier GEN bug 92774 seems to have had similar issue.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20161116/ccc186db/attachment.html>


More information about the intel-gfx-bugs mailing list