[Bug 65495] New: [GM45 Bisected]igt/ZZ_hangman Aborted and [drm:i915_reset] *ERROR* Failed to reset chip

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Jun 7 00:43:14 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=65495

          Priority: high
            Bug ID: 65495
                CC: xunx.fang at intel.com, yangweix.shui at intel.com
          Assignee: intel-gfx-bugs at lists.freedesktop.org
           Summary: [GM45 Bisected]igt/ZZ_hangman  Aborted and
                    [drm:i915_reset] *ERROR* Failed to reset chip
        QA Contact: intel-gfx-bugs at lists.freedesktop.org
          Severity: major
    Classification: Unclassified
                OS: Linux (All)
          Reporter: huax.lu at intel.com
          Hardware: All
            Status: NEW
           Version: unspecified
         Component: DRM/Intel
           Product: DRI

System Environment:
--------------------------
Arch:           x86_64
Platform:       GM45
Kernel: drm-intel-next-queued cb8b2a30b32cde5ac9053d399d084c487598976a

Bug detailed description:
-------------------------
It happens on GM45 with drm-intel-next-queued kernel, It works well on
drm-intel-fixes kernel. Many igt cases will fail after run ZZ_hangman. It
caused by igt commit.
Bisect shows: 1cb4f90946289457c3b92773f2ce96b0b03e4a22 is the first bad commit
commit 1cb4f90946289457c3b92773f2ce96b0b03e4a22
Author:     Imre Deak <imre.deak at intel.com>
AuthorDate: Tue May 28 17:35:32 2013 +0300
Commit:     Daniel Vetter <daniel.vetter at ffwll.ch>
CommitDate: Tue May 28 18:32:32 2013 +0200

    tests/lib: make sure the GPU is idle at test start and exit

    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=64270

    v2:
    - Make sure also that the GPU is idle at start and error exit of any
      test using drm_open_any(). (Daniel)
    v3:
    - actually call gem_quiescent_gpu() at exit

    Signed-off-by: Imre Deak <imre.deak at intel.com>
    Signed-off-by: Daniel Vetter <daniel.vetter at ffwll.ch>

output:
rings stopped
gem_set_domain:467 failed, ret=-1, errno=5
./ZZ_hangman: line 30:  4247 Aborted                 (core dumped)
$SOURCE_DIR/gem_exec_big
gpu hang correctly dectected

dmesg:
[  120.374100] [drm:i915_ring_stop_set], Stopping rings 0x0000000f
[  120.376368] [drm:i915_driver_open],
[  120.376383] [drm:intel_crtc_set_config], [CRTC:3] [FB:37] #connectors=1 (x
y) (0 0)
[  120.376389] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  120.376392] [drm:intel_crtc_set_config], [CRTC:4] [NOFB]
[  120.376394] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  120.376400] [drm:i915_driver_open],
[  126.708148] [drm:i915_hangcheck_elapsed] *ERROR* render ring: stuck on addr
0xbac8
[  126.708224] [drm] capturing error event; look for more information in
/sys/kernel/debug/dri/0/i915_error_state
[  126.711675] [drm:i915_error_work_func], resetting chip
[  126.711720] [drm] Simulated gpu hang, resetting stop_rings
[  126.711765] [drm:i915_gem_context_init], Disabling HW Contexts; old hardware
[  126.711768] [drm:gm45_get_vblank_counter], trying to get vblank count for
disabled pipe B
[  126.711825] [drm:i9xx_update_plane], Writing base 00046000 00000000 0 0 5120
[  132.704157] [drm:i915_hangcheck_elapsed] *ERROR* bsd ring: stuck on addr
0x28
[  132.704310] [drm:i915_error_work_func], resetting chip
[  132.704370] [drm:i915_gem_context_init], Disabling HW Contexts; old hardware
[  132.704373] [drm:gm45_get_vblank_counter], trying to get vblank count for
disabled pipe B
[  132.704417] [drm:i9xx_update_plane], Writing base 00046000 00000000 0 0 5120
[  133.198449] [drm:intel_crtc_set_config], [CRTC:3] [FB:37] #connectors=1 (x
y) (0 0)
[  133.198455] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  133.198458] [drm:intel_crtc_set_config], [CRTC:4] [NOFB]
[  133.198460] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  133.208290] [drm:i915_driver_open],
[  133.208298] [drm:intel_crtc_set_config], [CRTC:3] [FB:37] #connectors=1 (x
y) (0 0)
[  133.208302] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  133.208304] [drm:intel_crtc_set_config], [CRTC:4] [NOFB]
[  133.208306] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  133.208311] [drm:i915_driver_open],
[  135.704156] [drm:i915_hangcheck_elapsed] *ERROR* bsd ring: stuck on addr
0x28
[  135.704837] [drm:i915_error_work_func], resetting chip
[  135.704878] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
[  135.704921] [drm:i915_reset] *ERROR* Failed to reset chip.
[  135.704958] [drm:i9xx_update_plane], Writing base 00046000 00000000 0 0 5120
[  145.704169] [drm:i915_gem_wait_for_error] *ERROR* Timed out waiting for the
gpu reset to complete
[  146.090217] [drm:intel_crtc_set_config], [CRTC:3] [FB:37] #connectors=1 (x
y) (0 0)
[  146.090226] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  146.090229] [drm:intel_crtc_set_config], [CRTC:4] [NOFB]
[  146.090231] [drm:intel_modeset_stage_output_state], [CONNECTOR:5:LVDS-1] to
[CRTC:3]
[  146.486347] [drm:i915_error_state_write], Resetting error state

Reproduce steps:
----------------
1../ZZ_hangman

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20130607/12c9de05/attachment.html>


More information about the intel-gfx-bugs mailing list