[Bug 61477] [965g] batch corruption, clflush?

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Apr 26 13:18:45 PDT 2014


https://bugs.freedesktop.org/show_bug.cgi?id=61477

--- Comment #51 from Norman Yarvin <yarvin at yarchive.net> ---
Well, it took over two months for it to crash this time, but it did.  From the
system log:

Apr 26 15:36:21 muttonhead kernel: [drm] stuck on render ring
Apr 26 15:36:21 muttonhead kernel: [drm] GPU crash dump saved to
/sys/class/drm/card0/error
Apr 26 15:36:21 muttonhead kernel: [drm] GPU hangs can indicate a bug anywhere
in the entire gfx stack, including userspace.
Apr 26 15:36:21 muttonhead kernel: [drm] Please file a _new_ bug report on
bugs.freedesktop.org against DRI -> DRM/Intel
Apr 26 15:36:21 muttonhead kernel: [drm] drm/i915 developers can then reassign
to the right component if it's not a kernel issue.
Apr 26 15:36:21 muttonhead kernel: [drm] The gpu crash dump is required to
analyze gpu hangs, so please always attach it.
Apr 26 15:36:21 muttonhead kernel: [drm:i915_set_reset_status] *ERROR* render
ring hung inside bo (0x2380000 ctx 0) at 0x2380310
Apr 26 15:36:21 muttonhead kernel: [drm:i915_reset] *ERROR* Failed to reset
chip.

>From Xorg.0.log (I can post the whole thing, but these are what look like the
interesting bits):

[    20.562] (II) intel(0): SNA compiled from 2.99.910-49-g0b92b12
[    20.562] (II) intel(0): SNA compiled with assertions enabled
[    20.565] (--) intel(0): Integrated Graphics Chipset: Intel(R) 965G

....

[2205130.336] (EE) intel(0): Detected a hung GPU, disabling acceleration.
[2205130.336] batch[1/0]: 8170 8170 65344, nreloc=16, nexec=12, nfence=0,
aperture=3067, fenced=0, high=98304,131072: errno=22
[2205130.336] exec[0] = handle:216, presumed offset: 6189000, size: 7987200,
tiling 1, fenced 0, snooped 0, deleted 0
[2205130.336] exec[1] = handle:23, presumed offset: 2203000, size: 90112,
tiling 0, fenced 0, snooped 0, deleted 0
[2205130.336] exec[2] = handle:154, presumed offset: 81fe000, size: 4096,
tiling 0, fenced 0, snooped 0, deleted 0
[2205130.336] exec[3] = handle:109, presumed offset: af53000, size: 4, tiling
0, fenced 0, snooped 0, deleted 0
[2205130.336] exec[4] = handle:25, presumed offset: 1dfb000, size: 4194304,
tiling 1, fenced 0, snooped 0, deleted 0
[2205130.336] exec[5] = handle:44, presumed offset: 83b8000, size: 4096, tiling
0, fenced 0, snooped 0, deleted 0
[2205130.336] exec[6] = handle:90, presumed offset: 8a0b000, size: 4096, tiling
0, fenced 0, snooped 0, deleted 0
[2205130.336] exec[7] = handle:45, presumed offset: e905000, size: 4096, tiling
0, fenced 0, snooped 0, deleted 0
[2205130.336] exec[8] = handle:94, presumed offset: 8a0d000, size: 4096, tiling
0, fenced 0, snooped 0, deleted 0
[2205130.336] exec[9] = handle:210, presumed offset: 8225000, size: 4096,
tiling 0, fenced 0, snooped 0, deleted 0
[2205130.337] exec[10] = handle:53, presumed offset: 8470000, size: 262144,
tiling 0, fenced 0, snooped 0, deleted 0
[2205130.337] exec[11] = handle:73, presumed offset: 0, size: 36864, tiling 0,
fenced 0, snooped 0, deleted 0
[2205130.337] reloc[0] = pos:16, target:0, delta:0, read:2, write:2,
offset:6189000
[2205130.337] reloc[1] = pos:40, target:0, delta:0, read:2, write:2,
offset:6189000
[2205130.337] reloc[2] = pos:64, target:0, delta:0, read:2, write:2,
offset:6189000
[2205130.337] reloc[3] = pos:140, target:1, delta:1, read:10, write:0,
offset:2203000
[2205130.337] reloc[4] = pos:144, target:11, delta:-225279, read:10, write:0,
offset:0
[2205130.337] reloc[5] = pos:36772, target:0, delta:0, read:2, write:2,
offset:6189000
[2205130.337] reloc[6] = pos:36740, target:2, delta:0, read:4, write:0,
offset:81fe000
[2205130.337] reloc[7] = pos:36676, target:3, delta:80, read:4, write:0,
offset:af53000
[2205130.337] reloc[8] = pos:36644, target:4, delta:0, read:4, write:0,
offset:1dfb000
[2205130.337] reloc[9] = pos:36580, target:5, delta:0, read:4, write:0,
offset:83b8000
[2205130.337] reloc[10] = pos:36420, target:6, delta:0, read:4, write:0,
offset:8a0b000
[2205130.337] reloc[11] = pos:36324, target:7, delta:0, read:4, write:0,
offset:e905000
[2205130.337] reloc[12] = pos:36228, target:8, delta:0, read:4, write:0,
offset:8a0d000
[2205130.337] reloc[13] = pos:36132, target:9, delta:0, read:4, write:0,
offset:8225000
[2205130.337] reloc[14] = pos:288, target:10, delta:0, read:20, write:0,
offset:8470000
[2205130.337] reloc[15] = pos:416, target:10, delta:0, read:20, write:0,
offset:8470000
[2205130.337] Aperture size 536870912, available 513679360

(and there the file ends)

That second excerpt was also printed on stdout or stderr, but other than that,
there were no assertions triggered.  I'll attach the i915_error_state file in
the next comment.

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20140426/3e70a84d/attachment.html>


More information about the intel-gfx-bugs mailing list