[Bug 59986] New: GPU hung when starting to play video

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Jan 28 11:52:47 PST 2013


https://bugs.freedesktop.org/show_bug.cgi?id=59986

          Priority: medium
            Bug ID: 59986
          Assignee: intel-gfx-bugs at lists.freedesktop.org
           Summary: GPU hung when starting to play video
        QA Contact: intel-gfx-bugs at lists.freedesktop.org
          Severity: normal
    Classification: Unclassified
                OS: Linux (All)
          Reporter: jkt at gentoo.org
          Hardware: x86-64 (AMD64)
            Status: NEW
           Version: XOrg CVS
         Component: DRM/Intel
           Product: DRI

- Thinkpad T420s 4171-6SG, "Intel Corporation 2nd Generation Core Processor
Family Integrated Graphics Controller (rev 09)", amd64 Gentoo Linux
- kernel 3.8.0-rc3 (pcie_aspm=force intel_iommu=off i915.i915_enable_rc6=1
i915.i915_enable_fbc=1 i915.lvds_downclock=1 drm.vblankoffdelay=1)
-  xf86-video-intel 47caffc50b5cdd288ad868fa9a697f0d4e2d28dc
- libdrm 2.4.40
- mesa 9.0.1
- kwin 4.9.5
- mplayer2-2.0_p20121128
- error state at http://dev.gentoo.org/~jkt/tmp/2013-01-28-i915_error_state

I started to play a video via mplayer2 (youtube's 92D5LgLnMxU.mp4 downloaded
via youtube-dl); previously, I've watched a couple of other videos without any
trouble. The display got stuck after a few frames; I tried to stop the video,
but it took a couple of seconds. This appeared at the terminal:

jkt at svist ~ $ mplayer2 92D5LgLnMxU.mp4 
MPlayer2 2.0-691-g661a350 (C) 2000-2012 MPlayer Team
MMX2 supported but disabled
Cannot open file '/home/jkt/.mplayer/input.conf': No such file or directory
Failed to open /home/jkt/.mplayer/input.conf.
Can't open joystick device /dev/input/js0: No such file or directory
Can't init input joystick

Playing 92D5LgLnMxU.mp4.
Detected file format: QuickTime/MPEG-4/Motion JPEG 2000 format (libavformat)
[lavf] stream 0: video (h264), -vid 0
[lavf] stream 1: audio (aac), -aid 0, -alang und
Clip info:
 major_brand: mp42
 minor_version: 0
 compatible_brands: isommp42
 creation_time: 2011-12-22 12:13:54
Load subtitles in .
Failed to open VDPAU backend libvdpau_nvidia.so: cannot open shared object
file: No such file or directory
[vdpau] Error when calling vdp_device_create_x11: 1
[ass] auto-open
Selected video codec: H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10 [libavcodec]
Selected audio codec: Advanced Audio Coding [libavcodec]
AUDIO: 44100 Hz, 2 ch, s16le, 152.0 kbit/10.77% (ratio: 19000->176400)
AO: [alsa] 48000Hz 2ch s16le (2 bytes per sample)
Starting playback...
VIDEO:  1920x1080  25.000 fps  4223.3 kbps (527.9 kB/s)
VO: [xv] 1920x1080 => 1920x1080 Planar YV12 
Colorspace details not fully supported by selected vo.
A:   4.5 V:   4.4 A-V:  0.120 ct:  0.000   0/  0  2% 128%  1.0% 0 0 
No bind found for key 'MOUSE_BTN3_DBL'.
No bind found for key 'MOUSE_BTN3_DBL'.
No bind found for key 'MOUSE_BTN3_DBL'.
No bind found for key 'MOUSE_BTN3_DBL'.                                         
No bind found for key 'MOUSE_BTN3_DBL'.                                         
A:   4.7 V:   4.4 A-V:  0.251 ct:  0.004   0/  0  2% 127%  1.0% 1 0 
X11 error: BadAlloc (insufficient resources for operation)

This appeared in the dmesg:

[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[drm] capturing error event; look for more information in
/debug/dri/0/i915_error_state
[drm:kick_ring] *ERROR* Kicking stuck wait on render ring

When trying to obtain the error state from debugfs, I got "out of memory" even
though `free` said that I had >500MB free (~1.7GB not counting buffers/cache):

cp: page allocation failure: order:9, mode:0x1040d0
Pid: 22036, comm: cp Not tainted 3.8.0-rc3 #1
Call Trace:
 [<ffffffff810b6a9e>] warn_alloc_failed+0x108/0x11d
 [<ffffffff810b8507>] ? drain_local_pages+0x11/0x13
 [<ffffffff810b99b7>] __alloc_pages_nodemask+0x769/0x7fc
 [<ffffffff810b9ac5>] __get_free_pages+0x12/0x41
 [<ffffffff810e6013>] kmalloc_order_trace+0x2e/0x83
 [<ffffffff810e7798>] __kmalloc+0x36/0x12e
 [<ffffffff810b89db>] ? __free_memcg_kmem_pages+0x9/0xb
 [<ffffffff81109c9c>] seq_read+0x1e9/0x353
 [<ffffffff810ef140>] vfs_read+0xad/0x129
 [<ffffffff810ef20b>] sys_read+0x4f/0x79
 [<ffffffff8149a752>] system_call_fastpath+0x16/0x1b
Mem-Info:
DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
CPU    1: hi:    0, btch:   1 usd:   0
CPU    2: hi:    0, btch:   1 usd:   0
CPU    3: hi:    0, btch:   1 usd:   0
DMA32 per-cpu:
CPU    0: hi:  186, btch:  31 usd:   0
CPU    1: hi:  186, btch:  31 usd:   0
CPU    2: hi:  186, btch:  31 usd:   0
CPU    3: hi:  186, btch:  31 usd:   0
Normal per-cpu:
CPU    0: hi:  186, btch:  31 usd:   0
CPU    1: hi:  186, btch:  31 usd:   0
CPU    2: hi:  186, btch:  31 usd:   0
CPU    3: hi:  186, btch:  31 usd:   0
active_anon:552924 inactive_anon:67302 isolated_anon:0
 active_file:58757 inactive_file:123989 isolated_file:0
 unevictable:1 dirty:387 writeback:0 unstable:0
 free:129169 slab_reclaimable:11537 slab_unreclaimable:8605
 mapped:108589 shmem:125321 pagetables:8688 bounce:0
 free_cma:0
DMA free:15900kB min:524kB low:652kB high:784kB active_anon:0kB
inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB
isolated(anon):0kB isolated(file):0kB present:15644kB managed:15900kB
mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB
slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB
free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
lowmem_reserve[]: 0 3422 3900 3900
DMA32 free:478140kB min:118124kB low:147652kB high:177184kB
active_anon:1909096kB inactive_anon:240536kB active_file:234424kB
inactive_file:495032kB unevictable:0kB isolated(anon):0kB isolated(file):0kB
present:3504436kB managed:3482280kB mlocked:0kB dirty:1456kB writeback:0kB
mapped:402060kB shmem:416912kB slab_reclaimable:26848kB
slab_unreclaimable:17508kB kernel_stack:1184kB pagetables:23240kB unstable:0kB
bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 478 478
Normal free:22636kB min:16512kB low:20640kB high:24768kB active_anon:302600kB
inactive_anon:28672kB active_file:604kB inactive_file:924kB unevictable:4kB
isolated(anon):0kB isolated(file):0kB present:489888kB managed:424996kB
mlocked:4kB dirty:92kB writeback:0kB mapped:32296kB shmem:84372kB
slab_reclaimable:19300kB slab_unreclaimable:16912kB kernel_stack:1808kB
pagetables:11512kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB
pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 0 0
DMA: 1*4kB (U) 1*8kB (U) 1*16kB (U) 0*32kB 2*64kB (U) 1*128kB (U) 1*256kB (U)
0*512kB 1*1024kB (U) 1*2048kB (U) 3*4096kB (MR) = 15900kB
DMA32: 4549*4kB (UEM) 4145*8kB (UEM) 4811*16kB (UEM) 3143*32kB (UEM) 1581*64kB
(UEM) 587*128kB (UEM) 212*256kB (UEM) 29*512kB (UEM) 4*1024kB (UM) 0*2048kB
0*4096kB = 478444kB
Normal: 953*4kB (UEM) 848*8kB (UEM) 423*16kB (UEM) 75*32kB (UEM) 0*64kB 6*128kB
(E) 2*256kB (E) 0*512kB 0*1024kB 1*2048kB (R) 0*4096kB = 23092kB
308067 total pagecache pages
0 pages in swap cache
Swap cache stats: add 0, delete 0, find 0/0
Free swap  = 0kB
Total swap = 0kB
1041904 pages RAM
60024 pages reserved
1344329 pages shared
667265 pages non-shared
cp: page allocation failure: order:9, mode:0x1040d0
Pid: 22037, comm: cp Not tainted 3.8.0-rc3 #1
Call Trace:
 [<ffffffff810b6a9e>] warn_alloc_failed+0x108/0x11d
 [<ffffffff810b8507>] ? drain_local_pages+0x11/0x13
 [<ffffffff810b99b7>] __alloc_pages_nodemask+0x769/0x7fc
 [<ffffffff810b9ac5>] __get_free_pages+0x12/0x41
 [<ffffffff810e6013>] kmalloc_order_trace+0x2e/0x83
 [<ffffffff810e7798>] __kmalloc+0x36/0x12e
 [<ffffffff810b89db>] ? __free_memcg_kmem_pages+0x9/0xb
 [<ffffffff81109c9c>] seq_read+0x1e9/0x353
 [<ffffffff810ef140>] vfs_read+0xad/0x129
 [<ffffffff810ef20b>] sys_read+0x4f/0x79
 [<ffffffff8149a752>] system_call_fastpath+0x16/0x1b
Mem-Info:
DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
CPU    1: hi:    0, btch:   1 usd:   0
CPU    2: hi:    0, btch:   1 usd:   0
CPU    3: hi:    0, btch:   1 usd:   0
DMA32 per-cpu:
CPU    0: hi:  186, btch:  31 usd:   0
CPU    1: hi:  186, btch:  31 usd:   0
CPU    2: hi:  186, btch:  31 usd:   0
CPU    3: hi:  186, btch:  31 usd:   0
Normal per-cpu:
CPU    0: hi:  186, btch:  31 usd:   0
CPU    1: hi:  186, btch:  31 usd:   0
CPU    2: hi:  186, btch:  31 usd:   0
CPU    3: hi:  186, btch:  31 usd:   0
active_anon:560184 inactive_anon:60102 isolated_anon:0
 active_file:58765 inactive_file:123808 isolated_file:0
 unevictable:1 dirty:389 writeback:0 unstable:0
 free:130066 slab_reclaimable:11537 slab_unreclaimable:8518
 mapped:108590 shmem:125321 pagetables:8687 bounce:0
 free_cma:0
DMA free:15900kB min:524kB low:652kB high:784kB active_anon:0kB
inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB
isolated(anon):0kB isolated(file):0kB present:15644kB managed:15900kB
mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB
slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB
free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
lowmem_reserve[]: 0 3422 3900 3900
DMA32 free:480688kB min:118124kB low:147652kB high:177184kB
active_anon:1933040kB inactive_anon:216468kB active_file:234432kB
inactive_file:495152kB unevictable:0kB isolated(anon):0kB isolated(file):0kB
present:3504436kB managed:3482280kB mlocked:0kB dirty:1464kB writeback:0kB
mapped:402064kB shmem:416792kB slab_reclaimable:26848kB
slab_unreclaimable:17160kB kernel_stack:1184kB pagetables:23236kB unstable:0kB
bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 478 478
Normal free:23676kB min:16512kB low:20640kB high:24768kB active_anon:307696kB
inactive_anon:23940kB active_file:628kB inactive_file:80kB unevictable:4kB
isolated(anon):0kB isolated(file):0kB present:489888kB managed:424996kB
mlocked:4kB dirty:92kB writeback:0kB mapped:32296kB shmem:84492kB
slab_reclaimable:19300kB slab_unreclaimable:16912kB kernel_stack:1808kB
pagetables:11512kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB
pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 0 0
DMA: 1*4kB (U) 1*8kB (U) 1*16kB (U) 0*32kB 2*64kB (U) 1*128kB (U) 1*256kB (U)
0*512kB 1*1024kB (U) 1*2048kB (U) 3*4096kB (MR) = 15900kB
DMA32: 4897*4kB (UEM) 4247*8kB (UEM) 4823*16kB (UEM) 3147*32kB (UEM) 1581*64kB
(UEM) 587*128kB (UEM) 212*256kB (UEM) 29*512kB (UEM) 4*1024kB (UM) 0*2048kB
0*4096kB = 480972kB
Normal: 1174*4kB (UEM) 879*8kB (UEM) 431*16kB (UEM) 76*32kB (UM) 0*64kB 6*128kB
(UE) 2*256kB (E) 0*512kB 0*1024kB 1*2048kB (R) 0*4096kB = 24384kB
307706 total pagecache pages
0 pages in swap cache
Swap cache stats: add 0, delete 0, find 0/0
Free swap  = 0kB
Total swap = 0kB
1041904 pages RAM
60024 pages reserved
1324062 pages shared
683152 pages non-shared

However, killing opera helped and I got the error_state.

This is in the X's log:

[ 17072.505] [mi] EQ overflowing.  Additional events will be discarded until
existing events are processed.
[ 17072.505] 
[ 17072.505] Backtrace:
[ 17072.509] 0: /usr/bin/X (xorg_backtrace+0x4c) [0x7f6b9ed1048c]
[ 17072.509] 1: /usr/bin/X (mieqEnqueue+0x2b3) [0x7f6b9ecee7e3]
[ 17072.509] 2: /usr/bin/X (0x7f6b9eb61000+0x68f6e) [0x7f6b9ebc9f6e]
[ 17072.509] 3: /usr/bin/X (xf86PostMotionEventP+0x4f) [0x7f6b9ec0b2cf]
[ 17072.509] 4: /usr/lib64/xorg/modules/input/evdev_drv.so
(0x7f6b99b45000+0x4e5a) [0x7f6b99b49e5a]
[ 17072.509] 5: /usr/bin/X (0x7f6b9eb61000+0x97517) [0x7f6b9ebf8517]
[ 17072.509] 6: /usr/bin/X (0x7f6b9eb61000+0xc24ec) [0x7f6b9ec234ec]
[ 17072.509] 7: /lib64/libpthread.so.0 (0x7f6b9de7b000+0x10460)
[0x7f6b9de8b460]
[ 17072.509] 8: /lib64/libc.so.6 (ioctl+0x7) [0x7f6b9cdca567]
[ 17072.509] 9: /usr/lib64/libdrm.so.2 (drmIoctl+0x38) [0x7f6b9b32c368]
[ 17072.509] 10: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x1bd00) [0x7f6b9ae3bd00]
[ 17072.509] 11: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x1cdf7) [0x7f6b9ae3cdf7]
[ 17072.509] 12: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x451a1) [0x7f6b9ae651a1]
[ 17072.509] 13: /usr/bin/X (BlockHandler+0x60) [0x7f6b9ebb7ca0]
[ 17072.509] 14: /usr/bin/X (WaitForSomething+0x12a) [0x7f6b9ed0d26a]
[ 17072.509] 15: /usr/bin/X (0x7f6b9eb61000+0x52232) [0x7f6b9ebb3232]
[ 17072.509] 16: /usr/bin/X (0x7f6b9eb61000+0x3f09a) [0x7f6b9eba009a]
[ 17072.509] 17: /lib64/libc.so.6 (__libc_start_main+0xfd) [0x7f6b9cd094fd]
[ 17072.509] 18: /usr/bin/X (0x7f6b9eb61000+0x3ec19) [0x7f6b9eb9fc19]
[ 17072.509] 
[ 17072.509] [mi] These backtraces from mieqEnqueue may point to a culprit
higher up the stack.
[ 17072.509] [mi] mieq is *NOT* the cause.  It is a victim.
[ 17072.905] [mi] EQ overflow continuing.  100 events have been dropped.
[ 17072.905] 
[ 17072.905] Backtrace:
[ 17072.905] 0: /usr/bin/X (xorg_backtrace+0x4c) [0x7f6b9ed1048c]
[ 17072.905] 1: /usr/bin/X (0x7f6b9eb61000+0x68f6e) [0x7f6b9ebc9f6e]
[ 17072.905] 2: /usr/bin/X (xf86PostMotionEventP+0x4f) [0x7f6b9ec0b2cf]
[ 17072.905] 3: /usr/lib64/xorg/modules/input/evdev_drv.so
(0x7f6b99b45000+0x4e5a) [0x7f6b99b49e5a]
[ 17072.905] 4: /usr/bin/X (0x7f6b9eb61000+0x97517) [0x7f6b9ebf8517]
[ 17072.905] 5: /usr/bin/X (0x7f6b9eb61000+0xc24ec) [0x7f6b9ec234ec]
[ 17072.905] 6: /lib64/libpthread.so.0 (0x7f6b9de7b000+0x10460)
[0x7f6b9de8b460]
[ 17072.905] 7: /lib64/libc.so.6 (ioctl+0x7) [0x7f6b9cdca567]
[ 17072.905] 8: /usr/lib64/libdrm.so.2 (drmIoctl+0x38) [0x7f6b9b32c368]
[ 17072.905] 9: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x1bd00) [0x7f6b9ae3bd00]
[ 17072.905] 10: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x1cdf7) [0x7f6b9ae3cdf7]
[ 17072.905] 11: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x451a1) [0x7f6b9ae651a1]
[ 17072.905] 12: /usr/bin/X (BlockHandler+0x60) [0x7f6b9ebb7ca0]
[ 17072.905] 13: /usr/bin/X (WaitForSomething+0x12a) [0x7f6b9ed0d26a]
[ 17072.905] 14: /usr/bin/X (0x7f6b9eb61000+0x52232) [0x7f6b9ebb3232]
[ 17072.905] 15: /usr/bin/X (0x7f6b9eb61000+0x3f09a) [0x7f6b9eba009a]
[ 17072.905] 16: /lib64/libc.so.6 (__libc_start_main+0xfd) [0x7f6b9cd094fd]
[ 17072.905] 17: /usr/bin/X (0x7f6b9eb61000+0x3ec19) [0x7f6b9eb9fc19]
[ 17072.905] 
[ 17077.323] [mi] EQ overflow continuing.  200 events have been dropped.
[ 17077.323] 
[ 17077.323] Backtrace:
[ 17077.323] 0: /usr/bin/X (xorg_backtrace+0x4c) [0x7f6b9ed1048c]
[ 17077.323] 1: /usr/bin/X (0x7f6b9eb61000+0x68f6e) [0x7f6b9ebc9f6e]
[ 17077.323] 2: /usr/bin/X (xf86PostKeyboardEvent+0x50) [0x7f6b9ec0bd60]
[ 17077.323] 3: /usr/lib64/xorg/modules/input/evdev_drv.so
(0x7f6b99b45000+0x4eb0) [0x7f6b99b49eb0]
[ 17077.323] 4: /usr/bin/X (0x7f6b9eb61000+0x97517) [0x7f6b9ebf8517]
[ 17077.323] 5: /usr/bin/X (0x7f6b9eb61000+0xc24ec) [0x7f6b9ec234ec]
[ 17077.323] 6: /lib64/libpthread.so.0 (0x7f6b9de7b000+0x10460)
[0x7f6b9de8b460]
[ 17077.323] 7: /lib64/libc.so.6 (ioctl+0x7) [0x7f6b9cdca567]
[ 17077.323] 8: /usr/lib64/libdrm.so.2 (drmIoctl+0x38) [0x7f6b9b32c368]
[ 17077.323] 9: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x1bd00) [0x7f6b9ae3bd00]
[ 17077.323] 10: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x1cdf7) [0x7f6b9ae3cdf7]
[ 17077.323] 11: /usr/lib64/xorg/modules/drivers/intel_drv.so
(0x7f6b9ae20000+0x451a1) [0x7f6b9ae651a1]
[ 17077.323] 12: /usr/bin/X (BlockHandler+0x60) [0x7f6b9ebb7ca0]
[ 17077.323] 13: /usr/bin/X (WaitForSomething+0x12a) [0x7f6b9ed0d26a]
[ 17077.323] 14: /usr/bin/X (0x7f6b9eb61000+0x52232) [0x7f6b9ebb3232]
[ 17077.323] 15: /usr/bin/X (0x7f6b9eb61000+0x3f09a) [0x7f6b9eba009a]
[ 17077.323] 16: /lib64/libc.so.6 (__libc_start_main+0xfd) [0x7f6b9cd094fd]
[ 17077.323] 17: /usr/bin/X (0x7f6b9eb61000+0x3ec19) [0x7f6b9eb9fc19]
[ 17077.323] 
[ 17077.717] (EE) intel(0): Detected a hung GPU, disabling acceleration.
[ 17077.717] (EE) intel(0): When reporting this, please include
i915_error_state from debugfs and the full dmesg.
[ 17077.717] [mi] Increasing EQ size to 512 to prevent dropped events.
[ 17077.717] [mi] EQ processing has resumed after 202 dropped events.
[ 17077.717] [mi] This may be caused my a misbehaving driver monopolizing the
server's resources.

I've used gdb's disas function to convert the offsets into symbols. 0x451a1 is
sna_accel_block_handler, the 0x1cdf7 is kgem_throttle and 0x1bd00 the
__kgem_throttle.

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20130128/9e3b4f0c/attachment-0001.html>


More information about the intel-gfx-bugs mailing list