[Bug 57136] [GM45 regression] GPU hang during disk io

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Jan 5 11:01:10 PST 2013


https://bugs.freedesktop.org/show_bug.cgi?id=57136

--- Comment #23 from Tom London <selinux at gmail.com> ---
Updated to xorg-x11-drv-intel-2.20.17-1.fc19.x86_64, reran my "disk load" test
("cat bigfiles >/dev/null"), and waited.

Within about 2 minutes gdm/Xorg hard crashed, the screen was black, and the
system was unresponsive to the usual keyboard entries (i.e., ctrl-alt-F2,
ctrl-alt-bksp, ctrl-alt-delete).

I did not get the "gdm Ooops something has gone wrong" screen.

I had to hard power reset the system.

On rebooting, I see this in /var/log/messages.


Jan  5 10:27:59 tlondon kernel: [ 2017.404040] [drm:i915_hangcheck_hung]
*ERROR* Hangcheck timer elapsed... GPU hung
Jan  5 10:27:59 tlondon kernel: [ 2017.404047] [drm] capturing error event;
look for more information in /debug/dri/0/i915_error_state
Jan  5 10:28:05 tlondon kernel: [ 2023.424023] [drm:i915_hangcheck_hung]
*ERROR* Hangcheck timer elapsed... GPU hung
Jan  5 10:28:05 tlondon kernel: [ 2023.475044] [drm:init_ring_common] *ERROR*
render ring initialization failed ctl 0001f001 head 00003000 tail 00000000
start 00003000
Jan  5 10:28:06 tlondon kernel: [ 2025.140021] [drm:i915_hangcheck_hung]
*ERROR* Hangcheck timer elapsed... GPU hung
Jan  5 10:28:06 tlondon kernel: [ 2025.140106] [drm:i915_reset] *ERROR* GPU
hanging too fast, declaring wedged!
Jan  5 10:28:06 tlondon kernel: [ 2025.140108] [drm:i915_reset] *ERROR* Failed
to reset chip.
Jan  5 10:28:07 tlondon kernel: [ 2025.214077] ------------[ cut here
]------------
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] kernel BUG at
drivers/gpu/drm/i915/i915_gem.c:3476!
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] invalid opcode: 0000 [#1] SMP 
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] Modules linked in: fuse(F)
ip6table_filter(F) ip6_tables(F) ebtable_nat(F) ebtables(F) ipt_MASQUERADE(F)
iptable_nat(F) nf_nat_ipv4(F) nf_nat(F) nf_conntrack_ipv4(F) nf_defrag_ipv4(F)
xt_conntrack(F) nf_conntrack(F) xt_CHECKSUM(F) iptable_mangle(F) bridge(F)
stp(F) llc(F) lockd(F) sunrpc(F) snd_usb_audio(F) snd_hda_codec_conexant(F)
snd_usbmidi_lib(F) arc4(F) iwldvm(F) snd_hda_intel(F) snd_hda_codec(F)
uvcvideo(F) snd_hwdep(F) snd_rawmidi(F) snd_seq(F) snd_seq_device(F)
mac80211(F) videobuf2_vmalloc(F) videobuf2_memops(F) videobuf2_core(F)
videodev(F) snd_pcm(F) thinkpad_acpi(F) iwlwifi(F) snd_page_alloc(F) media(F)
snd_timer(F) snd(F) cfg80211(F) soundcore(F) e1000e(F) btusb(F) iTCO_wdt(F)
bluetooth(F) coretemp(F) iTCO_vendor_support(F) mei(F) tpm_tis(F) tpm(F)
lpc_ich(F) rfkill(F) mfd_core(F) i2c_i801(F) tpm_bios(F) microcode(F)
vhost_net(F) tun(F) macvtap(F) macvlan(F) kvm_intel(F) kvm(F) binfmt_misc(F)
uinput(F) i915(F) i2c_algo_bit(F) drm_kms_helper(F) drm(F) i2c_core(F) wmi(F)
video(F)
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] CPU 0 
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] Pid: 660, comm: Xorg Tainted: GF
           3.7.1-1.local2.fc19.x86_64 #1 LENOVO 74585FU/74585FU
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] RIP: 0010:[<ffffffffa009c847>] 
[<ffffffffa009c847>] i915_gem_object_unpin+0x47/0x50 [i915]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] RSP: 0018:ffff880134be7938 
EFLAGS: 00010246
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] RAX: ffff880130a78000 RBX:
ffff880130da3800 RCX: 0000000000000000
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] RDX: 0000000000000002 RSI:
0000000000070008 RDI: ffff8801262db400
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] RBP: ffff880134be7938 R08:
0000000000000030 R09: 0000000000000006
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] R10: 0000000000000000 R11:
0000000000000001 R12: ffff880130da0800
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] R13: ffff880130da0820 R14:
0000000000000000 R15: ffff880130da0800
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] FS:  00007fc5f1d5f940(0000)
GS:ffff88013bc00000(0000) knlGS:0000000000000000
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] CS:  0010 DS: 0000 ES: 0000 CR0:
000000008005003b
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] CR2: 00000000008054bc CR3:
0000000130822000 CR4: 00000000000007f0
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] Process Xorg (pid: 660,
threadinfo ffff880134be6000, task ffff880130964560)
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] Stack:
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  ffff880134be7948
ffffffffa00adf5e ffff880134be7978 ffffffffa00b17e6
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  ffff8801338497d8
ffff880130da3800 0000000000000001 ffff880130da0c50
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  ffff880134be7c08
ffffffffa00b43d2 ffff880100000001 000000008121ac18
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] Call Trace:
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa00adf5e>]
intel_unpin_fb_obj+0x3e/0x40 [i915]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa00b17e6>]
intel_crtc_disable+0x96/0x130 [i915]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa00b43d2>]
intel_set_mode+0x262/0xa50 [i915]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff8121d26c>] ?
ext4_dirty_inode+0x3c/0x60
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff8125b182>] ?
jbd2_journal_stop+0x1b2/0x2a0
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff81237dc6>] ?
__ext4_journal_stop+0x76/0xa0
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff8121badd>] ?
ext4_da_write_end+0x9d/0x350
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff812f1a31>] ?
vsnprintf+0x461/0x600
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff812f1c74>] ?
snprintf+0x34/0x40
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa00b4d11>] ?
intel_crtc_set_config+0x151/0x970 [i915]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa00b52d6>]
intel_crtc_set_config+0x716/0x970 [i915]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff81633af6>] ?
__schedule+0x3c6/0x7a0
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa0037286>]
drm_framebuffer_remove+0xc6/0x150 [drm]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa003ac75>]
drm_mode_rmfb+0xd5/0xe0 [drm]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa002a4a3>]
drm_ioctl+0x4d3/0x580 [drm]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff811d3402>] ?
send_to_group+0x182/0x250
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffffa003aba0>] ?
drm_mode_addfb2+0x6d0/0x6d0 [drm]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff811d372f>] ?
fsnotify+0x25f/0x340
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff811a6649>]
do_vfs_ioctl+0x99/0x580
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff8128b94a>] ?
inode_has_perm.isra.31.constprop.61+0x2a/0x30
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff8128cd17>] ?
file_has_perm+0x97/0xb0
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff811a6bc1>]
sys_ioctl+0x91/0xb0
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff810dc8cc>] ?
__audit_syscall_exit+0x3ec/0x450
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  [<ffffffff8163d9d9>]
system_call_fastpath+0x16/0x1b
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] Code: 00 74 2a 89 d0 83 e2 0f c0
e8 04 83 e8 01 83 e0 0f 89 c1 c1 e1 04 09 ca 84 c0 88 97 e9 00 00 00 75 07 80
a7 ea 00 00 00 fb 5d c3 <0f> 0b 0f 0b 0f 1f 44 00 00 66 66 66 66 90 55 48 89 e5
41 57 41 
Jan  5 10:28:07 tlondon kernel: [ 2025.215017] RIP  [<ffffffffa009c847>]
i915_gem_object_unpin+0x47/0x50 [i915]
Jan  5 10:28:07 tlondon kernel: [ 2025.215017]  RSP <ffff880134be7938>

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20130105/b4df13a1/attachment-0001.html>


More information about the intel-gfx-bugs mailing list