[Bug 111601] regression: deadlock-freeze due to kernel commit aa56a292ce623734ddd30f52d73f527d1f3529b5
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Tue Sep 10 14:48:21 UTC 2019
https://bugs.freedesktop.org/show_bug.cgi?id=111601
--- Comment #11 from howaboutsynergy <howaboutsynergy at pm.me> ---
(In reply to Leho Kraav (:macmaN :lkraav) from comment #10)
Ha! That's very interesting! I can actually ssh to it while frozen!! (just like
you)
I assumed that since during `stress` the CPU/fans are high speed, and when
freeze happens they go low/silent/off and not even the mouse moves in X, that
it's completely frozen, even though sysrq stuff would work.
So I can then see hung tasks via ssh, dmesg like:
```
[ 2062.553078] glibc64:getnameinfo.c:559/getnameinfo[6929]:
sshd[6929](full:'sshd: user at pts/7') for user user(1000(eff:user(1000))) 2of2
successfully reverse-resolved requested IP address:
192.168.0.78 192.168.0.78
[ 2098.511916] INFO: task kworker/10:0H:63 blocked for more than 36 seconds.
[ 2098.511923] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2098.511926] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2098.511929] kworker/10:0H D 0 63 2 0x80004000
[ 2098.511944] Workqueue: events_highpri intel_atomic_cleanup_work
[ 2098.511948] Call Trace:
[ 2098.511960] ? __schedule+0x2c6/0x580
[ 2098.511966] schedule+0x34/0xa0
[ 2098.511972] schedule_preempt_disabled+0x5/0x10
[ 2098.511978] __mutex_lock.isra.0+0x25f/0x4b0
[ 2098.511986] ? pick_next_task_fair+0x1eb/0x700
[ 2098.511993] intel_cleanup_plane_fb+0x28/0x70
[ 2098.512000] drm_atomic_helper_cleanup_planes+0x4a/0x60
[ 2098.512005] intel_atomic_cleanup_work+0x17/0x80
[ 2098.512011] process_one_work+0x16b/0x2a0
[ 2098.512016] worker_thread+0x48/0x390
[ 2098.512023] kthread+0xee/0x130
[ 2098.512028] ? process_one_work+0x2a0/0x2a0
[ 2098.512034] ? kthread_park+0x70/0x70
[ 2098.512038] ret_from_fork+0x1f/0x30
[ 2098.512076] INFO: task kworker/u24:1:5567 blocked for more than 36 seconds.
[ 2098.512079] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2098.512081] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2098.512083] kworker/u24:1 D 0 5567 2 0x80004000
[ 2098.512093] Workqueue: i915 __i915_gem_free_work
[ 2098.512095] Call Trace:
[ 2098.512102] ? __schedule+0x2c6/0x580
[ 2098.512107] schedule+0x34/0xa0
[ 2098.512113] schedule_preempt_disabled+0x5/0x10
[ 2098.512119] __mutex_lock.isra.0+0x25f/0x4b0
[ 2098.512125] ? __pm_runtime_resume+0x53/0x70
[ 2098.512130] __i915_gem_free_objects+0x71/0x1d0
[ 2098.512135] __i915_gem_free_work+0x5f/0x90
[ 2098.512140] process_one_work+0x16b/0x2a0
[ 2098.512144] worker_thread+0x48/0x390
[ 2098.512151] kthread+0xee/0x130
[ 2098.512155] ? process_one_work+0x2a0/0x2a0
[ 2098.512160] ? kthread_park+0x70/0x70
[ 2098.512164] ret_from_fork+0x1f/0x30
[ 2098.512171] INFO: task stress:6620 blocked for more than 36 seconds.
[ 2098.512174] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2098.512176] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2098.512178] stress D 0 6620 6618 0x00000004
[ 2098.512183] Call Trace:
[ 2098.512189] ? __schedule+0x2c6/0x580
[ 2098.512194] schedule+0x34/0xa0
[ 2098.512199] io_schedule+0xd/0x40
[ 2098.512206] __lock_page+0x13a/0x230
[ 2098.512212] ? file_fdatawait_range+0x20/0x20
[ 2098.512217] set_page_dirty_lock+0x44/0x50
[ 2098.512224] i915_gem_userptr_put_pages+0x135/0x180
[ 2098.512229] __i915_gem_object_put_pages+0x50/0x90
[ 2098.512235] userptr_mn_invalidate_range_start+0x17f/0x210
[ 2098.512243] __mmu_notifier_invalidate_range_start+0x4f/0x90
[ 2098.512249] try_to_unmap_one+0x718/0x820
[ 2098.512254] rmap_walk_file+0xe4/0x250
[ 2098.512260] try_to_unmap+0xc1/0xf0
[ 2098.512264] ? page_remove_rmap+0x2a0/0x2a0
[ 2098.512268] ? page_not_mapped+0x10/0x10
[ 2098.512272] ? page_get_anon_vma+0x70/0x70
[ 2098.512277] migrate_pages+0x7aa/0x9a0
[ 2098.512285] ? isolate_freepages_block+0x340/0x340
[ 2098.512291] ? move_freelist_tail+0xd0/0xd0
[ 2098.512295] compact_zone+0x656/0xa70
[ 2098.512300] ? set_next_entity+0x9e/0x1a0
[ 2098.512304] compact_zone_order+0xde/0x120
[ 2098.512309] try_to_compact_pages+0x187/0x240
[ 2098.512316] __alloc_pages_direct_compact+0x87/0x170
[ 2098.512323] __alloc_pages_slowpath+0x1f8/0xc10
[ 2098.512329] ? get_page_from_freelist+0xe80/0x1330
[ 2098.512337] __alloc_pages_nodemask+0x268/0x2b0
[ 2098.512342] alloc_pages_vma+0xc1/0x160
[ 2098.512348] do_huge_pmd_anonymous_page+0x271/0x610
[ 2098.512354] __handle_mm_fault+0xbfc/0x12f0
[ 2098.512360] handle_mm_fault+0xa9/0x1d0
[ 2098.512367] __do_page_fault+0x23a/0x480
[ 2098.512373] do_page_fault+0x1a/0x64
[ 2098.512378] page_fault+0x39/0x40
[ 2098.512383] RIP: 0033:0x6453194e9c10
[ 2098.512391] Code: Bad RIP value.
[ 2098.512394] RSP: 002b:00007ffdcdfbbbc0 EFLAGS: 00010206
[ 2098.512398] RAX: 00000006c5c79000 RBX: 00007a6cfe787010 RCX:
00007a73db2596fb
[ 2098.512401] RDX: 0000000000000001 RSI: 00000006dc9ad000 RDI:
00007a6cfe787000
[ 2098.512404] RBP: 00006453194eaa54 R08: 00007a6cfe787010 R09:
0000000000000000
[ 2098.512406] R10: 0000000000000022 R11: 00000006dc9ac000 R12:
ffffffffffffffff
[ 2098.512409] R13: 0000000000000002 R14: 0000000000001000 R15:
00000006dc9ac000
[ 2117.676377] gpg-agent[1084]: handler 0x7c2858d5b700 for fd 10 started
[ 2117.741468] gpg-agent[1084]: handler 0x7c2858d5b700 for fd 10 terminated
[ 2135.375908] INFO: task kworker/10:0H:63 blocked for more than 73 seconds.
[ 2135.375914] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2135.375917] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2135.375920] kworker/10:0H D 0 63 2 0x80004000
[ 2135.375935] Workqueue: events_highpri intel_atomic_cleanup_work
[ 2135.375938] Call Trace:
[ 2135.375950] ? __schedule+0x2c6/0x580
[ 2135.375957] schedule+0x34/0xa0
[ 2135.375963] schedule_preempt_disabled+0x5/0x10
[ 2135.375969] __mutex_lock.isra.0+0x25f/0x4b0
[ 2135.375977] ? pick_next_task_fair+0x1eb/0x700
[ 2135.375984] intel_cleanup_plane_fb+0x28/0x70
[ 2135.375991] drm_atomic_helper_cleanup_planes+0x4a/0x60
[ 2135.375996] intel_atomic_cleanup_work+0x17/0x80
[ 2135.376002] process_one_work+0x16b/0x2a0
[ 2135.376007] worker_thread+0x48/0x390
[ 2135.376014] kthread+0xee/0x130
[ 2135.376019] ? process_one_work+0x2a0/0x2a0
[ 2135.376025] ? kthread_park+0x70/0x70
[ 2135.376029] ret_from_fork+0x1f/0x30
[ 2135.376061] INFO: task kworker/u24:1:5567 blocked for more than 73 seconds.
[ 2135.376064] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2135.376066] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2135.376069] kworker/u24:1 D 0 5567 2 0x80004000
[ 2135.376078] Workqueue: i915 __i915_gem_free_work
[ 2135.376080] Call Trace:
[ 2135.376087] ? __schedule+0x2c6/0x580
[ 2135.376093] schedule+0x34/0xa0
[ 2135.376098] schedule_preempt_disabled+0x5/0x10
[ 2135.376104] __mutex_lock.isra.0+0x25f/0x4b0
[ 2135.376110] ? __pm_runtime_resume+0x53/0x70
[ 2135.376114] __i915_gem_free_objects+0x71/0x1d0
[ 2135.376119] __i915_gem_free_work+0x5f/0x90
[ 2135.376124] process_one_work+0x16b/0x2a0
[ 2135.376129] worker_thread+0x48/0x390
[ 2135.376135] kthread+0xee/0x130
[ 2135.376139] ? process_one_work+0x2a0/0x2a0
[ 2135.376145] ? kthread_park+0x70/0x70
[ 2135.376149] ret_from_fork+0x1f/0x30
[ 2135.376155] INFO: task stress:6620 blocked for more than 73 seconds.
[ 2135.376158] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2135.376160] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2135.376163] stress D 0 6620 6618 0x00000004
[ 2135.376166] Call Trace:
[ 2135.376172] ? __schedule+0x2c6/0x580
[ 2135.376178] schedule+0x34/0xa0
[ 2135.376183] io_schedule+0xd/0x40
[ 2135.376190] __lock_page+0x13a/0x230
[ 2135.376197] ? file_fdatawait_range+0x20/0x20
[ 2135.376201] set_page_dirty_lock+0x44/0x50
[ 2135.376209] i915_gem_userptr_put_pages+0x135/0x180
[ 2135.376214] __i915_gem_object_put_pages+0x50/0x90
[ 2135.376220] userptr_mn_invalidate_range_start+0x17f/0x210
[ 2135.376228] __mmu_notifier_invalidate_range_start+0x4f/0x90
[ 2135.376234] try_to_unmap_one+0x718/0x820
[ 2135.376239] rmap_walk_file+0xe4/0x250
[ 2135.376244] try_to_unmap+0xc1/0xf0
[ 2135.376248] ? page_remove_rmap+0x2a0/0x2a0
[ 2135.376252] ? page_not_mapped+0x10/0x10
[ 2135.376257] ? page_get_anon_vma+0x70/0x70
[ 2135.376261] migrate_pages+0x7aa/0x9a0
[ 2135.376269] ? isolate_freepages_block+0x340/0x340
[ 2135.376275] ? move_freelist_tail+0xd0/0xd0
[ 2135.376279] compact_zone+0x656/0xa70
[ 2135.376284] ? set_next_entity+0x9e/0x1a0
[ 2135.376288] compact_zone_order+0xde/0x120
[ 2135.376293] try_to_compact_pages+0x187/0x240
[ 2135.376300] __alloc_pages_direct_compact+0x87/0x170
[ 2135.376307] __alloc_pages_slowpath+0x1f8/0xc10
[ 2135.376314] ? get_page_from_freelist+0xe80/0x1330
[ 2135.376321] __alloc_pages_nodemask+0x268/0x2b0
[ 2135.376327] alloc_pages_vma+0xc1/0x160
[ 2135.376332] do_huge_pmd_anonymous_page+0x271/0x610
[ 2135.376338] __handle_mm_fault+0xbfc/0x12f0
[ 2135.376344] handle_mm_fault+0xa9/0x1d0
[ 2135.376351] __do_page_fault+0x23a/0x480
[ 2135.376357] do_page_fault+0x1a/0x64
[ 2135.376362] page_fault+0x39/0x40
[ 2135.376367] RIP: 0033:0x6453194e9c10
[ 2135.376375] Code: Bad RIP value.
[ 2135.376379] RSP: 002b:00007ffdcdfbbbc0 EFLAGS: 00010206
[ 2135.376383] RAX: 00000006c5c79000 RBX: 00007a6cfe787010 RCX:
00007a73db2596fb
[ 2135.376386] RDX: 0000000000000001 RSI: 00000006dc9ad000 RDI:
00007a6cfe787000
[ 2135.376389] RBP: 00006453194eaa54 R08: 00007a6cfe787010 R09:
0000000000000000
[ 2135.376392] R10: 0000000000000022 R11: 00000006dc9ac000 R12:
ffffffffffffffff
[ 2135.376395] R13: 0000000000000002 R14: 0000000000001000 R15:
00000006dc9ac000
[ 2172.239905] INFO: task kworker/10:0H:63 blocked for more than 110 seconds.
[ 2172.239912] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2172.239914] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2172.239918] kworker/10:0H D 0 63 2 0x80004000
[ 2172.239932] Workqueue: events_highpri intel_atomic_cleanup_work
[ 2172.239935] Call Trace:
[ 2172.239947] ? __schedule+0x2c6/0x580
[ 2172.239953] schedule+0x34/0xa0
[ 2172.239959] schedule_preempt_disabled+0x5/0x10
[ 2172.239965] __mutex_lock.isra.0+0x25f/0x4b0
[ 2172.239973] ? pick_next_task_fair+0x1eb/0x700
[ 2172.239980] intel_cleanup_plane_fb+0x28/0x70
[ 2172.239986] drm_atomic_helper_cleanup_planes+0x4a/0x60
[ 2172.239992] intel_atomic_cleanup_work+0x17/0x80
[ 2172.239997] process_one_work+0x16b/0x2a0
[ 2172.240003] worker_thread+0x48/0x390
[ 2172.240010] kthread+0xee/0x130
[ 2172.240014] ? process_one_work+0x2a0/0x2a0
[ 2172.240020] ? kthread_park+0x70/0x70
[ 2172.240024] ret_from_fork+0x1f/0x30
[ 2172.240054] INFO: task kworker/u24:1:5567 blocked for more than 110 seconds.
[ 2172.240057] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2172.240059] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2172.240062] kworker/u24:1 D 0 5567 2 0x80004000
[ 2172.240071] Workqueue: i915 __i915_gem_free_work
[ 2172.240073] Call Trace:
[ 2172.240080] ? __schedule+0x2c6/0x580
[ 2172.240085] schedule+0x34/0xa0
[ 2172.240091] schedule_preempt_disabled+0x5/0x10
[ 2172.240097] __mutex_lock.isra.0+0x25f/0x4b0
[ 2172.240102] ? __pm_runtime_resume+0x53/0x70
[ 2172.240107] __i915_gem_free_objects+0x71/0x1d0
[ 2172.240112] __i915_gem_free_work+0x5f/0x90
[ 2172.240117] process_one_work+0x16b/0x2a0
[ 2172.240122] worker_thread+0x48/0x390
[ 2172.240128] kthread+0xee/0x130
[ 2172.240132] ? process_one_work+0x2a0/0x2a0
[ 2172.240137] ? kthread_park+0x70/0x70
[ 2172.240141] ret_from_fork+0x1f/0x30
[ 2172.240148] INFO: task stress:6620 blocked for more than 110 seconds.
[ 2172.240151] Tainted: G U 5.3.0-rc8-gf74c2bb98776 #62
[ 2172.240153] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
message.
[ 2172.240155] stress D 0 6620 6618 0x00000004
[ 2172.240159] Call Trace:
[ 2172.240165] ? __schedule+0x2c6/0x580
[ 2172.240170] schedule+0x34/0xa0
[ 2172.240176] io_schedule+0xd/0x40
[ 2172.240182] __lock_page+0x13a/0x230
[ 2172.240188] ? file_fdatawait_range+0x20/0x20
[ 2172.240193] set_page_dirty_lock+0x44/0x50
[ 2172.240200] i915_gem_userptr_put_pages+0x135/0x180
[ 2172.240206] __i915_gem_object_put_pages+0x50/0x90
[ 2172.240212] userptr_mn_invalidate_range_start+0x17f/0x210
[ 2172.240219] __mmu_notifier_invalidate_range_start+0x4f/0x90
[ 2172.240224] try_to_unmap_one+0x718/0x820
[ 2172.240229] rmap_walk_file+0xe4/0x250
[ 2172.240234] try_to_unmap+0xc1/0xf0
[ 2172.240239] ? page_remove_rmap+0x2a0/0x2a0
[ 2172.240243] ? page_not_mapped+0x10/0x10
[ 2172.240247] ? page_get_anon_vma+0x70/0x70
[ 2172.240251] migrate_pages+0x7aa/0x9a0
[ 2172.240259] ? isolate_freepages_block+0x340/0x340
[ 2172.240265] ? move_freelist_tail+0xd0/0xd0
[ 2172.240269] compact_zone+0x656/0xa70
[ 2172.240274] ? set_next_entity+0x9e/0x1a0
[ 2172.240278] compact_zone_order+0xde/0x120
[ 2172.240283] try_to_compact_pages+0x187/0x240
[ 2172.240290] __alloc_pages_direct_compact+0x87/0x170
[ 2172.240297] __alloc_pages_slowpath+0x1f8/0xc10
[ 2172.240304] ? get_page_from_freelist+0xe80/0x1330
[ 2172.240311] __alloc_pages_nodemask+0x268/0x2b0
[ 2172.240317] alloc_pages_vma+0xc1/0x160
[ 2172.240322] do_huge_pmd_anonymous_page+0x271/0x610
[ 2172.240328] __handle_mm_fault+0xbfc/0x12f0
[ 2172.240334] handle_mm_fault+0xa9/0x1d0
[ 2172.240341] __do_page_fault+0x23a/0x480
[ 2172.240347] do_page_fault+0x1a/0x64
[ 2172.240352] page_fault+0x39/0x40
[ 2172.240357] RIP: 0033:0x6453194e9c10
[ 2172.240365] Code: Bad RIP value.
[ 2172.240369] RSP: 002b:00007ffdcdfbbbc0 EFLAGS: 00010206
[ 2172.240373] RAX: 00000006c5c79000 RBX: 00007a6cfe787010 RCX:
00007a73db2596fb
[ 2172.240376] RDX: 0000000000000001 RSI: 00000006dc9ad000 RDI:
00007a6cfe787000
[ 2172.240379] RBP: 00006453194eaa54 R08: 00007a6cfe787010 R09:
0000000000000000
[ 2172.240382] R10: 0000000000000022 R11: 00000006dc9ac000 R12:
ffffffffffffffff
[ 2172.240384] R13: 0000000000000002 R14: 0000000000001000 R15:
00000006dc9ac000
[ 2177.737800] gpg-agent[1084]: handler 0x7c285955c700 for fd 10 started
[ 2177.781519] gpg-agent[1084]: handler 0x7c285955c700 for fd 10 terminated
```
--
You are receiving this mail because:
You are the QA Contact for the bug.
You are on the CC list for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20190910/b1bd52cd/attachment-0001.html>
More information about the intel-gfx-bugs
mailing list