[Bug 111601] regression: deadlock-freeze due to kernel commit aa56a292ce623734ddd30f52d73f527d1f3529b5

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Sep 10 10:36:19 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=111601

--- Comment #8 from howaboutsynergy <howaboutsynergy at pm.me> ---
Created attachment 145318
  --> https://bugs.freedesktop.org/attachment.cgi?id=145318&action=edit
semi-full log of stacktraces at the point of `return -EINTR`

I got stacktraces from this:
```c
diff --git a/drivers/gpu/drm/i915/gem/i915_gem_userptr.c
b/drivers/gpu/drm/i915/gem/i915_gem_userptr.c
index 2caa594322bc..5a9dd14d3bea 100644
--- a/drivers/gpu/drm/i915/gem/i915_gem_userptr.c
+++ b/drivers/gpu/drm/i915/gem/i915_gem_userptr.c
@@ -138,6 +138,7 @@ userptr_mn_invalidate_range_start(struct mmu_notifier *_mn,
                        case MUTEX_TRYLOCK_FAILED:
                                if (mutex_lock_killable_nested(unlock,
I915_MM_SHRINKER)) {
                                        i915_gem_object_put(obj);
+          WARN_ON(1);
                                        return -EINTR;
                                }
                                /* fall through */
```


They look like this:
```
[  154.134495] ------------[ cut here ]------------
[  154.134498] WARN_ON(1)
[  154.134642] WARNING: CPU: 7 PID: 1931 at
drivers/gpu/drm/i915/gem/i915_gem_userptr.c:141
userptr_mn_invalidate_range_start+0x176/0x220 [i915]
[  154.134684] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek
snd_hda_codec_generic xt_TCPMSS iptable_mangle iptable_security iptable_nat
nf_nat iptable_raw nf_log_ipv4 nf_log_common xt_conntrack xt_LOG xt_connlimit
nf_conncount nf_conntrack nf_defrag_ipv4 xt_hashlimit xt_multiport xt_owner
xt_addrtype intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal
intel_powerclamp coretemp i915 crct10dif_pclmul crc32_pclmul crc32c_intel
i2c_algo_bit snd_hda_intel ghash_clmulni_intel drm_kms_helper snd_hda_codec
syscopyarea snd_hwdep sysfillrect sysimgblt snd_hda_core fb_sys_fops snd_pcm
iTCO_wdt intel_cstate drm snd_timer iTCO_vendor_support intel_uncore snd
intel_rapl_perf e1000e drm_panel_orientation_quirks soundcore bfq i2c_i801
pcspkr xhci_pci xhci_hcd
[  154.134720] CPU: 7 PID: 1931 Comm: stress Kdump: loaded Tainted: G     U  W 
       5.3.0-rc8-gf74c2bb98776 #56
[  154.134723] Hardware name: System manufacturer System Product Name/PRIME
Z370-A, BIOS 2201 05/27/2019
[  154.134804] RIP: 0010:userptr_mn_invalidate_range_start+0x176/0x220 [i915]
[  154.134809] Code: ff ff ff 48 89 ef e8 a9 37 cc cb 84 c0 74 08 48 89 ef e8
1d db f1 ff 48 c7 c6 81 00 89 c0 48 c7 c7 7e 00 89 c0 e8 63 ae 8e cb <0f> 0b 41
bf fc ff ff ff e9 97 fe ff ff be 01 00 00 00 48 89 ef e8
[  154.134816] RSP: 0018:ffffaf0607e03768 EFLAGS: 00010286
[  154.134820] RAX: 0000000000000000 RBX: ffffaf0607e03820 RCX:
0000000000000000
[  154.134824] RDX: 000000000000000a RSI: ffffffff8d56c9ca RDI:
ffffffff8d56d9ca
[  154.134828] RBP: ffff94de3c88f900 R08: 00000023e321caf3 R09:
000000000000000a
[  154.134832] R10: 0000000000000000 R11: 00000000fffffffe R12:
ffff94de3c418068
[  154.134837] R13: ffff94dea9dbfd98 R14: ffff94de47e70b90 R15:
0000000000000000
[  154.134842] FS:  0000766ecc735740(0000) GS:ffff94dead9c0000(0000)
knlGS:0000000000000000
[  154.134846] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  154.134849] CR2: 00007ded1ef46740 CR3: 00000007bdffe006 CR4:
00000000003606e0
[  154.134852] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[  154.134859] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[  154.134865] Call Trace:
[  154.134877]  __mmu_notifier_invalidate_range_start+0x4f/0x90
[  154.134897]  try_to_unmap_one+0x718/0x820
[  154.134905]  rmap_walk_file+0xe4/0x250
[  154.134933]  try_to_unmap+0xc1/0xf0
[  154.134942]  ? page_remove_rmap+0x2a0/0x2a0
[  154.134952]  ? page_not_mapped+0x10/0x10
[  154.134974]  ? page_get_anon_vma+0x70/0x70
[  154.134986]  migrate_pages+0x7aa/0x9a0
[  154.134999]  ? isolate_freepages_block+0x340/0x340
[  154.135007]  ? move_freelist_tail+0xd0/0xd0
[  154.135019]  compact_zone+0x656/0xa70
[  154.135029]  compact_zone_order+0xde/0x120
[  154.135042]  try_to_compact_pages+0x187/0x240
[  154.135065]  __alloc_pages_direct_compact+0x87/0x170
[  154.135084]  __alloc_pages_slowpath+0x1f8/0xc10
[  154.135096]  ? get_page_from_freelist+0xe80/0x1330
[  154.135117]  __alloc_pages_nodemask+0x268/0x2b0
[  154.135141]  alloc_pages_vma+0xc1/0x160
[  154.135149]  do_huge_pmd_anonymous_page+0x271/0x610
[  154.135166]  __handle_mm_fault+0xbfc/0x12f0
[  154.135173]  handle_mm_fault+0xa9/0x1d0
[  154.135187]  __do_page_fault+0x23a/0x480
[  154.135199]  do_page_fault+0x1a/0x64
[  154.135221]  page_fault+0x39/0x40
[  154.135229] RIP: 0033:0x5bfeb4f67c10
[  154.135244] Code: c0 0f 84 53 02 00 00 8b 54 24 0c 31 c0 85 d2 0f 94 c0 89
04 24 41 83 fd 02 0f 8f fa 00 00 00 31 c0 4d 85 ff 7e 10 0f 1f 40 00 <c6> 04 03
5a 4c 01 f0 49 39 c7 7f f4 4d 85 e4 0f 84 f4 01 00 00 7e
[  154.135266] RSP: 002b:00007ffcc3719e40 EFLAGS: 00010206
[  154.135272] RAX: 0000000374e52000 RBX: 00007667ebdae010 RCX:
0000766ecc85a6fb
[  154.135281] RDX: 0000000000000001 RSI: 00000006e0987000 RDI:
00007667ebdae000
[  154.135295] RBP: 00005bfeb4f68a54 R08: 00007667ebdae010 R09:
0000000000000000
[  154.135305] R10: 0000000000000022 R11: 00000006e0986000 R12:
ffffffffffffffff
[  154.135315] R13: 0000000000000002 R14: 0000000000001000 R15:
00000006e0986000
[  154.135331] ---[ end trace 28cc7c376d08f23b ]---
[  154.135452] userptr_mn_invalidate_range_start+0x0/0x220 [i915] callback
failed with -4 in blockable context.
```

semi-full log attached

Hope this is helpful.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20190910/0f1b7711/attachment-0001.html>


More information about the intel-gfx-bugs mailing list