<html>
<head>
<base href="https://bugs.freedesktop.org/">
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW - [BAT] IGT gem_exec_parallel hangs half of the time on BDW+ testhosts"
href="https://bugs.freedesktop.org/show_bug.cgi?id=100232">100232</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>[BAT] IGT gem_exec_parallel hangs half of the time on BDW+ testhosts
</td>
</tr>
<tr>
<th>Product</th>
<td>DRI
</td>
</tr>
<tr>
<th>Version</th>
<td>DRI git
</td>
</tr>
<tr>
<th>Hardware</th>
<td>Other
</td>
</tr>
<tr>
<th>OS</th>
<td>All
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Severity</th>
<td>normal
</td>
</tr>
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Component</th>
<td>DRM/Intel
</td>
</tr>
<tr>
<th>Assignee</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Reporter</th>
<td>tomi.p.sarvela@intel.com
</td>
</tr>
<tr>
<th>QA Contact</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>CC</th>
<td>intel-gfx-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>i915 platform</th>
<td>BDW, BXT, KBL, SKL
</td>
</tr>
<tr>
<th>i915 features</th>
<td>GEM/execlists
</td>
</tr></table>
<p>
<div>
<pre>One of the subtests in gem_exec_parallel often hangs the host. Below is dump
from SKL6700K on Z170 MB, hanged hard on igt@gem_exec_parallel@render-fds after
running tests/intel-ci/fast-feedback.testlist
CI_DRM_2352 is drm-tip, todays build. For details
<a href="https://intel-gfx-ci.01.org/CI/">https://intel-gfx-ci.01.org/CI/</a>
[ 947.215802] general protection fault: 0000 [#1] PREEMPT SMP
[ 947.221439] Modules linked in: snd_hda_intel i915 vgem snd_hda_codec_hdmi
snd_hda_codec_realtek x
86_pkg_temp_thermal snd_hda_codec_generic intel_powerclamp coretemp
crct10dif_pclmul crc32_pclmul gh
ash_clmulni_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm mei_me mei
e1000e igb ptp pps_core pr
ime_numbers pinctrl_sunrisepoint pinctrl_intel i2c_hid [last unloaded: i915]
[ 947.254918] CPU: 6 PID: 47 Comm: ksoftirqd/6 Tainted: G U
4.11.0-rc2-CI-CI_DRM_2352+
#1
[ 947.264181] Hardware name: Gigabyte Technology Co., Ltd.
Z170X-UD5/Z170X-UD5-CF, BIOS F21 01/06/2
017
[ 947.273499] task: ffff88042bdaa7c0 task.stack: ffffc900001fc000
[ 947.279489] RIP: 0010:notifier_call_chain+0x59/0xa0
[ 947.284426] RSP: 0018:ffffc900001ffd38 EFLAGS: 00010286
[ 947.289698] RAX: 0000000000000001 RBX: 00000000ffffffff RCX:
00000000ffffffff
[ 947.296917] RDX: ffff8803bf65d5c0 RSI: 0000000000000001 RDI:
ffff88041d05e4c8
[ 947.304249] RBP: ffffc900001ffd70 R08: 0000000000000000 R09:
643e07b800000000
[ 947.311544] R10: 0000000000000000 R11: ffff88042bdaa7c0 R12:
0000000000000000
[ 947.318833] R13: 0000000000000000 R14: 00000000ffffffff R15:
6b6b6b6b6b6b6b6b
[ 947.326070] FS: 0000000000000000(0000) GS:ffff88043ed80000(0000)
knlGS:0000000000000000
[ 947.334408] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 947.340242] CR2: 00007f83d8000010 CR3: 0000000429021000 CR4:
00000000003406e0
[ 947.347512] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[ 947.354801] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[ 947.362080] Call Trace:
[ 947.364582] __atomic_notifier_call_chain+0x73/0x110
[ 947.369709] ? unregister_die_notifier+0x20/0x20
[ 947.374406] atomic_notifier_call_chain+0x11/0x20
[ 947.379276] intel_lrc_irq_handler+0x191/0x490 [i915]
[ 947.384458] tasklet_hi_action+0xf0/0x110
[ 947.388611] __do_softirq+0x116/0x4c0
[ 947.392321] run_ksoftirqd+0x22/0x50
[ 947.395960] smpboot_thread_fn+0x180/0x280
[ 947.400129] kthread+0x107/0x140
[ 947.403431] ? sort_range+0x20/0x20
[ 947.407002] ? kthread_create_on_node+0x40/0x40
[ 947.411621] ret_from_fork+0x2e/0x40
[ 947.415233] Code: 4c 89 ff 41 ff 17 4d 85 e4 41 89 c5 74 05 41 83 04 24 01
41 f7 c5 00 80 00 00 7
5 39 83 eb 01 4d 89 f7 4d 85 ff 74 2e 85 db 74 2a <49> 8b 3f 4d 8b 77 08 e8 cb
ca ff ff 85 c0 75 bd
48 c7 c2 04 f1
[ 947.434475] RIP: notifier_call_chain+0x59/0xa0 RSP: ffffc900001ffd38
[ 947.440931] ---[ end trace e6564010da93ee3e ]---
[ 947.608936] Kernel panic - not syncing: Fatal exception in interrupt
[ 947.615465] Kernel Offset: disabled
[ 947.791838] ---[ end Kernel panic - not syncing: Fatal exception in
interrupt
[ 947.799101] ------------[ cut here ]------------
[ 947.803805] WARNING: CPU: 6 PID: 47 at arch/x86/kernel/smp.c:127
native_smp_send_reschedule+0x3a/
0x40
[ 947.813181] Modules linked in: snd_hda_intel i915 vgem snd_hda_codec_hdmi
snd_hda_codec_realtek x
86_pkg_temp_thermal snd_hda_codec_generic intel_powerclamp coretemp
crct10dif_pclmul crc32_pclmul gh
ash_clmulni_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm mei_me mei
e1000e igb ptp pps_core pr
ime_numbers pinctrl_sunrisepoint pinctrl_intel i2c_hid [last unloaded: i915]
[ 947.846626] CPU: 6 PID: 47 Comm: ksoftirqd/6 Tainted: G UD
4.11.0-rc2-CI-CI_DRM_2352+
#1
[ 947.855898] Hardware name: Gigabyte Technology Co., Ltd.
Z170X-UD5/Z170X-UD5-CF, BIOS F21 01/06/2
017
[ 947.865208] Call Trace:
[ 947.867711] <IRQ>
[ 947.869768] dump_stack+0x67/0x92
[ 947.873129] __warn+0xc6/0xe0
[ 947.876145] warn_slowpath_null+0x18/0x20
[ 947.880218] native_smp_send_reschedule+0x3a/0x40
[ 947.885019] trigger_load_balance+0x2cd/0x580
[ 947.889448] ? trigger_load_balance+0x6f/0x580
[ 947.893956] scheduler_tick+0x97/0xc0
[ 947.897673] ? tick_sched_handle.isra.7+0x40/0x40
[ 947.902458] update_process_times+0x42/0x50
[ 947.906721] tick_sched_handle.isra.7+0x1c/0x40
[ 947.911356] tick_sched_timer+0x3d/0x70
[ 947.915249] __hrtimer_run_queues+0xf3/0x530
[ 947.919590] hrtimer_interrupt+0xb9/0x210
[ 947.923655] local_apic_timer_interrupt+0x31/0x50
[ 947.928449] smp_apic_timer_interrupt+0x33/0x50
[ 947.933084] apic_timer_interrupt+0x90/0xa0
[ 947.937330] RIP: 0010:panic+0x1c7/0x205
[ 947.941231] RSP: 0018:ffffc900001ffb90 EFLAGS: 00000246 ORIG_RAX:
ffffffffffffff10
[ 947.948909] RAX: 0000000000000041 RBX: 0000000000000000 RCX:
0000000000000000
[ 947.956191] RDX: 0000000000000101 RSI: ffffffff81c6e65d RDI:
ffffffff8117ef23
[ 947.963410] RBP: ffffc900001ffc00 R08: 0000000000000001 R09:
0000000000000000
[ 947.970664] R10: 0000000000000000 R11: 0000000000000000 R12:
0000000000000000
[ 947.977909] R13: 0000000000000000 R14: 0000000000000000 R15:
6b6b6b6b6b6b6b6b
[ 947.985170] </IRQ>
[ 947.987320] ? panic+0x1c4/0x205
[ 947.990604] ? kmsg_dump+0x11f/0x1c0
[ 947.994237] oops_end+0x78/0x90
[ 947.997435] die+0x46/0x60
[ 948.000172] do_general_protection+0xe0/0x1a0
[ 948.004610] general_protection+0x22/0x30
[ 948.008683] RIP: 0010:notifier_call_chain+0x59/0xa0
[ 948.013650] RSP: 0018:ffffc900001ffd38 EFLAGS: 00010286
[ 948.018963] RAX: 0000000000000001 RBX: 00000000ffffffff RCX:
00000000ffffffff
[ 948.026224] RDX: ffff8803bf65d5c0 RSI: 0000000000000001 RDI:
ffff88041d05e4c8
[ 948.033487] RBP: ffffc900001ffd70 R08: 0000000000000000 R09:
643e07b800000000
[ 948.040759] R10: 0000000000000000 R11: ffff88042bdaa7c0 R12:
0000000000000000
[ 948.048020] R13: 0000000000000000 R14: 00000000ffffffff R15:
6b6b6b6b6b6b6b6b
[ 948.055277] __atomic_notifier_call_chain+0x73/0x110
[ 948.060328] ? unregister_die_notifier+0x20/0x20
[ 948.065034] atomic_notifier_call_chain+0x11/0x20
[ 948.069835] intel_lrc_irq_handler+0x191/0x490 [i915]
[ 948.074960] tasklet_hi_action+0xf0/0x110
[ 948.079023] __do_softirq+0x116/0x4c0
[ 948.082766] run_ksoftirqd+0x22/0x50
[ 948.086406] smpboot_thread_fn+0x180/0x280
[ 948.090593] kthread+0x107/0x140
[ 948.093867] ? sort_range+0x20/0x20
[ 948.097421] ? kthread_create_on_node+0x40/0x40
[ 948.102041] ret_from_fork+0x2e/0x40
[ 948.105681] ---[ end trace e6564010da93ee3f ]---</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
<li>You are on the CC list for the bug.</li>
<li>You are the QA Contact for the bug.</li>
</ul>
</body>
</html>