[Bug 111385] [GEN9] (partly recoverable) GPU hang in (multi-context) SynMark HDRBloom
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Tue Sep 3 10:50:40 UTC 2019
https://bugs.freedesktop.org/show_bug.cgi?id=111385
--- Comment #9 from Eero Tamminen <eero.t.tamminen at intel.com> ---
This time SKL GT4e did have recoverable GPU hang in SynMark (CPU<->GPU sync)
Terrain tests, instead of in HdrBloom like SKL GT2 & KBL GT3e had.
KBL GT3e recoverable GPU hang dmesg shows some additional issue:
----------------------------------------------------------------
[ 1799.952461] Iteration 1/3: synmark2 OglHdrBloom
[ 1822.876411] i915 0000:00:02.0: GPU HANG: ecode 9:1:0x00000000, hang on rcs0
[ 1822.876414] GPU hangs can indicate a bug anywhere in the entire gfx stack,
including userspace.
[ 1822.876415] Please file a _new_ bug report on bugs.freedesktop.org against
DRI -> DRM/Intel
[ 1822.876416] drm/i915 developers can then reassign to the right component if
it's not a kernel issue.
[ 1822.876417] The GPU crash dump is required to analyze GPU hangs, so please
always attach it.
[ 1822.876418] GPU crash dump saved to /sys/class/drm/card0/error
[ 1822.877427] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 1822.878178] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1822.878298] i915 0000:00:02.0: Resetting chip for hang on rcs0
[ 1822.880074] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1822.880814] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1830.876391] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 1838.876410] i915 0000:00:02.0: Resetting rcs0 for stuck wait on rcs0
[ 1838.993330] Iteration 2/3: synmark2 OglHdrBloom
[ 1856.861369] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 1856.862122] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1856.862206] i915 0000:00:02.0: Resetting chip for hang on rcs0
[ 1856.863980] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1856.864751] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1864.860345] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 1872.861361] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 1872.971858] Iteration 3/3: synmark2 OglHdrBloom
[ 1898.845367] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 1898.846124] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1898.846214] i915 0000:00:02.0: Resetting chip for hang on rcs0
[ 1898.847988] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1898.848741] [drm:gen8_reset_engines [i915]] *ERROR* rcs0 reset request timed
out: {request: 00000001, RESET_CTL: 00000001}
[ 1906.844355] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
[ 1916.892378] i915 0000:00:02.0: Resetting rcs0 for stuck wait on rcs0
[ 1916.895299] ------------[ cut here ]------------
[ 1916.895392] WARNING: CPU: 2 PID: 0 at ./include/linux/dma-fence.h:532
i915_request_skip+0xa8/0xc0 [i915]
[ 1916.895393] Modules linked in: fuse i915 nfs lockd grace overlay
x86_pkg_temp_thermal coretemp crct10dif_pclmul mei_me e1000e crc32_pclmul mei
sunrpc
[ 1916.895405] CPU: 2 PID: 0 Comm: swapper/2 Not tainted
5.3.0-rc6-CI-Nightly_1860+ #1
[ 1916.895407] Hardware name: /NUC7i7BNB, BIOS
BNKBL357.86A.0062.2018.0222.1644 02/22/2018
[ 1916.895466] RIP: 0010:i915_request_skip+0xa8/0xc0 [i915]
[ 1916.895469] Code: eb c7 48 c7 c7 40 b8 2d a0 89 74 24 04 e8 5e e8 ee e0 0f
0b 8b 74 24 04 eb 93 48 c7 c7 40 b8 2d a0 89 74 24 04 e8 46 e8 ee e0 <0f> 0b 8b
74 24 04 e9 6b ff ff ff 0f 1f 00 66 2e 0f 1f 84 00 00 00
[ 1916.895471] RSP: 0018:ffffc9000011ce48 EFLAGS: 00010086
[ 1916.895473] RAX: 0000000000000024 RBX: ffff88820241c6c0 RCX:
0000000000000103
[ 1916.895475] RDX: 0000000000000000 RSI: ffff888276b163d8 RDI:
00000000ffffffff
[ 1916.895477] RBP: ffffc90031050000 R08: 00000000000002e3 R09:
0000000000000004
[ 1916.895478] R10: ffffc9000011ced8 R11: 0000000000000001 R12:
ffff88826796d2c0
[ 1916.895480] R13: ffff8882745da000 R14: ffff888241d02ac0 R15:
ffff888241d00fc0
[ 1916.895482] FS: 0000000000000000(0000) GS:ffff888276b00000(0000)
knlGS:0000000000000000
[ 1916.895484] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1916.895485] CR2: 0000558f16d1e848 CR3: 000000000340a004 CR4:
00000000003606e0
[ 1916.895487] Call Trace:
[ 1916.895491] <IRQ>
[ 1916.895546] __i915_request_submit+0x11d/0x150 [i915]
[ 1916.895595] execlists_dequeue+0x60c/0xda0 [i915]
[ 1916.895641] execlists_submission_tasklet+0x59/0x60 [i915]
[ 1916.895648] tasklet_action_common.isra.4+0x3d/0xa0
[ 1916.895654] __do_softirq+0xf7/0x34b
[ 1916.895659] irq_exit+0x98/0xb0
[ 1916.895663] smp_apic_timer_interrupt+0x8e/0x190
[ 1916.895666] apic_timer_interrupt+0xf/0x20
[ 1916.895668] </IRQ>
[ 1916.895673] RIP: 0010:cpuidle_enter_state+0xae/0x450
[ 1916.895676] Code: 49 89 c4 0f 1f 44 00 00 31 ff e8 2d a5 8f ff 45 84 f6 74
12 9c 58 f6 c4 02 0f 85 73 03 00 00 31 ff e8 c6 a4 94 ff fb 45 85 ed <0f> 88 c9
02 00 00 4c 2b 24 24 48 ba cf f7 53 e3 a5 9b c4 20 49 63
[ 1916.895678] RSP: 0018:ffffc900000b7e80 EFLAGS: 00000202 ORIG_RAX:
ffffffffffffff13
[ 1916.895680] RAX: ffff888276b00000 RBX: ffffffff824b4340 RCX:
000000000000001f
[ 1916.895682] RDX: 000001be4fdc7708 RSI: 000000002487924d RDI:
0000000000000000
[ 1916.895684] RBP: ffff888276b31e00 R08: 0000000000000002 R09:
0000000000028dc0
[ 1916.895685] R10: ffffc900000b7e60 R11: 000000000000024a R12:
000001be4fdc7708
[ 1916.895687] R13: 0000000000000004 R14: 0000000000000000 R15:
0000000000000004
[ 1916.895694] cpuidle_enter+0x29/0x40
[ 1916.895698] do_idle+0x1e9/0x240
[ 1916.895702] cpu_startup_entry+0x19/0x20
[ 1916.895705] start_secondary+0x159/0x1a0
[ 1916.895709] secondary_startup_64+0xa4/0xb0
[ 1916.895767] WARNING: CPU: 2 PID: 0 at ./include/linux/dma-fence.h:532
i915_request_skip+0xa8/0xc0 [i915]
[ 1916.895769] ---[ end trace 0ec701c0ac1a866b ]---
----------------------------------------------------------------
--
You are receiving this mail because:
You are the QA Contact for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-3d-bugs/attachments/20190903/45f517dd/attachment.html>
More information about the intel-3d-bugs
mailing list