[Bug 111597] [CI][RESUME] igt@* - fail - Failed assertion: !"GPU hung"

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Sep 11 10:31:49 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=111597

--- Comment #5 from Stanislav Lisovskiy <stanislav.lisovskiy at intel.com> ---
Doing now some testing with tgl, when I do submit multiple gpgpu_fill commands
I constantly get this:

(kms_plane_stress:3092) gpu_cmds-CRITICAL: Test assertion failure function
gen7_render_flush, file ../lib/gpu_cmds.c:36:
(kms_plane_stress:3092) gpu_cmds-CRITICAL: Failed assertion: ret == 0
(kms_plane_stress:3092) gpu_cmds-CRITICAL: Last errno: 5, Input/output error
Pausing GPU thread 0 
Stack trace:
  #0 ../lib/igt_core.c:1694 __igt_fail_assert()
  #1 ../lib/gpu_cmds.c:36 gen7_render_flush()
  #2 ../lib/gpgpu_fill.c:356 gen12p1_gpgpu_fillfunc()
  #3 ../tests/kms_plane_stress.c:318 gpu_load()
  #4 /build/glibc-OTsEL5/glibc-2.27/nptl/pthread_create.c:463 start_thread()
  #5 ../sysdeps/unix/sysv/linux/x86_64/clone.S:97 __clone()

Which works quite fine with ICL and other platforms. In dmesg I have this:

[ 3108.643351] hangcheck rcs0
[ 3108.643420] hangcheck        Awake? 2
[ 3108.643428] hangcheck        Hangcheck: 6016 ms ago
[ 3108.643434] hangcheck        Reset count: 0 (global 0)
[ 3108.643440] hangcheck        Requests:
[ 3108.643628] hangcheck                active  1a:4*  prio=2 @ 7900ms:
kms_plane_stres[1347]
[ 3108.643689] hangcheck                ring->start:  0x00008000
[ 3108.643708] hangcheck                ring->head:   0x00000048
[ 3108.643724] hangcheck                ring->tail:   0x00003078
[ 3108.643733] hangcheck                ring->emit:   0x00003080
[ 3108.643738] hangcheck                ring->space:  0x00000f88
[ 3108.643745] hangcheck                ring->hwsp:   0xffff81c0
[ 3108.643753] hangcheck [head 0080, postfix 00c8, tail 0100, batch
0x00000000_007ea000]:
[ 3108.643820] hangcheck [0000] 7a000004 21144c1c fffff080 00000000 00000000
00000000 02800000 00000000
[ 3108.643832] hangcheck [0020] 10400002 ffff81c0 00000000 00000003 04000001
18800101 007ea000 00000000
[ 3108.643841] hangcheck [0040] 04000000 00000000 7a000004 111050a1 ffff81c0
00000000 00000004 00000000
[ 3108.643849] hangcheck [0060] 01000000 04000001 0e40c002 00000000 ffffe0c8
00000000 02800000 00000000
[ 3108.644037] hangcheck        MMIO base:  0x00002000
[ 3108.644085] hangcheck        RING_START: 0x00008000
[ 3108.644098] hangcheck        RING_HEAD:  0x000000c0
[ 3108.644110] hangcheck        RING_TAIL:  0x00003078
[ 3108.644139] hangcheck        RING_CTL:   0x00003001
[ 3108.644158] hangcheck        RING_MODE:  0x00000000
[ 3108.644173] hangcheck        RING_IMR: 00000000
[ 3108.644198] hangcheck        ACTHD:  0x00000000_007ea884
[ 3108.644223] hangcheck        BBADDR: 0x00000000_007ea885
[ 3108.644246] hangcheck        DMA_FADDR: 0x00000000_007eaa80
[ 3108.644257] hangcheck        IPEIR: 0x00000000
[ 3108.644267] hangcheck        IPEHR: 0x25014100
[ 3108.644286] hangcheck        Execlist status: 0x00002098 00000040, entries
12
[ 3108.644295] hangcheck        Execlist CSB read 8, write 8, tasklet queued?
no (enabled)
[ 3108.644318] hangcheck                Active[0: ring:{start:00008000,
hwsp:ffff81c0, seqno:00000003}, rq:  1a:c2  prio=2 @ 7748ms:
kms_plane_stres[1347]
[ 3108.644343] hangcheck                E  1a:4*  prio=2 @ 7901ms:
kms_plane_stres[1347]
[ 3108.644352] hangcheck                E  1a:6  prio=2 @ 7900ms:
kms_plane_stres[1347]
[ 3108.644360] hangcheck                E  1a:8  prio=2 @ 7899ms:
kms_plane_stres[1347]
[ 3108.644368] hangcheck                E  1a:a  prio=2 @ 7898ms:
kms_plane_stres[1347]
[ 3108.644377] hangcheck                E  1a:c  prio=2 @ 7898ms:
kms_plane_stres[1347]
[ 3108.644384] hangcheck                E  1a:e  prio=2 @ 7897ms:
kms_plane_stres[1347]
[ 3108.644392] hangcheck                E  1a:10  prio=2 @ 7896ms:
kms_plane_stres[1347]
[ 3108.644442] hangcheck                ...skipping 88 executing requests...
[ 3108.644450] hangcheck                E  1a:c2  prio=2 @ 7748ms:
kms_plane_stres[1347]
[ 3108.644457] hangcheck HWSP:
[ 3108.644470] hangcheck [0000] 00000000 00000000 00000000 00000000 00000000
00000000 00000000 00000000
[ 3108.644475] hangcheck *
[ 3108.644486] hangcheck [0040] 00010001 00010005 00010001 00010005 00010001
00010005 00010001 00010005
[ 3108.644491] hangcheck *
[ 3108.644499] hangcheck [00a0] 00000000 00000000 00000000 00000000 00000000
00000000 00000000 00000008
[ 3108.644508] hangcheck [00c0] 00000000 00000000 00000000 00000000 00000000
00000000 00000000 00000000
[ 3108.644513] hangcheck *
[ 3108.644563] hangcheck Idle? no
[ 3108.644578] hangcheck Signals:
[ 3108.644676] hangcheck        [1a:44] @ 7846ms
[ 3108.651414] i915 0000:00:02.0: GPU HANG: ecode 12:1:0xdadebeff, in
kms_plane_stres [1347], hang on rcs0
[ 3108.651930] GPU hangs can indicate a bug anywhere in the entire gfx stack,
including userspace.
[ 3108.651945] Please file a _new_ bug report on bugs.freedesktop.org against
DRI -> DRM/Intel
[ 3108.651953] drm/i915 developers can then reassign to the right component if
it's not a kernel issue.
[ 3108.651958] The GPU crash dump is required to analyze GPU hangs, so please
always attach it.
[ 3108.651965] GPU crash dump saved to /sys/class/drm/card0/error

kms_plane_stress is not yet in IGT however, I think there is definitely a bug,
however I don't have any clue what gpu hang might mean.

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are on the CC list for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20190911/01dd39c6/attachment-0001.html>


More information about the intel-gfx-bugs mailing list