[Bug 103647] [CI] double incomplete - owatch reset

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Nov 16 13:48:59 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=103647

--- Comment #1 from Marta Löfstedt <marta.lofstedt at intel.com> ---
Here is another very strange run:
https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_7089/fi-kbl-r/igt@chamelium@dp-hpd-fast.html

run.log
running: igt/chamelium/dp-hpd-fast

[000/289]  |                      
owatch: TIMEOUT!
FATAL: command execution failed
java.io.EOFException
...
Finished: FAILURE
Completed CI_IGT_test Patchwork_7089 at fi-kbl-r : FAILURE
CI_IGT_test runtime 186 seconds

dmesg is empty!

there are 2 pstore files:
https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_7089/fi-kbl-r/dmesg-1510579136_Oops_1.log
<7>[    6.831092] [drm:intel_dump_pipe_config [i915]] requested mode:
...
<7>[    8.322952] [drm:intel_dp_read_dpcd [i915]] DPCD: 12 14 84 41 00 00 01 01
02 00 00 00 00 0b 00
<1>[    8.323437] BUG: unable to handle kernel paging request at
ffffffff810dd07d
<1>[    8.323441] IP: __lock_acquire+0x109/0x1b00
<6>[    8.323441] PGD 3e10067 P4D 3e10067 PUD 3e11063 PMD 30001e1 
<4>[    8.323444] Oops: 0003 [#1] PREEMPT SMP
<4>[    8.323445] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek
snd_hda_codec_generic asix usbnet mii i915 snd_hda_intel x86_pkg_temp_thermal
intel_powerclamp coretemp snd_hda_codec crct10dif_pclmul crc32_pclmul e1000e
snd_hwdep snd_hda_core ghash_clmulni_intel snd_pcm ptp pps_core mei_me mei
prime_numbers i2c_hid pinctrl_sunrisepoint pinctrl_intel
<4>[    8.323457] CPU: 0 PID: 5 Comm: kworker/u16:0 Not tainted
4.14.0-rc8-CI-Patchwork_7089+ #1
<4>[    8.323457] Hardware name: Intel Corporation Kabylake Client
platform/Kabylake R DDR4 RVP, BIOS KBLSE2R1.R00.X078.P02.1703030515 03/03/2017
<4>[    8.323497] Workqueue: i915-dp i915_digport_work_func [i915]
<4>[    8.323498] task: ffff8802b4c6d0c0 task.stack: ffffc900000a8000
<4>[    8.323500] RIP: 0010:__lock_acquire+0x109/0x1b00
<4>[    8.323501] RSP: 0018:ffffc900000abcb0 EFLAGS: 00010082
<4>[    8.323502] RAX: ffffffff810dcf45 RBX: ffff8802b4c6d9d0 RCX:
0000000000000000
<4>[    8.323503] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
ffffc900000abd80
<4>[    8.323503] RBP: ffffc900000abd70 R08: 0000000000000001 R09:
0000000000000000
<4>[    8.323504] R10: 0000000000000000 R11: ffffffffa025cdc2 R12:
ffffc900000abd80
<4>[    8.323504] R13: ffff8802b4c6d0c0 R14: 0000000000000001 R15:
0000000000000000
<4>[    8.323505] FS:  0000000000000000(0000) GS:ffff8802bec00000(0000)
knlGS:0000000000000000
<4>[    8.323506] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[    8.323506] CR2: ffffffff810dd07d CR3: 0000000003e0f003 CR4:
00000000003606f0
<4>[    8.323507] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
<4>[    8.323507] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
<4>[    8.323508] Call Trace:
<4>[    8.323510]  ? try_to_wake_up+0x2b4/0x620
<4>[    8.323512]  ? __queue_work+0x19c/0x5a0
<4>[    8.323551]  ? intel_dp_hpd_pulse+0xd2/0x3a0 [i915]
<4>[    8.323553]  reacquire_held_locks+0xa5/0x170
<4>[    8.323554]  ? __this_cpu_preempt_check+0x13/0x20
<4>[    8.323556]  ? reacquire_held_locks+0xa5/0x170
<4>[    8.323591]  ? intel_dp_hpd_pulse+0xd2/0x3a0 [i915]
<4>[    8.323593]  ? find_held_lock+0xae/0xc0
<4>[    8.323595]  ? process_one_work+0x23d/0x650
<4>[    8.323596]  lock_release+0x107/0x310
<4>[    8.323598]  process_one_work+0x252/0x650
<4>[    8.323600]  worker_thread+0x4e/0x3c0
<4>[    8.323601]  kthread+0x114/0x150
<4>[    8.323602]  ? process_one_work+0x650/0x650
<4>[    8.323603]  ? kthread_create_on_node+0x40/0x40
<4>[    8.323605]  ret_from_fork+0x27/0x40
<4>[    8.323607] Code: 8d 62 f8 c3 49 81 3c 24 20 2b 11 82 41 be 00 00 00 00
45 0f 45 f0 83 fe 01 77 86 89 f0 49 8b 44 c4 08 48 85 c0 0f 84 76 ff ff ff <f0>
ff 80 38 01 00 00 8b 1d 22 7a bc 01 45 8b 85 b8 08 00 00 85 
<1>[    8.323627] RIP: __lock_acquire+0x109/0x1b00 RSP: ffffc900000abcb0
<4>[    8.323627] CR2: ffffffff810dd07d

https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_7089/fi-kbl-r/dmesg-1510579281_Panic_3.log
<12>[   60.890787] owatch: Using watchdog device /dev/watchdog0
<12>[   60.890809] owatch: Watchdog /dev/watchdog0 is a software watchdog
<12>[   60.891113] owatch: timeout for /dev/watchdog0 set to 100 (requested
100)
<12>[  153.731546] owatch: TIMEOUT!
<12>[  153.731606] owatch: timeout for /dev/watchdog0 set to 10 (requested 10)
<6>[  153.731685] sysrq: SysRq : Trigger a crash
<1>[  153.731694] BUG: unable to handle kernel NULL pointer dereference at     
     (null)
<1>[  153.731703] IP: sysrq_handle_crash+0x45/0x80
<6>[  153.731705] PGD 2aab19067 P4D 2aab19067 PUD 29d5d2067 PMD 0 
<4>[  153.731712] Oops: 0002 [#2] PREEMPT SMP
<4>[  153.731715] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek
snd_hda_codec_generic asix usbnet mii i915 snd_hda_intel x86_pkg_temp_thermal
intel_powerclamp coretemp snd_hda_codec crct10dif_pclmul crc32_pclmul e1000e
snd_hwdep snd_hda_core ghash_clmulni_intel snd_pcm ptp pps_core mei_me mei
prime_numbers i2c_hid pinctrl_sunrisepoint pinctrl_intel
<4>[  153.731749] CPU: 4 PID: 1048 Comm: owatch Tainted: G      D W      
4.14.0-rc8-CI-Patchwork_7089+ #1
<4>[  153.731751] Hardware name: Intel Corporation Kabylake Client
platform/Kabylake R DDR4 RVP, BIOS KBLSE2R1.R00.X078.P02.1703030515 03/03/2017
<4>[  153.731753] task: ffff88029d5c50c0 task.stack: ffffc90000ee0000
<4>[  153.731756] RIP: 0010:sysrq_handle_crash+0x45/0x80
<4>[  153.731758] RSP: 0018:ffffc90000ee3de0 EFLAGS: 00010282
<4>[  153.731762] RAX: ffff88029d5c50c0 RBX: 0000000000000063 RCX:
0000000000000000
<4>[  153.731763] RDX: ffffffff8159d19b RSI: 0000000000000001 RDI:
ffffffff81e4d600
<4>[  153.731765] RBP: ffffc90000ee3de0 R08: 0000000000000001 R09:
0000000000000000
<4>[  153.731766] R10: 0000000000000000 R11: 0000000000000000 R12:
0000000000000000
<4>[  153.731768] R13: 000000000000000f R14: ffffffff81ebefc0 R15:
ffffc90000ee3f20
<4>[  153.731770] FS:  00007fbf6a3ec700(0000) GS:ffff8802bed00000(0000)
knlGS:0000000000000000
<4>[  153.731772] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[  153.731774] CR2: 0000000000000000 CR3: 00000002b042f003 CR4:
00000000003606e0
<4>[  153.731775] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
<4>[  153.731777] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
<4>[  153.731778] Call Trace:
<4>[  153.731782]  __handle_sysrq+0x132/0x210
<4>[  153.731786]  write_sysrq_trigger+0x51/0x60
<4>[  153.731791]  proc_reg_write+0x42/0x70
<4>[  153.731795]  __vfs_write+0x28/0x130
<4>[  153.731799]  ? rcu_sync_lockdep_assert+0x12/0x60
<4>[  153.731803]  ? __sb_start_write+0x10c/0x200
<4>[  153.731807]  vfs_write+0xc8/0x1c0
<4>[  153.731811]  SyS_write+0x49/0xb0
<4>[  153.731817]  entry_SYSCALL_64_fastpath+0x1c/0xb1
<4>[  153.731820] RIP: 0033:0x7fbf69f12290
<4>[  153.731822] RSP: 002b:00007ffcb99f86b8 EFLAGS: 00000246 ORIG_RAX:
0000000000000001
<4>[  153.731825] RAX: ffffffffffffffda RBX: 0000000001bf3020 RCX:
00007fbf69f12290
<4>[  153.731826] RDX: 0000000000000001 RSI: 0000000001bf3250 RDI:
0000000000000003
<4>[  153.731827] RBP: 0000000000000000 R08: 00007fbf6a3ec700 R09:
0000000000000001
<4>[  153.731829] R10: 0000000000000449 R11: 0000000000000246 R12:
0000000000000000
<4>[  153.731831] R13: 00007ffcb99f88f0 R14: 0000000000000000 R15:
0000000000000000
<4>[  153.731839] Code: 34 e8 80 ea b5 ff 48 c7 c2 9b d1 59 81 be 01 00 00 00
48 c7 c7 00 d6 e4 81 e8 a8 05 b4 ff c7 05 4e 0c b7 00 01 00 00 00 0f ae f8 <c6>
04 25 00 00 00 00 01 5d c3 e8 0c 19 b6 ff 84 c0 75 c3 48 c7 
<1>[  153.731925] RIP: sysrq_handle_crash+0x45/0x80 RSP: ffffc90000ee3de0
<4>[  153.731927] CR2: 0000000000000000
<4>[  153.731933] ---[ end trace 0cbf3056b4970310 ]---
<0>[  153.764966] Kernel panic - not syncing: Fatal exception

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20171116/4a74e1db/attachment-0001.html>


More information about the intel-gfx-bugs mailing list