[Bug 103738] [IGT] igt at gem_exec/igt at gem_reloc/igt at sync some subtests has a dmesg-warn watchdog: BUG: soft lockup - CPU#1 stuck for 23s!

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Apr 5 18:43:15 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=103738

--- Comment #27 from Ricardo Perez <ricardo.o.perez at intel.com> ---
For the following test in CNL:

igt at gem_exec_reloc@cpu-31

Software version:

IGT-Version: 1.22-g0721161 (x86_64) (Linux:
4.16.0-rc6-drm-intel-qa-ww12-commit-4db112a+ x86_64)
Subtest cpu-31: SUCCESS (4.202s)

We are seeing the following dmesg-warn:

[  562.977732] Setting dangerous option reset - tainting kernel
[  567.865254] Setting dangerous option reset - tainting kernel
[  577.855254] Setting dangerous option reset - tainting kernel
[  579.573912] Setting dangerous option reset - tainting kernel
[  604.039990] watchdog: BUG: soft lockup - CPU#3 stuck for 22s!
[gem_exec_reloc:2144]
[  604.039997] Modules linked in: snd_hda_codec_hdmi bnep 8250_dw nls_iso8859_1
arc4 snd_soc_skl x86_pkg_temp_thermal intel_powerclamp snd_soc_skl_ipc coretemp
snd_soc_sst_ipc snd_soc_sst_dsp kvm_intel snd_hda_ext_core
snd_hda_codec_realtek snd_soc_acpi snd_hda_codec_generic kvm snd_soc_core
snd_compress snd_pcm_dmaengine irqbypass ac97_bus crct10dif_pclmul iwlmvm
crc32_pclmul ghash_clmulni_intel pcbc mac80211 aesni_intel snd_hda_intel
aes_x86_64 crypto_simd glue_helper cryptd snd_hda_codec snd_hda_core snd_hwdep
snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi serio_raw snd_seq wmi_bmof
btusb btrtl snd_seq_device btbcm iwlwifi asix snd_timer btintel usbnet mii
bluetooth snd input_leds soundcore ecdh_generic idma64 shpchp virt_dma mei_me
intel_lpss_pci cfg80211 mei intel_pch_thermal intel_lpss acpi_pad
[  604.040035]  mac_hid parport_pc ppdev lp parport ip_tables x_tables autofs4
uas usb_storage hid_generic usbhid hid i915 dwc3 udc_core ulpi e1000e dwc3_pci
prime_numbers wmi video
[  604.040049] CPU: 3 PID: 2144 Comm: gem_exec_reloc Tainted: G     U  W       
4.16.0-rc6-drm-intel-qa-ww12-commit-4db112a+ #1
[  604.040049] Hardware name: Intel Corporation CannonLake Client
Platform/CannonLake Y LPDDR4 RVP, BIOS CNLSFWR1.R00.X124.B02.1802051422
02/05/2018
[  604.040086] RIP: 0010:i915_exit+0x44/0x3b7 [i915]
[  604.040087] RSP: 0018:ffffbba98215f8d0 EFLAGS: 00050246 ORIG_RAX:
ffffffffffffff12
[  604.040089] RAX: 0000000000000000 RBX: ffffbba98215fb58 RCX:
ffff9eb231e55004
[  604.040090] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
0000000000000000
[  604.040091] RBP: ffff9eb22d796100 R08: 0000000000001000 R09:
0000000001e2ed51
[  604.040092] R10: ffff9eb22d796100 R11: 0000000000000001 R12:
00007f7ce694ac10
[  604.040092] R13: 00007f7ce694aa30 R14: ffffbba98215f900 R15:
ffff9eb22d2295c0
[  604.040094] FS:  00007f7ced73a8c0(0000) GS:ffff9eb23f980000(0000)
knlGS:0000000000000000
[  604.040095] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  604.040095] CR2: 00007f7ce694aa30 CR3: 00000002af862002 CR4:
0000000000760ee0
[  604.040096] PKRU: 55555554
[  604.040097] Call Trace:
[  604.040124]  i915_gem_do_execbuffer+0x591/0x1020 [i915]
[  604.040131]  ? shmem_getpage_gfp+0x7a1/0xcf0
[  604.040135]  ? is_bpf_text_address+0xa/0x20
[  604.040138]  ? __save_stack_trace+0x92/0x100
[  604.040140]  ? create_object+0x24e/0x300
[  604.040161]  i915_gem_execbuffer2_ioctl+0xe7/0x340 [i915]
[  604.040180]  ? i915_gem_execbuffer_ioctl+0x2b0/0x2b0 [i915]
[  604.040184]  drm_ioctl_kernel+0x67/0xb0
[  604.040186]  drm_ioctl+0x2d4/0x3c0
[  604.040205]  ? i915_gem_execbuffer_ioctl+0x2b0/0x2b0 [i915]
[  604.040208]  ? vma_merge+0xc8/0x330
[  604.040211]  do_vfs_ioctl+0xa2/0x610
[  604.040214]  SyS_ioctl+0x74/0x80
[  604.040217]  do_syscall_64+0x6e/0x120
[  604.040220]  entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[  604.040222] RIP: 0033:0x7f7ceb91aef7
[  604.040222] RSP: 002b:00007ffebad27fd8 EFLAGS: 00000246 ORIG_RAX:
0000000000000010
[  604.040224] RAX: ffffffffffffffda RBX: 0000000000000003 RCX:
00007f7ceb91aef7
[  604.040225] RDX: 00007ffebad28090 RSI: 0000000040406469 RDI:
0000000000000003
[  604.040225] RBP: 00007ffebad28090 R08: ffffffffffffffff R09:
0000000000000000
[  604.040226] R10: 000000000000049e R11: 0000000000000246 R12:
0000000040406469
[  604.040227] R13: 0000000000000003 R14: 00007ffebad28090 R15:
0000000002000000
[  604.040228] Code: ff ff ff e9 75 11 f1 ff b9 f2 ff ff ff e9 7d 11 f1 ff b8
f2 ff ff ff 40 30 f6 e9 d3 3a f2 ff b8 f2 ff ff ff 30 d2 e9 db 3a f2 ff <ba> f2
ff ff ff e9 6b 55 f2 ff b8 f2 ff ff ff e9 1c 5a f2 ff b8 
[  641.307239] Setting dangerous option reset - tainting kernel
[  643.512634] Setting dangerous option reset - tainting kernel
[  653.165474] Setting dangerous option reset - tainting kernel
[  673.001889] Setting dangerous option reset - tainting kernel
[  676.264418] Setting dangerous option reset - tainting kernel
[  700.039987] watchdog: BUG: soft lockup - CPU#2 stuck for 22s!
[gem_exec_reloc:9761]
[  700.039992] Modules linked in: snd_hda_codec_hdmi bnep 8250_dw nls_iso8859_1
arc4 snd_soc_skl x86_pkg_temp_thermal intel_powerclamp snd_soc_skl_ipc coretemp
snd_soc_sst_ipc snd_soc_sst_dsp kvm_intel snd_hda_ext_core
snd_hda_codec_realtek snd_soc_acpi snd_hda_codec_generic kvm snd_soc_core
snd_compress snd_pcm_dmaengine irqbypass ac97_bus crct10dif_pclmul iwlmvm
crc32_pclmul ghash_clmulni_intel pcbc mac80211 aesni_intel snd_hda_intel
aes_x86_64 crypto_simd glue_helper cryptd snd_hda_codec snd_hda_core snd_hwdep
snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi serio_raw snd_seq wmi_bmof
btusb btrtl snd_seq_device btbcm iwlwifi asix snd_timer btintel usbnet mii
bluetooth snd input_leds soundcore ecdh_generic idma64 shpchp virt_dma mei_me
intel_lpss_pci cfg80211 mei intel_pch_thermal intel_lpss acpi_pad
[  700.040022]  mac_hid parport_pc ppdev lp parport ip_tables x_tables autofs4
uas usb_storage hid_generic usbhid hid i915 dwc3 udc_core ulpi e1000e dwc3_pci
prime_numbers wmi video
[  700.040032] CPU: 2 PID: 9761 Comm: gem_exec_reloc Tainted: G     U  W    L  
4.16.0-rc6-drm-intel-qa-ww12-commit-4db112a+ #1
[  700.040033] Hardware name: Intel Corporation CannonLake Client
Platform/CannonLake Y LPDDR4 RVP, BIOS CNLSFWR1.R00.X124.B02.1802051422
02/05/2018
[  700.040077] RIP: 0010:i915_exit+0x44/0x3b7 [i915]
[  700.040077] RSP: 0018:ffffbba983da78d0 EFLAGS: 00050246 ORIG_RAX:
ffffffffffffff12
[  700.040079] RAX: 0000000000000000 RBX: ffffbba983da7b58 RCX:
ffff9eb232a2d004
[  700.040079] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
0000000000000000
[  700.040080] RBP: ffff9eb201f1a400 R08: 0000000000001000 R09:
00000000021c42c8
[  700.040081] R10: ffff9eb201f1a400 R11: 0000000000000001 R12:
00007f80c7a13a10
[  700.040082] R13: 00007f80c7a13910 R14: ffffbba983da79e0 R15:
ffff9eb1d976d700
[  700.040083] FS:  00007f81075588c0(0000) GS:ffff9eb23f900000(0000)
knlGS:0000000000000000
[  700.040083] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  700.040084] CR2: 00007f80c7a13910 CR3: 00000002af480006 CR4:
0000000000760ee0
[  700.040085] PKRU: 55555554
[  700.040085] Call Trace:
[  700.040104]  i915_gem_do_execbuffer+0x591/0x1020 [i915]
[  700.040108]  ? is_bpf_text_address+0xa/0x20
[  700.040111]  ? __save_stack_trace+0x92/0x100
[  700.040113]  ? create_object+0x24e/0x300
[  700.040128]  i915_gem_execbuffer2_ioctl+0xe7/0x340 [i915]
[  700.040143]  ? i915_gem_execbuffer_ioctl+0x2b0/0x2b0 [i915]
[  700.040146]  drm_ioctl_kernel+0x67/0xb0
[  700.040147]  drm_ioctl+0x2d4/0x3c0
[  700.040161]  ? i915_gem_execbuffer_ioctl+0x2b0/0x2b0 [i915]
[  700.040164]  ? vma_merge+0xc8/0x330
[  700.040167]  do_vfs_ioctl+0xa2/0x610
[  700.040169]  SyS_ioctl+0x74/0x80
[  700.040171]  do_syscall_64+0x6e/0x120
[  700.040173]  entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[  700.040174] RIP: 0033:0x7f8105738ef7
[  700.040175] RSP: 002b:00007ffd7a1aef18 EFLAGS: 00000246 ORIG_RAX:
0000000000000010
[  700.040176] RAX: ffffffffffffffda RBX: 0000000000000003 RCX:
00007f8105738ef7
[  700.040177] RDX: 00007ffd7a1aefd0 RSI: 0000000040406469 RDI:
0000000000000003
[  700.040177] RBP: 00007ffd7a1aefd0 R08: ffffffffffffffff R09:
0000000000000000
[  700.040178] R10: 000000000000049e R11: 0000000000000246 R12:
0000000040406469
[  700.040179] R13: 0000000000000003 R14: 00007ffd7a1aefd0 R15:
0000000004000000
[  700.040180] Code: ff ff ff e9 75 11 f1 ff b9 f2 ff ff ff e9 7d 11 f1 ff b8
f2 ff ff ff 40 30 f6 e9 d3 3a f2 ff b8 f2 ff ff ff 30 d2 e9 db 3a f2 ff <ba> f2
ff ff ff e9 6b 55 f2 ff b8 f2 ff ff ff e9 1c 5a f2 ff b8 
[  789.011827] Setting dangerous option reset - tainting kernel

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the QA Contact for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20180405/7d69d817/attachment-0001.html>


More information about the intel-gfx-bugs mailing list