[Intel-gfx] ✗ Fi.CI.BAT: failure for Stop users from using the device on driver unbind

Chris Wilson chris at chris-wilson.co.uk
Fri Apr 5 16:26:46 UTC 2019


Quoting Patchwork (2019-04-05 17:20:39)
> == Series Details ==
> 
> Series: Stop users from using the device on driver unbind
> URL   : https://patchwork.freedesktop.org/series/59064/
> State : failure
> 
> == Summary ==
> 
> CI Bug Log - changes from CI_DRM_5881 -> Patchwork_12699
> ====================================================
> 
> Summary
> -------
> 
>   **FAILURE**
> 
>   Serious unknown changes coming with Patchwork_12699 absolutely need to be
>   verified manually.
>   
>   If you think the reported changes have nothing to do with the changes
>   introduced in Patchwork_12699, please notify your bug team to allow them
>   to document this new failure mode, which will reduce false positives in CI.
> 
>   External URL: https://patchwork.freedesktop.org/api/1.0/series/59064/revisions/1/mbox/
> 
> Possible new issues
> -------------------
> 
>   Here are the unknown changes that may have been introduced in Patchwork_12699:
> 
> ### IGT changes ###
> 
> #### Possible regressions ####
> 
>   * igt at i915_module_load@reload:

2 issues, it appears:

<4> [271.799080] WARN_ON(dev_priv->mm.object_count)
<4> [271.799241] WARNING: CPU: 0 PID: 3288 at drivers/gpu/drm/i915/i915_gem.c:5145 i915_gem_cleanup_early+0x104/0x110 [i915]
<4> [271.799249] Modules linked in: vgem snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic i915(-) mei_hdcp x86_pkg_temp_thermal btusb coretemp btrtl btbcm btintel bluetooth crct10dif_pclmul crc32_pclmul snd_hda_codec snd_hwdep ghash_clmulni_intel snd_hda_core e1000e ecdh_generic snd_pcm mei_me ptp prime_numbers pps_core mei [last unloaded: snd_hda_intel]
<4> [271.799302] CPU: 0 PID: 3288 Comm: i915_module_loa Tainted: G     U            5.1.0-rc3-CI-Patchwork_12699+ #1
<4> [271.799307] Hardware name:  /NUC6i7KYB, BIOS KYSKLi70.86A.0059.2018.1122.1431 11/22/2018
<4> [271.799406] RIP: 0010:i915_gem_cleanup_early+0x104/0x110 [i915]
<4> [271.799412] Code: 00 00 48 c7 c2 d0 6b 3d a0 48 c7 c7 ca 5c 2c a0 e8 c1 b5 ec e0 0f 0b 48 c7 c6 68 c0 3f a0 48 c7 c7 63 88 42 a0 e8 9c 77 de e0 <0f> 0b e9 40 ff ff ff 0f 1f 44 00 00 e8 5b 7e 00 00 31 c0 c3 0f 1f
<4> [271.799417] RSP: 0018:ffffc90000453dd0 EFLAGS: 00010282
<4> [271.799423] RAX: 0000000000000000 RBX: ffff88849afd0000 RCX: 0000000000000000
<4> [271.799428] RDX: 0000000000000006 RSI: ffff88849ee130b8 RDI: ffffffff8211dc4d
<4> [271.799432] RBP: ffff88849afd7630 R08: 00000000028bc995 R09: 0000000000000000
<4> [271.799436] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffa04a81e0
<4> [271.799440] R13: 0000000000000000 R14: 0000000000000000 R15: ffffffffa04a82d0
<4> [271.799446] FS:  00007f31e8cec980(0000) GS:ffff8884aee00000(0000) knlGS:0000000000000000
<4> [271.799451] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [271.799455] CR2: 00007ffea58773d8 CR3: 000000044cfc6003 CR4: 00000000003606f0
<4> [271.799459] Call Trace:
<4> [271.799531]  i915_driver_cleanup_early+0x30/0x70 [i915]
<4> [271.799603]  i915_driver_release+0xa/0x30 [i915]
<4> [271.799672]  i915_driver_unload+0x6a/0x120 [i915]
<4> [271.799748]  i915_pci_remove+0x19/0x30 [i915]
<4> [271.799765]  pci_device_remove+0x36/0xb0

we leaked objects (don't recall that in recent times)

and

<4> [271.812054] general protection fault: 0000 [#1] PREEMPT SMP PTI
<4> [271.812079] CPU: 3 PID: 3288 Comm: i915_module_loa Tainted: G     U  W         5.1.0-rc3-CI-Patchwork_12699+ #1
<4> [271.812094] Hardware name:  /NUC6i7KYB, BIOS KYSKLi70.86A.0059.2018.1122.1431 11/22/2018
<4> [271.812172] RIP: 0010:intel_uncore_forcewake_get+0x0/0xa0 [i915]
<4> [271.812183] Code: e8 fd ff ff 66 0f 1f 44 00 00 48 83 c7 60 e9 17 e0 d9 e0 0f 1f 80 00 00 00 00 e9 5b ed f6 ff 90 66 2e 0f 1f 84 00 00 00 00 00 <48> 83 7f 78 00 74 49 41 55 41 54 55 53 89 f5 4c 8b 67 08 48 89 fb
<4> [271.812208] RSP: 0018:ffffc90000453d78 EFLAGS: 00010282
<4> [271.812217] RAX: 6b6b6b6b6b6b6b6b RBX: ffff88849afd75f0 RCX: 0000000000000000
<4> [271.812228] RDX: 0000000000000001 RSI: 00000000000001ff RDI: 6b6b6b6b6b6b7a1b
<4> [271.812239] RBP: ffff888498992158 R08: 000000000029049a R09: ffff8884ac8af000
<4> [271.812249] R10: ffff8884ac8afc40 R11: ffff8884ad266f98 R12: ffff88849afd7630
<4> [271.812260] R13: ffff88849afd0000 R14: 0000000000000000 R15: ffffffffa04a82d0
<4> [271.812271] FS:  00007f31e8cec980(0000) GS:ffff8884aeec0000(0000) knlGS:0000000000000000
<4> [271.812283] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [271.812292] CR2: 000055f1c5db3128 CR3: 000000044cfc6004 CR4: 00000000003606e0
<4> [271.812303] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
<4> [271.812314] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
<4> [271.812324] Call Trace:
<4> [271.812367]  reset_prepare+0x34/0x60 [i915]
<4> [271.812411]  __i915_gem_set_wedged.part.4+0x55/0x190 [i915]
<4> [271.812425]  ? _raw_spin_unlock_irqrestore+0x39/0x60
<4> [271.812469]  i915_gem_set_wedged+0x56/0x60 [i915]
<4> [271.812511]  i915_driver_unload+0x72/0x120 [i915]
<4> [271.812553]  i915_pci_remove+0x19/0x30 [i915]
<4> [271.812565]  pci_device_remove+0x36/0xb0
<4> [271.812574]  device_release_driver_internal+0xdf/0x1d0
<4> [271.812584]  driver_detach+0x3e/0x80
<4> [271.812593]  bus_remove_driver+0x53/0xd0
<4> [271.812602]  pci_unregister_driver+0x25/0xa0
<4> [271.812669]  i915_exit+0x16/0x1c [i915]

where we try to wedge but the device is full of poison 0x6b. We are not
even in the fault-injection test, so that device should be after a
successful load.

Hmm, certainly didn't expect the latter. 
-Chris


More information about the Intel-gfx mailing list