[Bug 103036] New: [BAT] igt at gem_exec_suspend@basic-s3 causing circular dependency

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Sep 29 07:47:15 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=103036

            Bug ID: 103036
           Summary: [BAT] igt at gem_exec_suspend@basic-s3 causing circular
                    dependency
           Product: DRI
           Version: DRI git
          Hardware: Other
                OS: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: DRM/Intel
          Assignee: intel-gfx-bugs at lists.freedesktop.org
          Reporter: jani.saarinen at intel.com
        QA Contact: intel-gfx-bugs at lists.freedesktop.org
                CC: intel-gfx-bugs at lists.freedesktop.org

On CI_DRM_3152 on cnl system in CI igt at gem_exec_suspend@basic-s3
causing error.

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3152/fi-cnl-y/igt@gem_exec_suspend@basic-s3.html

Results for igt at gem_exec_suspend@basic-s3

Result: dmesg-warn
Out     
IGT-Version: 1.19-g3df22e0d (x86_64) (Linux: 4.14.0-rc2-CI-CI_DRM_3152+ x86_64)
rtcwake: wakeup from "mem" using /dev/rtc0 at Thu Sep 28 20:29:11 2017
Subtest basic-S3: SUCCESS (9.471s)

Environment     
PIGLIT_PLATFORM="mixed_glx_egl" PIGLIT_SOURCE_DIR="/opt/igt/piglit"
Command 
/opt/igt/libexec/intel-gpu-tools/gem_exec_suspend --run-subtest basic-S3

Dmesg   
[  277.886969] Setting dangerous option reset - tainting kernel
[  279.334108] Suspending console(s) (use no_console_suspend to debug)
[  279.890551] sd 0:0:0:0: [sda] Synchronizing SCSI cache
[  283.912201] snd_hda_intel 0000:00:1f.3: azx_get_response timeout, switching
to polling mode: last cmd=0x201f0500
[  284.914685] snd_hda_intel 0000:00:1f.3: No response from codec, disabling
MSI: last cmd=0x201f0500
[  285.918189] snd_hda_intel 0000:00:1f.3: azx_get_response timeout, switching
to single_cmd mode: last cmd=0x201f0500

[  286.158699] ======================================================
[  286.158700] WARNING: possible circular locking dependency detected
[  286.158703] 4.14.0-rc2-CI-CI_DRM_3152+ #1 Tainted: G     U         
[  286.158704] ------------------------------------------------------
[  286.158706] rtcwake/3262 is trying to acquire lock:
[  286.158708]  ((complete)&st->done){+.+.}, at: [<ffffffff81909e0d>]
wait_for_completion+0x1d/0x20
[  286.158718] 
               but task is already holding lock:
[  286.158719]  (sparse_irq_lock){+.+.}, at: [<ffffffff810f22e7>]
irq_lock_sparse+0x17/0x20
[  286.158726] 
               which lock already depends on the new lock.

[  286.158727] 
               the existing dependency chain (in reverse order) is:
[  286.158728] 
               -> #1 (sparse_irq_lock){+.+.}:
[  286.158734]        __mutex_lock+0x86/0x9b0
[  286.158736]        mutex_lock_nested+0x1b/0x20
[  286.158740]        irq_lock_sparse+0x17/0x20
[  286.158743]        irq_affinity_online_cpu+0x18/0xd0
[  286.158746]        cpuhp_invoke_callback+0xa3/0x840
[  286.158746] 
               -> #0 ((complete)&st->done){+.+.}:
[  286.158752]        check_prev_add+0x430/0x840
[  286.158755]        __lock_acquire+0x1420/0x15e0
[  286.158758]        lock_acquire+0xb0/0x200
[  286.158760]        wait_for_common+0x58/0x210
[  286.158762]        wait_for_completion+0x1d/0x20
[  286.158764]        takedown_cpu+0x89/0xf0
[  286.158766]        cpuhp_invoke_callback+0xa3/0x840
[  286.158769]        cpuhp_down_callbacks+0x42/0x80
[  286.158770]        _cpu_down+0xb9/0xf0
[  286.158773]        freeze_secondary_cpus+0xa3/0x390
[  286.158775]        suspend_devices_and_enter+0x2fd/0xce0
[  286.158778]        pm_suspend+0x4f0/0x9d0
[  286.158779]        state_store+0x82/0xf0
[  286.158783]        kobj_attr_store+0xf/0x20
[  286.158787]        sysfs_kf_write+0x45/0x60
[  286.158790]        kernfs_fop_write+0x124/0x1c0
[  286.158792]        __vfs_write+0x28/0x130
[  286.158794]        vfs_write+0xcb/0x1c0
[  286.158796]        SyS_write+0x49/0xb0
[  286.158799]        entry_SYSCALL_64_fastpath+0x1c/0xb1
[  286.158800] 
               other info that might help us debug this:

[  286.158800]  Possible unsafe locking scenario:

[  286.158801]        CPU0                    CPU1
[  286.158802]        ----                    ----
[  286.158803]   lock(sparse_irq_lock);
[  286.158805]                                lock((complete)&st->done);
[  286.158807]                                lock(sparse_irq_lock);
[  286.158809]   lock((complete)&st->done);
[  286.158811] 
                *** DEADLOCK ***

[  286.158813] 8 locks held by rtcwake/3262:
[  286.158814]  #0:  (sb_writers#5){.+.+}, at: [<ffffffff81220701>]
vfs_write+0x171/0x1c0
[  286.158820]  #1:  (&of->mutex){+.+.}, at: [<ffffffff812a39d2>]
kernfs_fop_write+0xf2/0x1c0
[  286.158826]  #2:  (kn->count#210){.+.+}, at: [<ffffffff812a39db>]
kernfs_fop_write+0xfb/0x1c0
[  286.158832]  #3:  (pm_mutex){+.+.}, at: [<ffffffff810e60a9>]
pm_suspend+0xa9/0x9d0
[  286.158837]  #4:  (acpi_scan_lock){+.+.}, at: [<ffffffff8153b8b7>]
acpi_scan_lock_acquire+0x17/0x20
[  286.158844]  #5:  (cpu_add_remove_lock){+.+.}, at: [<ffffffff810811ce>]
freeze_secondary_cpus+0x2e/0x390
[  286.158849]  #6:  (cpu_hotplug_lock.rw_sem){++++}, at: [<ffffffff810d676b>]
percpu_down_write+0x2b/0x110
[  286.158855]  #7:  (sparse_irq_lock){+.+.}, at: [<ffffffff810f22e7>]
irq_lock_sparse+0x17/0x20
[  286.158861] 
               stack backtrace:
[  286.158864] CPU: 2 PID: 3262 Comm: rtcwake Tainted: G     U         
4.14.0-rc2-CI-CI_DRM_3152+ #1
[  286.158866] Hardware name: Intel Corporation CannonLake Client
Platform/CannonLake Y LPDDR4 RVP, BIOS CNLSFWR1.R00.X095.A01.1707101424
07/10/2017
[  286.158867] Call Trace:
[  286.158872]  dump_stack+0x68/0x9f
[  286.158876]  print_circular_bug+0x235/0x3c0
[  286.158879]  ? lockdep_init_map_crosslock+0x20/0x20
[  286.158882]  check_prev_add+0x430/0x840
[  286.158888]  __lock_acquire+0x1420/0x15e0
[  286.158890]  ? __lock_acquire+0x1420/0x15e0
[  286.158893]  ? lockdep_init_map_crosslock+0x20/0x20
[  286.158897]  lock_acquire+0xb0/0x200
[  286.158900]  ? wait_for_completion+0x1d/0x20
[  286.158903]  wait_for_common+0x58/0x210
[  286.158905]  ? wait_for_completion+0x1d/0x20
[  286.158907]  ? cpuhp_invoke_callback+0x840/0x840
[  286.158912]  ? stop_machine_cpuslocked+0xc1/0xd0
[  286.158914]  ? cpuhp_invoke_callback+0x840/0x840
[  286.158916]  wait_for_completion+0x1d/0x20
[  286.158919]  takedown_cpu+0x89/0xf0
[  286.158921]  ? cpuhp_complete_idle_dead+0x20/0x20
[  286.158924]  cpuhp_invoke_callback+0xa3/0x840
[  286.158927]  cpuhp_down_callbacks+0x42/0x80
[  286.158930]  _cpu_down+0xb9/0xf0
[  286.158933]  freeze_secondary_cpus+0xa3/0x390
[  286.158936]  suspend_devices_and_enter+0x2fd/0xce0
[  286.158940]  pm_suspend+0x4f0/0x9d0
[  286.158943]  state_store+0x82/0xf0
[  286.158947]  kobj_attr_store+0xf/0x20
[  286.158950]  sysfs_kf_write+0x45/0x60
[  286.158953]  kernfs_fop_write+0x124/0x1c0
[  286.158956]  __vfs_write+0x28/0x130
[  286.158960]  ? rcu_read_lock_sched_held+0x7a/0x90
[  286.158962]  ? rcu_sync_lockdep_assert+0x2f/0x60
[  286.158965]  ? __sb_start_write+0x108/0x200
[  286.158968]  vfs_write+0xcb/0x1c0
[  286.158970]  SyS_write+0x49/0xb0
[  286.158974]  entry_SYSCALL_64_fastpath+0x1c/0xb1
[  286.158977] RIP: 0033:0x7f2b7e8378f0
[  286.158979] RSP: 002b:00007fffb950fe08 EFLAGS: 00000246 ORIG_RAX:
0000000000000001
[  286.158981] RAX: ffffffffffffffda RBX: ffffffff81493023 RCX:
00007f2b7e8378f0
[  286.158983] RDX: 0000000000000004 RSI: 0000558d2b42e060 RDI:
0000000000000007
[  286.158984] RBP: ffffc90000293f88 R08: 0000558d2b42bdc0 R09:
00007f2b7ed0b700
[  286.158986] R10: 00007f2b7eb00b58 R11: 0000000000000246 R12:
0000558d2b42bce0
[  286.158987] R13: 0000000000000001 R14: 0000000000000004 R15:
0000000000000004
[  286.158993]  ? __this_cpu_preempt_check+0x13/0x20
[  286.169772] IRQ 32: no longer affine to CPU2
[  286.169781] IRQ 120: no longer affine to CPU2
[  286.180638] IRQ 1: no longer affine to CPU3
[  286.180646] IRQ 8: no longer affine to CPU3
[  286.180651] IRQ 9: no longer affine to CPU3
[  286.180672] IRQ 12: no longer affine to CPU3
[  286.180699] IRQ 16: no longer affine to CPU3
[  286.180726] IRQ 17: no longer affine to CPU3
[  286.180761] IRQ 122: no longer affine to CPU3
[  286.195291]  cache: parent cpu1 should not be sleeping
[  286.198990]  cache: parent cpu2 should not be sleeping
[  286.202167]  cache: parent cpu3 should not be sleeping
[  286.231235] ACPI: button: The lid device is not compliant to SW_LID.
[  286.406396] HDA: we are doing full chip reset now
[  286.423171] snd_hda_codec_hdmi hdaudioC0D2: Unable to sync register
0x2f0d00. -5
[  286.440300] CSR SSP Base Not fine
[  286.440315] ------------[ cut here ]------------
[  286.440334] WARNING: CPU: 2 PID: 5 at
drivers/gpu/drm/i915/intel_runtime_pm.c:577 assert_csr_loaded+0x8e/0xc0 [i915]
[  286.440335] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek
snd_hda_codec_generic i915 snd_hda_intel snd_hda_codec asix snd_hwdep usbnet
snd_hda_core mii e1000e snd_pcm x86_pkg_temp_thermal intel_powerclamp coretemp
crct10dif_pclmul crc32_pclmul ptp ghash_clmulni_intel pps_core prime_numbers
i2c_hid
[  286.440354] CPU: 2 PID: 5 Comm: kworker/u8:0 Tainted: G     U         
4.14.0-rc2-CI-CI_DRM_3152+ #1
[  286.440355] Hardware name: Intel Corporation CannonLake Client
Platform/CannonLake Y LPDDR4 RVP, BIOS CNLSFWR1.R00.X095.A01.1707101424
07/10/2017
[  286.440358] Workqueue: events_unbound async_run_entry_fn
[  286.440360] task: ffff8802677d0040 task.stack: ffffc90000068000
[  286.440377] RIP: 0010:assert_csr_loaded+0x8e/0xc0 [i915]
[  286.440378] RSP: 0000:ffffc9000006bac0 EFLAGS: 00010282
[  286.440380] RAX: 0000000000000015 RBX: ffff880255510000 RCX:
0000000000000000
[  286.440381] RDX: 0000000080000001 RSI: 0000000000000001 RDI:
ffffffff810eef26
[  286.440382] RBP: ffffc9000006bac8 R08: 0000000000000001 R09:
0000000000000000
[  286.440382] R10: 0000000000000000 R11: 0000000000000000 R12:
ffff880255510000
[  286.440383] R13: ffffffffa0258d38 R14: ffff880255510000 R15:
0000000400000000
[  286.440385] FS:  0000000000000000(0000) GS:ffff880271100000(0000)
knlGS:0000000000000000
[  286.440386] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  286.440387] CR2: 0000000000000000 CR3: 0000000003e10001 CR4:
00000000006606e0
[  286.440387] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[  286.440388] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[  286.440389] PKRU: 00000000
[  286.440390] Call Trace:
[  286.440408]  skl_enable_dc6+0x40/0xb0 [i915]
[  286.440425]  gen9_dc_off_power_well_disable+0x31/0x40 [i915]
[  286.440442]  intel_power_well_disable+0x3e/0x50 [i915]
[  286.440459]  intel_display_power_put+0xb5/0x110 [i915]
[  286.440490]  intel_display_set_init_power+0x2a/0x40 [i915]
[  286.440516]  intel_modeset_setup_hw_state+0xc5b/0xda0 [i915]
[  286.440519]  ? ww_mutex_lock+0x42/0xb0
[  286.440544]  __intel_display_resume+0x1f/0xc0 [i915]
[  286.440568]  intel_display_resume+0xc7/0x100 [i915]
[  286.440585]  i915_pm_restore+0xf4/0x190 [i915]
[  286.440600]  i915_pm_resume+0xe/0x10 [i915]
[  286.440603]  pci_pm_resume+0x74/0xb0
[  286.440606]  dpm_run_callback+0x6f/0x310
[  286.440608]  ? pci_pm_suspend+0x140/0x140
[  286.440611]  device_resume+0xb4/0x1e0
[  286.440614]  ? dpm_watchdog_set+0x70/0x70
[  286.440617]  async_resume+0x1d/0x50
[  286.440619]  async_run_entry_fn+0x38/0x160
[  286.440622]  process_one_work+0x233/0x660
[  286.440625]  worker_thread+0x4e/0x3b0
[  286.440628]  kthread+0x152/0x190
[  286.440629]  ? process_one_work+0x660/0x660
[  286.440631]  ? kthread_create_on_node+0x40/0x40
[  286.440633]  ret_from_fork+0x27/0x40
[  286.440636] Code: 05 77 7d 15 00 01 e8 f3 83 fb e0 0f ff eb af 80 3d 66 7d
15 00 00 75 bd 48 c7 c7 f7 82 24 a0 c6 05 56 7d 15 00 01 e8 d3 83 fb e0 <0f> ff
eb a6 80 3d 45 7d 15 00 00 75 b4 48 c7 c7 0e 83 24 a0 c6 
[  286.440674] ---[ end trace 623896c0d288792c ]---
[  286.440676] CSR HTP Not fine
[  286.440686] ------------[ cut here ]------------
[  286.440704] WARNING: CPU: 2 PID: 5 at
drivers/gpu/drm/i915/intel_runtime_pm.c:578 assert_csr_loaded+0xae/0xc0 [i915]
[  286.440705] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek
snd_hda_codec_generic i915 snd_hda_intel snd_hda_codec asix snd_hwdep usbnet
snd_hda_core mii e1000e snd_pcm x86_pkg_temp_thermal intel_powerclamp coretemp
crct10dif_pclmul crc32_pclmul ptp ghash_clmulni_intel pps_core prime_numbers
i2c_hid
[  286.440720] CPU: 2 PID: 5 Comm: kworker/u8:0 Tainted: G     U  W      
4.14.0-rc2-CI-CI_DRM_3152+ #1
[  286.440721] Hardware name: Intel Corporation CannonLake Client
Platform/CannonLake Y LPDDR4 RVP, BIOS CNLSFWR1.R00.X095.A01.1707101424
07/10/2017
[  286.440724] Workqueue: events_unbound async_run_entry_fn
[  286.440725] task: ffff8802677d0040 task.stack: ffffc90000068000
[  286.440742] RIP: 0010:assert_csr_loaded+0xae/0xc0 [i915]
[  286.440743] RSP: 0000:ffffc9000006bac0 EFLAGS: 00010282
[  286.440744] RAX: 0000000000000010 RBX: ffff880255510000 RCX:
0000000000000000
[  286.440745] RDX: 0000000080000001 RSI: 0000000000000001 RDI:
ffffffff810eef26
[  286.440746] RBP: ffffc9000006bac8 R08: 0000000000000001 R09:
0000000000000000
[  286.440747] R10: 0000000000000000 R11: 0000000000000000 R12:
ffff880255510000
[  286.440748] R13: ffffffffa0258d38 R14: ffff880255510000 R15:
0000000400000000
[  286.440749] FS:  0000000000000000(0000) GS:ffff880271100000(0000)
knlGS:0000000000000000
[  286.440750] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  286.440751] CR2: 0000000000000000 CR3: 0000000003e10001 CR4:
00000000006606e0
[  286.440752] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[  286.440753] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[  286.440753] PKRU: 00000000
[  286.440754] Call Trace:
[  286.440771]  skl_enable_dc6+0x40/0xb0 [i915]
[  286.440787]  gen9_dc_off_power_well_disable+0x31/0x40 [i915]
[  286.440803]  intel_power_well_disable+0x3e/0x50 [i915]
[  286.440820]  intel_display_power_put+0xb5/0x110 [i915]
[  286.440836]  intel_display_set_init_power+0x2a/0x40 [i915]
[  286.440860]  intel_modeset_setup_hw_state+0xc5b/0xda0 [i915]
[  286.440861]  ? ww_mutex_lock+0x42/0xb0
[  286.440885]  __intel_display_resume+0x1f/0xc0 [i915]
[  286.440907]  intel_display_resume+0xc7/0x100 [i915]
[  286.440924]  i915_pm_restore+0xf4/0x190 [i915]
[  286.440939]  i915_pm_resume+0xe/0x10 [i915]
[  286.440941]  pci_pm_resume+0x74/0xb0
[  286.440944]  dpm_run_callback+0x6f/0x310
[  286.440946]  ? pci_pm_suspend+0x140/0x140
[  286.440949]  device_resume+0xb4/0x1e0
[  286.440951]  ? dpm_watchdog_set+0x70/0x70
[  286.440955]  async_resume+0x1d/0x50
[  286.440957]  async_run_entry_fn+0x38/0x160
[  286.440959]  process_one_work+0x233/0x660
[  286.440962]  worker_thread+0x4e/0x3b0
[  286.440964]  kthread+0x152/0x190
[  286.440965]  ? process_one_work+0x660/0x660
[  286.440967]  ? kthread_create_on_node+0x40/0x40
[  286.440969]  ret_from_fork+0x27/0x40
[  286.440972] Code: 05 56 7d 15 00 01 e8 d3 83 fb e0 0f ff eb a6 80 3d 45 7d
15 00 00 75 b4 48 c7 c7 0e 83 24 a0 c6 05 35 7d 15 00 01 e8 b3 83 fb e0 <0f> ff
eb 9d 0f 1f 40 00 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 
[  286.441010] ---[ end trace 623896c0d288792d ]---
[  286.655622] atkbd serio0: Failed to deactivate keyboard on isa0060/serio0
[  286.686680] snd_hda_codec_realtek hdaudioC0D0: Unable to sync register
0x2b8000. -5
[  286.686799] snd_hda_codec_realtek hdaudioC0D0: Unable to sync register
0x2b8000. -5
[  287.039766] atkbd serio0: Failed to enable keyboard on isa0060/serio0
[  287.368209] Setting dangerous option reset - tainting kernel

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20170929/bb726a3f/attachment-0001.html>


More information about the intel-gfx-bugs mailing list