[Bug 102037] New: [KBL][IGT] kernel BUG at drivers/gpu/drm/i915/intel_lrc.c:539!

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Aug 4 14:45:16 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=102037

            Bug ID: 102037
           Summary: [KBL][IGT] kernel BUG at
                    drivers/gpu/drm/i915/intel_lrc.c:539!
           Product: DRI
           Version: XOrg git
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: major
          Priority: medium
         Component: DRM/Intel
          Assignee: intel-gfx-bugs at lists.freedesktop.org
          Reporter: tomi.p.sarvela at intel.com
        QA Contact: intel-gfx-bugs at lists.freedesktop.org
                CC: intel-gfx-bugs at lists.freedesktop.org

Boom on Intel Kaby Lake NUC7i5BNH with IGT-testing. Recent (todays) DRM-Tip
with lot of debug on.

Short version:
ickle> first 2, file a bug
ickle> context-switch interrupt when waking up before we even have submitted
anything?

<6>[   33.522783] PM: late suspend of devices complete after 16.768 msecs
<6>[   33.554237] PM: noirq suspend of devices complete after 31.445 msecs
<6>[   33.555474] ACPI: Preparing to enter system sleep state S3
<6>[   33.564375] ACPI: EC: event blocked
<6>[   33.564378] ACPI: EC: EC stopped
<6>[   33.564380] PM: Saving platform NVS memory
<6>[   33.564716] Disabling non-boot CPUs ...
<6>[   33.578858] smpboot: CPU 1 is now offline
<6>[   33.591791] smpboot: CPU 2 is now offline
<4>[   33.601468] IRQ 8: no longer affine to CPU3
<4>[   33.601481] IRQ 9: no longer affine to CPU3
<4>[   33.601493] IRQ 126: no longer affine to CPU3
<4>[   33.601499] IRQ 132: no longer affine to CPU3
<6>[   33.602611] smpboot: CPU 3 is now offline
<6>[   33.606640] ACPI: Low-level resume complete
<6>[   33.606831] ACPI: EC: EC started
<6>[   33.606832] PM: Restoring platform NVS memory
<6>[   33.607698] Suspended for 16.777 seconds
<6>[   33.607755] Enabling non-boot CPUs ...
<6>[   33.607803] x86: Booting SMP configuration:
<6>[   33.607804] smpboot: Booting Node 0 Processor 1 APIC 0x2
<4>[   33.609060]  cache: parent cpu1 should not be sleeping
<6>[   33.609563] CPU1 is up
<6>[   33.609587] smpboot: Booting Node 0 Processor 2 APIC 0x1
<4>[   33.610591]  cache: parent cpu2 should not be sleeping
<6>[   33.611006] CPU2 is up
<6>[   33.611029] smpboot: Booting Node 0 Processor 3 APIC 0x3
<4>[   33.611871]  cache: parent cpu3 should not be sleeping
<6>[   33.612319] CPU3 is up
<6>[   33.615531] ACPI: Waking up from system sleep state S3
<6>[   33.685469] PM: noirq resume of devices complete after 25.560 msecs
<7>[   33.685673] [drm:gen9_set_dc_state [i915]] Setting DC state from 00 to 00
<7>[   33.685722] [drm:intel_power_well_enable [i915]] enabling power well 1
<7>[   33.685768] [drm:intel_power_well_enable [i915]] enabling MISC IO power
well
<7>[   33.685824] [drm:skl_init_cdclk [i915]] Sanitizing cdclk programmed by
pre-os
<7>[   33.687722] [drm:intel_update_cdclk [i915]] Current CD clock rate: 337500
kHz, VCO: 8100000 kHz, ref: 24000 kHz
<7>[   33.688549] [drm:intel_power_well_enable [i915]] enabling always-on
<7>[   33.688576] [drm:intel_power_well_enable [i915]] enabling DC off
<7>[   33.688604] [drm:gen9_set_dc_state [i915]] Setting DC state from 00 to 00
<7>[   33.688636] [drm:intel_power_well_enable [i915]] enabling power well 2
<7>[   33.688674] [drm:intel_power_well_enable [i915]] enabling DDI A/E IO
power well
<7>[   33.688702] [drm:intel_power_well_enable [i915]] enabling DDI B IO power
well
<7>[   33.688729] [drm:intel_power_well_enable [i915]] enabling DDI C IO power
well
<7>[   33.688757] [drm:intel_power_well_enable [i915]] enabling DDI D IO power
well
<6>[   33.691248] PM: early resume of devices complete after 5.736 msecs
<7>[   33.691480] [drm:intel_opregion_setup [i915]] graphic opregion physical
addr: 0x7af8a018
<7>[   33.691562] [drm:intel_opregion_setup [i915]] Public ACPI methods
supported
<7>[   33.691612] [drm:intel_opregion_setup [i915]] SWSCI supported
<6>[   33.691762] ACPI: EC: event unblocked
<7>[   33.698713] [drm:intel_opregion_setup [i915]] SWSCI GBDA callbacks
00000cb3, SBCB callbacks 00300483
<7>[   33.698755] [drm:intel_opregion_setup [i915]] ASLE supported
<7>[   33.698793] [drm:intel_opregion_setup [i915]] ASLE extension supported
<7>[   33.698831] [drm:intel_opregion_setup [i915]] Found valid VBT in ACPI
OpRegion (Mailbox #4)
<7>[   33.699372] [drm:lspcon_wake_native_aux_ch [i915]] Native AUX CH up, DPCD
version: 15.14
<7>[   33.772767] [drm:lspcon_resume [i915]] LSPCON recovering in PCON mode
after 73 ms
<7>[   33.773204] [drm:drm_dp_i2c_do_msg] native defer
<7>[   33.774375] [drm:drm_dp_i2c_do_msg] native defer
<7>[   33.775592] [drm:drm_dp_i2c_do_msg] native defer
<7>[   33.776796] [drm:drm_dp_i2c_do_msg] native defer
<7>[   33.778014] [drm:drm_dp_i2c_do_msg] native defer
<7>[   33.778813] [drm:lspcon_wait_mode [i915]] Current LSPCON mode PCON
<7>[   33.779222] [drm:gen8_init_common_ring [i915]] Execlists enabled for rcs0
<7>[   33.779244] [drm:gen8_init_common_ring [i915]] Restarting rcs0:0 from
0x73
<7>[   33.779282] [drm:init_workarounds_ring [i915]] rcs0: Number of context
specific w/a: 17
<7>[   33.779352] [drm:gen8_init_common_ring [i915]] Execlists enabled for bcs0
<4>[   33.779409] ------------[ cut here ]------------
<2>[   33.779410] kernel BUG at drivers/gpu/drm/i915/intel_lrc.c:539!
<4>[   33.779412] invalid opcode: 0000 [#1] PREEMPT SMP
<4>[   33.779413] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek
snd_hda_codec_generic i915 x86_pkg_temp_thermal intel_powerclamp coretemp
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_intel e1000e
snd_hda_codec ptp snd_hwdep snd_hda_core pps_core snd_pcm mei_me mei
prime_numbers pinctrl_sunrisepoint pinctrl_intel i2c_hid
<4>[   33.779429] CPU: 0 PID: 1335 Comm: kworker/u8:50 Tainted: G        W     
 4.13.0-rc3-CI-CI_DRM_2919+ #1
<4>[   33.779430] Hardware name:                  /NUC7i5BNB, BIOS
BNKBL357.86A.0048.2017.0704.1415 07/04/2017
<4>[   33.779433] Workqueue: events_unbound async_run_entry_fn
<4>[   33.779434] task: ffff88025ef80040 task.stack: ffffc90001e74000
<4>[   33.779454] RIP: 0010:intel_lrc_irq_handler+0x25e/0x500 [i915]
<4>[   33.779455] RSP: 0018:ffff88027ec03f08 EFLAGS: 00010246
<4>[   33.779456] RAX: 0000000000000000 RBX: ffff88026bce4588 RCX:
0000000000000000
<4>[   33.779457] RDX: 0000000080010001 RSI: ffffffff81c9a92b RDI:
ffff88026bce42a8
<4>[   33.779458] RBP: ffff88027ec03f30 R08: ffff88026cc30000 R09:
0000000000000000
<4>[   33.779459] R10: 0000000000000000 R11: 0000000000000000 R12:
ffff88026bce4590
<4>[   33.779459] R13: 0000000000000000 R14: ffffffff81cfbae7 R15:
0000000000000000
<4>[   33.779460] FS:  0000000000000000(0000) GS:ffff88027ec00000(0000)
knlGS:0000000000000000
<4>[   33.779461] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[   33.779462] CR2: 000000edb033ad80 CR3: 0000000003e0f000 CR4:
00000000003406f0
<4>[   33.779462] Call Trace:
<4>[   33.779463]  <IRQ>
<4>[   33.779466]  ? tasklet_hi_action+0x93/0x120
<4>[   33.779468]  __do_softirq+0xbb/0x4b0
<4>[   33.779471]  irq_exit+0xa9/0xc0
<4>[   33.779472]  do_IRQ+0x6c/0x130
<4>[   33.779475]  common_interrupt+0x90/0x90
<4>[   33.779476] RIP: 0010:_raw_spin_unlock_irq+0x2d/0x50
<4>[   33.779477] RSP: 0018:ffffc90001e77c30 EFLAGS: 00000246 ORIG_RAX:
ffffffffffffff6d
<4>[   33.779478] RAX: ffffffffffffffff RBX: ffff88025e030060 RCX:
0000000000000000
<4>[   33.779479] RDX: ffffffffa01d2e9a RSI: 0000000000000001 RDI:
0000000000000001
<4>[   33.779479] RBP: ffffc90001e77c38 R08: 0000000000000019 R09:
0000000000000000
<4>[   33.779480] R10: 0000000000000000 R11: 0000000000000000 R12:
ffff88025e030060
<4>[   33.779481] R13: ffff88025e030010 R14: 0000000000000000 R15:
ffffffff81ccf32f
<4>[   33.779482]  </IRQ>
<4>[   33.779502]  ? intel_engine_reset_breadcrumbs+0x4a/0x60 [i915]
<4>[   33.779521]  intel_engine_reset_breadcrumbs+0x4a/0x60 [i915]
<4>[   33.779540]  gen8_init_common_ring+0x39/0x150 [i915]
<4>[   33.779558]  i915_gem_init_hw+0xce/0x2c0 [i915]
<4>[   33.779571]  i915_pm_restore+0x90/0x190 [i915]
<4>[   33.779584]  i915_pm_resume+0x9/0x10 [i915]
<4>[   33.779586]  pci_pm_resume+0x5f/0x90
<4>[   33.779588]  dpm_run_callback+0x6a/0x310
<4>[   33.779590]  ? pci_pm_freeze+0xe0/0xe0
<4>[   33.779592]  device_resume+0xac/0x1e0
<4>[   33.779593]  ? dpm_watchdog_set+0x60/0x60
<4>[   33.779595]  async_resume+0x18/0x40
<4>[   33.779597]  async_run_entry_fn+0x33/0x160
<4>[   33.779599]  process_one_work+0x21f/0x630
<4>[   33.779601]  worker_thread+0x49/0x3b0
<4>[   33.779603]  kthread+0x10f/0x150
<4>[   33.779604]  ? process_one_work+0x630/0x630
<4>[   33.779605]  ? kthread_create_on_node+0x40/0x40
<4>[   33.779607]  ret_from_fork+0x27/0x40
<4>[   33.779609] Code: 83 e5 08 0f 85 d8 fe ff ff 0f 0b 0f 0b 0f 0b 48 89 cf
4c 89 45 c0 4c 89 55 c8 e8 7e 8c 40 e1 4c 8b 45 c0 4c 8b 55 c8 eb 92 0f 0b <0f>
0b 49 8d 84 24 40 03 00 00 48 83 e2 fc 48 89 45 a8 74 14 8b 
<1>[   33.779658] RIP: intel_lrc_irq_handler+0x25e/0x500 [i915] RSP:
ffff88027ec03f08
<4>[   33.779663] ---[ end trace e6fc4c5de8ebe67b ]---

Long version:
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_2919/shard-kbl4/dmesg-1501855809_Oops_1.log
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_2919/shard-kbl4/dmesg-1501855809_Panic_2.log

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the QA Contact for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20170804/3844eb15/attachment-0001.html>


More information about the intel-gfx-bugs mailing list