[Bug 107927] Errors on wake up from hibernation

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Sep 14 06:47:36 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=107927

            Bug ID: 107927
           Summary: Errors on wake up from hibernation
           Product: DRI
           Version: unspecified
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: major
          Priority: medium
         Component: DRM/AMDgpu
          Assignee: dri-devel at lists.freedesktop.org
          Reporter: tomas.volf at showmax.com

Created attachment 141558
  --> https://bugs.freedesktop.org/attachment.cgi?id=141558&action=edit
full dmesg

System does wake up from hibernation, but there are errors in dmesg and system
is unresponsive.

Errors are:

        [  248.007506] [drm:gfx_v9_0_hw_init [amdgpu]] *ERROR* KCQ enable
failed (scratch(0xC040)=0xCAFEDEAD)
        [  248.007531] [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR*
resume of IP block <gfx_v9_0> failed -22
        [  248.007555] [drm:amdgpu_device_resume [amdgpu]] *ERROR*
amdgpu_device_ip_resume failed (-22).
        [  248.007561] dpm_run_callback(): pci_pm_thaw+0x0/0x90 returns -22
        [  248.007567] PM: Device 0000:05:00.0 failed to thaw async: error -22
        ...
        [  252.009448] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)
        [  252.293606] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)
        [  252.579115] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)
        [  252.863401] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)
        [  253.147310] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)
        [  253.431661] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)
        [  253.715567] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)
        [  254.001175] [drm:gfx_v9_0_hw_fini [amdgpu]] *ERROR* KCQ disabled
failed (scratch(0xC040)=0xCAFEDEAD)

Full dmesg is attached.

I have `amd_iommu=off` because I'm not able to boot otherwise (see
https://bugzilla.kernel.org/show_bug.cgi?id=201123 ). My lspci

        00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15d0
        00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Device 15d1
        00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge
        00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 15d3
        00:01.3 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 15d3
        00:01.4 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 15d3
        00:01.6 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 15d3
        00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge
        00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 15db
        00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 15dc
        00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller
(rev 61)
        00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge
(rev 51)
        00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15e8
        00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15e9
        00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15ea
        00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15eb
        00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15ec
        00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15ed
        00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15ee
        00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Device 15ef
        01:00.0 Network controller: Realtek Semiconductor Co., Ltd. RTL8822BE
802.11a/b/g/n/ac WiFi adapter
        02:00.0 Non-Volatile memory controller: Lenovo Device 0006
        03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd.
RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 0e)
        03:00.1 Serial controller: Realtek Semiconductor Co., Ltd. Device 816a
(rev 0e)
        03:00.2 Serial controller: Realtek Semiconductor Co., Ltd. Device 816b
(rev 0e)
        03:00.3 IPMI Interface: Realtek Semiconductor Co., Ltd. Device 816c
(rev 0e)
        03:00.4 USB controller: Realtek Semiconductor Co., Ltd. Device 816d
(rev 0e)
        04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd.
RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 10)
        05:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
[AMD/ATI] Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series] (rev d0)
        05:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device
15de
        05:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD]
Device 15df
        05:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Device 15e0
        05:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Device 15e1
        05:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Device 15e3
        06:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA
Controller [AHCI mode] (rev 61)

If more information are needed, I would be happy to provide them.

I've looked at next branch in git (drm-next-4.19) and there seem to be some
commits that could be relevant:
https://cgit.freedesktop.org/~agd5f/linux/log/?h=drm-next-4.19&qt=grep&q=KCQ ,
however I'm not that good with kernel code so I have no idea if I should try to
use any of those commits :)

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20180914/b98a1e85/attachment-0001.html>


More information about the dri-devel mailing list