[Bug 107432] Periodic complete system lockup with Vega M and Kernel 4.18-rc6+

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Jul 31 01:10:56 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=107432

            Bug ID: 107432
           Summary: Periodic complete system lockup with Vega M and Kernel
                    4.18-rc6+
           Product: DRI
           Version: unspecified
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: normal
          Priority: medium
         Component: DRM/AMDgpu
          Assignee: dri-devel at lists.freedesktop.org
          Reporter: rstrube at gmail.com

Created attachment 140902
  --> https://bugs.freedesktop.org/attachment.cgi?id=140902&action=edit
System log leading up to hard crash

Description:

Periodically my system will begin to slow down dramatically (the mouse cursor
hitches as I try to move it) and I am unable to interact with anything on the
screen.  Eventually the mouse cursor disappears altogether.  Trying to switch
to a tty I do get prompted to login, but after entering my credentials nothing
happens.  It appears to be a hard lockup.  The only solution is to manually
power down my machine and reboot.

This probably happens one or two times a day, normally after starting a new
application.

Hardware:
Dell XPS 15 9575 2 in 1 (Kaby Lake G)

Versions:
Kernel 4.18-rc7
Mesa 18.1.5
Xorg 1.19.6
uCode for Vega M from Linux Firmware git (master) which includes the latest
18.20 uCode from AMD that was recently merged into Linux Firmware

I do have the two sinks available (one for the Intel iGPU and one for the AMD
Vega M), running:

xrandr --listproviders

Lists the following:

Providers: number : 2
Provider 0: id: 0x6f cap: 0x9, Source Output, Sink Offload crtcs: 3 outputs: 7
associated providers: 1 name:modesetting
Provider 1: id: 0x45 cap: 0x6, Sink Output, Source Offload crtcs: 6 outputs: 0
associated providers: 1 name:Unknown AMD Radeon GPU @ pci:0000:01:00.0

And running:

env DRI_PRIME=1 glxinfo | grep "OpenGL renderer"

Lists:

OpenGL renderer string: AMD VEGAM (DRM 3.26.0, 4.18.0-041800rc7-generic, LLVM
6.0.0)

So the Vega M is active and available in my system.

I noticed that this problem started happening after the release of kernel
4.18-rc6 and continues with 4.18-rc7. I've been using 4.18 since rc1 without
issue.  This entry in the changelog caught my eye:

Leo Liu (1):
      drm/amdgpu: Make sure IB tests flushed after IP resume

Not sure if this is at all related, but the reason I bring this up is because 
the errors I see in my logs everytime I encounter this problem are:

kernel: amdgpu 0000:01:00.0: GPU pci config reset
kernel: [drm:amdgpu_device_ip_suspend [amdgpu]] *ERROR* suspend of IP block
<uvd_v6_0> failed -12

Please note that so far I have only encountered this problem when launching
applications that use my Intel iGPU (i.e. I am not setting DRI_PRIME=1).

I've attached my entire log to provide more context.

Thanks!

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20180731/85326149/attachment-0001.html>


More information about the dri-devel mailing list