[bugzilla-daemon at bugzilla.kernel.org: [Bug 202493] New: Soft lockup ryzen]

Borislav Petkov bp at alien8.de
Sat Feb 2 13:32:22 UTC 2019


FYI:

First splat triggers the REG_WAIT timeout warning:

[drm:generic_reg_wait [amdgpu]] *ERROR* REG_WAIT timeout 10us * 3000 tries - dce110_stream_encoder_dp_blank line:944
WARNING: CPU: 14 PID: 1613 at drivers/gpu/drm/amd/amdgpu/../display/dc/dc_helper.c:249 generic_reg_wait+0xdc/0x140 [amdgpu]

Can ppl have a look pls?

Thx.

----- Forwarded message from bugzilla-daemon at bugzilla.kernel.org -----

Date: Sat, 02 Feb 2019 13:13:59 +0000
From: bugzilla-daemon at bugzilla.kernel.org
To: bp at alien8.de
Subject: [Bug 202493] New: Soft lockup ryzen
Message-ID: <bug-202493-6385 at https.bugzilla.kernel.org/>

https://bugzilla.kernel.org/show_bug.cgi?id=202493

            Bug ID: 202493
           Summary: Soft lockup ryzen
           Product: Platform Specific/Hardware
           Version: 2.5
    Kernel Version: 4.20.5
          Hardware: x86-64
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: x86-64
          Assignee: platform_x86_64 at kernel-bugs.osdl.org
          Reporter: jon780 at gmail.com
        Regression: No

Created attachment 280929
  --> https://bugzilla.kernel.org/attachment.cgi?id=280929&action=edit
journalctl output during lockup

Fedora 29
Ryzen 1700X
AMD RX 560

Kernel is currently 4.20.5 compiled from kernel.org using the fedora .config. 
Have the same issues with every kernel I've tried in the Fedora 29 repos.

Machine freezes, usually multiple times per day.  Mouse cursor moves, but won't
respond to clicks.  No other input works.  Num lock light on keyboard is frozen
(cant toggle it on/off from num lock key).  Cannot switch to virtual terminals,
cannot sysreq+reisub, nothing.  No response via icmp, no other services
respond.    Seems to happen most frequently waking from sleep, but that might
just be my impression.

What I have tried:
Set powersupply to "Typical Current Idle" in bios
Disabled c-state control in bios
Compiled kernel with RCU_NOCB_CPU (it was already config in fedora kernels)
Added rcu_nocbs=0-15 to kernel boot
Added idle=nomwait to kernel boot
Added processor.max_cstate=5

(I realize some of this is redundant, I was grasping at straws)

For reference, heres is my kernel boot line:BOOT_IMAGE=/vmlinuz-4.20.5-jmd
root=/dev/mapper/fedora_localhost--live-root ro
resume=/dev/mapper/fedora_localhost--live-swap
rd.lvm.lv=fedora_localhost-live/root rd.lvm.lv=fedora_localhost-live/swap rhgb
quiet LANG=en_US.UTF-8 idle=nomwait processor.max_cstate=5 rcu_nocbs=0-15

Attached is journalctl output covering the time period, and a little further
back.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.

----- End forwarded message -----

-- 
Regards/Gruss,
    Boris.

Good mailing practices for 400: avoid top-posting and trim the reply.


More information about the amd-gfx mailing list