[Bug 111481] AMD Navi GPU frequent freezes on both Manjaro/Ubuntu with kernel 5.3 and mesa 19.2 -git/llvm9

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Oct 24 13:25:58 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=111481

L.S.S. <ragnaros39216 at yandex.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |ragnaros39216 at yandex.com

--- Comment #151 from L.S.S. <ragnaros39216 at yandex.com> ---
Created attachment 145807
  --> https://bugs.freedesktop.org/attachment.cgi?id=145807&action=edit
captured GCVM_L2_PROTECTION_FAULT errors in the log. This was captured on
5.4(rc) kernel.

I'm having similar issues with Navi on Manjaro (both 5.3 and 5.4 kernels). Both
kernels were from official Manjaro repos.

It's almost 100% reproducible using Cinnamon's file manager, Nemo. It can
happen right after I start it, or after I click something (such as opening a
folder). Interestingly, I haven't gotten a freeze from use web browsers
(Firefox, Chromium) just yet.

When the system froze, the rest of the stuffs are still running. The froze
happened in the morning and since I was about to leave for work I left the
system as is (until I get back home in the evening). The xmrig (CPU) mining
session in the background continued to work as normal as observed from the
pool's dashboard.

It seems the protection fault errors would appear after the system has frozen
long enough (I only saw it appear at the time I left it on frozen for a while,
and the rest of the times I reset my system right after it froze). If resetting
the system only a short a while after the freeze happened, the log will end
only at "ring sdma0 timeout".

It seems the "nodma nongg" trick partially worked on 5.3 (5.3.6 to be precise)
as the system hasn't frozen for the time being (even when using Nemo). It
however, doesn't work with the 5.4 (rc) kernel as I still got a freeze caused
by the same "ring sdma0 timeout" error.

Off-topic: On 5.3 kernel, the mouse cursor feels sluggish as if my monitor is
running at 30Hz (while xrandr reports it's indeed 60Hz), while the mouse cursor
works fine on 5.4(rc) kernel.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20191024/5e9ced94/attachment-0001.html>


More information about the dri-devel mailing list