[Bug 111481] AMD Navi GPU frequent freezes on both Manjaro/Ubuntu with kernel 5.3 and mesa 19.2 -git/llvm9

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Nov 6 08:38:20 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=111481

--- Comment #210 from Lazy <ousleyaric at gmail.com> ---
To clarify, first: it's an Asus reference (blower-style) 5700XT
I can't use the overclock utilities without a crash coming within the hour on
Windows 10 or any of my Linux installs, no fan profiles, no manual control of
fans, no setting it to "high performance" on the dynamic clock or it crashes
within the hour. No exceptions, no setting then resetting the setting to
default to get around it.

Generally speaking, it maxes around 75C, but that's mostly due to the default
fan profile only ramping up enough to negate further gains at that point (I'm
guessing that's to do with trying to keep the card quiet). If I supply cool
air, it'll slow the fan, and the heat still comes eventually.

Some things that may or may not be relevant:

This card crashes mostly around times that the clock rate adjusts more often;
If the card goes from, say, max freq to a step below and back, there's a chance
of a crash. (maybe coincidence, maybe not, I don't know to be bluntly honest)
This is a constant I've noticed on both OSes. Windows 10 tends to keep things
relatively stable in that regard, while Manjaro tends to see a lot of spiking
and sudden drops. SteamVR definitely instigates that kind of behavior in my
experience on my old Vega 56 as well (Which with nodma set on Navi, is actually
not much different tbh). Probably explains why ever since the latest set of
patches, the majority of the time it crashes is after an hour or two of
gameplay in Manjaro. (also no idea why Manjaro switches more often..)

To be blunt, though, in both OSes, seemingly random hangs are also a common
occurrence for me. I had Win10 just yesterday, hang completely, no recovery,
simply animating a minimizing window as SteamVR first opened. Granted, this
also coincided with a rapid up-tick in clock speed most likely, as I've
observed this massive spike on launching SteamVR via GPU adjusting utilities
before I realized they instigate the issue as well.

Setting nodma does get rid of some of the more random crashes, but these ones
stick around in my experience so far. Maybe 75C is a bit high, but in neither
OS can I manage to adjust the fans without the same issue, so.. No idea what to
do, here.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20191106/f2bcb3f4/attachment.html>


More information about the dri-devel mailing list