[Bug 100964] RX-480 [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Aug 13 01:29:38 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=100964

--- Comment #6 from Maxim Cournoyer <maxim.cournoyer at gmail.com> ---
Hi! I have the exact same issue ``[drm:gfx_v8_0_ring_test_ring [amdgpu]]
*ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)`` with a R9 285
GPU. I've had this problems for ages and had been using nomodeset to get by.

I'm trying this on Debian 9 (stretch), with kernel ``Linux debian 4.9.0-3-amd64
#1 SMP Debian 4.9.30-2+deb9u2 (2017-06-26) x86_64 GNU/Linux``. I've attached
the full dmesg.

The interesting thing is that this seems to be related to the motherboard; when
using the very same card (R9 285) in another system, *with the same software*
(Debian 9), it works! It is not a hardware problem: the power supply is brand
new (Seasonic G550W), the RAM tests fine, the SSD is brand new, the CMOS
battery too, etc.

The problem occurs on Asus M2N SLI Deluxe motherboard based system; and it
disappears when using it with an equally old Asus P5W DH Deluxe based system.

I notice there is a message saying that the clock source is unstable right
before the error occurs; could it be related?

Here is an excerpt of the dmesg: 
[   11.552754] 
                failed to send pre message 5b ret is 0 
[   11.748489] 
                failed to send message 5b ret is 0 
[   11.748536] clocksource: timekeeping watchdog on CPU0: Marking clocksource
'tsc' as unstable because the skew is too large:
[   11.748538] clocksource:                       'acpi_pm' wd_now: f69088
wd_last: ba62a0 mask: ffffff
[   11.748539] clocksource:                       'tsc' cs_now: 167976623b
cs_last: 12698f30d7 mask: ffffffffffffffff
[   11.957731] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0
test failed (scratch(0xC040)=0xCAFEDEAD)
[   11.957840] [drm:amdgpu_device_init [amdgpu]] *ERROR* hw_init of IP block
<gfx_v8_0> failed -22
[   11.957879] amdgpu 0000:03:00.0: amdgpu_init failed
[   12.171671] 
                failed to send pre message 133 ret is 0 
[   12.385424] 
                failed to send message 133 ret is 0 
[   12.385433] DPM is not running right now, no need to disable DPM!
[   12.386774] clocksource: Switched to clocksource acpi_pm
[   12.772038]

I can provide dmesg from the working system (using the same software with the
same card) if judged useful.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20170813/f9f896c1/attachment.html>


More information about the dri-devel mailing list