[Bug 215727] New: drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout when using firefox, chrome or icaclient

bugzilla-daemon at kernel.org bugzilla-daemon at kernel.org
Tue Mar 22 20:30:18 UTC 2022


https://bugzilla.kernel.org/show_bug.cgi?id=215727

            Bug ID: 215727
           Summary: drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring
                    gfx_0.0.0 timeout when using firefox, chrome or
                    icaclient
           Product: Drivers
           Version: 2.5
    Kernel Version: 5.16.15-arch1-1
          Hardware: Intel
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: high
          Priority: P1
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri at kernel-bugs.osdl.org
          Reporter: scallar at poczta.fm
        Regression: No

Created attachment 300599
  --> https://bugzilla.kernel.org/attachment.cgi?id=300599&action=edit
Dmesg

Hi,

Symptoms:
I have installed an AMD Radeon RX 6700-XT card and started having following
random crashes when using a browser or icaclient (Citrix client):
[   85.861734] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0
timeout, signaled seq=13365, emitted seq=13367
[   85.862162] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information:
process kwin_x11 pid 819 thread kwin_x11:cs0 pid 838
Display hangs/ becomes glitched.

Steps to reproduce:
Happens randomly when using a browser (tested firefox and chrome-based) or
icaclient.
I get this error several times every day.
Happens in Xorg, also in Wayland.
Process mentioned in the error is not always window manager (kwin_x11).
Sometimes it's Xorg (or Xwayland), sometimes app (i.e. firefox).
System: Archlinux (linux-firmware 20220309.cd01f85-1)
DE: KDE 5.24.3 / mesa 21.3.7

Logs:
In this case of attached dmesg I was using kwin on Xorg and just started
firefox (hardware acceleration was on). Same thing happens when using icaclient
(very frequent crashes, but hard to reproduce on demand).
Afterwards, i have also tried collecting gfx_0.0.0 data with umr:
umr -R gfx_0.0.0

This also resulted with crash:
[  171.047397] BUG: unable to handle page fault for address: ffffb34e820ffffc

(full log at the end of attached dmesg).

If you need additional data I can reproduce this error.

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list