[Bug 210415] New: [amdgpu] constant GPU hangs followed by kernel "BUG" and following kernel oops

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Sun Nov 29 19:12:08 UTC 2020


https://bugzilla.kernel.org/show_bug.cgi?id=210415

            Bug ID: 210415
           Summary: [amdgpu] constant GPU hangs followed by kernel "BUG"
                    and following kernel oops
           Product: Drivers
           Version: 2.5
    Kernel Version: 5.9.11
          Hardware: x86-64
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri at kernel-bugs.osdl.org
          Reporter: david.alejandro.rubio at gmail.com
        Regression: No

Created attachment 293863
  --> https://bugzilla.kernel.org/attachment.cgi?id=293863&action=edit
dmesg output

I have an RX 480. Every few hours after kernel 5.4 (!) I've been getting random
GPU hangs, and after kernel 5.9, they became not only more frequent, but
afterwards the kernel sent messages like 

Nov 29 15:44:31 reimu kernel: [drm] Bailing on TDR for s_job:34a, as another
already in progress
Nov 29 15:44:31 reimu kernel: BUG: kernel NULL pointer dereference, address:
0000000000000020
Nov 29 15:44:31 reimu kernel: #PF: supervisor write access in kernel mode
Nov 29 15:44:31 reimu kernel: #PF: error_code(0x0002) - not-present page

And an Oops right afterwards
Oops: 0002 [#2] PREEMPT SMP NOPTI

The full dmesg is attached. Kernel is compiled with Archlinux kernel
preferences, but using a kernel directly from kernel.org and compiled with the
modules I need give me the same error.

Attached error.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list