[Bug 201957] amdgpu: ring gfx timeout

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Mon Jan 24 23:17:23 UTC 2022


https://bugzilla.kernel.org/show_bug.cgi?id=201957

--- Comment #57 from Spencer (smp at nandre.com) ---
Created attachment 300315
  --> https://bugzilla.kernel.org/attachment.cgi?id=300315&action=edit
Kernel config

OS: Gentoo
Kernel: 5.15.16, config attached, built with make -j12
Launch options: root=/dev/sda2 ro quiet

I'd like to be able to boot with amdgpu.dpm=0, as this seems to fix the bug
with minor tradeoffs, however:
When I boot with dpm disabled, my screen will freeze and leave this nice little
stinker to ruin my day

Jan 24 16:33:05 [kernel] [    2.572474] Loading firmware: amdgpu/navi10_pfp.bin
Jan 24 16:33:05 [kernel] [    2.572475] Loading firmware: amdgpu/navi10_me.bin
Jan 24 16:33:05 [kernel] [    2.572476] Loading firmware: amdgpu/navi10_ce.bin
Jan 24 16:33:05 [kernel] [    2.572477] Loading firmware: amdgpu/navi10_rlc.bin
Jan 24 16:33:05 [kernel] [    2.572477] Loading firmware: amdgpu/navi10_mec.bin
Jan 24 16:33:05 [kernel] [    2.572478] Loading firmware:
amdgpu/navi10_mec2.bin
Jan 24 16:33:05 [kernel] [    2.572968] EXT4-fs (sdb1): mounted filesystem with
ordered data mode. Opts: discard. Quota mode: none.
Jan 24 16:33:05 [kernel] [    2.573030] Loading firmware:
amdgpu/navi10_sdma.bin
Jan 24 16:33:05 [kernel] [    2.573032] Loading firmware:
amdgpu/navi10_sdma1.bin
Jan 24 16:33:05 [kernel] [    2.573071] Loading firmware: amdgpu/navi10_vcn.bin
Jan 24 16:33:05 [kernel] [    2.573072] [drm] Found VCN firmware Version ENC:
1.14 DEC: 5 VEP: 0 Revision: 20
Jan 24 16:33:05 [kernel] [    2.573075] amdgpu 0000:28:00.0: amdgpu: Will use
PSP to load VCN firmware
Jan 24 16:33:05 [kernel] [    2.747244] [drm] reserve 0x900000 from
0x817e400000 for PSP TMR
Jan 24 16:33:05 [kernel] [    2.785931] amdgpu 0000:28:00.0: amdgpu: RAS:
optional ras ta ucode is not available
Jan 24 16:33:05 [kernel] [    2.790137] amdgpu 0000:28:00.0: amdgpu: RAP:
optional rap ta ucode is not available
Jan 24 16:33:05 [kernel] [    2.790138] amdgpu 0000:28:00.0: amdgpu:
SECUREDISPLAY: securedisplay ta ucode is not available
Jan 24 16:33:05 [kernel] [    2.790140] amdgpu: smu firmware loading failed
Jan 24 16:33:05 [kernel] [    2.790141] amdgpu 0000:28:00.0: amdgpu:
amdgpu_device_ip_init failed
Jan 24 16:33:05 [kernel] [    2.790143] amdgpu 0000:28:00.0: amdgpu: Fatal
error during GPU init
Jan 24 16:33:05 [kernel] [    2.790144] amdgpu 0000:28:00.0: amdgpu: amdgpu:
finishing device.
Jan 24 16:33:05 [kernel] [    2.793726] [drm] free PSP TMR buffer
Jan 24 16:33:05 [kernel] [    2.825874] amdgpu: probe of 0000:28:00.0 failed
with error -95
Jan 24 16:33:05 [kernel] [    2.825951] BUG: unable to handle page fault for
address: ffffa4af5100d000
Jan 24 16:33:05 [kernel] [    2.825954] #PF: supervisor write access in kernel
mode
Jan 24 16:33:05 [kernel] [    2.825955] #PF: error_code(0x0002) - not-present
page
Jan 24 16:33:05 [kernel] [    2.825957] PGD 100000067 P4D 100000067 PUD
100104067 PMD 0
Jan 24 16:33:05 [kernel] [    2.825960] Oops: 0002 [#1] SMP NOPTI
Jan 24 16:33:05 [kernel] [    2.825962] CPU: 6 PID: 759 Comm: systemd-udevd Not
tainted 5.15.16-gentoo #8
Jan 24 16:33:05 [kernel] [    2.825965] Hardware name: Micro-Star International
Co., Ltd MS-7B86/B450 GAMING PLUS MAX (MS-7B86), BIOS H.60 04/18/2020
Jan 24 16:33:05 [kernel] [    2.825967] RIP: 0010:vcn_v2_0_sw_fini+0x65/0x80
[amdgpu]
Jan 24 16:33:05 [kernel] [    2.826139] Code: 89 ef e8 fe 1b ff ff 85 c0 75 08
48 89 ef e8 42 1a ff ff 48 8b 54 24 08 65 48 2b 14 25 28 00 00 00 75 18 48 83
c4 10 5b 5d c3 <c7> 03 00 00 00 00 8b 7c 24 04 e8 4c c4 4d e9 eb bc e8 15 cd ab
e9
Jan 24 16:33:05 [kernel] [    2.826142] RSP: 0018:ffffa4af40bc7c30 EFLAGS:
00010202

TL;DR: amdgpu: smu firmware loading failed
What it means exactly, I know not, but I know it means my screen is frozen
Is there a trick? A workaround to this?
If there is some info I left out ask for it and I'll fetch it

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list