[Bug 201957] amdgpu: ring gfx timeout

bugzilla-daemon at kernel.org bugzilla-daemon at kernel.org
Wed Jun 22 12:56:38 UTC 2022


https://bugzilla.kernel.org/show_bug.cgi?id=201957

--- Comment #74 from Martin von Wittich (martin.von.wittich at iserv.eu) ---
> Did you try with the latest Linux Kernel? I had a lot of gpu lockups like this. Also try these kernel parameters : "amdgpu.ppfeaturemask=0xffffbffb  amdgpu.noretry=0 amdgpu.lockup_timeout=0 amdgpu.gpu_recovery=1 amdgpu.audio=0 amdgpu.deep_color=1 amd_iommu=on iommu=pt"" ( you might also try with amdgpu.ppfeaturemask=0xfffd7fff or amdgpu.ppfeaturemask=0xffffffff )

I can confirm that at least on the current Ubuntu linux-image-oem-20.04d
kernel, these options do not resolve the issue:

```
martin at martin ~ % uname -a
Linux martin 5.14.0-1042-oem #47-Ubuntu SMP Fri Jun 3 18:17:11 UTC 2022 x86_64
x86_64 x86_64 GNU/Linux
martin at martin ~ % cat /proc/cmdline 
BOOT_IMAGE=/vmlinuz-5.14.0-1042-oem
root=UUID=1bd000ac-1487-4457-be1a-5ea901ded9e9 ro
amdgpu.ppfeaturemask=0xffffbffb amdgpu.noretry=0 amdgpu.lockup_timeout=0
amdgpu.gpu_recovery=1 amdgpu.audio=0 amdgpu.deep_color=1 amd_iommu=on iommu=pt
quiet
martin at martin ~ % dmesg -T | grep 'ring gfx timeout'
[Mi Jun 22 14:48:07 2022] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx
timeout, signaled seq=1820983, emitted seq=1820985
[Mi Jun 22 14:48:18 2022] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx
timeout, signaled seq=1820987, emitted seq=1820990
```

I had enabled these options on 2022-06-20 14:14 UTC+2, this is the first crash
I've encountered since then.

I have no idea how to build the latest kernel and therefore haven't tested that
yet.

I'll now revert back to amdgpu.dpm=0.

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list