amdgpu 100% CPU usage causing freeze 1002:15d8
Alex Deucher
alexdeucher at gmail.com
Tue Jan 14 22:25:46 UTC 2025
On Tue, Jan 14, 2025 at 3:49 PM Marco Moock <mm at dorfdsl.de> wrote:
>
> Hello!
>
> This might be related to
> https://lists.freedesktop.org/archives/amd-gfx/2025-January/118759.html
> As I subscribed just now, I can't reply there and can't get the
> Message-I
>
> description: Motherboard
> product: Pro A520M-C II
> vendor: ASUSTeK COMPUTER INC.
> physical id: 0
> version: Rev X.0x
> slot: Default string
> *-firmware
> description: BIOS
> vendor: American Megatrends Inc.
> physical id: 0
> version: 3612
> date: 12/03/2024
>
> I updated the UEFI yesterday to the latest version, problem still
> exists.
>
> 08:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc.
> [AMD/ATI] Picasso/Raven 2 [Radeon Vega Series / Radeon Vega Mobile
> Series] [1002:15d8] (rev c9) Subsystem: ASUSTeK Computer Inc. Device
> [1043:876b] Kernel driver in use: amdgpu Kernel modules: amdgpu
>
> Linux ryz 6.12.9-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.12.9-1
> (2025-01-10) x86_64 GNU/Linux
>
> un mesa-common-dev <none> <none> (no description available)
> un mesa-glide2-dev <none> <none> (no description available)
> ii mesa-libgallium:amd64 24.3.3-1 amd64 shared infrastructure for Mesa drivers
> ii mesa-libgallium:i386 24.3.3-1 i386 shared infrastructure for Mesa drivers
> un mesa-opencl-icd <none> <none> (no description available)
> ii mesa-utils 9.0.0-2+b1 amd64 Miscellaneous Mesa utilities -- symlinks
> ii mesa-utils-bin:amd64 9.0.0-2+b1 amd64 Miscellaneous Mesa utilities -- native applications
> un mesa-utils-extra <none> <none> (no description available)
> ii mesa-va-drivers:amd64 24.3.3-1 amd64 Mesa VA-API video acceleration drivers
> ii mesa-vdpau-drivers:amd64 24.3.3-1 amd64 Mesa VDPAU video acceleration drivers
> ii mesa-vulkan-drivers:amd64 24.3.3-1 amd64 Mesa Vulkan graphics drivers
> ii mesa-vulkan-drivers:i386 24.3.3-1 i386 Mesa Vulkan graphics drivers
> un mesag-dev <none> <none> (no description available)
> un mesag3 <none> <none> (no description available)
> un mesag3+ggi-dev <none> <none> (no description
> available)
>
>
> I am running Debian Unstable and encounter 100% CPU usage after some
> hours, reproducible. I have to shut off the system with sysrq, I can't
> shut it down the normal way as it is non-responsive.
>
> I tried 6.12.9 and 6.12.8.
>
> 6.12.8 gave some dmesg error messages:
>
> Jan 13 11:09:44 ryz kernel: amdgpu 0000:08:00.0: amdgpu: Dumping IP State
> Jan 13 11:09:48 ryz kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:73:crtc-0] hw_done or flip_done timed out
> Jan 13 11:09:55 ryz kernel: amdgpu 0000:08:00.0: amdgpu: failed to write reg 28b4 wait reg 28c6
> Jan 13 11:10:00 ryz kernel: sysrq: Keyboard mode set to system default
> Jan 13 11:10:00 ryz kernel: sysrq: This sysrq operation is disabled.
> Jan 13 11:10:00 ryz kernel: sysrq: This sysrq operation is disabled.
> Jan 13 11:10:01 ryz kernel: sysrq: Emergency Sync
>
>
> 6.12.9 doesn't gave me them, but it doesn't list sysrq calls either, so
> I assume it didn't manage to store them in dmesg.
>
> I remember the first occurrence last Friday, some mesa packages were
> updated and I assume it was running 6.12.8 according to the apt logs.
>
>
>
>
> Please tell me which further info you need to track down the issue.
What kernel version(s) is it working properly with? Can you bisect?
Alex
More information about the amd-gfx
mailing list