[Mesa-dev] Screen freeze AMD TURKS/JUNIPER

Guus Ellenkamp guus at activediscovery.net
Wed Oct 20 10:20:08 UTC 2021


My screen freezes every now and then (once a day before, now more often) 
in an Ubuntu 20.04 system. It started using a TURKS AMD graphics adapter 
and after trying many things I thought the adapter might be defective, 
but replacing it with a JUNIPER type the problem still remains.

Mostly I was able to switch to a terminal screen and often I can use ssh 
to reboot properly, but after upgrading to newer drivers (kisak 
repository), the problem seems to get worse.

I just rebooted in 18.04 mode, which seems more stable, but I didn't use 
it that long.

It seems the freeze occurs after 'something' goes wrong, like I mostly 
see network errors before the card problem output.

The problem is pretty urgent, as I am using this computer for work and 
random reboots are very annoying. I thought Linux was supposed to be 
more stable than Windows LOL.

Hope someone can help sort this out.



Some log file, but I see different errors (have more log output):

[ 4605.729370] CIFS VFS: SMB signature verification returned error = -13
[ 4605.841441] CIFS VFS: SMB signature verification returned error = -13
[ 4606.016338] CIFS VFS: SMB signature verification returned error = -13
[ 4606.033629] audit: type=1400 audit(1630738964.068:89): 
apparmor="ALLOWED" operation="rename_src" profile="libreoffice-soffice" 
name=2F6E6574776F726B2F636F6D70616E792D6F6E2D6D6F6F6E2F437573746F6D657220646F63756D656E74732F456967656E20486F72656361204D616B656C6161722F6C7531363634373566646475692E746D70 
pid=16647 comm="soffice.bin" requested_mask="wrd" denied_mask="wrd" 
fsuid=1000 ouid=1000
[ 7384.500862] perf: interrupt took too long (2540 > 2500), lowering 
kernel.perf_event_max_sample_rate to 78500
[ 8291.818014] radeon 0000:01:00.0: ring 0 stalled for more than 10228msec
[ 8291.818023] radeon 0000:01:00.0: GPU lockup (current fence id 
0x00000000000500f9 last fence id 0x000000000005010a on ring 0)
[ 8291.936996] radeon 0000:01:00.0: Saved 535 dwords of commands on ring 0.
[ 8291.937007] radeon 0000:01:00.0: GPU softreset: 0x00000008
[ 8291.937008] radeon 0000:01:00.0:   GRBM_STATUS               = 0xA0003828
[ 8291.937009] radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000007
[ 8291.937010] radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000007
[ 8291.937011] radeon 0000:01:00.0:   SRBM_STATUS               = 0x20000AC0
[ 8291.937012] radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
[ 8291.937013] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[ 8291.937014] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00010002
[ 8291.937015] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00020186
[ 8291.937016] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80038647
[ 8291.937017] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[ 8291.951250] radeon 0000:01:00.0: GRBM_SOFT_RESET=0x00004001
[ 8291.951302] radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
[ 8291.952448] radeon 0000:01:00.0:   GRBM_STATUS               = 0x00003828
[ 8291.952449] radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000007
[ 8291.952450] radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000007
[ 8291.952451] radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
[ 8291.952452] radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
[ 8291.952453] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[ 8291.952454] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
[ 8291.952455] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
[ 8291.952456] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x00000000
[ 8291.952457] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[ 8291.952470] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[ 8291.974662] [drm] enabling PCIE gen 2 link speeds, disable with 
radeon.pcie_gen2=0
[ 8291.978998] [drm] PCIE GART of 1024M enabled (table at 
0x0000000000162000).
[ 8291.979111] radeon 0000:01:00.0: WB enabled
[ 8291.979113] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 
0x00000000e0000c00 and cpu addr 0x000000001ebd2b6b
[ 8291.979114] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 
0x00000000e0000c0c and cpu addr 0x00000000fc505e35
[ 8291.979874] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 
0x0000000000072118 and cpu addr 0x000000003a66b0fa
[ 8291.996180] [drm] ring test on 0 succeeded in 3 usecs
[ 8291.996191] [drm] ring test on 3 succeeded in 7 usecs
[ 8292.171926] [drm] ring test on 5 succeeded in 2 usecs
[ 8292.171934] [drm] UVD initialized successfully.
[ 8293.322021] [drm:r600_ib_test [radeon]] *ERROR* radeon: fence wait 
timed out.
[ 8293.322080] [drm:radeon_ib_ring_tests [radeon]] *ERROR* radeon: 
failed testing IB on GFX ring (-110).
[ 8299.068114] show_signal: 7 callbacks suppressed
[ 8299.068117] traps: Core[4211] general protection fault 
ip:7f45295949f6 sp:7f451e5d8930 error:0 in libc-2.31.so[7f4529570000+178000]
[ 8300.172199] pcmanfm-qt[4191]: segfault at 30 ip 00007f013b88a60a sp 
00007ffc256c0e50 error 4 in libQt5Core.so.5.12.8[7f013b68a000+2e0000]
[ 8300.172205] Code: 53 48 83 ec 28 8b 05 8d 23 2c 00 83 f8 ff 7c 16 0f 
b6 05 29 23 2c 00 84 c0 0f 84 c9 00 00 00 4c 8d 35 3a 23 2c 00 4d 8d 7e 
30 <41> 8b 2f 41 89 ec 41 89 ed 41 81 e4 ff ff ff 00 41 81 e5 c0 ff ff
[ 8453.468876] device enp3s0 left promiscuous mode
[ 8460.611133] vboxnetflt: 0 out of 346316 packets were not sent 
(directed to host)




More information about the mesa-dev mailing list