[Bug 105819] Window system hang due to GPU Fault

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Jun 30 17:42:44 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=105819

--- Comment #5 from Kertesz Laszlo <laszlo.kertesz at gmail.com> ---
I have this issue too. 
Debian testing, kernel compiled from mainline git

It begun with the 4.18 kernels (mainline), now i am on 4.18 rc2+ and still
happens. I did not see this with the 4.17 kernels.

For me it happened a few times, most times i was clicking around in Firefox and
once when i let the computer idle (Firefox was still in the foreground though).
I logged in via ssh and captured these from dmesg:

One instance (i think i reset the system with the magic key combination so it
didn't get to the hung timeout:
[ 3459.767019] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout,
last signaled seq=92850, last emitted seq=92853
[ 3459.767028] amdgpu 0000:06:00.0: GPU reset begin!

Another one:

[275981.536711] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout,
last signaled seq=5720217, last emitted seq=5720220
[275981.536720] amdgpu 0000:06:00.0: GPU reset begin!
[276099.291632] INFO: task kworker/u32:3:15729 blocked for more than 120
seconds.
[276099.291639]       Tainted: G        W   E     4.18.0-rc1 #1
[276099.291641] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables
this message.
[276099.291643] kworker/u32:3   D    0 15729      2 0x80000000
[276099.291661] Workqueue: events_unbound commit_work [drm_kms_helper]
[276099.291664] Call Trace:
[276099.291674]  ? __schedule+0x2b7/0x890
[276099.291680]  ? __update_load_avg_se.isra.38+0x1cf/0x1e0
[276099.291684]  schedule+0x28/0x80
[276099.291688]  schedule_timeout+0x1ee/0x380
[276099.291754]  ? generic_reg_get+0x20/0x30 [amdgpu]
[276099.291815]  ? optc1_get_crtc_scanoutpos+0x68/0xa0 [amdgpu]
[276099.291820]  dma_fence_default_wait+0x1fd/0x280
[276099.291823]  ? dma_fence_release+0x90/0x90
[276099.291826]  dma_fence_wait_timeout+0x39/0xf0
[276099.291830]  reservation_object_wait_timeout_rcu+0x17b/0x370
[276099.291892]  amdgpu_dm_do_flip+0x112/0x350 [amdgpu]
[276099.291898]  ? __wake_up_common+0x76/0x170
[276099.291955]  amdgpu_dm_atomic_commit_tail+0xb91/0xd90 [amdgpu]
[276099.291961]  ? __switch_to+0x16f/0x440
[276099.291970]  commit_tail+0x3d/0x70 [drm_kms_helper]
[276099.291974]  process_one_work+0x195/0x370
[276099.291978]  worker_thread+0x30/0x390
[276099.291981]  ? process_one_work+0x370/0x370
[276099.291984]  kthread+0x113/0x130
[276099.291987]  ? kthread_create_worker_on_cpu+0x70/0x70
[276099.291990]  ret_from_fork+0x22/0x40

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20180630/0c99dddd/attachment.html>


More information about the dri-devel mailing list