回复: Bug: amdgpu drm driver cause process into Disk sleep state

78666679 78666679 at qq.com
Tue Sep 3 08:27:33 UTC 2019


Hi, Christian:
     Thanks very much for you so fast reply. I will try this commit patch first. My kernel version is 4.19.36(for some reasons,  this version should not be upgraded).





----
yanhua

------------------ 原始邮件 ------------------
发件人: "Koenig, Christian"<Christian.Koenig at amd.com>;
发送时间: 2019年9月3日(星期二) 下午4:21
收件人: ""<78666679 at qq.com>;"amd-gfx"<amd-gfx at lists.freedesktop.org>;
抄送: "Deucher, Alexander"<Alexander.Deucher at amd.com>;
主题: Re: Bug: amdgpu drm driver cause process into Disk sleep state



Hi Yanhua,

please update your kernel first, cause that looks like a known issue 
which was recently fixed by patch "drm/scheduler: use job count instead 
of peek".

Probably best to try the latest bleeding edge kernel and if that doesn't 
help please open up a bug report on https://bugs.freedesktop.org/.

Regards,
Christian.

Am 03.09.19 um 09:35 schrieb 78666679:
> Hi, Sirs:
>         I have a wx5100 amdgpu card, It randomly come into failure.  sometimes, it will cause processes into uninterruptible wait state.
>
>
> cps-new-ondemand-0587:~ # ps aux|grep -w D
> root      11268  0.0  0.0 260628  3516 ?        Ssl  8月26   0:00 /usr/sbin/gssproxy -D
> root     136482  0.0  0.0 212500   572 pts/0    S+   15:25   0:00 grep --color=auto -w D
> root     370684  0.0  0.0  17972  7428 ?        Ss   9月02   0:04 /usr/sbin/sshd -D
> 10066    432951  0.0  0.0      0     0 ?        D    9月02   0:00 [FakeFinalizerDa]
> root     496774  0.0  0.0      0     0 ?        D    9月02   0:17 [kworker/8:1+eve]
> cps-new-ondemand-0587:~ # cat /proc/496774/stack
> [<0>] __switch_to+0x94/0xe8
> [<0>] drm_sched_entity_flush+0xf8/0x248 [gpu_sched]
> [<0>] amdgpu_ctx_mgr_entity_flush+0xac/0x148 [amdgpu]
> [<0>] amdgpu_flush+0x2c/0x50 [amdgpu]
> [<0>] filp_close+0x40/0xa0
> [<0>] put_files_struct+0x118/0x120
> [<0>] put_files_struct+0x30/0x68 [binder_linux]
> [<0>] binder_deferred_func+0x4d4/0x658 [binder_linux]
> [<0>] process_one_work+0x1b4/0x3f8
> [<0>] worker_thread+0x54/0x470
> [<0>] kthread+0x134/0x138
> [<0>] ret_from_fork+0x10/0x18
> [<0>] 0xffffffffffffffff
>
>
>
> This issue troubled me a long time.  looking eagerly to get help from you!
>
>
> -----
> Yanhua
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/amd-gfx/attachments/20190903/63bd0093/attachment-0001.html>


More information about the amd-gfx mailing list