Deadlocks with multiple applications on AMD RX 460 and RX 550 - Update 2

Luís Mendes luis.p.mendes at gmail.com
Wed Feb 7 15:50:48 UTC 2018


Hi Christian, Alexander,

Kmemleak reported leaked data structures and the GPU hung a bit after.
Could this be caused from DC?
Info in attachments.


I'm not sure if my previous email got overlooked, or if simply, there
are no suggestions at this moment. Sorry for kind of re-sending the
email.


Regards,
Luís

On Mon, Feb 5, 2018 at 12:40 PM, Luís Mendes <luis.p.mendes at gmail.com> wrote:
> Hi everyone,
>
> I have some updates. I left the system idle most of the time during
> the weekend and from time to time I played a video on youtube and
> turned off the screen. Yesterday night I did the same and today
> morning I checked the system and it got hung up during the night. This
> time it took a lot longer to hang, but I think it was related to a
> Flash animation add that was only present on the youtube page the last
> time I switched off the screen. The amdgpu always seem to hang when
> that flash animation is present, from all the crash attempts I have
> made.
> There is a memory leak according to kmemleak which I attach along with
> the crash dmesg log.
>
> The kernel and patches are the same as on my previous email. I ended
> up not changing either the mesa version, nor the kernel version and
> patches.
>
> Regards,
> Luís
>
>
> On Fri, Feb 2, 2018 at 6:46 PM, Luís Mendes <luis.p.mendes at gmail.com> wrote:
>> Hi Christian, Alexander,
>>
>> I have enabled kmemleak, but memleak didn't detect anything special,
>> in fact this time, I don't know why, I didn't get any allocation
>> failure at all, but the GPU did hang after around 4h 6m of uptime with
>> Xorg.
>> The log can be found in attachment. I will try again to see if the
>> allocation failure reappears, or if it has become less apparent due to
>> kmemleak scans.
>>
>> The kernel stack trace is similar to the GPU hangs I was getting on
>> earlier kernel versions with Kodi, or Firefox when watching videos
>> with either one, but if I left Xorg idle, it would remain up and
>> available without hanging for more than one day.
>> This stack trace also looks quite similar to what Daniel Andersson
>> reported in "[BUG] Intermittent hang/deadlock when opening browser tab
>> with Vega gpu", looks like another demonstration of the same bug on
>> different architectures.
>>
>> Regards,
>> Luís
>>
>> On Fri, Feb 2, 2018 at 7:48 AM, Christian König
>> <ckoenig.leichtzumerken at gmail.com> wrote:
>>> Hi Luis,
>>>
>>> please enable kmemleak in your build and watch out for any suspicious
>>> messages in the system log.
>>>
>>> Regards,
>>> Christian.
>>>
>>>
>>> Am 02.02.2018 um 00:03 schrieb Luís Mendes:
>>>>
>>>> Hi Alexander,
>>>>
>>>> I didn't notice improvements on this issue with that particular patch
>>>> applied. It still ends up failing to allocate kernel memory after a
>>>> few hours of uptime with Xorg.
>>>>
>>>> I will try to upgrade to mesa 18.0.0-rc3 and to amd-staging-drm-next
>>>> head, to see if the issue still occurs with those versions.
>>>>
>>>> If you have additional suggestions I'll be happy to try them.
>>>>
>>>> Regards,
>>>> Luís Mendes
>>>>
>>>> On Thu, Feb 1, 2018 at 2:30 AM, Alex Deucher <alexdeucher at gmail.com>
>>>> wrote:
>>>>>
>>>>> On Wed, Jan 31, 2018 at 6:57 PM, Luís Mendes <luis.p.mendes at gmail.com>
>>>>> wrote:
>>>>>>
>>>>>> Hi everyone,
>>>>>>
>>>>>> I am getting a new issue with amdgpu with RX460, that is, now I can
>>>>>> play any videos with Kodi or play web videos with firefox and run
>>>>>> OpenGL applications without running into any issues, however after
>>>>>> some uptime with XOrg even when almost inactive I get a kmalloc
>>>>>> allocation failure, normally followed by a GPU hang a while after the
>>>>>> the allocation failure.
>>>>>> I had a terminal window under Ubuntu Mate 17.10 and I was compiling
>>>>>> code when I got the kernel messages that can be found in attachment.
>>>>>>
>>>>>> I am using the kernel as identified on my previous email, which can be
>>>>>> found below.
>>>>>
>>>>> does this patch help?
>>>>> https://patchwork.freedesktop.org/patch/198258/
>>>>>
>>>>> Alex
>>>>>
>>>>>> Regards,
>>>>>> Luís Mendes
>>>>>>
>>>>>> On Wed, Jan 31, 2018 at 12:47 PM, Luís Mendes <luis.p.mendes at gmail.com>
>>>>>> wrote:
>>>>>>>
>>>>>>> Hi Alexander,
>>>>>>>
>>>>>>> I've cherry picked the patch you pointed out into kernel from
>>>>>>> amd-drm-next-4.17-wip at commit
>>>>>>> 9ab2894122275a6d636bb2654a157e88a0f7b9e2 ( drm/amdgpu: set
>>>>>>> DRIVER_ATOMIC flag early) and tested it on ARMv7l and the problem has
>>>>>>> gone indeed.
>>>>>>>
>>>>>>>
>>>>>>> Working great on ARMv7l with AMD RX460.
>>>>>>>
>>>>>>> Thanks,
>>>>>>> Luís Mendes
>>>>>>>
>>>>>>>
>>>>>>> On Tue, Jan 30, 2018 at 6:44 PM, Deucher, Alexander
>>>>>>> <Alexander.Deucher at amd.com> wrote:
>>>>>>>>
>>>>>>>> Fixed with this patch:
>>>>>>>>
>>>>>>>>
>>>>>>>> https://lists.freedesktop.org/archives/amd-gfx/2018-January/018472.html
>>>>>>>>
>>>>>>>>
>>>>>>>> Alex
>>>>>>
>>>>>> <>
>>>>>>>>
>>>>>>>> __________________
>>>>>>
>>>>>> _______________________________________________
>>>>>> amd-gfx mailing list
>>>>>> amd-gfx at lists.freedesktop.org
>>>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>>>>>
>>>> _______________________________________________
>>>> amd-gfx mailing list
>>>> amd-gfx at lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>>
>>>
-------------- next part --------------
ubuntu at linux:~$ sudo cat /sys/kernel/debug/kmemleak
[sudo] password for ubuntu:
unreferenced object 0xb0fac380 (size 128):
  comm "Xorg", pid 3750, jiffies 5608934 (age 178088.970s)
  hex dump (first 32 bytes):
    00 4e 9f b9 00 f0 33 bb 80 1a 15 97 00 00 00 00  .N....3.........
    fa 00 00 00 82 01 00 00 80 00 00 00 80 00 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<85099e84>] dm_drm_plane_duplicate_state+0x30/0x60 [amdgpu]
    [<5c55b4e7>] drm_atomic_get_plane_state+0x74/0x118 [drm]
    [<03e85711>] drm_atomic_add_affected_planes+0x84/0xb0 [drm]
    [<d9340120>] drm_atomic_helper_check_modeset+0x4d4/0xb04 [drm_kms_helper]
    [<3c94e005>] amdgpu_dm_atomic_check+0x44/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper]
    [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm]
    [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
unreferenced object 0xa44c5800 (size 1024):
  comm "Xorg", pid 3750, jiffies 5608934 (age 178088.970s)
  hex dump (first 32 bytes):
    00 70 4c a4 40 05 00 00 00 00 00 00 00 04 00 00  .pL. at ...........
    00 00 00 00 18 00 00 00 88 00 00 00 26 03 00 00  ............&...
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<a591e85d>] dc_create_stream_for_sink+0x30/0x15c [amdgpu]
    [<87d2cda1>] create_stream_for_sink+0x50/0x4b8 [amdgpu]
    [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu]
    [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper]
    [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm]
    [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
unreferenced object 0xb0fac080 (size 128):
  comm "Xorg", pid 3750, jiffies 5608935 (age 178088.960s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 08 00 08 00  ..3.............
    90 c0 fa b0 90 c0 fa b0 ff ff ff ff 0a 00 0a 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm]
    [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper]
    [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm]
    [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xba604680 (size 128):
  comm "Xorg", pid 3750, jiffies 5608936 (age 178088.960s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 04 00 04 00  ..3.............
    90 46 60 ba 90 46 60 ba ff ff ff ff 06 00 06 00  .F`..F`.........
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm]
    [<2b9bc4a3>] drm_atomic_connector_commit_dpms+0xec/0xfc [drm]
    [<9b7fdbd7>] drm_mode_obj_set_property_ioctl+0x1a8/0x318 [drm]
    [<be4c5272>] drm_mode_connector_property_set_ioctl+0x4c/0x68 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xb948fe00 (size 512):
  comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 01 01 00 21 00 00 00 02 00 00 00  ..3.....!.......
    02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<70e004db>] dm_crtc_duplicate_state+0x38/0x84 [amdgpu]
    [<b13e4c2f>] drm_atomic_get_crtc_state+0x78/0x10c [drm]
    [<1c6f2b44>] page_flip_common+0x28/0xcc [drm_kms_helper]
    [<cc45921a>] drm_atomic_helper_page_flip+0x50/0xac [drm_kms_helper]
    [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xb0face00 (size 128):
  comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s)
  hex dump (first 32 bytes):
    00 7a 81 b9 00 f0 33 bb 40 28 84 be 00 00 00 00  .z....3.@(......
    00 00 00 00 00 00 00 00 00 04 00 00 00 03 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<85099e84>] dm_drm_plane_duplicate_state+0x30/0x60 [amdgpu]
    [<5c55b4e7>] drm_atomic_get_plane_state+0x74/0x118 [drm]
    [<2802153b>] page_flip_common+0x50/0xcc [drm_kms_helper]
    [<cc45921a>] drm_atomic_helper_page_flip+0x50/0xac [drm_kms_helper]
    [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xb0facf80 (size 128):
  comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 05 00 05 00  ..3.............
    90 cf fa b0 90 cf fa b0 ff ff ff ff 06 00 06 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<270dc6b5>] drm_atomic_nonblocking_commit+0x54/0x58 [drm]
    [<2e425aac>] drm_atomic_helper_page_flip+0x9c/0xac [drm_kms_helper]
    [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bdcd800 (size 1024):
  comm "kworker/0:0", pid 5643, jiffies 5609303 (age 178085.640s)
  hex dump (first 32 bytes):
    04 00 00 00 00 01 00 00 00 ff ff ff ff ff ff 00  ................
    2e 83 54 21 34 00 00 00 29 15 01 03 80 30 1b 78  ..T!4...)....0.x
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<f1541e77>] dc_sink_create+0x2c/0x88 [amdgpu]
    [<b58321e8>] dc_link_detect+0x1f0/0x994 [amdgpu]
    [<1e160950>] handle_hpd_irq+0x40/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x933a6480 (size 64):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s)
  hex dump (first 32 bytes):
    01 00 00 00 00 a4 81 b9 01 00 00 00 00 c6 56 ba  ..............V.
    80 27 f4 9b 03 00 00 00 80 62 3a 93 00 00 00 00  .'.......b:.....
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<1a912412>] dm_atomic_state_alloc+0x2c/0x60 [amdgpu]
    [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm]
    [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bf42780 (size 128):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s)
  hex dump (first 32 bytes):
    00 f0 33 bb 00 fe 48 b9 00 fe 48 b9 00 fe 71 b1  ..3...H...H...q.
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<bac96619>] __kmalloc+0x1a0/0x284
    [<6315c347>] drm_atomic_state_init+0x50/0xcc [drm]
    [<b3875b9a>] dm_atomic_state_alloc+0x40/0x60 [amdgpu]
    [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm]
    [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0xba56c600 (size 192):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s)
  hex dump (first 32 bytes):
    00 7a 81 b9 00 ce fa b0 00 ce fa b0 00 28 f4 9b  .z...........(..
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<bac96619>] __kmalloc+0x1a0/0x284
    [<81bcb9b0>] drm_atomic_state_init+0x78/0xcc [drm]
    [<b3875b9a>] dm_atomic_state_alloc+0x40/0x60 [amdgpu]
    [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm]
    [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x933a6280 (size 64):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.780s)
  hex dump (first 32 bytes):
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
    00 50 8c b9 80 c2 fa b0 80 c2 fa b0 80 20 f4 9b  .P........... ..
  backtrace:
    [<b51d17be>] __kmalloc_track_caller+0x1a0/0x284
    [<bfb2e13f>] krealloc+0x54/0xc0
    [<b6fa0df1>] drm_atomic_get_connector_state+0x134/0x180 [drm]
    [<1d297503>] dm_restore_drm_connector_state+0x90/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0xb772e000 (size 8192):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.780s)
  hex dump (first 32 bytes):
    00 c4 dc 9b 00 00 00 00 00 00 00 00 00 00 00 00  ................
    00 00 00 00 00 00 00 00 00 00 00 00 03 00 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<ecab1fe2>] dc_create_state+0x28/0x3c [amdgpu]
    [<3f29ed80>] amdgpu_dm_atomic_check+0xe0/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bdcc400 (size 1024):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.790s)
  hex dump (first 32 bytes):
    00 d8 dc 9b 40 05 00 00 00 00 00 00 00 04 00 00  .... at ...........
    00 00 00 00 18 00 00 00 88 00 00 00 26 03 00 00  ............&...
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<a591e85d>] dc_create_stream_for_sink+0x30/0x15c [amdgpu]
    [<87d2cda1>] create_stream_for_sink+0x50/0x4b8 [amdgpu]
    [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu]
    [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bf30000 (size 24632):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.790s)
  hex dump (first 32 bytes):
    01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<8ff04208>] kmalloc_order+0x4c/0x54
    [<24677274>] kmalloc_order_trace+0x24/0xc8
    [<b006d2b2>] dc_create_transfer_func+0x20/0x30 [amdgpu]
    [<a9439ec9>] create_stream_for_sink+0xc0/0x4b8 [amdgpu]
    [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu]
    [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0xba604f80 (size 128):
  comm "kworker/0:0", pid 5643, jiffies 5609324 (age 178086.120s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 08 00 08 00  ..3.............
    90 4f 60 ba 90 4f 60 ba ff ff ff ff 0a 00 0a 00  .O`..O`.........
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
-------------- next part --------------
Feb  4 23:36:30 linux kernel: [188528.461609] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=2186227, last emitted seq=2186230
Feb  4 23:36:30 linux kernel: [188528.472965] [drm] IP block:gmc_v8_0 is hung!
Feb  4 23:36:30 linux kernel: [188528.472971] [drm] IP block:gfx_v8_0 is hung!
Feb  4 23:36:30 linux kernel: [188528.473019] [drm] GPU recovery disabled.
Feb  4 23:52:11 linux kernel: [189469.863152] INFO: task amdgpu_cs:0:3799 blocked for more than 120 seconds.
Feb  4 23:52:11 linux kernel: [189469.870134]       Not tainted 4.15.0-rc8-next2g-g9ab2894-dirty #3
Feb  4 23:52:11 linux kernel: [189469.876354] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Feb  4 23:52:11 linux kernel: [189469.884304] amdgpu_cs:0     D    0  3799   3088 0x00000000
Feb  4 23:52:11 linux kernel: [189469.884309] Backtrace: 
Feb  4 23:52:11 linux kernel: [189469.884320] [<80b571c8>] (__schedule) from [<80b578cc>] (schedule+0x44/0xa4)
Feb  4 23:52:11 linux kernel: [189469.884325]  r10:600c0013 r9:b6108000 r8:b6109bd4 r7:00000000 r6:7fffffff r5:81004c48
Feb  4 23:52:11 linux kernel: [189469.884327]  r4:ffffe000
Feb  4 23:52:11 linux kernel: [189469.884333] [<80b57888>] (schedule) from [<80b5b4f0>] (schedule_timeout+0x1e0/0x2e8)
Feb  4 23:52:11 linux kernel: [189469.884336]  r5:81004c48 r4:7fffffff
Feb  4 23:52:11 linux kernel: [189469.884344] [<80b5b310>] (schedule_timeout) from [<8065df3c>] (dma_fence_default_wait+0x218/0x2b0)
Feb  4 23:52:11 linux kernel: [189469.884348]  r10:600c0013 r9:b6108000 r8:b6109bd4 r7:00000000 r6:7fffffff r5:81004c48
Feb  4 23:52:11 linux kernel: [189469.884350]  r4:94953e80
Feb  4 23:52:11 linux kernel: [189469.884355] [<8065dd24>] (dma_fence_default_wait) from [<8065d6b4>] (dma_fence_wait_timeout+0x48/0x15c)
Feb  4 23:52:11 linux kernel: [189469.884360]  r10:ba77b000 r9:b9820000 r8:00000001 r7:91374968 r6:00000000 r5:94953e80
Feb  4 23:52:11 linux kernel: [189469.884361]  r4:81096c18
Feb  4 23:52:11 linux kernel: [189469.884566] [<8065d66c>] (dma_fence_wait_timeout) from [<7f1b5bc8>] (amdgpu_ctx_wait_prev_fence+0x48/0x80 [amdgpu])
Feb  4 23:52:11 linux kernel: [189469.884570]  r7:91374968 r6:00000001 r5:b6bc60c0 r4:00000001
Feb  4 23:52:11 linux kernel: [189469.884829] [<7f1b5b80>] (amdgpu_ctx_wait_prev_fence [amdgpu]) from [<7f19e780>] (amdgpu_cs_ioctl+0x428/0x1edc [amdgpu])
Feb  4 23:52:11 linux kernel: [189469.884832]  r5:b6bc60c0 r4:00000001
Feb  4 23:52:11 linux kernel: [189469.884995] [<7f19e358>] (amdgpu_cs_ioctl [amdgpu]) from [<7f045920>] (drm_ioctl_kernel+0x68/0xb4 [drm])
Feb  4 23:52:11 linux kernel: [189469.884999]  r10:00000018 r9:b6109e2c r8:7f19e358 r7:00000021 r6:00000000 r5:b981a400
Feb  4 23:52:11 linux kernel: [189469.885001]  r4:ba6ca240
Feb  4 23:52:11 linux kernel: [189469.885041] [<7f0458b8>] (drm_ioctl_kernel [drm]) from [<7f045dec>] (drm_ioctl+0x2cc/0x3b0 [drm])
Feb  4 23:52:11 linux kernel: [189469.885045]  r9:00000044 r8:c0186444 r7:ba6ca240 r6:7f19e358 r5:7f2fcba4 r4:81004c48
Feb  4 23:52:11 linux kernel: [189469.885193] [<7f045b20>] (drm_ioctl [drm]) from [<7f180010>] (amdgpu_drm_ioctl+0x10/0x14 [amdgpu])
Feb  4 23:52:11 linux kernel: [189469.885197]  r10:b9b28510 r9:b6108000 r8:732c5ac0 r7:0000000c r6:b6426480 r5:732c5ac0
Feb  4 23:52:11 linux kernel: [189469.885199]  r4:81004c48
Feb  4 23:52:11 linux kernel: [189469.885324] [<7f180000>] (amdgpu_drm_ioctl [amdgpu]) from [<8028e4b4>] (do_vfs_ioctl+0xb8/0x8cc)
Feb  4 23:52:11 linux kernel: [189469.885330] [<8028e3fc>] (do_vfs_ioctl) from [<8028ed04>] (SyS_ioctl+0x3c/0x60)
Feb  4 23:52:11 linux kernel: [189469.885334]  r10:00000000 r9:b6108000 r8:732c5ac0 r7:c0186444 r6:0000000c r5:b6426480
Feb  4 23:52:11 linux kernel: [189469.885336]  r4:b6426481
Feb  4 23:52:11 linux kernel: [189469.885343] [<8028ecc8>] (SyS_ioctl) from [<80108f00>] (ret_fast_syscall+0x0/0x54)
Feb  4 23:52:11 linux kernel: [189469.885347]  r9:b6108000 r8:801090e4 r7:00000036 r6:c0186444 r5:732c5ac0 r4:c0006400


ubuntu at linux:~$ uptime
 12:36:43 up 2 days, 17:22,  3 users,  load average: 1.02, 1.03, 1.00



More information about the amd-gfx mailing list