Deadlocks with multiple applications on AMD RX 460 and RX 550 - Update 2

Luís Mendes luis.p.mendes at gmail.com
Mon Feb 5 12:40:57 UTC 2018


Hi everyone,

I have some updates. I left the system idle most of the time during
the weekend and from time to time I played a video on youtube and
turned off the screen. Yesterday night I did the same and today
morning I checked the system and it got hung up during the night. This
time it took a lot longer to hang, but I think it was related to a
Flash animation add that was only present on the youtube page the last
time I switched off the screen. The amdgpu always seem to hang when
that flash animation is present, from all the crash attempts I have
made.
There is a memory leak according to kmemleak which I attach along with
the crash dmesg log.

The kernel and patches are the same as on my previous email. I ended
up not changing either the mesa version, nor the kernel version and
patches.

Regards,
Luís


On Fri, Feb 2, 2018 at 6:46 PM, Luís Mendes <luis.p.mendes at gmail.com> wrote:
> Hi Christian, Alexander,
>
> I have enabled kmemleak, but memleak didn't detect anything special,
> in fact this time, I don't know why, I didn't get any allocation
> failure at all, but the GPU did hang after around 4h 6m of uptime with
> Xorg.
> The log can be found in attachment. I will try again to see if the
> allocation failure reappears, or if it has become less apparent due to
> kmemleak scans.
>
> The kernel stack trace is similar to the GPU hangs I was getting on
> earlier kernel versions with Kodi, or Firefox when watching videos
> with either one, but if I left Xorg idle, it would remain up and
> available without hanging for more than one day.
> This stack trace also looks quite similar to what Daniel Andersson
> reported in "[BUG] Intermittent hang/deadlock when opening browser tab
> with Vega gpu", looks like another demonstration of the same bug on
> different architectures.
>
> Regards,
> Luís
>
> On Fri, Feb 2, 2018 at 7:48 AM, Christian König
> <ckoenig.leichtzumerken at gmail.com> wrote:
>> Hi Luis,
>>
>> please enable kmemleak in your build and watch out for any suspicious
>> messages in the system log.
>>
>> Regards,
>> Christian.
>>
>>
>> Am 02.02.2018 um 00:03 schrieb Luís Mendes:
>>>
>>> Hi Alexander,
>>>
>>> I didn't notice improvements on this issue with that particular patch
>>> applied. It still ends up failing to allocate kernel memory after a
>>> few hours of uptime with Xorg.
>>>
>>> I will try to upgrade to mesa 18.0.0-rc3 and to amd-staging-drm-next
>>> head, to see if the issue still occurs with those versions.
>>>
>>> If you have additional suggestions I'll be happy to try them.
>>>
>>> Regards,
>>> Luís Mendes
>>>
>>> On Thu, Feb 1, 2018 at 2:30 AM, Alex Deucher <alexdeucher at gmail.com>
>>> wrote:
>>>>
>>>> On Wed, Jan 31, 2018 at 6:57 PM, Luís Mendes <luis.p.mendes at gmail.com>
>>>> wrote:
>>>>>
>>>>> Hi everyone,
>>>>>
>>>>> I am getting a new issue with amdgpu with RX460, that is, now I can
>>>>> play any videos with Kodi or play web videos with firefox and run
>>>>> OpenGL applications without running into any issues, however after
>>>>> some uptime with XOrg even when almost inactive I get a kmalloc
>>>>> allocation failure, normally followed by a GPU hang a while after the
>>>>> the allocation failure.
>>>>> I had a terminal window under Ubuntu Mate 17.10 and I was compiling
>>>>> code when I got the kernel messages that can be found in attachment.
>>>>>
>>>>> I am using the kernel as identified on my previous email, which can be
>>>>> found below.
>>>>
>>>> does this patch help?
>>>> https://patchwork.freedesktop.org/patch/198258/
>>>>
>>>> Alex
>>>>
>>>>> Regards,
>>>>> Luís Mendes
>>>>>
>>>>> On Wed, Jan 31, 2018 at 12:47 PM, Luís Mendes <luis.p.mendes at gmail.com>
>>>>> wrote:
>>>>>>
>>>>>> Hi Alexander,
>>>>>>
>>>>>> I've cherry picked the patch you pointed out into kernel from
>>>>>> amd-drm-next-4.17-wip at commit
>>>>>> 9ab2894122275a6d636bb2654a157e88a0f7b9e2 ( drm/amdgpu: set
>>>>>> DRIVER_ATOMIC flag early) and tested it on ARMv7l and the problem has
>>>>>> gone indeed.
>>>>>>
>>>>>>
>>>>>> Working great on ARMv7l with AMD RX460.
>>>>>>
>>>>>> Thanks,
>>>>>> Luís Mendes
>>>>>>
>>>>>>
>>>>>> On Tue, Jan 30, 2018 at 6:44 PM, Deucher, Alexander
>>>>>> <Alexander.Deucher at amd.com> wrote:
>>>>>>>
>>>>>>> Fixed with this patch:
>>>>>>>
>>>>>>>
>>>>>>> https://lists.freedesktop.org/archives/amd-gfx/2018-January/018472.html
>>>>>>>
>>>>>>>
>>>>>>> Alex
>>>>>
>>>>> <>
>>>>>>>
>>>>>>> __________________
>>>>>
>>>>> _______________________________________________
>>>>> amd-gfx mailing list
>>>>> amd-gfx at lists.freedesktop.org
>>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>>>>
>>> _______________________________________________
>>> amd-gfx mailing list
>>> amd-gfx at lists.freedesktop.org
>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>
>>
-------------- next part --------------
ubuntu at linux:~$ sudo cat /sys/kernel/debug/kmemleak
[sudo] password for ubuntu:
unreferenced object 0xb0fac380 (size 128):
  comm "Xorg", pid 3750, jiffies 5608934 (age 178088.970s)
  hex dump (first 32 bytes):
    00 4e 9f b9 00 f0 33 bb 80 1a 15 97 00 00 00 00  .N....3.........
    fa 00 00 00 82 01 00 00 80 00 00 00 80 00 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<85099e84>] dm_drm_plane_duplicate_state+0x30/0x60 [amdgpu]
    [<5c55b4e7>] drm_atomic_get_plane_state+0x74/0x118 [drm]
    [<03e85711>] drm_atomic_add_affected_planes+0x84/0xb0 [drm]
    [<d9340120>] drm_atomic_helper_check_modeset+0x4d4/0xb04 [drm_kms_helper]
    [<3c94e005>] amdgpu_dm_atomic_check+0x44/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper]
    [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm]
    [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
unreferenced object 0xa44c5800 (size 1024):
  comm "Xorg", pid 3750, jiffies 5608934 (age 178088.970s)
  hex dump (first 32 bytes):
    00 70 4c a4 40 05 00 00 00 00 00 00 00 04 00 00  .pL. at ...........
    00 00 00 00 18 00 00 00 88 00 00 00 26 03 00 00  ............&...
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<a591e85d>] dc_create_stream_for_sink+0x30/0x15c [amdgpu]
    [<87d2cda1>] create_stream_for_sink+0x50/0x4b8 [amdgpu]
    [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu]
    [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper]
    [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm]
    [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
unreferenced object 0xb0fac080 (size 128):
  comm "Xorg", pid 3750, jiffies 5608935 (age 178088.960s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 08 00 08 00  ..3.............
    90 c0 fa b0 90 c0 fa b0 ff ff ff ff 0a 00 0a 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm]
    [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper]
    [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm]
    [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xba604680 (size 128):
  comm "Xorg", pid 3750, jiffies 5608936 (age 178088.960s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 04 00 04 00  ..3.............
    90 46 60 ba 90 46 60 ba ff ff ff ff 06 00 06 00  .F`..F`.........
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm]
    [<2b9bc4a3>] drm_atomic_connector_commit_dpms+0xec/0xfc [drm]
    [<9b7fdbd7>] drm_mode_obj_set_property_ioctl+0x1a8/0x318 [drm]
    [<be4c5272>] drm_mode_connector_property_set_ioctl+0x4c/0x68 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xb948fe00 (size 512):
  comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 01 01 00 21 00 00 00 02 00 00 00  ..3.....!.......
    02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<70e004db>] dm_crtc_duplicate_state+0x38/0x84 [amdgpu]
    [<b13e4c2f>] drm_atomic_get_crtc_state+0x78/0x10c [drm]
    [<1c6f2b44>] page_flip_common+0x28/0xcc [drm_kms_helper]
    [<cc45921a>] drm_atomic_helper_page_flip+0x50/0xac [drm_kms_helper]
    [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xb0face00 (size 128):
  comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s)
  hex dump (first 32 bytes):
    00 7a 81 b9 00 f0 33 bb 40 28 84 be 00 00 00 00  .z....3.@(......
    00 00 00 00 00 00 00 00 00 04 00 00 00 03 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<85099e84>] dm_drm_plane_duplicate_state+0x30/0x60 [amdgpu]
    [<5c55b4e7>] drm_atomic_get_plane_state+0x74/0x118 [drm]
    [<2802153b>] page_flip_common+0x50/0xcc [drm_kms_helper]
    [<cc45921a>] drm_atomic_helper_page_flip+0x50/0xac [drm_kms_helper]
    [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0xb0facf80 (size 128):
  comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 05 00 05 00  ..3.............
    90 cf fa b0 90 cf fa b0 ff ff ff ff 06 00 06 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<270dc6b5>] drm_atomic_nonblocking_commit+0x54/0x58 [drm]
    [<2e425aac>] drm_atomic_helper_page_flip+0x9c/0xac [drm_kms_helper]
    [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm]
    [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm]
    [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm]
    [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu]
    [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc
    [<95adff3a>] SyS_ioctl+0x3c/0x60
    [<4cde0ae2>] ret_fast_syscall+0x0/0x54
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bdcd800 (size 1024):
  comm "kworker/0:0", pid 5643, jiffies 5609303 (age 178085.640s)
  hex dump (first 32 bytes):
    04 00 00 00 00 01 00 00 00 ff ff ff ff ff ff 00  ................
    2e 83 54 21 34 00 00 00 29 15 01 03 80 30 1b 78  ..T!4...)....0.x
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<f1541e77>] dc_sink_create+0x2c/0x88 [amdgpu]
    [<b58321e8>] dc_link_detect+0x1f0/0x994 [amdgpu]
    [<1e160950>] handle_hpd_irq+0x40/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x933a6480 (size 64):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s)
  hex dump (first 32 bytes):
    01 00 00 00 00 a4 81 b9 01 00 00 00 00 c6 56 ba  ..............V.
    80 27 f4 9b 03 00 00 00 80 62 3a 93 00 00 00 00  .'.......b:.....
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<1a912412>] dm_atomic_state_alloc+0x2c/0x60 [amdgpu]
    [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm]
    [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bf42780 (size 128):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s)
  hex dump (first 32 bytes):
    00 f0 33 bb 00 fe 48 b9 00 fe 48 b9 00 fe 71 b1  ..3...H...H...q.
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<bac96619>] __kmalloc+0x1a0/0x284
    [<6315c347>] drm_atomic_state_init+0x50/0xcc [drm]
    [<b3875b9a>] dm_atomic_state_alloc+0x40/0x60 [amdgpu]
    [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm]
    [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0xba56c600 (size 192):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s)
  hex dump (first 32 bytes):
    00 7a 81 b9 00 ce fa b0 00 ce fa b0 00 28 f4 9b  .z...........(..
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<bac96619>] __kmalloc+0x1a0/0x284
    [<81bcb9b0>] drm_atomic_state_init+0x78/0xcc [drm]
    [<b3875b9a>] dm_atomic_state_alloc+0x40/0x60 [amdgpu]
    [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm]
    [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x933a6280 (size 64):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.780s)
  hex dump (first 32 bytes):
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
    00 50 8c b9 80 c2 fa b0 80 c2 fa b0 80 20 f4 9b  .P........... ..
  backtrace:
    [<b51d17be>] __kmalloc_track_caller+0x1a0/0x284
    [<bfb2e13f>] krealloc+0x54/0xc0
    [<b6fa0df1>] drm_atomic_get_connector_state+0x134/0x180 [drm]
    [<1d297503>] dm_restore_drm_connector_state+0x90/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0xb772e000 (size 8192):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.780s)
  hex dump (first 32 bytes):
    00 c4 dc 9b 00 00 00 00 00 00 00 00 00 00 00 00  ................
    00 00 00 00 00 00 00 00 00 00 00 00 03 00 00 00  ................
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<ecab1fe2>] dc_create_state+0x28/0x3c [amdgpu]
    [<3f29ed80>] amdgpu_dm_atomic_check+0xe0/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bdcc400 (size 1024):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.790s)
  hex dump (first 32 bytes):
    00 d8 dc 9b 40 05 00 00 00 00 00 00 00 04 00 00  .... at ...........
    00 00 00 00 18 00 00 00 88 00 00 00 26 03 00 00  ............&...
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<a591e85d>] dc_create_stream_for_sink+0x30/0x15c [amdgpu]
    [<87d2cda1>] create_stream_for_sink+0x50/0x4b8 [amdgpu]
    [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu]
    [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0x9bf30000 (size 24632):
  comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.790s)
  hex dump (first 32 bytes):
    01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<8ff04208>] kmalloc_order+0x4c/0x54
    [<24677274>] kmalloc_order_trace+0x24/0xc8
    [<b006d2b2>] dc_create_transfer_func+0x20/0x30 [amdgpu]
    [<a9439ec9>] create_stream_for_sink+0xc0/0x4b8 [amdgpu]
    [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu]
    [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu]
    [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm]
    [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
unreferenced object 0xba604f80 (size 128):
  comm "kworker/0:0", pid 5643, jiffies 5609324 (age 178086.120s)
  hex dump (first 32 bytes):
    00 f0 33 bb 01 00 00 00 ff ff ff ff 08 00 08 00  ..3.............
    90 4f 60 ba 90 4f 60 ba ff ff ff ff 0a 00 0a 00  .O`..O`.........
  backtrace:
    [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c
    [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper]
    [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper]
    [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu]
    [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm]
    [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu]
    [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu]
    [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu]
    [<4bf849a6>] process_one_work+0x204/0x510
    [<37e481ef>] worker_thread+0x5c/0x5f0
    [<292a33bf>] kthread+0x164/0x16c
    [<c1559dd0>] ret_from_fork+0x14/0x2c
    [<1738b98c>] 0xffffffff
-------------- next part --------------
Feb  4 23:36:30 linux kernel: [188528.461609] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=2186227, last emitted seq=2186230
Feb  4 23:36:30 linux kernel: [188528.472965] [drm] IP block:gmc_v8_0 is hung!
Feb  4 23:36:30 linux kernel: [188528.472971] [drm] IP block:gfx_v8_0 is hung!
Feb  4 23:36:30 linux kernel: [188528.473019] [drm] GPU recovery disabled.
Feb  4 23:52:11 linux kernel: [189469.863152] INFO: task amdgpu_cs:0:3799 blocked for more than 120 seconds.
Feb  4 23:52:11 linux kernel: [189469.870134]       Not tainted 4.15.0-rc8-next2g-g9ab2894-dirty #3
Feb  4 23:52:11 linux kernel: [189469.876354] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Feb  4 23:52:11 linux kernel: [189469.884304] amdgpu_cs:0     D    0  3799   3088 0x00000000
Feb  4 23:52:11 linux kernel: [189469.884309] Backtrace: 
Feb  4 23:52:11 linux kernel: [189469.884320] [<80b571c8>] (__schedule) from [<80b578cc>] (schedule+0x44/0xa4)
Feb  4 23:52:11 linux kernel: [189469.884325]  r10:600c0013 r9:b6108000 r8:b6109bd4 r7:00000000 r6:7fffffff r5:81004c48
Feb  4 23:52:11 linux kernel: [189469.884327]  r4:ffffe000
Feb  4 23:52:11 linux kernel: [189469.884333] [<80b57888>] (schedule) from [<80b5b4f0>] (schedule_timeout+0x1e0/0x2e8)
Feb  4 23:52:11 linux kernel: [189469.884336]  r5:81004c48 r4:7fffffff
Feb  4 23:52:11 linux kernel: [189469.884344] [<80b5b310>] (schedule_timeout) from [<8065df3c>] (dma_fence_default_wait+0x218/0x2b0)
Feb  4 23:52:11 linux kernel: [189469.884348]  r10:600c0013 r9:b6108000 r8:b6109bd4 r7:00000000 r6:7fffffff r5:81004c48
Feb  4 23:52:11 linux kernel: [189469.884350]  r4:94953e80
Feb  4 23:52:11 linux kernel: [189469.884355] [<8065dd24>] (dma_fence_default_wait) from [<8065d6b4>] (dma_fence_wait_timeout+0x48/0x15c)
Feb  4 23:52:11 linux kernel: [189469.884360]  r10:ba77b000 r9:b9820000 r8:00000001 r7:91374968 r6:00000000 r5:94953e80
Feb  4 23:52:11 linux kernel: [189469.884361]  r4:81096c18
Feb  4 23:52:11 linux kernel: [189469.884566] [<8065d66c>] (dma_fence_wait_timeout) from [<7f1b5bc8>] (amdgpu_ctx_wait_prev_fence+0x48/0x80 [amdgpu])
Feb  4 23:52:11 linux kernel: [189469.884570]  r7:91374968 r6:00000001 r5:b6bc60c0 r4:00000001
Feb  4 23:52:11 linux kernel: [189469.884829] [<7f1b5b80>] (amdgpu_ctx_wait_prev_fence [amdgpu]) from [<7f19e780>] (amdgpu_cs_ioctl+0x428/0x1edc [amdgpu])
Feb  4 23:52:11 linux kernel: [189469.884832]  r5:b6bc60c0 r4:00000001
Feb  4 23:52:11 linux kernel: [189469.884995] [<7f19e358>] (amdgpu_cs_ioctl [amdgpu]) from [<7f045920>] (drm_ioctl_kernel+0x68/0xb4 [drm])
Feb  4 23:52:11 linux kernel: [189469.884999]  r10:00000018 r9:b6109e2c r8:7f19e358 r7:00000021 r6:00000000 r5:b981a400
Feb  4 23:52:11 linux kernel: [189469.885001]  r4:ba6ca240
Feb  4 23:52:11 linux kernel: [189469.885041] [<7f0458b8>] (drm_ioctl_kernel [drm]) from [<7f045dec>] (drm_ioctl+0x2cc/0x3b0 [drm])
Feb  4 23:52:11 linux kernel: [189469.885045]  r9:00000044 r8:c0186444 r7:ba6ca240 r6:7f19e358 r5:7f2fcba4 r4:81004c48
Feb  4 23:52:11 linux kernel: [189469.885193] [<7f045b20>] (drm_ioctl [drm]) from [<7f180010>] (amdgpu_drm_ioctl+0x10/0x14 [amdgpu])
Feb  4 23:52:11 linux kernel: [189469.885197]  r10:b9b28510 r9:b6108000 r8:732c5ac0 r7:0000000c r6:b6426480 r5:732c5ac0
Feb  4 23:52:11 linux kernel: [189469.885199]  r4:81004c48
Feb  4 23:52:11 linux kernel: [189469.885324] [<7f180000>] (amdgpu_drm_ioctl [amdgpu]) from [<8028e4b4>] (do_vfs_ioctl+0xb8/0x8cc)
Feb  4 23:52:11 linux kernel: [189469.885330] [<8028e3fc>] (do_vfs_ioctl) from [<8028ed04>] (SyS_ioctl+0x3c/0x60)
Feb  4 23:52:11 linux kernel: [189469.885334]  r10:00000000 r9:b6108000 r8:732c5ac0 r7:c0186444 r6:0000000c r5:b6426480
Feb  4 23:52:11 linux kernel: [189469.885336]  r4:b6426481
Feb  4 23:52:11 linux kernel: [189469.885343] [<8028ecc8>] (SyS_ioctl) from [<80108f00>] (ret_fast_syscall+0x0/0x54)
Feb  4 23:52:11 linux kernel: [189469.885347]  r9:b6108000 r8:801090e4 r7:00000036 r6:c0186444 r5:732c5ac0 r4:c0006400


ubuntu at linux:~$ uptime
 12:36:43 up 2 days, 17:22,  3 users,  load average: 1.02, 1.03, 1.00



More information about the amd-gfx mailing list