Deadlocks with multiple applications on AMD RX 460 and RX 550 - Update 2

Deucher, Alexander Alexander.Deucher at amd.com
Wed Feb 7 16:41:04 UTC 2018


We haven't had a chance to look yet.


Alex

________________________________
From: Luís Mendes <luis.p.mendes at gmail.com>
Sent: Wednesday, February 7, 2018 10:50:48 AM
To: Koenig, Christian
Cc: Alex Deucher; Deucher, Alexander; Zhou, David(ChunMing); Michel Dänzer; amd-gfx at lists.freedesktop.org
Subject: Re: Deadlocks with multiple applications on AMD RX 460 and RX 550 - Update 2

Hi Christian, Alexander,

Kmemleak reported leaked data structures and the GPU hung a bit after.
Could this be caused from DC?
Info in attachments.


I'm not sure if my previous email got overlooked, or if simply, there
are no suggestions at this moment. Sorry for kind of re-sending the
email.


Regards,
Luís

On Mon, Feb 5, 2018 at 12:40 PM, Luís Mendes <luis.p.mendes at gmail.com> wrote:
> Hi everyone,
>
> I have some updates. I left the system idle most of the time during
> the weekend and from time to time I played a video on youtube and
> turned off the screen. Yesterday night I did the same and today
> morning I checked the system and it got hung up during the night. This
> time it took a lot longer to hang, but I think it was related to a
> Flash animation add that was only present on the youtube page the last
> time I switched off the screen. The amdgpu always seem to hang when
> that flash animation is present, from all the crash attempts I have
> made.
> There is a memory leak according to kmemleak which I attach along with
> the crash dmesg log.
>
> The kernel and patches are the same as on my previous email. I ended
> up not changing either the mesa version, nor the kernel version and
> patches.
>
> Regards,
> Luís
>
>
> On Fri, Feb 2, 2018 at 6:46 PM, Luís Mendes <luis.p.mendes at gmail.com> wrote:
>> Hi Christian, Alexander,
>>
>> I have enabled kmemleak, but memleak didn't detect anything special,
>> in fact this time, I don't know why, I didn't get any allocation
>> failure at all, but the GPU did hang after around 4h 6m of uptime with
>> Xorg.
>> The log can be found in attachment. I will try again to see if the
>> allocation failure reappears, or if it has become less apparent due to
>> kmemleak scans.
>>
>> The kernel stack trace is similar to the GPU hangs I was getting on
>> earlier kernel versions with Kodi, or Firefox when watching videos
>> with either one, but if I left Xorg idle, it would remain up and
>> available without hanging for more than one day.
>> This stack trace also looks quite similar to what Daniel Andersson
>> reported in "[BUG] Intermittent hang/deadlock when opening browser tab
>> with Vega gpu", looks like another demonstration of the same bug on
>> different architectures.
>>
>> Regards,
>> Luís
>>
>> On Fri, Feb 2, 2018 at 7:48 AM, Christian König
>> <ckoenig.leichtzumerken at gmail.com> wrote:
>>> Hi Luis,
>>>
>>> please enable kmemleak in your build and watch out for any suspicious
>>> messages in the system log.
>>>
>>> Regards,
>>> Christian.
>>>
>>>
>>> Am 02.02.2018 um 00:03 schrieb Luís Mendes:
>>>>
>>>> Hi Alexander,
>>>>
>>>> I didn't notice improvements on this issue with that particular patch
>>>> applied. It still ends up failing to allocate kernel memory after a
>>>> few hours of uptime with Xorg.
>>>>
>>>> I will try to upgrade to mesa 18.0.0-rc3 and to amd-staging-drm-next
>>>> head, to see if the issue still occurs with those versions.
>>>>
>>>> If you have additional suggestions I'll be happy to try them.
>>>>
>>>> Regards,
>>>> Luís Mendes
>>>>
>>>> On Thu, Feb 1, 2018 at 2:30 AM, Alex Deucher <alexdeucher at gmail.com>
>>>> wrote:
>>>>>
>>>>> On Wed, Jan 31, 2018 at 6:57 PM, Luís Mendes <luis.p.mendes at gmail.com>
>>>>> wrote:
>>>>>>
>>>>>> Hi everyone,
>>>>>>
>>>>>> I am getting a new issue with amdgpu with RX460, that is, now I can
>>>>>> play any videos with Kodi or play web videos with firefox and run
>>>>>> OpenGL applications without running into any issues, however after
>>>>>> some uptime with XOrg even when almost inactive I get a kmalloc
>>>>>> allocation failure, normally followed by a GPU hang a while after the
>>>>>> the allocation failure.
>>>>>> I had a terminal window under Ubuntu Mate 17.10 and I was compiling
>>>>>> code when I got the kernel messages that can be found in attachment.
>>>>>>
>>>>>> I am using the kernel as identified on my previous email, which can be
>>>>>> found below.
>>>>>
>>>>> does this patch help?
>>>>> https://patchwork.freedesktop.org/patch/198258/
>>>>>
>>>>> Alex
>>>>>
>>>>>> Regards,
>>>>>> Luís Mendes
>>>>>>
>>>>>> On Wed, Jan 31, 2018 at 12:47 PM, Luís Mendes <luis.p.mendes at gmail.com>
>>>>>> wrote:
>>>>>>>
>>>>>>> Hi Alexander,
>>>>>>>
>>>>>>> I've cherry picked the patch you pointed out into kernel from
>>>>>>> amd-drm-next-4.17-wip at commit
>>>>>>> 9ab2894122275a6d636bb2654a157e88a0f7b9e2 ( drm/amdgpu: set
>>>>>>> DRIVER_ATOMIC flag early) and tested it on ARMv7l and the problem has
>>>>>>> gone indeed.
>>>>>>>
>>>>>>>
>>>>>>> Working great on ARMv7l with AMD RX460.
>>>>>>>
>>>>>>> Thanks,
>>>>>>> Luís Mendes
>>>>>>>
>>>>>>>
>>>>>>> On Tue, Jan 30, 2018 at 6:44 PM, Deucher, Alexander
>>>>>>> <Alexander.Deucher at amd.com> wrote:
>>>>>>>>
>>>>>>>> Fixed with this patch:
>>>>>>>>
>>>>>>>>
>>>>>>>> https://lists.freedesktop.org/archives/amd-gfx/2018-January/018472.html
>>>>>>>>
>>>>>>>>
>>>>>>>> Alex
>>>>>>
>>>>>> <>
>>>>>>>>
>>>>>>>> __________________
>>>>>>
>>>>>> _______________________________________________
>>>>>> amd-gfx mailing list
>>>>>> amd-gfx at lists.freedesktop.org
>>>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>>>>>
>>>> _______________________________________________
>>>> amd-gfx mailing list
>>>> amd-gfx at lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>>
>>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/amd-gfx/attachments/20180207/1df7aaac/attachment.html>


More information about the amd-gfx mailing list