After Vega 56/64 GPU hang I unable reboot system

Grodzovsky, Andrey Andrey.Grodzovsky at amd.com
Wed Jan 9 20:35:07 UTC 2019



On 01/09/2019 02:36 PM, Mikhail Gavrilov wrote:
> On Mon, 7 Jan 2019 at 23:47, Grodzovsky, Andrey
> <Andrey.Grodzovsky at amd.com> wrote:
>> I see 'no active waves' print meaning it's not shader hang.
>>
>> We can try and estimate around which commands the hang occurred - in
>> addition to what you already print please also dump
>>
>> sudo umr -O many,bits  -r *.*.mmGRBM_STATUS* && sudo umr -O many,bits
>> -r *.*.mmCP_EOP_* && sudo umr -O many,bits -r *.*.mmCP_PFP_HEADER_DUMP
>> && sudo umr -O many,bits  -r *.*.mmCP_ME_HEADER_DUMP
>>
>> Andrey
>>
> All new one logs attached here.
>
> Thanks.
>
> P.S. This time I had to terminate command `./umr -O verbose,follow -R
> gfx[.] > gfx.log 2>&1` cause it tried to write log infinitely.
> I also had to terminate command `./umr -O verbose,follow -R gfx[.] >
> gfx.log 2>&1` cause it stuck for a long time.
>
>
> --
> Best Regards,
> Mike Gavrilov.

I think the 'verbose' flag causes it do dump so much output, maybe try without it in ALL the commands above.
Are you are aware of any particular application during which run this happens ?

Andrey




More information about the amd-gfx mailing list