After Vega 56/64 GPU hang I unable reboot system

Mikhail Gavrilov mikhail.v.gavrilov at gmail.com
Wed Dec 19 21:14:57 UTC 2018


On Thu, 20 Dec 2018 at 01:56, StDenis, Tom <tom.stdenis at amd.com> wrote:
>
> Sorry missed the gfx ring in the reply.
>
> Um what kernel version?
4.20.0-0.rc6

> Is this the latest umr?
yes, master branch, commit 546c30a71f7b87f97f2a96eab184c3973b014711

> Maybe capture a trace of umr to see what is happening.

Cannot seek to MMIO address: Bad file descriptor
[ERROR]: Could not open ring debugfs file
Program received signal SIGSEGV, Segmentation fault.
umr_pm4_decode_ring (asic=asic at entry=0x1c08a50, ringname=<optimized
out="">, no_halt=no_halt at entry=1) at
/home/mikhail/packaging-work/umr/src/lib/umr_read_pm4_stream.c:333
333 ringdata[0] %= ringsize;
(gdb) thread apply all bt full

Thread 1 (Thread 0x7ffff7a22740 (LWP 7844)):
#0  umr_pm4_decode_ring (asic=asic at entry=0x1c08a50,
ringname=<optimized out="">, no_halt=no_halt at entry=1) at
/home/mikhail/packaging-work/umr/src/lib/umr_read_pm4_stream.c:333
        ps = <optimized out="">
        ringdata = 0x0
        ringsize = 8191
#1  0x00000000004b4ac6 in umr_print_waves (asic=asic at entry=0x1c08a50)
at /home/mikhail/packaging-work/umr/src/app/print_waves.c:52
        x = <optimized out="">
        y = <optimized out="">
        shift = <optimized out="">
        thread = <optimized out="">
        pgm_addr = <optimized out="">
        shader_addr = <optimized out="">
        wd = <optimized out="">
        owd = <optimized out="">
        first = 1
        col = 0
        shader = 0x0
        stream = <optimized out="">
#2  0x0000000000496952 in main (argc=<optimized out="">,
argv=<optimized out="">) at
/home/mikhail/packaging-work/umr/src/app/main.c:285
        i = 3
        j = <optimized out="">
        k = <optimized out="">
        l = <optimized out="">
        asic = 0x1c08a50
        blockname = <optimized out="">
        str = <optimized out="">
        str2 = <optimized out="">
        asicname = "\000\000\000\000\004", '\000' <repeats 19=""
times="">, "F;\226\000\000\000\000\000\000\000\000\000\004", '\000'
<repeats 19="" times="">, "\a", '\000' <repeats 11="" times="">,
"\004", '\000' <repeats 19="" times="">,
"\027\362\321\000\000\000\000\000\000\000\000\000\004", '\000'
<repeats 31="" times="">, "\004", '\000' <repeats 31="" times="">,
"\004", '\000' <repeats 31="" times="">, "\004", '\000' <repeats 31=""
times="">...
        ipname = '\000' <repeats 24="" times="">, "F;\226", '\000'
<repeats 29="" times="">, "l-option", '\000' <repeats 24="" times="">,
"\006\000\000\000\000\000\000\200", '\000' <repeats 56="" times="">,
"\027\362\321", '\000' <repeats 29="" times="">, "\037", '\000'
<repeats 31="" times="">...
        regname = "\000\000\000\000\000 ", '\000' <repeats 18=""
times="">, "\017\004", '\000' <repeats 11="" times="">, " ", '\000'
<repeats 18="" times="">, "\220\377\377\377\377\377\377\377", '\000'
<repeats 16="" times="">, "\031", '\000' <repeats 15="" times="">,
"\a\000\000\000\000\000\000\000\037\000\000\000\000\000\000\000\003\000\000\000\000\000\000\000\030\220\275\001\000\000\000\000P\000\000\000\000\000\000\000\220\377\377\377\377\377\377\377\000\000\000\000\000\000\000\000\003\000\000\000w\000\000\000[\000\000\000\060",
'\000' <repeats 27="" times="">, "n\000\000\000|", '\000' <repeats
19="" times="">...
        req = {tv_sec = 0, tv_nsec = 7310868735956184161}
(gdb)

> It works just fine on my raven1.
>

$ inxi -bM
System:    Host: localhost.localdomain Kernel:
4.20.0-0.rc6.git2.3.fc30.x86_64 x86_64 bits: 64 Desktop: Gnome 3.31.2
           Distro: Fedora release 30 (Rawhide)
Machine:   Type: Desktop Mobo: ASUSTeK model: ROG STRIX X470-I GAMING
v: Rev 1.xx serial: <root required="">
           UEFI: American Megatrends v: 1103 date: 11/16/2018
CPU:       8-Core: AMD Ryzen 7 2700X type: MT MCP speed: 2086 MHz
min/max: 2200/3700 MHz
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Vega 10 XL/XT
[Radeon RX Vega 56/64] driver: amdgpu v: kernel
           Display: wayland server: Fedora Project X.org 1.20.3
driver: amdgpu resolution: 3840x2160~60Hz
           OpenGL: renderer: Radeon RX Vega (VEGA10 DRM 3.27.0
4.20.0-0.rc6.git2.3.fc30.x86_64 LLVM 7.0.0) v: 4.5 Mesa 18.3.0
Network:   Device-1: Intel I211 Gigabit Network driver: igb
           Device-2: Realtek RTL8822BE 802.11a/b/g/n/ac WiFi adapter
driver: r8822be
Drives:    Local Storage: total: 11.35 TiB used: 7.54 TiB (66.4%)
Info:      Processes: 435 Uptime: 22m Memory: 31.35 GiB used: 19.69
GiB (62.8%) Shell: bash inxi: 3.0.29


--
Best Regards,
Mike Gavrilov.


More information about the amd-gfx mailing list