[Bug 59649] [r600][RV635] GPU lockup CP stall / GPU resets over and over - Kernel 3.7 to 3.11 inclusive

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Sep 16 20:15:40 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=59649

--- Comment #14 from Shawn Starr <shawn.starr at rogers.com> ---
This is not Second Life related at all, I manged to get GPU to reset in the
following way:

1) Set /sys/class/drm/card0/device/power_dpm_state to Battery and leave
/sys/class/drm/card0/device/power_dpm_force_performance_level as 'auto mode.

2) have kwin enabled wih composite, rendering: XRender (not OpenGL as this will
show black windows with GLAMOR)
3) Browsed a webpage in Chromium/Chrome and it suddenly GPU reset

if I recall, In both places even when playing with Second Life, I set DPM power
state to Battery even though the laptop has AC plugged in as seen in this log
from the latest reset:

[   55.572222] bridge0: port 2(vnet0) entered forwarding state
[   55.572229] bridge0: port 2(vnet0) entered forwarding state
[   70.624026] bridge0: port 2(vnet0) entered forwarding state
[  591.264107] device vnet1 entered promiscuous mode
[  591.273419] bridge0: port 3(vnet1) entered forwarding state
[  591.273425] bridge0: port 3(vnet1) entered forwarding state
[  606.303032] bridge0: port 3(vnet1) entered forwarding state
[  610.073896] perf samples too long (2506 > 2500), lowering
kernel.perf_event_max_sample_rate to 50000
[ 1924.749108] switching from power state:
[ 1924.749113]  ui class: performance
[ 1924.749115]  internal class: none
[ 1924.749116]  caps: single_disp video 
[ 1924.749118]  uvd    vclk: 0 dclk: 0
[ 1924.749120]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[ 1924.749121]          power level 1    sclk: 30000 mclk: 70000 vddc: 1100
[ 1924.749123]          power level 2    sclk: 60000 mclk: 70000 vddc: 1100
[ 1924.749124]  status: c 
[ 1924.749125] switching to power state:
[ 1924.749126]  ui class: battery
[ 1924.749127]  internal class: none
[ 1924.749128]  caps: single_disp video 
[ 1924.749130]  uvd    vclk: 0 dclk: 0
[ 1924.749131]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[ 1924.749132]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[ 1924.749133]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[ 1924.749134]  status: r 
[ 6797.378014] hrtimer: interrupt took 14736 ns
[15919.834055] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[15919.839527] radeon 0000:01:00.0: GPU lockup (waiting for 0x000000000011ebf9)
[15919.839532] radeon 0000:01:00.0: failed to get a new IB (-35)
[15919.845308] [drm:radeon_cs_ib_chunk] *ERROR* Failed to get ib !
[15920.072129] radeon 0000:01:00.0: Saved 1081 dwords of commands on ring 0.
[15920.072146] radeon 0000:01:00.0: GPU softreset: 0x00000009
[15920.072149] radeon 0000:01:00.0:   R_008010_GRBM_STATUS      = 0xE4723030
[15920.072152] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2     = 0x00110103
[15920.072154] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS      = 0x200000C0
[15920.072156] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[15920.072159] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00008002
[15920.072161] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00008086
[15920.072163] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80018645
[15920.072166] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[15920.129823] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEF
[15920.129880] radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
[15920.131986] radeon 0000:01:00.0:   R_008010_GRBM_STATUS      = 0xA0003030
[15920.131989] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2     = 0x00000003
[15920.131991] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS      = 0x200080C0
[15920.131993] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[15920.131995] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
[15920.131998] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
[15920.132011] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80100000
[15920.132014] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[15920.132021] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[15920.149897] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[15920.149928] radeon 0000:01:00.0: WB enabled
[15920.149931] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr
0x0000000020000c00 and cpu addr 0xffff88003715bc00
[15920.149934] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr
0x0000000020000c0c and cpu addr 0xffff88003715bc0c
[15920.181446] [drm] ring test on 0 succeeded in 1 usecs
[15920.389589] [drm:r600_dma_ring_test] *ERROR* radeon: ring 3 test failed
(0xCAFEDEAD)
[15920.397386] [drm:r600_resume] *ERROR* r600 startup failed on resume
[15930.402047] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[15930.409147] radeon 0000:01:00.0: GPU lockup (waiting for 0x000000000011ec1b
last fence id 0x000000000011ebff)
[15930.409150] [drm:r600_ib_test] *ERROR* radeon: fence wait failed (-35).
[15930.415268] [drm:radeon_ib_ring_tests] *ERROR* radeon: failed testing IB on
GFX ring (-35).
[15930.422761] radeon 0000:01:00.0: ib ring test failed (-35).
[15930.430056] radeon 0000:01:00.0: GPU softreset: 0x00000009
[15930.430059] radeon 0000:01:00.0:   R_008010_GRBM_STATUS      = 0xA0783030
[15930.430061] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2     = 0x00000103
[15930.430064] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS      = 0x200020C0
[15930.430066] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[15930.430068] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00008002
[15930.430070] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00008086
[15930.430072] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80018645
[15930.430074] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[15930.635438] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEF
[15930.635495] radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
[15930.637603] radeon 0000:01:00.0:   R_008010_GRBM_STATUS      = 0xA0003030
[15930.637606] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2     = 0x00000003
[15930.637608] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS      = 0x2000A0C0
[15930.637610] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[15930.637612] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
[15930.637614] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
[15930.637617] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80100000
[15930.637619] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[15930.637624] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[15930.800267] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[15930.800293] radeon 0000:01:00.0: WB enabled
[15930.800297] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr
0x0000000020000c00 and cpu addr 0xffff88003715bc00
[15930.800299] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr
0x0000000020000c0c and cpu addr 0xffff88003715bc0c
[15930.831855] [drm] ring test on 0 succeeded in 1 usecs
[15931.040164] [drm:r600_dma_ring_test] *ERROR* radeon: ring 3 test failed
(0xCAFEDEAD)
[15931.047446] [drm:r600_resume] *ERROR* r600 startup failed on resume
[15931.052132] [drm] ib test on ring 0 succeeded in 0 usecs
[15931.052586] switching from power state:
[15931.052588]  ui class: none
[15931.052590]  internal class: boot 
[15931.052591]  caps: video 
[15931.052593]  uvd    vclk: 0 dclk: 0
[15931.052594]          power level 0    sclk: 60000 mclk: 70000 vddc: 1100
[15931.052596]          power level 1    sclk: 60000 mclk: 70000 vddc: 1100
[15931.052597]          power level 2    sclk: 60000 mclk: 70000 vddc: 1100
[15931.052598]  status: c b 
[15931.052599] switching to power state:
[15931.052600]  ui class: battery
[15931.052601]  internal class: none
[15931.052602]  caps: single_disp video 
[15931.052604]  uvd    vclk: 0 dclk: 0
[15931.052605]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15931.052620]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15931.052621]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15931.052622]  status: r 
[15938.711325] switching from power state:
[15938.711327]  ui class: battery
[15938.711328]  internal class: none
[15938.711328]  caps: single_disp video 
[15938.711329]  uvd    vclk: 0 dclk: 0
[15938.711330]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15938.711331]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15938.711331]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15938.711332]  status: c 
[15938.711332] switching to power state:
[15938.711333]  ui class: battery
[15938.711333]  internal class: none
[15938.711334]  caps: video 
[15938.711335]  uvd    vclk: 0 dclk: 0
[15938.711335]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15938.711336]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15938.711336]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15938.711337]  status: r 
[15946.896158] switching from power state:
[15946.896164]  ui class: battery
[15946.896165]  internal class: none
[15946.896167]  caps: video 
[15946.896169]  uvd    vclk: 0 dclk: 0
[15946.896170]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15946.896172]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15946.896173]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15946.896174]  status: c 
[15946.896175] switching to power state:
[15946.896176]  ui class: battery
[15946.896177]  internal class: none
[15946.896178]  caps: single_disp video 
[15946.896180]  uvd    vclk: 0 dclk: 0
[15946.896181]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15946.896182]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15946.896184]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15946.896184]  status: r 
[15954.045444] switching from power state:
[15954.045446]  ui class: battery
[15954.045447]  internal class: none
[15954.045448]  caps: single_disp video 
[15954.045449]  uvd    vclk: 0 dclk: 0
[15954.045450]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15954.045450]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15954.045451]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15954.045451]  status: c 
[15954.045452] switching to power state:
[15954.045452]  ui class: battery
[15954.045453]  internal class: none
[15954.045454]  caps: video 
[15954.045454]  uvd    vclk: 0 dclk: 0
[15954.045455]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15954.045455]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15954.045456]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15954.045456]  status: r 
[15973.562587] switching from power state:
[15973.562591]  ui class: battery
[15973.562593]  internal class: none
[15973.562594]  caps: video 
[15973.562596]  uvd    vclk: 0 dclk: 0
[15973.562597]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15973.562599]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15973.562600]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15973.562601]  status: c 
[15973.562602] switching to power state:
[15973.562603]  ui class: battery
[15973.562604]  internal class: none
[15973.562605]  caps: single_disp video 
[15973.562607]  uvd    vclk: 0 dclk: 0
[15973.562608]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15973.562609]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15973.562610]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15973.562611]  status: r 
[15979.422353] switching from power state:
[15979.422355]  ui class: battery
[15979.422356]  internal class: none
[15979.422357]  caps: single_disp video 
[15979.422358]  uvd    vclk: 0 dclk: 0
[15979.422359]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15979.422359]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15979.422360]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15979.422361]  status: c 
[15979.422361] switching to power state:
[15979.422361]  ui class: battery
[15979.422362]  internal class: none
[15979.422363]  caps: video 
[15979.422363]  uvd    vclk: 0 dclk: 0
[15979.422364]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15979.422364]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15979.422365]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15979.422365]  status: r 
[15985.278874] switching from power state:
[15985.278878]  ui class: battery
[15985.278880]  internal class: none
[15985.278881]  caps: video 
[15985.278883]  uvd    vclk: 0 dclk: 0
[15985.278884]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15985.278886]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15985.278887]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15985.278888]  status: c 
[15985.278889] switching to power state:
[15985.278890]  ui class: battery
[15985.278891]  internal class: none
[15985.278892]  caps: single_disp video 
[15985.278894]  uvd    vclk: 0 dclk: 0
[15985.278895]          power level 0    sclk: 11000 mclk: 40500 vddc: 900
[15985.278896]          power level 1    sclk: 30000 mclk: 40500 vddc: 900
[15985.278897]          power level 2    sclk: 30000 mclk: 40500 vddc: 900
[15985.278898]  status: r

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/dri-devel/attachments/20130917/2fc93c23/attachment-0001.html>


More information about the dri-devel mailing list