<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW --- - GPU fault unless R600_DEBUG=nodma"
href="https://bugs.freedesktop.org/show_bug.cgi?id=62997">62997</a>
</td>
</tr>
<tr>
<th>Assignee</th>
<td>dri-devel@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Summary</th>
<td>GPU fault unless R600_DEBUG=nodma
</td>
</tr>
<tr>
<th>Severity</th>
<td>major
</td>
</tr>
<tr>
<th>Classification</th>
<td>Unclassified
</td>
</tr>
<tr>
<th>OS</th>
<td>Linux (All)
</td>
</tr>
<tr>
<th>Reporter</th>
<td>udovdh@xs4all.nl
</td>
</tr>
<tr>
<th>Hardware</th>
<td>x86-64 (AMD64)
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Version</th>
<td>git
</td>
</tr>
<tr>
<th>Component</th>
<td>Drivers/Gallium/r600
</td>
</tr>
<tr>
<th>Product</th>
<td>Mesa
</td>
</tr></table>
<p>
<div>
<pre>Ever since booting into kernel.org 3.8.4 on my AMD A10-5800K (ARUBA graphics),
running git mesa and git xf86-video-ati, I get short uptimes (15 minutes,
around one hour max) due to crashes.
The logs mention stuff like:
[ 1332.480233] radeon 0000:00:01.0: GPU fault detected: 146 0x0134710c
[ 1332.480243] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000813
[ 1332.480250] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0407100C
Watching youtube `helps` triggering the issue as it appears. (correlates, no
real causation yet)
Having R600_DEBUG=nodma in the environment solves the problem.
Occasionally I see a GPU lockup, if that is related:
[29648.098135] disk 0, wo:0, o:1, dev:sda2
[29648.098140] disk 1, wo:0, o:1, dev:sdb2
[29648.098142] disk 2, wo:0, o:1, dev:sdc2
[29648.098145] disk 3, wo:0, o:1, dev:sdd2
[68707.166021] radeon 0000:00:01.0: GPU fault detected: 146 0x0d4c2604
[68707.166030] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x000008D4
[68707.166043] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0C026004
[70621.378798] radeon 0000:00:01.0: GPU fault detected: 146 0x013c710c
[70621.378808] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000813
[70621.378815] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0C07100C
[70621.378837] radeon 0000:00:01.0: GPU fault detected: 147 0x0f0c7102
[70621.378843] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[70621.378848] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[70621.378854] radeon 0000:00:01.0: GPU fault detected: 147 0x0f1c7102
[70621.378859] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[70621.378864] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[70631.857918] radeon 0000:00:01.0: GPU lockup CP stall for more than
10000msec
[70631.857927] radeon 0000:00:01.0: GPU lockup (waiting for
0x00000000007e1fe5 last fence id 0x00000000007e1fe3)
[70631.858436] radeon 0000:00:01.0: sa_manager is not empty, clearing
anyway
[70631.859755] radeon 0000:00:01.0: Saved 951 dwords of commands on ring 0.
[70631.859761] radeon 0000:00:01.0: GPU softreset: 0x00000003
[70631.859766] radeon 0000:00:01.0: VM_CONTEXT0_PROTECTION_FAULT_ADDR
0x00000000
[70631.859770] radeon 0000:00:01.0: VM_CONTEXT0_PROTECTION_FAULT_STATUS
0x00000000
[70631.859774] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[70631.859778] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[70631.867299] radeon 0000:00:01.0: GRBM_STATUS =
0xA2703828
[70631.867305] radeon 0000:00:01.0: GRBM_STATUS_SE0 =
0x1D000007
[70631.867309] radeon 0000:00:01.0: GRBM_STATUS_SE1 =
0x00000007
[70631.867313] radeon 0000:00:01.0: SRBM_STATUS =
0x20000040
[70631.867317] radeon 0000:00:01.0: R_008674_CP_STALLED_STAT1 =
0x00000000
[70631.867321] radeon 0000:00:01.0: R_008678_CP_STALLED_STAT2 =
0x00018000
[70631.867325] radeon 0000:00:01.0: R_00867C_CP_BUSY_STAT =
0x00008006
[70631.867328] radeon 0000:00:01.0: R_008680_CP_STAT =
0x80038647
[70631.867332] radeon 0000:00:01.0: GRBM_SOFT_RESET=0x0000DF7B
[70631.867386] radeon 0000:00:01.0: GRBM_STATUS =
0x00003828
[70631.867390] radeon 0000:00:01.0: GRBM_STATUS_SE0 =
0x00000007
[70631.867393] radeon 0000:00:01.0: GRBM_STATUS_SE1 =
0x00000007
[70631.867397] radeon 0000:00:01.0: SRBM_STATUS =
0x20000040
[70631.867400] radeon 0000:00:01.0: R_008674_CP_STALLED_STAT1 =
0x00000000
[70631.867404] radeon 0000:00:01.0: R_008678_CP_STALLED_STAT2 =
0x00000000
[70631.867408] radeon 0000:00:01.0: R_00867C_CP_BUSY_STAT =
0x00000000
[70631.867411] radeon 0000:00:01.0: R_008680_CP_STAT =
0x00000000
[70631.883681] radeon 0000:00:01.0: GPU reset succeeded, trying to resume
[70631.916445] [drm] PCIE GART of 512M enabled (table at
0x0000000000040000).
[70631.916534] radeon 0000:00:01.0: WB enabled
[70631.916536] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr
0x0000000030000c00 and cpu addr 0xffff880235891c00
[70631.916538] radeon 0000:00:01.0: fence driver on ring 1 use gpu addr
0x0000000030000c04 and cpu addr 0xffff880235891c04
[70631.916540] radeon 0000:00:01.0: fence driver on ring 2 use gpu addr
0x0000000030000c08 and cpu addr 0xffff880235891c08
[70631.916541] radeon 0000:00:01.0: fence driver on ring 3 use gpu addr
0x0000000030000c0c and cpu addr 0xffff880235891c0c
[70631.916543] radeon 0000:00:01.0: fence driver on ring 4 use gpu addr
0x0000000030000c10 and cpu addr 0xffff880235891c10
[70631.935206] [drm] ring test on 0 succeeded in 3 usecs
[70631.935264] [drm] ring test on 3 succeeded in 2 usecs
[70631.935271] [drm] ring test on 4 succeeded in 1 usecs
[70631.949531] [drm] ib test on ring 0 succeeded in 0 usecs
[70631.950057] [drm] ib test on ring 3 succeeded in 0 usecs
[70631.950576] [drm] ib test on ring 4 succeeded in 1 usecs</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>