<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW - Tonga GPU lock/reset fail with Unigine Valley"
href="https://bugs.freedesktop.org/show_bug.cgi?id=91278#c17">Comment # 17</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW - Tonga GPU lock/reset fail with Unigine Valley"
href="https://bugs.freedesktop.org/show_bug.cgi?id=91278">bug 91278</a>
from <span class="vcard"><a class="email" href="mailto:adf.lists@gmail.com" title="Andy Furniss <adf.lists@gmail.com>"> <span class="fn">Andy Furniss</span></a>
</span></b>
<pre>Haven't had time yet to hang with the patches.
Yesterday without I them I hung, rebooted, did the memsleep, then tested the
rest of the day trying to lock valley and unreal but couldn't. For the whole
day, the only logging I got was a few hundred -
Sep 29 18:10:47 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from
'TC6' (0x54433600) (72)
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0: GPU fault detected: 146
0x0be84804
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0017557D
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x08048004
Sep 29 18:10:49 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from
'TC6' (0x54433600) (72)
Last thing I applied the patches to couple of days old llvm and mesa gits.
This morning ran valley from power off boot after a bit of browsing/mail
(yesterday this hung).
Only a quick test which I stopped, looked OK but in dmesg I have >10k of -
[ 1792.292640] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404
[ 1792.292643] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00136F23
[ 1792.292644] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x060C4004
[ 1792.292646] VM fault (0x04, vmid 3) at page 1273635, read from 'TC4'
(0x54433400) (196)
[ 1792.292650] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.292651] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[ 1792.292652] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292654] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.292658] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404
[ 1792.292659] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[ 1792.292660] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292661] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.292666] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404
[ 1792.292667] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[ 1792.292668] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292669] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375515] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404
[ 1792.375518] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00136F23
[ 1792.375519] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x06084004
[ 1792.375521] VM fault (0x04, vmid 3) at page 1273635, read from 'TC10'
(0x54433130) (132)
[ 1792.375526] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.375527] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[ 1792.375528] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375530] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375534] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404
[ 1792.375535] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[ 1792.375536] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375538] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375542] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404
[ 1792.375543] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00000000
[ 1792.375544] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375546] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.432272] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.432276] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
0x00136F23
[ 1792.432277] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x06044004
[ 1792.432280] VM fault (0x04, vmid 3) at page 1273635, read from 'TC7'
(0x54433700) (68)</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>