[Bug 91278] Tonga GPU lock/reset fail with Unigine Valley

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Sep 30 02:38:20 PDT 2015


https://bugs.freedesktop.org/show_bug.cgi?id=91278

--- Comment #17 from Andy Furniss <adf.lists at gmail.com> ---
Haven't had time yet to hang with the patches.

Yesterday without I them I hung, rebooted, did the memsleep, then tested the
rest of the day trying to lock valley and unreal but couldn't. For the whole
day, the only logging I got was a few hundred -

Sep 29 18:10:47 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from
'TC6' (0x54433600) (72)
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0: GPU fault detected: 146
0x0be84804
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0017557D
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x08048004
Sep 29 18:10:49 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from
'TC6' (0x54433600) (72)

Last thing I applied the patches to couple of days old llvm and mesa gits.

This morning ran valley from power off boot after a bit of browsing/mail
(yesterday this hung).

Only a quick test which I stopped, looked OK but in dmesg I have >10k of -

[ 1792.292640] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404
[ 1792.292643] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00136F23
[ 1792.292644] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x060C4004
[ 1792.292646] VM fault (0x04, vmid 3) at page 1273635, read from 'TC4'
(0x54433400) (196)
[ 1792.292650] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.292651] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.292652] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292654] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.292658] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404
[ 1792.292659] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.292660] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292661] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.292666] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404
[ 1792.292667] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.292668] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292669] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375515] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404
[ 1792.375518] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00136F23
[ 1792.375519] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x06084004
[ 1792.375521] VM fault (0x04, vmid 3) at page 1273635, read from 'TC10'
(0x54433130) (132)
[ 1792.375526] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.375527] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.375528] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375530] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375534] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404
[ 1792.375535] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.375536] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375538] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375542] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404
[ 1792.375543] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.375544] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375546] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.432272] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.432276] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00136F23
[ 1792.432277] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x06044004
[ 1792.432280] VM fault (0x04, vmid 3) at page 1273635, read from 'TC7'
(0x54433700) (68)

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/dri-devel/attachments/20150930/1b62911d/attachment.html>


More information about the dri-devel mailing list