[amd-gfx] AMD Carrizo - GPU fault detected: 146 0x0842b714

Mads mads at ab3.no
Sat Jun 18 08:15:45 UTC 2016


Hi!

For a while now I've been having issues with my HP EliteDesk 705 G2 mini 
PC[1].

If I open up e.g. dolphin or konsole when in kde plasma 5.6.4, the 
screen corrupts and locks up, and this appears in dmesg:

juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected: 
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960, 
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected: 
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010151F
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053983, 
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected: 
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960, 
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected: 
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010151F
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053983, 
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected: 
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960, 
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected: 
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960, 
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected: 
146 0x08e2b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010151E
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:   
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053982, 
write from 'SDM0' (0x53444d30) (183)

This didn't happen back with mesa-11.2.2 built against llvm 3.8.0, but 
that starts to be quite a lot of commits ago now, considering the 
development pace mesa's got at the moment.

I tried out mesa and llvm from git and svn around when Bas Nieuwenhuizen 
posted those GL compute shaders for radeonsi patches[2], and I think 
that's when it was the first time I saw the bug.

It seems that the bug appears no matter what kernel I try to use, I've 
been through countless iterations of drm-next-4.7 kernels and 
drm-fixes-4.6 kernels, but it seems to happen no matter what I use. The 
error message pasted above comes from gentoo provided 4.6.2-kernel:

# uname -a
Linux hphtpc 4.6.2-gentoo #2 SMP PREEMPT Mon Jun 13 21:27:32 CEST 2016 
x86_64 AMD PRO A12-8800B R7, 12 Compute Cores 4C+8G AuthenticAMD 
GNU/Linux

Am I at the right mailing list for this kind of bug? How can I debug 
this further?

- Mads

---------
[1] 
http://store.hp.com/us/en/PDPStdView?catalogId=10051&urlLangId=-1&langId=-1&productId=1086676&storeId=10151
[2] 
https://lists.freedesktop.org/archives/mesa-dev/2016-April/111638.html


More information about the amd-gfx mailing list