[amd-gfx] AMD Carrizo - GPU fault detected: 146 0x0842b714
Mads
mads at ab3.no
Sat Jun 18 08:15:45 UTC 2016
Hi!
For a while now I've been having issues with my HP EliteDesk 705 G2 mini
PC[1].
If I open up e.g. dolphin or konsole when in kde plasma 5.6.4, the
screen corrupts and locks up, and this appears in dmesg:
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected:
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960,
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected:
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010151F
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053983,
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected:
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960,
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected:
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010151F
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053983,
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected:
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960,
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected:
146 0x0842b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00101508
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053960,
write from 'SDM0' (0x53444d30) (183)
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0: GPU fault detected:
146 0x08e2b714
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010151E
juni 17 22:50:42 hphtpc kernel: amdgpu 0000:00:01.0:
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B0B7014
juni 17 22:50:42 hphtpc kernel: VM fault (0x14, vmid 5) at page 1053982,
write from 'SDM0' (0x53444d30) (183)
This didn't happen back with mesa-11.2.2 built against llvm 3.8.0, but
that starts to be quite a lot of commits ago now, considering the
development pace mesa's got at the moment.
I tried out mesa and llvm from git and svn around when Bas Nieuwenhuizen
posted those GL compute shaders for radeonsi patches[2], and I think
that's when it was the first time I saw the bug.
It seems that the bug appears no matter what kernel I try to use, I've
been through countless iterations of drm-next-4.7 kernels and
drm-fixes-4.6 kernels, but it seems to happen no matter what I use. The
error message pasted above comes from gentoo provided 4.6.2-kernel:
# uname -a
Linux hphtpc 4.6.2-gentoo #2 SMP PREEMPT Mon Jun 13 21:27:32 CEST 2016
x86_64 AMD PRO A12-8800B R7, 12 Compute Cores 4C+8G AuthenticAMD
GNU/Linux
Am I at the right mailing list for this kind of bug? How can I debug
this further?
- Mads
---------
[1]
http://store.hp.com/us/en/PDPStdView?catalogId=10051&urlLangId=-1&langId=-1&productId=1086676&storeId=10151
[2]
https://lists.freedesktop.org/archives/mesa-dev/2016-April/111638.html
More information about the amd-gfx
mailing list