page fault GCVM_L2_PROTECTION_FAULT_STATUS on 7900xtx Linux 6.7-rc1 with Mesa 23.3.0-rc3
Abhinav Praveen
abhinav at praveen.org.uk
Tue Nov 14 04:51:56 UTC 2023
Hi,
When I start X/i3 on a 7900xtx with Linux 6.7-rc1 and Mesa 23.3_rc3, my
log is filled with errors like:
[ 649.788816] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788819] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008088ed3dd000 from client 10
[ 649.788820] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.788821] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.788822] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.788823] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.788824] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.788825] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.788826] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
A log with all entries since X session startup is attached. I'm
wondering if this is an issue that the amdgpu devs can reproduce or
whether I should create an issue on drm/amd? Also, I am having an issue
with a partially garbled UEFI (but no artifacts after amdgpudrmfb picks
up) and no artifacts in X or in games (This is not an amdgpu issue but I
am just mentioning it incase it is related).
Also drm/amd#2356 persists too. Writing to power1_cap results in:
tee: /sys/class/drm/card0/device/hwmon/hwmon1/power1_cap: Input/output error
--
Abhinav Praveen
-------------- next part --------------
[ 649.746070] elogind-daemon[1941]: New session 3 of user me.
[ 649.761578] gmc_v11_0_process_interrupt: 83 callbacks suppressed
[ 649.761580] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.761584] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008088ed3dc000 from client 10
[ 649.761586] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.761587] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.761588] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.761589] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.761590] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.761590] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.761591] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.767011] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.767014] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008088ed3dd000 from client 10
[ 649.767016] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.767017] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.767018] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.767019] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.767019] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.767020] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.767021] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.769005] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.769008] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008088ed3dd000 from client 10
[ 649.769010] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.769011] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.769012] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.769013] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.769013] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.769014] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.769015] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.788708] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788714] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008088ed3dd000 from client 10
[ 649.788717] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.788719] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.788720] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.788721] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.788722] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.788723] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.788724] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.788737] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788739] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x000080898b18e000 from client 10
[ 649.788741] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.788743] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.788744] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.788745] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.788746] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.788747] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.788749] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.788754] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788756] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x000080894a77d000 from client 10
[ 649.788758] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.788759] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.788760] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.788762] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.788763] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.788764] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.788765] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.788775] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788777] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x000080898b18e000 from client 10
[ 649.788778] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D53
[ 649.788779] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.788781] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
[ 649.788782] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.788783] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.788784] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.788785] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.788789] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788791] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x000080894a77d000 from client 10
[ 649.788792] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 649.788793] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 649.788794] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.788795] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
[ 649.788797] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 649.788797] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 649.788798] amdgpu 0000:03:00.0: amdgpu: RW: 0x0
[ 649.788803] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788805] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x000080898b18e000 from client 10
[ 649.788806] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D53
[ 649.788808] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.788809] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
[ 649.788810] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.788811] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.788812] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.788812] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
[ 649.788816] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:169 vmid:0 pasid:0, for process pid 0 thread pid 0)
[ 649.788819] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008088ed3dd000 from client 10
[ 649.788820] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041D52
[ 649.788821] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SDMA1 (0xe)
[ 649.788822] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
[ 649.788823] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x1
[ 649.788824] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 649.788825] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 649.788826] amdgpu 0000:03:00.0: amdgpu: RW: 0x1
More information about the amd-gfx
mailing list