[PATCH 2/3] amd/amdgpu: wait no process running in kfd before resuming device

Zhigang Luo Zhigang.Luo at amd.com
Fri Mar 22 19:57:36 UTC 2024


it will cause page fault after device recovered if there is a process running.

Signed-off-by: Zhigang Luo <Zhigang.Luo at amd.com>
Change-Id: Ib1eddb56b69ecd41fe703abd169944154f48b0cd
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
index 70261eb9b0bb..2867e9186e44 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
@@ -4974,6 +4974,8 @@ static int amdgpu_device_reset_sriov(struct amdgpu_device *adev,
 retry:
 	amdgpu_amdkfd_pre_reset(adev);
 
+	amdgpu_amdkfd_wait_no_process_running(adev);
+
 	amdgpu_device_stop_pending_resets(adev);
 
 	if (from_hypervisor)
-- 
2.25.1



More information about the amd-gfx mailing list