[PATCH] drm/amdgpu: clean wptr on wb when gpu recovery
Christian König
ckoenig.leichtzumerken at gmail.com
Fri Feb 28 09:19:40 UTC 2020
Am 28.02.20 um 07:31 schrieb Yintian Tao:
> The TDR will be randomly failed due to compute ring
> test failure. If the compute ring wptr & 0x7ff(ring_buf_mask)
> is 0x100 then after map mqd the compute ring rptr will be
> synced with 0x100. And the ring test packet size is also 0x100.
> Then after invocation of amdgpu_ring_commit, the cp will not
> really handle the packet on the ring buffer because rptr is equal to wptr.
>
> Signed-off-by: Yintian Tao <yttao at amd.com>
Of hand that looks correct to me, but I can't fully judge if that won't
have any negative side effects. Patch is Acked-by: Christian König
<christian.koenig at amd.com> for now.
Monk according to git you modified that function as well. Could this
have any potential negative effect for SRIOV? I don't think so, but
better save than sorry.
Regards,
Christian.
> ---
> drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 1 +
> drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 1 +
> 2 files changed, 2 insertions(+)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c
> index 44f00ecea322..5df1a6d45457 100644
> --- a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c
> +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c
> @@ -3508,6 +3508,7 @@ static int gfx_v10_0_kcq_init_queue(struct amdgpu_ring *ring)
>
> /* reset ring buffer */
> ring->wptr = 0;
> + atomic64_set((atomic64_t *)&adev->wb.wb[ring->wptr_offs], 0);
> amdgpu_ring_clear_ring(ring);
> } else {
> amdgpu_ring_clear_ring(ring);
> diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c b/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c
> index 4135e4126e82..ac22490e8656 100644
> --- a/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c
> +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c
> @@ -3664,6 +3664,7 @@ static int gfx_v9_0_kcq_init_queue(struct amdgpu_ring *ring)
>
> /* reset ring buffer */
> ring->wptr = 0;
> + atomic64_set((atomic64_t *)&adev->wb.wb[ring->wptr_offs], 0);
> amdgpu_ring_clear_ring(ring);
> } else {
> amdgpu_ring_clear_ring(ring);
More information about the amd-gfx
mailing list