[PATCH 1/9] drm/amdgpu:cleanup hw fence get&put
Christian König
ckoenig.leichtzumerken at gmail.com
Wed Oct 25 11:38:34 UTC 2017
NAK, we have tried this before.
This way we can accidentally try to signal a fence twice when the fence
value is somehow corrupted.
Christian.
Am 25.10.2017 um 11:22 schrieb Monk Liu:
> In amdgpu_fence_process it shouldn't put the signaled hw fence and set
> RCU pointer to NULL, instead the more reasonable sequence is just leave
> the RCU be there untouched, and:
> 1) Either driver_fini() should put this RCU fence or
> 2) New hw fence emitting should take the RCU slot and put the old hw fence.
>
> mapping between get & put on hw fence is:
>
> Get Put
> fence_emit:fence_init ---> free_job:put job->fence
> ---> run_job:put job->fence //GPU RESET case
>
> fence_emit:rcu_assign_pointer ---> fence_driver_fini:put ring->fence_drv.fences[j]
> ---> fence_emit:put(old) //after fence_wait(old)
>
> run_job:job->fence=fence_get() --> sched_main:put(fence)
> --> job_recovery:put(fence) //for GPU RESET case
>
> sched_main:parent=fence_get() --> sched_fence_free:put(fence->parent)
> sched_job_recovery:parent=fence_get() --> sched_hw_job_reset:put(s_job->s_fence->parent) // for GPU RESET case
>
> Change-Id: I623167c1e3143233f4e1be7a400a9b698b4a1355
> Signed-off-by: Monk Liu <Monk.Liu at amd.com>
> ---
> drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c | 4 ++--
> 1 file changed, 2 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c
> index f43319c..d7374cf 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c
> @@ -161,6 +161,8 @@ int amdgpu_fence_emit(struct amdgpu_ring *ring, struct dma_fence **f)
> dma_fence_wait(old, false);
> }
>
> + dma_fence_put(old);
> +
> rcu_assign_pointer(*ptr, dma_fence_get(&fence->base));
>
> *f = &fence->base;
> @@ -246,7 +248,6 @@ void amdgpu_fence_process(struct amdgpu_ring *ring)
>
> /* There is always exactly one thread signaling this fence slot */
> fence = rcu_dereference_protected(*ptr, 1);
> - RCU_INIT_POINTER(*ptr, NULL);
>
> if (!fence)
> continue;
> @@ -257,7 +258,6 @@ void amdgpu_fence_process(struct amdgpu_ring *ring)
> else
> BUG();
>
> - dma_fence_put(fence);
> } while (last_seq != seq);
> }
>
More information about the amd-gfx
mailing list