[Intel-gfx] [PATCH 3/4] drm/i915: peel dma-fence-chains wait fences

Venkata Sandeep Dhanalakota venkata.s.dhanalakota at intel.com
Mon Apr 13 01:32:16 UTC 2020


On 20/04/11 11:50, Lionel Landwerlin wrote:
> On 10/04/2020 19:51, Venkata Sandeep Dhanalakota wrote:
> > From: Lionel Landwerlin <lionel.g.landwerlin at intel.com>
> > 
> > To allow faster engine to engine synchronization, peel the layer of
> > dma-fence-chain to expose potential i915 fences so that the
> > i915-request code can emit HW semaphore wait/signal operations in the
> > ring which is faster than waking up the host to submit unblocked
> > workloads after interrupt notification.
> > 
> > Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin at intel.com>
> > ---
> >   .../gpu/drm/i915/gem/i915_gem_execbuffer.c    | 39 +++++++++++++++++--
> >   1 file changed, 35 insertions(+), 4 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c b/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c
> > index 8dd651cdca39..e43b76d7e9fd 100644
> > --- a/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c
> > +++ b/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c
> > @@ -2508,6 +2508,7 @@ await_fence_array(struct i915_execbuffer *eb,
> >   	for (n = 0; n < nfences; n++) {
> >   		struct drm_syncobj *syncobj;
> > +		struct dma_fence_chain *chain;
> >   		unsigned int flags;
> >   		syncobj = ptr_unpack_bits(fences[n].syncobj, &flags, 2);
> > @@ -2515,10 +2516,40 @@ await_fence_array(struct i915_execbuffer *eb,
> >   		if (!fences[n].dma_fence)
> >   			continue;
> > -		err = i915_request_await_dma_fence(eb->request,
> > -						   fences[n].dma_fence);
> > -		if (err < 0)
> > -			return err;
> > +		/*
> > +		 * If we're dealing with a dma-fence-chain, peel the chain by
> > +		 * adding all of the unsignaled fences
> > +		 * (dma_fence_chain_for_each does that for us) the chain
> > +		 * points to.
> > +		 *
> > +		 * This enables us to identify waits on i915 fences and allows
> > +		 * for faster engine-to-engine synchronization using HW
> > +		 * semaphores.
> > +		 */
> > +		chain = to_dma_fence_chain(fences[n].dma_fence);
> > +		if (chain) {
> > +			struct dma_fence *iter;
> > +
> > +			dma_fence_chain_for_each(iter, fences[n].dma_fence) {
> 
> 
> The kbuild bot made me think of an interesting case.
> 
> It is possible to build a chain where the first element isn't a
> dma_fence_chain.
> 
Yes agreed, we could have a valid fence-chain with first element as normal
dma_fence and so iter_chain can be null. Will address this in next
revision of the patch.

> 
> We should handle this here like this :
> 
> 
> if (iter_chain)
> 
>     err = i915_request_await_dma_fence(eb->request, iter_chain->fence);
> 
> else
> 
>     err = i915_request_await_dma_fence(eb->request, iter);
> 
> if (err < 0) {
> 
>     dma_fence_put(iter);
> 
>     return err;
> 
> }
> 
> 
> > +				struct dma_fence_chain *iter_chain =
> > +					to_dma_fence_chain(iter);
> > +
> > +				GEM_BUG_ON(!iter_chain);
> > +
> > +				err = i915_request_await_dma_fence(eb->request,
> > +								   iter_chain->fence);
> > +				if (err < 0) {
> > +					dma_fence_put(iter);
> > +					return err;
> > +				}
> > +			}
> > +
> > +		} else {
> > +			err = i915_request_await_dma_fence(eb->request,
> > +							   fences[n].dma_fence);
> > +			if (err < 0)
> > +				return err;
> > +		}
> >   	}
> >   	return 0;
> 
> 


More information about the Intel-gfx mailing list