[PATCH 4/8] drm/xe/oa: Add input fence dependencies

Dixit, Ashutosh ashutosh.dixit at intel.com
Tue Aug 20 01:06:35 UTC 2024


On Thu, 08 Aug 2024 16:25:31 -0700, Matthew Brost wrote:
>

Hi Matt,

> On Thu, Aug 08, 2024 at 10:41:35AM -0700, Ashutosh Dixit wrote:
> > Add input fence dependencies which will make OA configuration wait till
> > these dependencies are met (till input fences signal).
> >
> > Signed-off-by: Ashutosh Dixit <ashutosh.dixit at intel.com>
> > ---
> >  drivers/gpu/drm/xe/xe_oa.c | 24 +++++++++++++++++++-----
> >  1 file changed, 19 insertions(+), 5 deletions(-)
> >
> > diff --git a/drivers/gpu/drm/xe/xe_oa.c b/drivers/gpu/drm/xe/xe_oa.c
> > index ba8f2e9d95b7f..416e031ac454b 100644
> > --- a/drivers/gpu/drm/xe/xe_oa.c
> > +++ b/drivers/gpu/drm/xe/xe_oa.c
> > @@ -578,10 +578,10 @@ static __poll_t xe_oa_poll(struct file *file, poll_table *wait)
> >  }
> >
> >  static int xe_oa_submit_bb(struct xe_oa_stream *stream, struct xe_bb *bb,
> > -			   struct dma_fence **fence)
> > +			   bool add_deps, struct dma_fence **fence)
>
> Bools are highly frowned upon [1] (one of many example of this). How
> about passing in 'stream->num_syncs' or 0?
>
> i.e. s/bool add_deps/u32 num_syncs/
>
> Maybe even add stream->syncs as argument then too.

I don't like this because this function is called from 3 places, 2 of which
set the arg to false and only one to true, even if stream->num_syncs > 0.
So it will not be clear why we are setting num_syncs to 0 from those 2
places and we will have to add comments pointing that out. Whereas it is
clear with the bool and no comments are needed.

But I understand the concerns about bool so I have instead converted it
into an enum in v2. That I think should take care of most of the objections
against bool. Let me know if this is ok or any other ideas regarding this?

Thanks.
--
Ashutosh

>
> Matt
>
> [1] https://patchwork.freedesktop.org/patch/607408/?series=135265&rev=6#comment_1103868

>
> >  {
> >	struct xe_sched_job *job;
> > -	int err = 0;
> > +	int i, err = 0;
> >
> >	/* Kernel configuration is issued on stream->k_exec_q, not stream->exec_q */
> >	job = xe_bb_create_job(stream->k_exec_q, bb);
> > @@ -590,9 +590,23 @@ static int xe_oa_submit_bb(struct xe_oa_stream *stream, struct xe_bb *bb,
> >		goto exit;
> >	}
> >
> > +	if (add_deps) {
> > +		for (i = 0; i < stream->num_syncs && !err; i++)
> > +			err = xe_sync_entry_add_deps(&stream->syncs[i], job);
> > +		if (err) {
> > +			drm_dbg(&stream->oa->xe->drm, "xe_sync_entry_add_deps err %d\n", err);
> > +			goto err_put_job;
> > +		}
> > +	}
> > +
> >	xe_sched_job_arm(job);
> >	*fence = dma_fence_get(&job->drm.s_fence->finished);
> >	xe_sched_job_push(job);
> > +
> > +	return 0;
> > +
> > +err_put_job:
> > +	xe_sched_job_put(job);
> >  exit:
> >	return err;
> >  }
> > @@ -670,7 +684,7 @@ static int xe_oa_modify_ctx_image(struct xe_oa_stream *stream, struct xe_lrc *lr
> >
> >	xe_oa_store_flex(stream, lrc, bb, flex, count);
> >
> > -	err = xe_oa_submit_bb(stream, bb, &fence);
> > +	err = xe_oa_submit_bb(stream, bb, false, &fence);
> >	xe_bb_free(bb, fence);
> >	dma_fence_put(fence);
> >  exit:
> > @@ -691,7 +705,7 @@ static int xe_oa_load_with_lri(struct xe_oa_stream *stream, struct xe_oa_reg *re
> >
> >	write_cs_mi_lri(bb, reg_lri, 1);
> >
> > -	err = xe_oa_submit_bb(stream, bb, &fence);
> > +	err = xe_oa_submit_bb(stream, bb, false, &fence);
> >	xe_bb_free(bb, fence);
> >	dma_fence_put(fence);
> >  exit:
> > @@ -971,7 +985,7 @@ static int xe_oa_emit_oa_config(struct xe_oa_stream *stream, struct xe_oa_config
> >		goto exit;
> >	}
> >
> > -	err = xe_oa_submit_bb(stream, oa_bo->bb, &fence);
> > +	err = xe_oa_submit_bb(stream, oa_bo->bb, true, &fence);
> >	if (err)
> >		goto exit;
> >
> > --
> > 2.41.0
> >


More information about the Intel-xe mailing list