[PATCH v4] drm/xe: Invalidate media_gt TLBs

Matthew Brost matthew.brost at intel.com
Tue Aug 20 17:31:41 UTC 2024


On Tue, Aug 20, 2024 at 10:22:54AM -0700, Matt Roper wrote:
> On Tue, Aug 20, 2024 at 09:01:29AM -0700, Matthew Brost wrote:
> > Testing on LNL has shown media TLBs need to be invalidated via the GuC,
> > update xe_vm_invalidate_vma appropriately.
> > 
> > v2: Fix 2 tile case
> > v3: Include missing local changm
> > 
> > Fixes: 3330361543fc ("drm/xe/lnl: Add LNL platform definition")
> 
> Is this the right Fixes line?  This may have been discovered in testing

I was struggling with the fixes tag. Just picked a LNL one as yes I
discovered this bug on LNL working on SVM.

> on LNL, but presumably the fix applies to MTL as well since we also have
> a standalone media GT there too, and we've had MTL support in the driver
> since before LNL was added.
> 

Forgot about MTL, yes this will apply to MTL too.

> Of course both platforms are under force_probe at the moment (and MTL
> will be forever since it's officially supported by i915 instead of Xe),
> so it probably doesn't really matter too much for backporting purposes
> and such.

Yea this patch will be pain to backport as it is built on top a recent
change to TLB invalidations. Maybe we just drop it as these are under
force probe so not a huge deal if it is not fixed.

Matt

> 
> 
> Matt
> 
> > Signed-off-by: Matthew Brost <matthew.brost at intel.com>
> > Reviewed-by: Himal Prasad Ghimiray <himal.prasad.ghimiray at intel.com>
> > ---
> >  drivers/gpu/drm/xe/xe_vm.c | 37 ++++++++++++++++++++++++-------------
> >  1 file changed, 24 insertions(+), 13 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/xe/xe_vm.c b/drivers/gpu/drm/xe/xe_vm.c
> > index d1bfd0b6e955..4cc13eddb6b3 100644
> > --- a/drivers/gpu/drm/xe/xe_vm.c
> > +++ b/drivers/gpu/drm/xe/xe_vm.c
> > @@ -3158,9 +3158,10 @@ int xe_vm_invalidate_vma(struct xe_vma *vma)
> >  {
> >  	struct xe_device *xe = xe_vma_vm(vma)->xe;
> >  	struct xe_tile *tile;
> > -	struct xe_gt_tlb_invalidation_fence fence[XE_MAX_TILES_PER_DEVICE];
> > -	u32 tile_needs_invalidate = 0;
> > +	struct xe_gt_tlb_invalidation_fence
> > +		fence[XE_MAX_TILES_PER_DEVICE * XE_MAX_GT_PER_TILE];
> >  	u8 id;
> > +	u32 fence_id = 0;
> >  	int ret = 0;
> >  
> >  	xe_assert(xe, !xe_vma_is_null(vma));
> > @@ -3188,27 +3189,37 @@ int xe_vm_invalidate_vma(struct xe_vma *vma)
> >  		if (xe_pt_zap_ptes(tile, vma)) {
> >  			xe_device_wmb(xe);
> >  			xe_gt_tlb_invalidation_fence_init(tile->primary_gt,
> > -							  &fence[id], true);
> > +							  &fence[fence_id],
> > +							  true);
> >  
> > -			/*
> > -			 * FIXME: We potentially need to invalidate multiple
> > -			 * GTs within the tile
> > -			 */
> >  			ret = xe_gt_tlb_invalidation_vma(tile->primary_gt,
> > -							 &fence[id], vma);
> > +							 &fence[fence_id], vma);
> >  			if (ret < 0) {
> > -				xe_gt_tlb_invalidation_fence_fini(&fence[id]);
> > +				xe_gt_tlb_invalidation_fence_fini(&fence[fence_id]);
> >  				goto wait;
> >  			}
> > +			++fence_id;
> >  
> > -			tile_needs_invalidate |= BIT(id);
> > +			if (!tile->media_gt)
> > +				continue;
> > +
> > +			xe_gt_tlb_invalidation_fence_init(tile->media_gt,
> > +							  &fence[fence_id],
> > +							  true);
> > +
> > +			ret = xe_gt_tlb_invalidation_vma(tile->media_gt,
> > +							 &fence[fence_id], vma);
> > +			if (ret < 0) {
> > +				xe_gt_tlb_invalidation_fence_fini(&fence[fence_id]);
> > +				goto wait;
> > +			}
> > +			++fence_id;
> >  		}
> >  	}
> >  
> >  wait:
> > -	for_each_tile(tile, xe, id)
> > -		if (tile_needs_invalidate & BIT(id))
> > -			xe_gt_tlb_invalidation_fence_wait(&fence[id]);
> > +	for (id = 0; id < fence_id; ++id)
> > +		xe_gt_tlb_invalidation_fence_wait(&fence[id]);
> >  
> >  	vma->tile_invalidated = vma->tile_mask;
> >  
> > -- 
> > 2.34.1
> > 
> 
> -- 
> Matt Roper
> Graphics Software Engineer
> Linux GPU Platform Enablement
> Intel Corporation


More information about the Intel-xe mailing list