[Intel-gfx] [PATCH 1/6] drm/i915: Eliminate vmap overhead for cmd parser
Chris Wilson
chris at chris-wilson.co.uk
Thu Oct 1 06:39:35 PDT 2015
On Thu, Oct 01, 2015 at 04:29:48PM +0300, Ville Syrjälä wrote:
> On Thu, Oct 01, 2015 at 02:24:53PM +0100, Chris Wilson wrote:
> > On Thu, Oct 01, 2015 at 03:37:21PM +0300, Ville Syrjälä wrote:
> > > On Thu, Oct 01, 2015 at 12:57:10PM +0100, Chris Wilson wrote:
> > > > - while (cmd < batch_end) {
> > > > - const struct drm_i915_cmd_descriptor *desc;
> > > > - u32 length;
> > > > + k = i;
> > > > + if (k > PAGE_SIZE - out)
> > > > + k = PAGE_SIZE - out;
> > > > + if (k == PAGE_SIZE)
> > > > + copy_page(dst, src);
> > > > + else
> > > > + memcpy(dst + out, src + j, k);
> > > > +
> > > > + out += k;
> > > > + j += k;
> > > > + i -= k;
> > > > + } while (i);
> > > > +
> > > > + cmd = src + in;
> > >
> > > So you're now checking the src batch? What prevents userspace from
> > > overwriting it with eg. NOPS between the time you copied it and the
> > > time you check it?
> >
> > Zilch. I picked src as it was already in the CPU cache, whereas dst will
> > be WC later. To satisfy you and byt, I need to stage the copy into a
> > temporary page, scan it, then copy into dst.
> >
> > The silver lining is that it does remove some lines of code. I'm pretty
> > confident that the double copy should not be noticed if I remember about
> > the cache-trashing of kunmap and carefully manage that.
>
> Yeah, I was thinking that optimally we'd do the copy+scan in cacheline
> units, but maybe that's too small to make it actually efficient?
Yes, the issue there becomes that we need up to 256 bytes (though I
think the largest command is actually ~130bytes) in order to pass a
complete copy of a command to the validator. If we make the temporary
too small we waste time building up partial commands, and the maximum
sane size is one page. __get_free_page() seems a reasonable first
choice, though a kmalloc() from a fresh slab tends to be faster to
allocate.
-Chris
--
Chris Wilson, Intel Open Source Technology Centre
More information about the Intel-gfx
mailing list