[Intel-gfx] [PATCH v2 7/8] drm/i915: Create vgpu specific write MMIO to reduce traps
Daniel Vetter
daniel at ffwll.ch
Wed Oct 22 17:33:29 CEST 2014
On Wed, Oct 22, 2014 at 08:27:50PM +0800, Yu, Zhang wrote:
>
>
> On 10/22/2014 12:40 AM, Daniel Vetter wrote:
> >On Thu, Oct 16, 2014 at 02:24:27PM +0800, Yu Zhang wrote:
> >>In the virtualized environment, forcewake operations are not
> >>necessory for the driver, because mmio accesses will be trapped
> >>and emulated by the host side, and real forcewake operations are
> >>also done in the host. New mmio write handlers are added to directly
> >>call the __raw_i915_write, therefore will reduce many traps and
> >>increase the overall performance for drivers runing in the VM
> >>with Intel GVT-g enhancement.
> >>
> >>Signed-off-by: Yu Zhang <yu.c.zhang at linux.intel.com>
> >>Signed-off-by: Jike Song <jike.song at intel.com>
> >>Signed-off-by: Kevin Tian <kevin.tian at intel.com>
> >>---
> >> drivers/gpu/drm/i915/intel_uncore.c | 20 ++++++++++++++++++++
> >> 1 file changed, 20 insertions(+)
> >>
> >>diff --git a/drivers/gpu/drm/i915/intel_uncore.c b/drivers/gpu/drm/i915/intel_uncore.c
> >>index d5f39f3..ec6d5ce 100644
> >>--- a/drivers/gpu/drm/i915/intel_uncore.c
> >>+++ b/drivers/gpu/drm/i915/intel_uncore.c
> >>@@ -719,6 +719,14 @@ hsw_write##x(struct drm_i915_private *dev_priv, off_t reg, u##x val, bool trace)
> >> REG_WRITE_FOOTER; \
> >> }
> >>
> >>+#define __vgpu_write(x) \
> >>+static void \
> >>+vgpu_write##x(struct drm_i915_private *dev_priv, off_t reg, u##x val, bool trace) { \
> >>+ REG_WRITE_HEADER; \
> >>+ __raw_i915_write##x(dev_priv, reg, val); \
> >>+ REG_WRITE_FOOTER; \
> >>+}
> >>+
> >> static const u32 gen8_shadowed_regs[] = {
> >> FORCEWAKE_MT,
> >> GEN6_RPNSWREQ,
> >>@@ -813,6 +821,10 @@ __gen4_write(8)
> >> __gen4_write(16)
> >> __gen4_write(32)
> >> __gen4_write(64)
> >>+__vgpu_write(8)
> >>+__vgpu_write(16)
> >>+__vgpu_write(32)
> >>+__vgpu_write(64)
> >>
> >> #undef __chv_write
> >> #undef __gen8_write
> >>@@ -820,6 +832,7 @@ __gen4_write(64)
> >> #undef __gen6_write
> >> #undef __gen5_write
> >> #undef __gen4_write
> >>+#undef __vgpu_write
> >> #undef REG_WRITE_FOOTER
> >> #undef REG_WRITE_HEADER
> >>
> >>@@ -950,6 +963,13 @@ void intel_uncore_init(struct drm_device *dev)
> >> dev_priv->uncore.funcs.mmio_readq = gen4_read64;
> >> break;
> >> }
> >>+
> >>+ if (intel_vgpu_active(dev)) {
> >>+ dev_priv->uncore.funcs.mmio_writeb = vgpu_write8;
> >>+ dev_priv->uncore.funcs.mmio_writew = vgpu_write16;
> >>+ dev_priv->uncore.funcs.mmio_writel = vgpu_write32;
> >>+ dev_priv->uncore.funcs.mmio_writeq = vgpu_write64;
> >
> >Someone should write a cool macro which uses prepocessor string
> >concatenation so that we can compress this all to
> >
> > ASSIGN_WRITE_MMIO_VFUNCS(vgpu)
> >
> >Then throw in an ASSIGN_READ_MMIO_VFUNC which looks similarly and this
> >might actually be pretty. Just an idea for some follow-up cleanup.
> >-Daniel
> >
> Thanks Daniel.
> Do you mean something like this:
> #define ASSIGN_WRITE_MMIO_VFUNCS(x) \
> do { \
> dev_priv->uncore.funcs.mmio_writeb = x##_write8; \
> dev_priv->uncore.funcs.mmio_writew = x##_write16; \
> dev_priv->uncore.funcs.mmio_writel = x##_write32; \
> dev_priv->uncore.funcs.mmio_writeq = x##_write64; \
> } while (0)
>
> and then we can use ASSIGN_WRITE_MMIO_VFUNCS(hsw) for hsw and
> ASSIGN_WRITE_MMIO_VFUNCS(vgpu) for vgpu, etc?
Yup. Plus the version for assigning READ vfuncs (on many platforms they
don't match up). Probably best if you do this conversion as a prep patch
before the vgt series so that I can merge it right away.
-Daniel
--
Daniel Vetter
Software Engineer, Intel Corporation
+41 (0) 79 365 57 48 - http://blog.ffwll.ch
More information about the Intel-gfx
mailing list