[PATCH v3] drm/i915: Allocate intel_engine_cs structure only for the enabled engines
Chris Wilson
chris at chris-wilson.co.uk
Wed Jul 27 09:53:26 UTC 2016
On Wed, Jul 27, 2016 at 03:27:35PM +0530, akash.goel at intel.com wrote:
> From: Akash Goel <akash.goel at intel.com>
>
> With the possibility of addition of many more number of rings in future,
> the drm_i915_private structure could bloat as an array, of type
> intel_engine_cs, is embedded inside it.
> struct intel_engine_cs engine[I915_NUM_ENGINES];
> Though this is still fine as generally there is only a single instance of
> drm_i915_private structure used, but not all of the possible rings would be
> enabled or active on most of the platforms. Some memory can be saved by
> allocating intel_engine_cs structure only for the enabled/active engines.
> Currently the engine/ring ID is kept static and dev_priv->engine[] is simply
> indexed using the enums defined in intel_engine_id.
> To save memory and continue using the static engine/ring IDs, 'engine' is
> defined as an array of pointers.
> struct intel_engine_cs *engine[I915_NUM_ENGINES];
> dev_priv->engine[engine_ID] will be NULL for disabled engine instances.
>
> v2:
> - Remove the engine iterator field added in drm_i915_private structure,
> instead pass a local iterator variable to the for_each_engine**
> macros. (Chris)
> - Do away with intel_engine_initialized() and instead directly use the
> NULL pointer check on engine pointer. (Chris)
>
> v3:
> - Remove for_each_engine_id() macro, as the updated macro for_each_engine()
> can be used in place of it. (Chris)
> - Protect the access to Render engine Fault register with a NULL check, as
> engine specific init is done later in Driver load sequence.
What's the impact upon code size? (Always nice to know as the oppposite
change was intended to reduce code size.)
> diff --git a/drivers/gpu/drm/i915/i915_drv.c b/drivers/gpu/drm/i915/i915_drv.c
> index fc84037..0f14bf5 100644
> --- a/drivers/gpu/drm/i915/i915_drv.c
> +++ b/drivers/gpu/drm/i915/i915_drv.c
> @@ -264,16 +264,16 @@ static int i915_getparam(struct drm_device *dev, void *data,
> value = 1;
> break;
> case I915_PARAM_HAS_BSD:
> - value = intel_engine_initialized(&dev_priv->engine[VCS]);
> + value = (dev_priv->engine[VCS] != NULL);
value = !!dev_priv->engine[VCS];
(no stray brackets, and if I'm not allowed to compare against NULL...)
> +#define for_each_engine(engine__, dev_priv__, iter__) \
> + for ((iter__) = 0; \
> + ((iter__) < I915_NUM_ENGINES) && \
> + ((engine__) = (dev_priv__)->engine[(iter__)], true); \
> + (iter__)++) \
> + for_each_if ((engine__))
> +#define for_each_engine_masked(engine__, dev_priv__, mask__, iter__) \
> + for ((iter__) = 0; \
> + ((iter__) < I915_NUM_ENGINES) && \
> + ((engine__) = (dev_priv__)->engine[(iter__)], true); \
> + (iter__)++) \
> + for_each_if ((engine__) && ((mask__) & intel_engine_flag(engine__)))
Looks good. One more pair of brackets than strictly required but meh.
> -static void
> -init_engine_lists(struct intel_engine_cs *engine)
> -{
> - INIT_LIST_HEAD(&engine->active_list);
> - INIT_LIST_HEAD(&engine->request_list);
> -}
The good news is that these should not be required (at least in the near
future).
> - if (engine == &dev_priv->engine[RCS] &&
> + if (engine == dev_priv->engine[RCS] &&
> instp_mode != dev_priv->relative_constants_mode) {
> ret = intel_ring_begin(params->request, 4);
> if (ret)
> @@ -1663,7 +1663,7 @@ static int gen8_emit_flush(struct drm_i915_gem_request *request,
>
> if (invalidate_domains & I915_GEM_GPU_DOMAINS) {
> cmd |= MI_INVALIDATE_TLB;
> - if (engine == &dev_priv->engine[VCS])
> + if (engine == dev_priv->engine[VCS])
> cmd |= MI_INVALIDATE_BSD;
> }
This style just needs to die. if (engine->id == VCS)
-Chris
--
Chris Wilson, Intel Open Source Technology Centre
More information about the Intel-gfx-trybot
mailing list