[Intel-gfx] [PATCH] drm/i915: Introduce concept of a sub-platform
Lucas De Marchi
lucas.demarchi at intel.com
Fri Mar 15 18:40:50 UTC 2019
On Fri, Mar 15, 2019 at 05:31:05PM +0000, Tvrtko Ursulin wrote:
>
>On 15/03/2019 17:12, Lucas De Marchi wrote:
>>On Fri, Mar 15, 2019 at 12:26:33PM +0000, Tvrtko Ursulin wrote:
>>>From: Tvrtko Ursulin <tvrtko.ursulin at intel.com>
>>>
>>>Concept of a sub-platform already exist in our code (like ULX and ULT
>>>platform variants and similar),implemented via the macros which check a
>>>list of device ids to determine a match.
>>>
>>>With this patch we consolidate device ids checking into a single function
>>>called during early driver load.
>>>
>>>A few low bits in the platform mask are reserved for sub-platform
>>>identification and defined as a per-platform namespace.
>>>
>>>At the same time it future proofs the platform_mask handling by preparing
>>>the code for easy extending, and tidies the very verbose WARN strings
>>>generated when IS_PLATFORM macros are embedded into a WARN type
>>>statements.
>>>
>>>The approach is also beneficial to driver size, with an combined
>>>shrink of
>>>code and strings of around 1.7 kiB.
>>>
>>>v2: Fixed IS_SUBPLATFORM. Updated commit msg.
>>>v3: Chris was right, there is an ordering problem.
>>>
>>>v4:
>>>* Catch-up with new sub-platforms.
>>>* Rebase for RUNTIME_INFO.
>>>* Drop subplatform mask union tricks and convert platform_mask to an
>>> array for extensibility.
>>>
>>>Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin at intel.com>
>>>Suggested-by: Chris Wilson <chris at chris-wilson.co.uk>
>>>Cc: Chris Wilson <chris at chris-wilson.co.uk>
>>>Cc: Jani Nikula <jani.nikula at intel.com>
>>>Cc: Lucas De Marchi <lucas.demarchi at intel.com>
>>>Cc: Jose Souza <jose.souza at intel.com>
>>>---
>>>drivers/gpu/drm/i915/i915_drv.c | 7 +-
>>>drivers/gpu/drm/i915/i915_drv.h | 110 +++++++++++++++--------
>>>drivers/gpu/drm/i915/i915_pci.c | 2 +-
>>>drivers/gpu/drm/i915/intel_device_info.c | 79 ++++++++++++++++
>>>drivers/gpu/drm/i915/intel_device_info.h | 28 +++++-
>>>5 files changed, 179 insertions(+), 47 deletions(-)
>>>
>>>diff --git a/drivers/gpu/drm/i915/i915_drv.c
>>>b/drivers/gpu/drm/i915/i915_drv.c
>>>index 0d743907e7bc..3218350cd225 100644
>>>--- a/drivers/gpu/drm/i915/i915_drv.c
>>>+++ b/drivers/gpu/drm/i915/i915_drv.c
>>>@@ -863,6 +863,8 @@ static int i915_driver_init_early(struct
>>>drm_i915_private *dev_priv)
>>> if (i915_inject_load_failure())
>>> return -ENODEV;
>>>
>>>+ intel_device_info_subplatform_init(dev_priv);
>>>+
>>> spin_lock_init(&dev_priv->irq_lock);
>>> spin_lock_init(&dev_priv->gpu_error.lock);
>>> mutex_init(&dev_priv->backlight_lock);
>>>@@ -1752,10 +1754,11 @@ static void i915_welcome_messages(struct
>>>drm_i915_private *dev_priv)
>>> if (drm_debug & DRM_UT_DRIVER) {
>>> struct drm_printer p = drm_debug_printer("i915 device info:");
>>>
>>>- drm_printf(&p, "pciid=0x%04x rev=0x%02x platform=%s gen=%i\n",
>>>+ drm_printf(&p, "pciid=0x%04x rev=0x%02x platform=%s (%x)
>>>gen=%i\n",
>>> INTEL_DEVID(dev_priv),
>>> INTEL_REVID(dev_priv),
>>> intel_platform_name(INTEL_INFO(dev_priv)->platform),
>>>+
>>>RUNTIME_INFO(dev_priv)->platform_mask[INTEL_INFO(dev_priv)->platform
>>>/ (BITS_PER_TYPE(RUNTIME_INFO(dev_priv)->platform_mask[0]) -
>>>INTEL_SUBPLATFORM_BITS)],
>>
>>bug here, INTEL_SUBPLATFORM_BITS should be outside of []. Bad things
>>will happen for platform=32 /o\
>
>? [32 / (32 - 3)] = [1], for which there is a BUILD_BUG_ON with a
>comment saying to increase size of array.
I think I missed a "(" and thought that would be (32 / 32) - 3
anyway, you are printing confusing information here since you only print
one u32.
>
>>
>>> INTEL_GEN(dev_priv));
>>>
>>> intel_device_info_dump_flags(INTEL_INFO(dev_priv), &p);
>>>@@ -1798,8 +1801,6 @@ i915_driver_create(struct pci_dev *pdev,
>>>const struct pci_device_id *ent)
>>> memcpy(device_info, match_info, sizeof(*device_info));
>>> RUNTIME_INFO(i915)->device_id = pdev->device;
>>>
>>>- BUILD_BUG_ON(INTEL_MAX_PLATFORMS >
>>>- BITS_PER_TYPE(device_info->platform_mask));
>>> BUG_ON(device_info->gen > BITS_PER_TYPE(device_info->gen_mask));
>>>
>>> return i915;
>>>diff --git a/drivers/gpu/drm/i915/i915_drv.h
>>>b/drivers/gpu/drm/i915/i915_drv.h
>>>index dccb6006aabf..34282cf66cb0 100644
>>>--- a/drivers/gpu/drm/i915/i915_drv.h
>>>+++ b/drivers/gpu/drm/i915/i915_drv.h
>>>@@ -2281,7 +2281,46 @@ static inline unsigned int
>>>i915_sg_segment_size(void)
>>>#define IS_REVID(p, since, until) \
>>> (INTEL_REVID(p) >= (since) && INTEL_REVID(p) <= (until))
>>>
>>>-#define IS_PLATFORM(dev_priv, p)
>>>(INTEL_INFO(dev_priv)->platform_mask & BIT(p))
>>>+#define __IS_PLATFORM(dev_priv, p) \
>>>+({ \
>>>+ const unsigned int pbits__ = \
>>>+ BITS_PER_TYPE(RUNTIME_INFO(dev_priv)->platform_mask[0]) - \
>>>+ INTEL_SUBPLATFORM_BITS; \
>>>+ const unsigned int pi__ = (p) / pbits__; \
>>>+ const unsigned int pb__ = (p) % pbits__ + INTEL_SUBPLATFORM_BITS; \
>>>+\
>>>+ BUILD_BUG_ON(!__builtin_constant_p(p)); \
>>>+\
>>>+ (RUNTIME_INFO(dev_priv)->platform_mask[pi__] & BIT(pb__)); \
>>
>>
>>Ugh. That double dword fiddling is way too ugly. IMO it is not buying us
>>anything. Just use a u64 rather than the double dword. Your approach may
>>have a small benefit on ARCH=i386, but has the burden of carrying all
>>this forward. The diff below (only build-tested) is on top of yours,
>>which is basically equivalent to "move to u64 and then add the
>>subplatform part".
>>
>> text data bss dec hex filename
>>1834620 40454 4176 1879250 1cacd2 drivers/gpu/drm/i915/i915.o.yours
>>1834710 40454 4176 1879340 1cad2c drivers/gpu/drm/i915/i915.o
>
>The cost of going u64 would be higher than what you saw if bits above
>were actually used I think. But would have to check the output to be
>sure. It was at least a year ago I think I last played with this.
I challenge that. My butt feeling here is pretty strong that's not
true.
>Benefit of the u32 array approach is that it avoids that even on
>64-bit builds.
my point is on 64-bit builds... It may suffer a little on m32.
Lucas De Marchi
>As it stands v5 of my patch has minimal positive effect on code size
>(sub 1k). Maybe a bit better in non-debug builds. But the main point
>is about the devid checking consolidation.
>
>It is of course open to discussion.
>
>Regards,
>
>Tvrtko
More information about the Intel-gfx
mailing list