[Intel-gfx] [PATCH 3/3] drm/i915: Allow optimized platform checks

Tvrtko Ursulin tvrtko.ursulin at linux.intel.com
Mon Sep 25 08:22:33 UTC 2017


On 20/09/2017 10:56, Tvrtko Ursulin wrote:
> On 20/09/2017 10:39, Jani Nikula wrote:
>> On Wed, 20 Sep 2017, Tvrtko Ursulin <tursulin at ursulin.net> wrote:
>>> From: Tvrtko Ursulin <tvrtko.ursulin at intel.com>
>>>
>>> If we store the platform as a bitmask, and convert the
>>> IS_PLATFORM macro to use it, we allow the compiler to
>>> merge the IS_PLATFORM(a) || IS_PLATFORM(b) || ... checks
>>> into a single conditional.
>>>
>>> Even with the added BUG_ON this saves almost 1k of text:
>>>
>>>      text           data     bss     dec     hex filename
>>> -1460254          60014    3656 1523924  1740d4 
>>> drivers/gpu/drm/i915/i915.ko
>>> +1459260          60026    3656 1522942  173cfe 
>>> drivers/gpu/drm/i915/i915.ko
>>>
>>> Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin at intel.com>
>>> Cc: Jani Nikula <jani.nikula at intel.com>
>>> ---
>>>   drivers/gpu/drm/i915/i915_drv.c | 6 ++++++
>>>   drivers/gpu/drm/i915/i915_drv.h | 3 ++-
>>>   2 files changed, 8 insertions(+), 1 deletion(-)
>>>
>>> diff --git a/drivers/gpu/drm/i915/i915_drv.c 
>>> b/drivers/gpu/drm/i915/i915_drv.c
>>> index b1f96eb1be16..c3bd4b7cb19b 100644
>>> --- a/drivers/gpu/drm/i915/i915_drv.c
>>> +++ b/drivers/gpu/drm/i915/i915_drv.c
>>> @@ -869,6 +869,12 @@ static int i915_driver_init_early(struct 
>>> drm_i915_private *dev_priv,
>>>       memcpy(device_info, match_info, sizeof(*device_info));
>>>       device_info->device_id = dev_priv->drm.pdev->device;
>>> +    BUILD_BUG_ON(sizeof(device_info->platform_mask) * BITS_PER_BYTE <
>>> +             (INTEL_MAX_PLATFORMS - 1));
>>> +    BUG_ON(device_info->platform == 0  >>> +           device_info->platform >= INTEL_MAX_PLATFORMS);
>>> +    device_info->platform_mask = BIT(device_info->platform - 1);
>>
>> Please just lose the -1, pretty please?
>>
>>> +
>>>       BUG_ON(device_info->gen > sizeof(device_info->gen_mask) * 
>>> BITS_PER_BYTE);
>>>       device_info->gen_mask = BIT(device_info->gen - 1);
>>> diff --git a/drivers/gpu/drm/i915/i915_drv.h 
>>> b/drivers/gpu/drm/i915/i915_drv.h
>>> index 950aa109f8cb..81211f23326a 100644
>>> --- a/drivers/gpu/drm/i915/i915_drv.h
>>> +++ b/drivers/gpu/drm/i915/i915_drv.h
>>> @@ -861,6 +861,7 @@ struct intel_device_info {
>>>       u8 ring_mask; /* Rings supported by the HW */
>>>       enum intel_platform platform;
>>> +    u32 platform_mask;
>>>       u32 display_mmio_offset;
>>> @@ -2984,7 +2985,7 @@ intel_info(const struct drm_i915_private 
>>> *dev_priv)
>>>   #define IS_REVID(p, since, until) \
>>>       (INTEL_REVID(p) >= (since) && INTEL_REVID(p) <= (until))
>>> -#define IS_PLATFORM(dev_priv, p) ((dev_priv)->info.platform == (p))
>>> +#define IS_PLATFORM(dev_priv, p) ((dev_priv)->info.platform_mask & 
>>> BIT((p) - 1))
>>
>> What would the result be without platform_mask and just:
>>
>> #define IS_PLATFORM(dev_priv, p) (BIT((dev_priv)->info.platform) & 
>> BIT(p))
> 
> More code I'm afraid. But the problem of 32 platforms limit makes it 
> problematic for me. Because I checked, and going to u64 for the 
> platform_mask grows the code 100-200 bytes over the starting point. We'd 
> keep the single conditional advantage but I don't know, feels like not 
> worth it in that case. It could only last for a couple years before we 
> would need to go to u64.

On the other hand we could take this, and by the time would need to 
grown the mask field, we can a) enjoy the size saving, b) by that time 
new platform code will dwarf the 100-200 bytes going to u64 will add, 
and c) 100-200 bytes might even turn to a gain again, since new or-ed 
IS_platform checks will appear. Thoughts?

Regards,

Tvrtko


More information about the Intel-gfx mailing list