[PATCH v2 4/4] drm/xe/xe2: Add performance tuning for L3 cache flushing

Gustavo Sousa gustavo.sousa at intel.com
Fri Sep 20 20:06:33 UTC 2024


Quoting Matt Roper (2024-09-20 16:26:44-03:00)
>On Fri, Sep 20, 2024 at 02:12:11PM -0300, Gustavo Sousa wrote:
>> A recommended performance tuning for LNL related to L3 cache flushing
>> was recently introduced in Bspec. Implement it.
>> 
>> v2:
>>   - Fix reference to Bspec. (Sai Teja, Tejas)
>>   - Use correct register name for "Tuning: L3 RW flush all Cache". (Sai
>>     Teja)
>>   - Use SCRATCH3_LBCF (with the underscore) for better readability.
>> 
>> Bspec: 72161
>> Cc: Sai Teja Pottumuttu <sai.teja.pottumuttu at intel.com>
>> Cc: Tejas Upadhyay <tejas.upadhyay at intel.com>
>> Signed-off-by: Gustavo Sousa <gustavo.sousa at intel.com>
>> ---
>>  drivers/gpu/drm/xe/regs/xe_gt_regs.h | 5 +++++
>>  drivers/gpu/drm/xe/xe_tuning.c       | 8 ++++++++
>>  2 files changed, 13 insertions(+)
>> 
>> diff --git a/drivers/gpu/drm/xe/regs/xe_gt_regs.h b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
>> index 07315eb72eff..8d8f6a113a86 100644
>> --- a/drivers/gpu/drm/xe/regs/xe_gt_regs.h
>> +++ b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
>> @@ -388,6 +388,9 @@
>>  #define L3SQCREG3                                XE_REG_MCR(0xb108)
>>  #define   COMPPWOVERFETCHEN                        REG_BIT(28)
>>  
>> +#define SCRATCH3_LBCF                                XE_REG_MCR(0xb154)
>> +#define   RWFLUSHALLEN                                REG_BIT(17)
>> +
>>  #define XEHP_L3SQCREG5                                XE_REG_MCR(0xb158)
>>  #define   L3_PWM_TIMER_INIT_VAL_MASK                REG_GENMASK(9, 0)
>>  
>> @@ -405,6 +408,8 @@
>>  
>>  #define XE2LPM_L3SQCREG3                        XE_REG_MCR(0xb608)
>>  
>> +#define XE2LPM_SCRATCH3_LBCF                        XE_REG_MCR(0xb654)
>> +
>>  #define XE2LPM_L3SQCREG5                        XE_REG_MCR(0xb658)
>>  
>>  #define XE2_TDF_CTRL                                XE_REG(0xb418)
>> diff --git a/drivers/gpu/drm/xe/xe_tuning.c b/drivers/gpu/drm/xe/xe_tuning.c
>> index 230369f108ab..39f50b6f94ce 100644
>> --- a/drivers/gpu/drm/xe/xe_tuning.c
>> +++ b/drivers/gpu/drm/xe/xe_tuning.c
>> @@ -75,6 +75,14 @@ static const struct xe_rtp_entry_sr gt_tunings[] = {
>>            XE_RTP_ACTIONS(FIELD_SET(STATELESS_COMPRESSION_CTRL, UNIFIED_COMPRESSION_FORMAT,
>>                                     REG_FIELD_PREP(UNIFIED_COMPRESSION_FORMAT, 0)))
>>          },
>> +        { XE_RTP_NAME("Tuning: L3 RW flush all Cache"),
>> +          XE_RTP_RULES(GRAPHICS_VERSION_RANGE(2004, XE_RTP_END_VERSION_UNDEFINED)),
>
>The bspec tagging on this one is a bit of a mess right now, but when I
>look at the ticket that originally added this to the bspec, there's an
>underlined comment there that says "only for LNL."  We might want to
>check with the hardware guys to clarify if that's still the case and, if
>so, make this one of the special cases where the tuning applies solely
>to this one version.

Okay. I raised a question on the original ticket.

We might as well make this LNL-specific and then come back and use an
open range in necessary when we get more info. Thoughts?

--
Gustavo Sousa

>
>
>Matt
>
>> +          XE_RTP_ACTIONS(SET(SCRATCH3_LBCF, RWFLUSHALLEN))
>> +        },
>> +        { XE_RTP_NAME("Tuning: L3 RW flush all cache - media"),
>> +          XE_RTP_RULES(MEDIA_VERSION_RANGE(2000, XE_RTP_END_VERSION_UNDEFINED)),
>> +          XE_RTP_ACTIONS(SET(XE2LPM_SCRATCH3_LBCF, RWFLUSHALLEN))
>> +        },
>>  
>>          {}
>>  };
>> -- 
>> 2.46.1
>> 
>
>-- 
>Matt Roper
>Graphics Software Engineer
>Linux GPU Platform Enablement
>Intel Corporation


More information about the Intel-xe mailing list