[Intel-gfx] [PATCH v5 04/10] drm/i915/perf: implement active wait for noa configurations

Lionel Landwerlin lionel.g.landwerlin at intel.com
Thu Jun 27 08:52:34 UTC 2019


On 27/06/2019 11:27, Chris Wilson wrote:
> Quoting Lionel Landwerlin (2019-06-27 09:00:39)
>> diff --git a/drivers/gpu/drm/i915/i915_perf.c b/drivers/gpu/drm/i915/i915_perf.c
>> index 2f1dc9be3bfb..b2f5ba87921c 100644
>> --- a/drivers/gpu/drm/i915/i915_perf.c
>> +++ b/drivers/gpu/drm/i915/i915_perf.c
>> @@ -410,6 +410,8 @@ static int alloc_oa_config_buffer(struct drm_i915_private *i915,
>>          size_t config_length = 0;
>>          u32 *cs;
>>   
>> +       lockdep_assert_held(&i915->drm.struct_mutex);
> What inside *this* function depends on the struct_mutex? The noa_wait
> vma is already pinned and I didn't spot anything else added that might
> be contentious.


Thanks, will drop this.


>
>>          if (oa_config->mux_regs_len > 0) {
>>                  config_length += DIV_ROUND_UP(oa_config->mux_regs_len,
>>                                                MI_LOAD_REGISTER_IMM_MAX_REGS) * 4;
>> @@ -425,7 +427,7 @@ static int alloc_oa_config_buffer(struct drm_i915_private *i915,
>>                                                MI_LOAD_REGISTER_IMM_MAX_REGS) * 4;
>>                  config_length += oa_config->flex_regs_len * 8;
>>          }
>> -       config_length += 4; /* MI_BATCH_BUFFER_END */
>> +       config_length += 12; /* MI_BATCH_BUFFER_START into noa_wait loop */
>>          config_length = ALIGN(config_length, I915_GTT_PAGE_SIZE);
>>   
>>          bo = i915_gem_object_create_shmem(i915, config_length);
>> @@ -442,7 +444,12 @@ static int alloc_oa_config_buffer(struct drm_i915_private *i915,
>>          cs = write_cs_mi_lri(cs, oa_config->b_counter_regs, oa_config->b_counter_regs_len);
>>          cs = write_cs_mi_lri(cs, oa_config->flex_regs, oa_config->flex_regs_len);
>>   
>> -       *cs++ = MI_BATCH_BUFFER_END;
>> +
>> +       /* Jump into the NOA wait busy loop. */
>> +       *cs++ = (INTEL_GEN(i915) < 8 ?
>> +                MI_BATCH_BUFFER_START : MI_BATCH_BUFFER_START_GEN8);
>> +       *cs++ = i915->perf.oa.noa_wait->node.start;
> i915_ggtt_offset(i915->perf.oa.noa_wait);
> double checks that you are in the ggtt and so assuming u32 is valid.


Right, will do.


>
> This is operating inside the user context right? (If I remember
> correctly, you plug into the user dispatch) If so, where do you save and
> restore the CS_GPR overwritten?


Duh! In a previous iteration I was saving this in the scratch BO as a 
given location.

Should I alloc my own buffer?


Thanks a lot,


-Lionel


> -Chris
>



More information about the Intel-gfx mailing list