[Intel-gfx] [PATCH v2 3/3] drm/i915/gt: Only kick the signal worker if there's been an update

Karolina Drobnik karolina.drobnik at intel.com
Tue Jul 12 06:29:32 UTC 2022


Hi Rodrigo,

Many thanks for taking another look at the patches.

On 08.07.2022 16:40, Rodrigo Vivi wrote:
> On Fri, Jul 08, 2022 at 04:20:13PM +0200, Karolina Drobnik wrote:
>> From: Chris Wilson <chris at chris-wilson.co.uk>
>>
>> One impact of commit 047a1b877ed4 ("dma-buf & drm/amdgpu: remove
>> dma_resv workaround") is that it stores many, many more fences. Whereas
>> adding an exclusive fence used to remove the shared fence list, that
>> list is now preserved and the write fences included into the list. Not
>> just a single write fence, but now a write/read fence per context. That
>> causes us to have to track more fences than before (albeit half of those
>> are redundant), and we trigger more interrupts for multi-engine
>> workloads.
>>
>> As part of reducing the impact from handling more signaling, we observe
>> we only need to kick the signal worker after adding a fence iff we have
> 
> s/iff/if

This is fine, it means "if, and only if"

>> good cause to believe that there is work to be done in processing the
>> fence i.e. we either need to enable the interrupt or the request is
>> already complete but we don't know if we saw the interrupt and so need
>> to check signaling.
>>
>> References: 047a1b877ed4 ("dma-buf & drm/amdgpu: remove dma_resv workaround")
>> Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
>> Signed-off-by: Karolina Drobnik <karolina.drobnik at intel.com>
>> ---
>>   drivers/gpu/drm/i915/gt/intel_breadcrumbs.c | 3 ++-
>>   1 file changed, 2 insertions(+), 1 deletion(-)
>>
>> diff --git a/drivers/gpu/drm/i915/gt/intel_breadcrumbs.c b/drivers/gpu/drm/i915/gt/intel_breadcrumbs.c
>> index 9dc9dccf7b09..ecc990ec1b95 100644
>> --- a/drivers/gpu/drm/i915/gt/intel_breadcrumbs.c
>> +++ b/drivers/gpu/drm/i915/gt/intel_breadcrumbs.c
>> @@ -399,7 +399,8 @@ static void insert_breadcrumb(struct i915_request *rq)
>>   	 * the request as it may have completed and raised the interrupt as
>>   	 * we were attaching it into the lists.
>>   	 */
>> -	irq_work_queue(&b->irq_work);
>> +	if (!b->irq_armed || __i915_request_is_complete(rq))
> 
> would we need the READ_ONCE(irq_armed) ?
> would we need to use the irq_lock?

I'll rephrase Chris' answer here:

No, it doesn't need either, the workqueuing is unrelated to the 
irq_lock. The worker enables the interrupt if there are any breadcrumbs 
at the end of its task. When queuing the work, we have to consider the 
race conditions:

   - If the worker is running and b->irq_armed at this point, we know the
     irq will remain armed
   - If the worker is running and !b->irq_armed at this point, we will
     kick the worker again -- it doesn't make any difference then if the
     worker is in the process of trying to arm the irq
   - If the worker is not running, b->irq_armed is constant, no race

Ergo, the only race condition is where the worker is trying to arm the 
irq, and we end up running the worker a second time.

The only danger to consider is _not_ running the worker when we need to. 
Once we put the breadcrumb on the signal, it has to be removed at some 
point. Normally this is only performed by the worker, so we have to 
confident that the worker will be run. We know that if the irq is armed 
(after we have attached this breadcrumb) there must be another run of 
the worker.

The other condition then, if the irq is armed, but the breadcrumb is 
already completed, we may not see an interrupt from the gpu as the 
breadcrumb may have completed as we attached it, keeping the worker 
alive, but not noticing the completed breadcrumb in that case, we have 
to simulate the interrupt ourselves and give the worker a kick.

The irq_lock is immaterial in both cases.

>> +		irq_work_queue(&b->irq_work);
>>   }
>>   
>>   bool i915_request_enable_breadcrumb(struct i915_request *rq)
>> -- 
>> 2.25.1
>>


More information about the Intel-gfx mailing list