[Intel-gfx] [PATCH 04/22] drm/i915: Hold a ref to the ring while retiring

Tvrtko Ursulin tvrtko.ursulin at linux.intel.com
Mon Mar 18 10:46:28 UTC 2019


On 18/03/2019 10:37, Chris Wilson wrote:
> Quoting Tvrtko Ursulin (2019-03-18 10:31:57)
>>
>> On 18/03/2019 09:51, Chris Wilson wrote:
>>> As the final request on a ring may hold the reference to this ring (via
>>> retiring the last pinned context), we may find ourselves chasing a
>>> dangling pointer on completion of the list.
>>>
>>> A quick solution is to hold a reference to the ring itself as we retire
>>> along it so that we only free it after we stop dereferencing it.
>>
>> Is there a guilty commit to reference as Fixes: ?
> 
> It only becomes a problem with veng as we gain an immediate free path,
> whereas at the moment, context frees are deferred until they can acquire
> the struct_mutex. We cannot hit this path at the moment, but that we had
> to use the safe iterator implies that we were aware that the ring itself
> could disappear. If you wanted to pin it on something,
> 
> References: b887d6154624 ("drm/i915: Retire requests along rings")
> 
>>> Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
>>> ---
>>>    drivers/gpu/drm/i915/i915_request.c          |  6 +++++-
>>>    drivers/gpu/drm/i915/intel_engine_types.h    |  2 ++
>>>    drivers/gpu/drm/i915/intel_lrc.c             |  4 ++--
>>>    drivers/gpu/drm/i915/intel_ringbuffer.c      |  9 +++++----
>>>    drivers/gpu/drm/i915/intel_ringbuffer.h      | 13 ++++++++++++-
>>>    drivers/gpu/drm/i915/selftests/mock_engine.c |  1 +
>>>    6 files changed, 27 insertions(+), 8 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/i915/i915_request.c b/drivers/gpu/drm/i915/i915_request.c
>>> index 9533a85cb0b3..0a3d94517d0a 100644
>>> --- a/drivers/gpu/drm/i915/i915_request.c
>>> +++ b/drivers/gpu/drm/i915/i915_request.c
>>> @@ -1332,8 +1332,12 @@ void i915_retire_requests(struct drm_i915_private *i915)
>>>        if (!i915->gt.active_requests)
>>>                return;
>>>    
>>> -     list_for_each_entry_safe(ring, tmp, &i915->gt.active_rings, active_link)
>>> +     list_for_each_entry_safe(ring, tmp,
>>> +                              &i915->gt.active_rings, active_link) {
>>> +             intel_ring_get(ring); /* last rq holds reference! */
>>>                ring_retire_requests(ring);
>>> +             intel_ring_put(ring);
>>> +     }
>>
>> Where does it chase a dangling pointer? It used the safe iterator already.
> 
> Inside ring_retire_requests(); the use of _safe here actually implies we
> met this problem already :)

I get it, the issue is during ring->request_list iteration in 
ring_retire_requests. How about move ring pinning in there so it is clearer?

Regards,

Tvrtko


More information about the Intel-gfx mailing list