[Bug 109605] [CI][SHARDS]: igt at gem_mmap_gtt@hang - incomplete/timeout - i915 0000:00:02.0: i915_reset_device timed out, cancelling all in-flight rendering.

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Mar 6 16:21:46 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=109605

Martin Peres <martin.peres at free.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |CLOSED

--- Comment #3 from Martin Peres <martin.peres at free.fr> ---
(In reply to Chris Wilson from comment #2)
> commit aeaaa55c7368ea0e7c195baa35dea37b806efb11
> Author: Chris Wilson <chris at chris-wilson.co.uk>
> Date:   Tue Feb 12 13:08:30 2019 +0000
> 
>     drm/i915: Recursive i915_reset_trylock() verboten
>     
>     We cannot nest i915_reset_trylock() as the inner may wait for the
>     I915_RESET_BACKOFF which in turn is waiting upon sync_srcu who is
>     waiting for our outermost lock. As we take the reset srcu around the
>     fence update, we have to defer taking it in i915_gem_fault() until after
>     we acquire the pin on the fence to avoid nesting. This is a little ugly,
>     but still works. If a reset occurs between i915_vma_pin_fence() and the
>     second reset lock, the reset will restore the fence register back to the
>     pinned value before the reset lock allows us to proceed (our mmap won't
>     be revoked as we haven't yet marked it as being a userfault as that
>     requires us to hold the reset lock), so the pagefault is still
>     serialised with the revocation in reset.
>     
>     Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109605
>     Fixes: 2caffbf11762 ("drm/i915: Revoke mmaps and prevent access to fence
> registers across reset")
>     Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
>     Cc: Mika Kuoppala <mika.kuoppala at intel.com>
>     Reviewed-by: Mika Kuoppala <mika.kuoppala at intel.com>
>     Link:
> https://patchwork.freedesktop.org/patch/msgid/20190212130831.14425-1-
> chris at chris-wilson.co.uk

10 runs without any issues, as opposed to multiple failures per run. Seems like
it was the right fix! Thanks!

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are the assignee for the bug.
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20190306/5547b612/attachment.html>


More information about the intel-gfx-bugs mailing list