[Bug 84490] GPU HANG: ecode 0:0x85fffff8, in kwin [3405], reason: Ring hung, action: reset

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Sep 29 23:01:55 PDT 2014


https://bugs.freedesktop.org/show_bug.cgi?id=84490

Chris Wilson <chris at chris-wilson.co.uk> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |NEEDINFO

--- Comment #1 from Chris Wilson <chris at chris-wilson.co.uk> ---
Should be fixed with

commit c4d69da167fa967749aeb70bc0e94a457e5d00c1
Author: Chris Wilson <chris at chris-wilson.co.uk>
Date:   Mon Sep 8 14:25:41 2014 +0100

    drm/i915: Evict CS TLBs between batches

    Running igt, I was encountering the invalid TLB bug on my 845g, despite
    that it was using the CS workaround. Examining the w/a buffer in the
    error state, showed that the copy from the user batch into the
    workaround itself was suffering from the invalid TLB bug (the first
    cacheline was broken with the first two words reversed). Time to try a
    fresh approach. This extends the workaround to write into each page of
    our scratch buffer in order to overflow the TLB and evict the invalid
    entries. This could be refined to only do so after we update the GTT,
    but for simplicity, we do it before each batch.

    I suspect this supersedes our current workaround, but for safety keep
    doing both.

    v2: The magic number shall be 2.

    This doesn't conclusively prove that it is the mythical TLB bug we've
    been trying to workaround for so long, that it requires touching a number
    of pages to prevent the corruption indicates to me that it is TLB
    related, but the corruption (the reversed cacheline) is more subtle than
    a TLB bug, where we would expect it to read the wrong page entirely.

    Oh well, it prevents a reliable hang for me and so probably for others
    as well.

    Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
    Cc: Daniel Vetter <daniel.vetter at ffwll.ch>
    Cc: Ville Syrjälä <ville.syrjala at linux.intel.com>
    Cc: stable at vger.kernel.org
    Reviewed-by: Daniel Vetter <daniel.vetter at ffwll.ch>
    Signed-off-by: Jani Nikula <jani.nikula at intel.com>

I believe.

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are on the CC list for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20140930/29d95197/attachment.html>


More information about the intel-gfx-bugs mailing list