[Bug 89360] [bdw-u iommu] DMAR error -> GPU hang

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Aug 29 17:17:07 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=89360

Yves-Alexis <corsac at debian.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|CLOSED                      |REOPENED
         Resolution|FIXED                       |---

--- Comment #93 from Yves-Alexis <corsac at debian.org> ---
I just had a chance to test 4.18 and I have to say it's *not* fixed. Maybe it's
a different bug, but in any case I had a “soft” freeze with following message
in dmesg:

Aug 29 19:04:17 scapa kernel: [   26.943249] DMAR: DRHD: handling fault status
reg 3
Aug 29 19:04:17 scapa kernel: [   26.943255] DMAR: [DMA Read] Request device
[00:02.0] fault addr 4600000 [fault reason 23] Unknown
Aug 29 19:04:17 scapa kernel: [   26.943259] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:04:17 scapa kernel: [   26.943262] DMAR: [DMA Read] Request device
[00:02.0] fault addr 4613000 [fault reason 23] Unknown
Aug 29 19:04:17 scapa kernel: [   26.943264] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:04:17 scapa kernel: [   26.943267] DMAR: [DMA Read] Request device
[00:02.0] fault addr 461b000 [fault reason 23] Unknown
Aug 29 19:04:17 scapa kernel: [   26.943269] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:04:24 scapa kernel: [   33.831279] [drm] GPU HANG: ecode
8:0:0x85dffffb, in Xorg [1028], reason: hang on rcs0, action: reset
Aug 29 19:04:24 scapa kernel: [   33.831280] [drm] GPU hangs can indicate a bug
anywhere in the entire gfx stack, including userspace.
Aug 29 19:04:24 scapa kernel: [   33.831281] [drm] Please file a _new_ bug
report on bugs.freedesktop.org against DRI -> DRM/Intel
Aug 29 19:04:24 scapa kernel: [   33.831281] [drm] drm/i915 developers can then
reassign to the right component if it's not a kernel issue.
Aug 29 19:04:24 scapa kernel: [   33.831282] [drm] The gpu crash dump is
required to analyze gpu hangs, so please always attach it.
Aug 29 19:04:24 scapa kernel: [   33.831282] [drm] GPU crash dump saved to
/sys/class/drm/card0/error
Aug 29 19:04:24 scapa kernel: [   33.831298] i915 0000:00:02.0: Resetting rcs0
for hang on rcs0
Aug 29 19:04:24 scapa kernel: [   33.838481] dmar_fault: 53 callbacks
suppressed
Aug 29 19:04:24 scapa kernel: [   33.838482] DMAR: DRHD: handling fault status
reg 3
Aug 29 19:04:24 scapa kernel: [   33.838487] DMAR: [DMA Write] Request device
[00:02.0] fault addr 4641000 [fault reason 23] Unknown
Aug 29 19:04:32 scapa kernel: [   41.824158] i915 0000:00:02.0: Resetting rcs0
for hang on rcs0
Aug 29 19:04:32 scapa kernel: [   41.824723] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:04:32 scapa kernel: [   41.824729] DMAR: [DMA Write] Request device
[00:02.0] fault addr 17f000 [fault reason 23] Unknown
Aug 29 19:04:40 scapa kernel: [   49.813478] i915 0000:00:02.0: Resetting rcs0
for hang on rcs0
Aug 29 19:04:48 scapa kernel: [   57.804899] i915 0000:00:02.0: Resetting rcs0
for hang on rcs0
Aug 29 19:04:56 scapa kernel: [   65.799728] i915 0000:00:02.0: Resetting rcs0
for hang on rcs0
Aug 29 19:04:56 scapa kernel: [   65.882208] wlan0: deauthenticating from
14:0c:76:bf:71:fc by local choice (Reason: 3=DEAUTH_LEAVING)
Aug 29 19:04:56 scapa kernel: [   65.902446] IPv6: ADDRCONF(NETDEV_UP): wlan0:
link is not ready
Aug 29 19:04:57 scapa kernel: [   66.510770] DMAR: DRHD: handling fault status
reg 3
Aug 29 19:04:57 scapa kernel: [   66.510778] DMAR: [DMA Write] Request device
[00:02.0] fault addr fffc6000 [fault reason 23] Unknown
Aug 29 19:04:57 scapa kernel: [   66.510781] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:04:57 scapa kernel: [   66.510784] DMAR: [DMA Write] Request device
[00:02.0] fault addr 4d000 [fault reason 23] Unknown
Aug 29 19:04:57 scapa kernel: [   66.510788] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:04:57 scapa kernel: [   66.510791] DMAR: [DMA Write] Request device
[00:02.0] fault addr 51000 [fault reason 23] Unknown
Aug 29 19:04:57 scapa kernel: [   66.510802] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:02 scapa kernel: [   71.509221] dmar_fault: 6733586 callbacks
suppressed
Aug 29 19:05:02 scapa kernel: [   71.509222] DMAR: DRHD: handling fault status
reg 3
Aug 29 19:05:02 scapa kernel: [   71.509230] DMAR: [DMA Write] Request device
[00:02.0] fault addr 32ff4b000 [fault reason 23] Unknown
Aug 29 19:05:02 scapa kernel: [   71.509233] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:02 scapa kernel: [   71.509236] DMAR: [DMA Write] Request device
[00:02.0] fault addr 32ff53000 [fault reason 23] Unknown
Aug 29 19:05:02 scapa kernel: [   71.509239] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:02 scapa kernel: [   71.509241] DMAR: [DMA Write] Request device
[00:02.0] fault addr 32ff57000 [fault reason 23] Unknown
Aug 29 19:05:02 scapa kernel: [   71.509244] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:07 scapa kernel: [   76.511769] dmar_fault: 6751341 callbacks
suppressed
Aug 29 19:05:07 scapa kernel: [   76.511770] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:07 scapa kernel: [   76.511775] DMAR: [DMA Write] Request device
[00:02.0] fault addr 66e6bc000 [fault reason 23] Unknown
Aug 29 19:05:07 scapa kernel: [   76.511778] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:07 scapa kernel: [   76.511781] DMAR: [DMA Write] Request device
[00:02.0] fault addr 66e6c3000 [fault reason 23] Unknown
Aug 29 19:05:07 scapa kernel: [   76.511784] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:07 scapa kernel: [   76.511787] DMAR: [DMA Write] Request device
[00:02.0] fault addr 66e6c8000 [fault reason 23] Unknown
Aug 29 19:05:07 scapa kernel: [   76.511790] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:12 scapa kernel: [   81.514717] dmar_fault: 6802731 callbacks
suppressed
Aug 29 19:05:12 scapa kernel: [   81.514718] DMAR: DRHD: handling fault status
reg 3
Aug 29 19:05:12 scapa kernel: [   81.514722] DMAR: [DMA Write] Request device
[00:02.0] fault addr 9ade03000 [fault reason 23] Unknown
Aug 29 19:05:12 scapa kernel: [   81.514725] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:12 scapa kernel: [   81.514728] DMAR: [DMA Write] Request device
[00:02.0] fault addr 9ade0a000 [fault reason 23] Unknown
Aug 29 19:05:12 scapa kernel: [   81.514731] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:12 scapa kernel: [   81.514733] DMAR: [DMA Write] Request device
[00:02.0] fault addr 9ade0e000 [fault reason 23] Unknown
Aug 29 19:05:12 scapa kernel: [   81.514736] DMAR: DRHD: handling fault status
reg 2
Aug 29 19:05:12 scapa kernel: [   81.794708] i915 0000:00:02.0: Resetting rcs0
for no progress on rcs0
Aug 29 19:05:20 scapa kernel: [   89.793873] i915 0000:00:02.0: Resetting chip
for hang on rcs0
Aug 29 19:05:20 scapa kernel: [   89.793938] i915 0000:00:02.0: GPU recovery
failed

Unfortunately because of the soft freeze I didn't have a chance to recover
/sys/class/drm/card0/error. But the hang happened pretty soon after boot on my
broadwell CPU, pretty much as soon as I enabled the external screen when logged
on the desktop.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are the QA Contact for the bug.
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20180829/d1bd1468/attachment-0001.html>


More information about the intel-gfx-bugs mailing list