[Bug 92235] Graphics frozen / stuck after random time (minutes-hours) messages contain "drm:intel_pipe_update_end [i915]] *ERROR* Atomic update failure on pipe A ..."

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Nov 13 03:57:08 PST 2015


https://bugs.freedesktop.org/show_bug.cgi?id=92235

--- Comment #9 from Jan Bertran <joanbe at gmail.com> ---
First a bit of history about the issue.
One year ago we tested our games in single screen mode under Kubuntu 14.10 and
J1900 system. It worked for weeks without any issue.
Then an update on xf86-video-intel always produced a hang and always in a few 
seconds. We went back to original xf86-video-intel file.

Later we started making dual screen versions of our games.
The window setup keeps being similar. The app window covers whole framebuffer,
single OpenGl context (No compositor) and page flip gets activated as in single 
monitor setup. Then started to happen graphics frozen issues. We have tried
for several releases of Ubuntus (15.04 15.10 kernels 3.10, 4.1 4.2 4.3) and 
Fedora 22 with same result but different behavior and time to hang
(some could be recovered switching to console vt and back to X, others left 
system unusable).
Lately all hangs on J1900 left system totally inaccessible so I switched 
testing to N3150.

About test without OpenGL I'm not sure to understand.
I'm not sure how to test page flip without OpenGL or maybe you suggest 
that depending on the gpu load the behavior can change.
I think so, our game needs less time to hang than a test application I made 
based on glxgears.
Also about timing we setup frame rate to 30fps (60 Hz monitor and 
glXSwapIntervalMESA(2) or glXSwapIntervalSGI(2) )
While almost all applications just sync to VBlank with no divisor, so this 
could be an uncommon case.

The last tests done are under Kubuntu 15.10 with kernels 

linux-image-4.3.0-994-generic_4.3.0-994.201510162200_amd64 and 
libdrm2_2.4.65+git20150922.f3c6740f-0ubuntu0ricotz
and more recently with
linux-image-4.3.0-994-generic_4.3.0-994.201511052100_amd64 and 
libdrm2_2.4.65+git20151026.c745e541-0ubuntu0ricotz

And with latest updates I have a problem: the test application does not hang 
(at least for 3 days) without "drm.debug=0x1e"
but it does hang when drm debug is enabled, our game instead keeps hanging in 1 
day regardless of drm debug.
It seems also that if intel_gpu_top is running on an ssh terminal graphics hang
in much less time.
Maybe system/gpu load influences on some race condition ?
Test app is just glxgears hacked with dummy textures and more gears and window 
setup using all display size, but our game usually is on the limit of 30fps
(sporadically goes down to 20fps).

Next week I will do requested tests of xf86-video-intel with debug enabled, and 
later a test app that just fills framebuffer with single color.
At the moment cat /sys/kernel/debug/dri/0/i915_gem_pageflip gives
either:
No flip due on pipe A (plane A)
No flip due on pipe B (plane B)
No flip due on pipe C (plane C)

or like:

Flip queued on pipe A (plane A)
Flip queued on render ring at seqno 13434, next seqno 13435 [current breadcrumb 
13434], completed? 1
Flip queued on frame 90833, (was ready on frame 0), now 90833
Stall check enabled, 1 prepares
Current scanout address 0x0da59000
New framebuffer address 0x0da59000
MMIO update completed? 1
No flip due on pipe B (plane B)
Flip queued on pipe C (plane C)
Flip queued on render ring at seqno 13434, next seqno 13435 [current breadcrumb 
13434], completed? 1
Flip queued on frame 91314, (was ready on frame 0), now 91314
Stall check enabled, 1 prepares
Current scanout address 0x0da4f000
New framebuffer address 0x0da4f000
MMIO update completed? 1

or

Flip queued on pipe A (plane A)
Flip queued on render ring at seqno 193d8, next seqno 193d9 [current breadcrumb 
193d8], completed? 1
Flip queued on frame 115155, (was ready on frame 0), now 115155
Stall check enabled, 0 prepares
Current scanout address 0x02db0000
New framebuffer address 0x0da59000
MMIO update completed? 0
No flip due on pipe B (plane B)
Flip queued on pipe C (plane C)
Flip not associated with any ring
Flip queued on frame 0, (was ready on frame 0), now 115789
Stall check waiting for page flip ioctl, 0 prepares
Current scanout address 0x02da6000
New framebuffer address 0x00000000
MMIO update completed? 0

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are on the CC list for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20151113/a14d579f/attachment-0001.html>


More information about the intel-gfx-bugs mailing list