[Intel-gfx] Incorrect plane programming sequence results into a corrupted display/hard hung system

Runyan, Arthur J arthur.j.runyan at intel.com
Thu Apr 12 19:28:48 UTC 2018


This seems like a typical atomic modeset requirement.

IPC should not impact register programming.
Vblank evasion only works if you have a guarantee on worst case interrupts/delays.  I think locks is part of the guarantee.
Double buffer control should guarantee safe alignment of programming across planes on the same pipe.  Multiple pipes will still require a wait for vblank.

From: Vyas, Tarun
Sent: Thursday, 12 April, 2018 9:57 AM
To: intel-gfx at lists.freedesktop.org
Cc: Runyan, Arthur J <arthur.j.runyan at intel.com>; Shaikh, Azhar <azhar.shaikh at intel.com>; Herbert, Marc <marc.herbert at intel.com>; Ciobanu, Nathan D <nathan.d.ciobanu at intel.com>; Lankhorst, Maarten <maarten.lankhorst at intel.com>
Subject: Incorrect plane programming sequence results into a corrupted display/hard hung system

On KBL platforms, with HW overlay and/or PSR2, a hard hang with corrupted display is observed while running tests that frequently disable/re-enable primary/overlay planes. Details recorded in this FDO bug: https://bugs.freedesktop.org/show_bug.cgi?id=104975.

The issue has been root caused as a race where only partial register updates get latched on the next vblank, specifically, the updates that give the buffer allocation of the current plane before disabling it (again, details captured in the FDO bug above). There have been several optimizations to work around this bug:

1.       Enable Isochronous priority control (IPC)

2.       Increase the vblank evasion time to 250 usec (We have tried 500 usec but that doesn't helps).

3.       Disable DOUBLE_BUFFER_CTL while the updates are done, inside intel_pipe_update_start and intel_pipe_update_end (doesn't helps)

4.       Grab all the required locks before starting the pipe_update.

Per Ville, none of the above optimizations guarantee a *full* update before the vblank. As a result, to fix this issue the right way, the plane programming sequence needs to be altered in the driver as mentioned below:

"Buffer allocation overlap among enabled planes will cause a full frame underrun, and that becomes a hard hange if pkgC or SAGV are enabled.
You need to make sure the plane is disabled before reallocating the buffer it uses.  For a single pipe it is sufficient to initiate the disabling of the plane before the reallocation.  For multiple pipes it can be more complex.
In this case you should be doing something like this to ensure plane 2A turns off before plane 1A steals the buffer


1.         PLANE_CTL_2A -> disabled

2.       PLANE_SURF_2A:  touch to arm double buffer update

3.       PLANE_BUF_CFG_1A -> (0-860)

4.       PLANE_SURF_1A: touch to arm double buffer update
If the planes are on different pipes there needs to be a wait for vblank between step 2 and 3 to ensure the plane 2A disable completed."


Please comment as required.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx/attachments/20180412/63d367a5/attachment.html>


More information about the Intel-gfx mailing list