[PATCH weston 1/1] compositor: Abort on bad page flip timestamps

Mario Kleiner mario.kleiner.de at gmail.com
Thu Nov 6 22:04:16 PST 2014


On 06/11/14 17:42, Pekka Paalanen wrote:
> On Thu, 06 Nov 2014 16:08:56 +0900
> Michel Dänzer <michel at daenzer.net> wrote:
>
>> On 06.11.2014 03:06, Frederic Plourde wrote:
>>> Many features, like animations, hardly depend on page flip timestamps
>>> to work properly, but some DRM drivers do not correctly support page flip
>>> timestamps (or not at all) and in that case, things start to go wrong.
>>>
>>> This patch adds sanity check to weston_output_finish_frame. By solely
>>> verifying that page flip timestamps are monotonically increasing, we
>>> make sure that :
>>>
>>> 1) Underlying driver is not throwing zeroed-out timestamp series at us.
>>> 2) We have not mistakenly jumped backwards because of integer overflow.
>>>
>>> If a pathological case is detected, we gracefully exit Weston
>>> with an appropriate exit code to help developers debug their drivers.
>>
>> That seems a bit harsh. IIRC, zero can be returned for the timestamp
>> intermittently if no accurate timestamp value can be determined, e.g.
>> because the CRTC is disabled. At the very least, I'd recommend
>> double-checking this with Mario Kleiner (Cc'd) and the dri-devel mailing
>> list.
>
> Can that really happen if we are not doing stupid things like
> attempting to flip on a disabled crtc or output?
>

I don't think it could happen for non-stupid regular use and would 
indicate a driver bug.

> Or can it happen, if we schedule a flip, then disable the crtc
> before the flip completes? Or maybe when an output is hot-unplugged?
>

On kernels <= 3.17 disabling vblank irqs will clear the cached timestamp 
to zero, so a waitvblank ioctl() for a pure query of msc/ust could 
return zero as a signal of "invalid/undefined timestamp" on some drivers 
under some circumstances. Basically a -EAGAIN error code.

But that shouldn't ever happen for kms-pageflip completion timestamps, 
because vblank irqs don't get disabled while a pageflip is pending, as 
the pending pageflip keeps the vblank reference count > 0.

There are also some new patches into Linux 3.18rc which should prevent 
returning zero timestamps even for pure waitvblank ioctl() queries if 
vblank irqs get disabled for one reason or the other, cfe.

http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/drivers/gpu/drm/drm_irq.c?id=844b03f27739135fe1fed2fef06da0ffc4c7a081

kms drivers should usually use the drm_send_vblank_event() helper in 
their pageflip completion routine. That helper will ideally send out the 
cached vblank count and high precision timestamps which were computed 
during the most recent vblank interrupt for the crtc that pageflipped. 
There's also some fallback there which, if no crtc is specified (crtc == 
-1), will simply return a msc of zero and the current system time as 
timestamp. The fallback is for gpus with weird flip completion 
behaviour. Currently nouveau uses it for old < NV-50 gpu's where we 
couldn't find a better way to make pageflip completion behave 
properly/reliably due to some hardware weirdness.

> Is zero a special timestamp that simply cannot be produced during
> normal operations, like due to clock wrap-around?
>

The timestamps are CLOCK_MONOTONIC uint32 seconds.microseconds since 
bootup, so a wraparound would ony happen after 2^32 seconds or 136 
years, so normal operation shouldn't cause a zero timestamp.

So i think observing zero pageflip timestamps would be a sign that the 
kms-driver needs fixing.

-mario

>
> Thanks,
> pq
>



More information about the dri-devel mailing list