[Intel-gfx] [PATCH v2 1/3] drm/i915: Skip register reads in i915_get_crtc_scanoutpos()
Mario Kleiner
mario.kleiner.de at gmail.com
Thu Sep 26 18:12:06 CEST 2013
On 25.09.13 10:14, Ville Syrjälä wrote:
> On Wed, Sep 25, 2013 at 04:35:56AM +0200, Mario Kleiner wrote:
>> On 23.09.13 13:48, ville.syrjala at linux.intel.com wrote:
>>> From: Ville Syrjälä <ville.syrjala at linux.intel.com>
>>>
>>> We have all the information we need in the mode structure, so going and
>>> reading it from the hardware is pointless, and slower.
>>>
>>> We never populated ->get_vblank_timestamp() in the UMS case, and as that
>>> is the only way we'd ever call ->get_scanout_position(), we can
>>> completely ignore UMS in i915_get_crtc_scanoutpos().
>>>
>>> Also reorganize intel_irq_init() a bit to clarify the KMS vs. UMS
>>> situation.
>>>
>>> v2: Drop UMS code
>>>
>>> Cc: Mario Kleiner <mario.kleiner at tuebingen.mpg.de>
>>> Signed-off-by: Ville Syrjälä <ville.syrjala at linux.intel.com>
>>> ---
>>> drivers/gpu/drm/i915/i915_irq.c | 43 ++++++++++++++++-------------------------
>>> 1 file changed, 17 insertions(+), 26 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/i915/i915_irq.c b/drivers/gpu/drm/i915/i915_irq.c
>>> index b356dc1..058f099 100644
>>> --- a/drivers/gpu/drm/i915/i915_irq.c
>>> +++ b/drivers/gpu/drm/i915/i915_irq.c
>>> @@ -570,24 +570,29 @@ static u32 gm45_get_vblank_counter(struct drm_device *dev, int pipe)
>>> static int i915_get_crtc_scanoutpos(struct drm_device *dev, int pipe,
>>> int *vpos, int *hpos)
>>> {
>>> - drm_i915_private_t *dev_priv = (drm_i915_private_t *) dev->dev_private;
>>> - u32 vbl = 0, position = 0;
>>> + struct drm_i915_private *dev_priv = dev->dev_private;
>>> + struct drm_crtc *crtc = dev_priv->pipe_to_crtc_mapping[pipe];
>>> + struct intel_crtc *intel_crtc = to_intel_crtc(crtc);
>>> + const struct drm_display_mode *mode = &intel_crtc->config.adjusted_mode;
>>> + u32 position;
>>> int vbl_start, vbl_end, htotal, vtotal;
>>> bool in_vbl = true;
>>> int ret = 0;
>>> - enum transcoder cpu_transcoder = intel_pipe_to_cpu_transcoder(dev_priv,
>>> - pipe);
>>>
>>> - if (!i915_pipe_enabled(dev, pipe)) {
>>> + if (!intel_crtc->active) {
>>> DRM_DEBUG_DRIVER("trying to get scanoutpos for disabled "
>>> "pipe %c\n", pipe_name(pipe));
>>> return 0;
>>> }
>>>
>>> - /* Get vtotal. */
>>> - vtotal = 1 + ((I915_READ(VTOTAL(cpu_transcoder)) >> 16) & 0x1fff);
>>> + htotal = mode->crtc_htotal;
>>> + vtotal = mode->crtc_vtotal;
>>> + vbl_start = mode->crtc_vblank_start;
>>> + vbl_end = mode->crtc_vblank_end;
>>>
>>> - if (INTEL_INFO(dev)->gen >= 4) {
>>> + ret |= DRM_SCANOUTPOS_VALID | DRM_SCANOUTPOS_ACCURATE;
>>> +
>>> + if (IS_G4X(dev) || INTEL_INFO(dev)->gen >= 5) {
>>> /* No obvious pixelcount register. Only query vertical
>>> * scanout position from Display scan line register.
>>> */
>>> @@ -605,29 +610,16 @@ static int i915_get_crtc_scanoutpos(struct drm_device *dev, int pipe,
>>> */
>>> position = (I915_READ(PIPEFRAMEPIXEL(pipe)) & PIPE_PIXEL_MASK) >> PIPE_PIXEL_SHIFT;
>>>
>>> - htotal = 1 + ((I915_READ(HTOTAL(cpu_transcoder)) >> 16) & 0x1fff);
>>> *vpos = position / htotal;
>>> *hpos = position - (*vpos * htotal);
>>> }
>>>
>>> - /* Query vblank area. */
>>> - vbl = I915_READ(VBLANK(cpu_transcoder));
>>> -
>>> - /* Test position against vblank region. */
>>> - vbl_start = vbl & 0x1fff;
>>> - vbl_end = (vbl >> 16) & 0x1fff;
>>> -
>>> - if ((*vpos < vbl_start) || (*vpos > vbl_end))
>>> - in_vbl = false;
>>> + in_vbl = *vpos >= vbl_start && *vpos < vbl_end;
>>
>> I think this should be a <= instead of < in *vpos < vbl_end, if it is
>> meant to be equal to the line it replaces (not > is <=), unless the
>> original comparison was off-by-one?
>
> Yeah, I think the original was wrong, in more ways than one. It forgot
> to add +1 to vbl_start/end, and then it did the comparison wrong as
> well.
>
Ah ok, that's possible. Then you have my blessing :).
On the Intel side i only had and have sporadic access to an old Intel
GMA-950 (Gen-3?) when writing that function, so i could only really test
one half of the code-path in that function. Also that card only has a
VGA output, which limits my actual measurements to use of a photo-diode
attached to a CRT monitor. That means i can only verify the accuracy of
timestamping down to about 0.2 msecs variability and 0.5 msecs bias due
to the limitations/noise of the measurement setup (depending how close i
get the photo-diode to the corner of the monitor, how dark it is, etc.).
So i know that the jitter in the timestamps is very low, less than 1
usec standard deviation iirc, and that the absolute error wrt. reality
is lower than 0.2 msecs, but i wouldn't be able to detect absolute
errors of a few scanlines.
-mario
>>
>> > + in_vbl = *vpos >= vbl_start && *vpos <= vbl_end;
>>
>> Other than that, it looks good.
>>
>> Reviewed-by: mario.kleiner.de at gmail.com
>>
>>>
>>> /* Inside "upper part" of vblank area? Apply corrective offset: */
>>> if (in_vbl && (*vpos >= vbl_start))
>>> *vpos = *vpos - vtotal;
>>>
>>> - /* Readouts valid? */
>>> - if (vbl > 0)
>>> - ret |= DRM_SCANOUTPOS_VALID | DRM_SCANOUTPOS_ACCURATE;
>>> -
>>> /* In vblank? */
>>> if (in_vbl)
>>> ret |= DRM_SCANOUTPOS_INVBL;
>>> @@ -3148,11 +3140,10 @@ void intel_irq_init(struct drm_device *dev)
>>> dev->driver->get_vblank_counter = gm45_get_vblank_counter;
>>> }
>>>
>>> - if (drm_core_check_feature(dev, DRIVER_MODESET))
>>> + if (drm_core_check_feature(dev, DRIVER_MODESET)) {
>>> dev->driver->get_vblank_timestamp = i915_get_vblank_timestamp;
>>> - else
>>> - dev->driver->get_vblank_timestamp = NULL;
>>> - dev->driver->get_scanout_position = i915_get_crtc_scanoutpos;
>>> + dev->driver->get_scanout_position = i915_get_crtc_scanoutpos;
>>> + }
>>>
>>> if (IS_VALLEYVIEW(dev)) {
>>> dev->driver->irq_handler = valleyview_irq_handler;
>>>
>
More information about the Intel-gfx
mailing list