[Intel-gfx] [PATCH 3/3] drm/i915: Improve the accuracy of get_scanout_pos on CTG+
Ville Syrjälä
ville.syrjala at linux.intel.com
Wed Sep 25 10:11:30 CEST 2013
On Wed, Sep 25, 2013 at 05:12:54AM +0200, Mario Kleiner wrote:
>
>
> On 23.09.13 12:02, ville.syrjala at linux.intel.com wrote:
> > From: Ville Syrjälä <ville.syrjala at linux.intel.com>
> >
> > The DSL register increments at the start of horizontal sync, so it
> > manages to miss the entire active portion of the current line.
> >
> > Improve the get_scanoutpos accuracy a bit when the scanout position is
> > close to the start or end of vblank. We can do that by double checking
> > the DSL value against the vblank status bit from ISR.
> >
> > Cc: Mario Kleiner <mario.kleiner at tuebingen.mpg.de>
> > Signed-off-by: Ville Syrjälä <ville.syrjala at linux.intel.com>
> > ---
> > drivers/gpu/drm/i915/i915_irq.c | 53 +++++++++++++++++++++++++++++++++++++++++
> > 1 file changed, 53 insertions(+)
> >
> > diff --git a/drivers/gpu/drm/i915/i915_irq.c b/drivers/gpu/drm/i915/i915_irq.c
> > index 4f74f0c..14b42d9 100644
> > --- a/drivers/gpu/drm/i915/i915_irq.c
> > +++ b/drivers/gpu/drm/i915/i915_irq.c
> > @@ -567,6 +567,47 @@ static u32 gm45_get_vblank_counter(struct drm_device *dev, int pipe)
> > return I915_READ(reg);
> > }
> >
> > +static bool g4x_pipe_in_vblank(struct drm_device *dev, enum pipe pipe)
> > +{
> > + struct drm_i915_private *dev_priv = dev->dev_private;
> > + uint32_t status;
> > +
> > + if (IS_VALLEYVIEW(dev)) {
> > + status = pipe == PIPE_A ?
> > + I915_DISPLAY_PIPE_A_VBLANK_INTERRUPT :
> > + I915_DISPLAY_PIPE_B_VBLANK_INTERRUPT;
> > +
> > + return I915_READ(VLV_ISR) & status;
> > + } else if (IS_G4X(dev)) {
> > + status = pipe == PIPE_A ?
> > + I915_DISPLAY_PIPE_A_VBLANK_INTERRUPT :
> > + I915_DISPLAY_PIPE_B_VBLANK_INTERRUPT;
> > +
> > + return I915_READ(ISR) & status;
> > + } else if (INTEL_INFO(dev)->gen < 7) {
> > + status = pipe == PIPE_A ?
> > + DE_PIPEA_VBLANK :
> > + DE_PIPEB_VBLANK;
> > +
> > + return I915_READ(DEISR) & status;
> > + } else {
> > + switch (pipe) {
> > + default:
> > + case PIPE_A:
> > + status = DE_PIPEA_VBLANK_IVB;
> > + break;
> > + case PIPE_B:
> > + status = DE_PIPEB_VBLANK_IVB;
> > + break;
> > + case PIPE_C:
> > + status = DE_PIPEC_VBLANK_IVB;
> > + break;
> > + }
> > +
> > + return I915_READ(DEISR) & status;
> > + }
> > +}
> > +
> > static int i915_get_crtc_scanoutpos(struct drm_device *dev, int pipe,
> > int *vpos, int *hpos)
> > {
> > @@ -616,6 +657,18 @@ static int i915_get_crtc_scanoutpos(struct drm_device *dev, int pipe,
> > * scanout position from Display scan line register.
> > */
> > position = I915_READ(PIPEDSL(pipe)) & 0x1fff;
> > +
> > + /*
> > + * The scanline counter increments at the leading edge
> > + * of hsync, ie. it completely misses the active portion
> > + * of the line. Fix up the counter at both edges of vblank
> > + * to get a more accurate picture whether we're in vblank
> > + * or not.
> > + */
> > + in_vbl = g4x_pipe_in_vblank(dev, pipe);
> > + if ((in_vbl && position == vbl_start - 1) ||
> > + (!in_vbl && position == vbl_end - 1))
> > + position = (position + 1) % vtotal;
> > } else {
> > /* Have access to pixelcount since start of frame.
> > * We can split this into vertical and horizontal
> >
>
> This one i don't know. I think i can't follow the logic, but i don't
> know enough about the way the intel hw counts.
>
> Do you mean the counter increments when the scanline is over, instead of
> when it begins?
Let me draw a picture of the scanline (not to scale):
|XXXXXXXXXXXXX|-----|___________|---|
horiz. active horiz. sync
^ ^
| |
first pixel this is where the
of the line scanline counter increments
> With this correction by +1 at the edges of vblank, the scanlines at
> vbl_start and vbl_end would be reported twice, for two successive
> scanline durations, that seems a bit weird and asymmetric to the rest of
> the scanline positions. Wouldn't it make more sense to simply always add
> 1 for a smaller overall error, given that hblank is shorter than the
> active scanout part of a scanline?
Since the counter increments too late, drm_handle_vblank()
may get the wrong idea ie. something like this may happen:
1. vblank irq triggered
2. drm_handle_vblank() gets called
3. i915_get_crtc_scanoutpos() returns vbl_start-1 as the scanline
4. delta_ns calculation gets confused and tries to correct for it
Now, the correction you do for delta_ns should handle this, but
I don't like having such kludges in common code, and we can handle
it in the driver as I've demonstrated. But yeah, I suppose it can
make the error slightly less stable.
For some other uses (atomic page flip stuff) of the scanline position,
I definitely want this correction since I need accurate information
whether the position has passed vblank start.
> Also it adds back one lock protected, therefore potentially slow,
> register read into the time critical code.
I don't think a single register read should be _that_ slow even
with all the extra junk we do. And of course we can fix that problem.
--
Ville Syrjälä
Intel OTC
More information about the Intel-gfx
mailing list