[PATCH 2/2] drm/vkms: Use a simpler composition function

Pekka Paalanen pekka.paalanen at haloniitty.fi
Fri Feb 2 10:24:29 UTC 2024


On Fri, 2 Feb 2024 10:26:01 +0100
Miquel Raynal <miquel.raynal at bootlin.com> wrote:

> Hi Pekka,
> 
> pekka.paalanen at haloniitty.fi wrote on Fri, 2 Feb 2024 10:55:22 +0200:
> 
> > On Thu, 01 Feb 2024 18:31:32 +0100
> > Louis Chauvet <louis.chauvet at bootlin.com> wrote:
> >   
> > > Change the composition algorithm to iterate over pixels instead of lines.
> > > It allows a simpler management of rotation and pixel access for complex formats.
> > > 
> > > This new algorithm allows read_pixel function to have access to x/y
> > > coordinates and make it possible to read the correct thing in a block
> > > when block_w and block_h are not 1.
> > > The iteration pixel-by-pixel in the same method also allows a simpler
> > > management of rotation with drm_rect_* helpers. This way it's not needed
> > > anymore to have misterious switch-case distributed in multiple places.    
> > 
> > Hi,
> > 
> > there was a very good reason to write this code using lines:
> > performance. Before lines, it was indeed operating on individual pixels.
> > 
> > Please, include performance measurements before and after this series
> > to quantify the impact on the previously already supported pixel
> > formats, particularly the 32-bit-per-pixel RGB variants.
> > 
> > VKMS will be used more and more in CI for userspace projects, and
> > performance actually matters there.
> > 
> > I'm worrying that this performance degradation here is significant. I
> > believe it is possible to keep blending with lines, if you add new line
> > getters for reading from rotated, sub-sampled etc. images. That way you
> > don't have to regress the most common formats' performance.  
> 
> While I understand performance is important and should be taken into
> account seriously, I cannot understand how broken testing could be
> considered better. Fast but inaccurate will always be significantly
> less attractive to my eyes.
> 
> I am in favor of making this working first, and then improving the code
> for faster results. Maybe the line-driven approach can be dedicated to
> "simpler" formats where more complex corner cases do not happen. But
> for now I don't see the point in comparing performances between broken
> and (hopefully) non broken implementations.

Hi,

usually that is a good idea, but in this case you are changing the
fundamental design of the algorithm. That makes it easy to add new
things (e.g. YUV support) that will be even harder to make faster
later. If you then later improve performance, that is another
re-design. You would be making maintainers review two rewrites instead
of one or less.

I suspect it might be less effort for the author as well to not ditch
the line-based based algorithm as the first step.


Thanks,
pq
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20240202/b0c96218/attachment.sig>


More information about the dri-devel mailing list