[Pixman] [PATCH 0/4] ARM: NEON: Pipelined bilinear scanline functions
Taekyun Kim
podain77 at gmail.com
Thu Sep 22 08:16:23 PDT 2011
I update this in my branch
http://cgit.freedesktop.org/~podain/pixman/?h=neon_bilinear
<http://cgit.freedesktop.org/%7Epodain/pixman/?h=neon_bilinear>
Fixed cache preload of src scanlines to work correctly and added cache
preload
for mask and dst scanlines. Some mistakes in comments are corrected.
Thanks,
Taekyun Kim
On 09/21/2011 05:38 PM, Taekyun Kim wrote:
> From: Taekyun Kim<tkq.kim at samsung.com>
>
> Hi, all
>
> Bilinear functions in pixman-arm-neon-asm-bilinear.S have lots of room
> for improvement. With some clean-up to bring tail/head scheme for them
> and did instruction scheduling for mostly used over_8888_8888. Passed
> make check.
>
> I'm not sure that the scheduling is done nicely, but at least I've got
> speed up on both cortex-a8 and a9 devices.
>
> Performance of before/after on cortex-a8 @ 1GHz
>
> << 2000 x 2000 with scale factor close to 1.x>>
> before : 39.71 Mpix/s
> after : 60.39 Mpix/s
>
> Performance of before/after on cortex-a9 @ 1.2GHz
>
> << 2000 x 2000 with scale factor close to 1.x>>
> before : 43.31 Mpix/s
> after : 65.83 Mpix/s
>
> I will do optimization of other functions too based on the perfstat
> results of popular cairo traces.
>
> --
> Best Regards,
> Taekyun Kim
>
> Taekyun Kim (4):
> ARM: NEON: Some cleanup of bilinear scanline functions
> ARM: NEON: Bilinear macro template for instruction scheduling
> ARM: NEON: Replace old bilinear scanline generator with new template
> ARM: NEON: Instruction scheduling of bilinear over_8888_8888
>
> pixman/pixman-arm-neon-asm-bilinear.S | 766 ++++++++++++++++++++++++++-------
> 1 files changed, 605 insertions(+), 161 deletions(-)
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/pixman/attachments/20110923/500c6676/attachment.htm>
More information about the Pixman
mailing list