[Pixman] [PATCH 0/5] ARM: NEON: Pipelined scaled bilinear functions
Taekyun Kim
podain77 at gmail.com
Tue Oct 11 06:24:42 PDT 2011
From: Taekyun Kim <tkq.kim at samsung.com>
Hi, all
Current bilinear NEON codes in pixman-arm-neon-asm-bilinear.S have
still lots of room for improvement. head/tail_head/tail scheme is
applied and I wrote pipelined over_8888_8888 and over_8888_8_8888
bilinear functions. Passed make check and got some speed up. Some
portion of speed up came from adding missed cache prefetch in the
first patch.
Measured on cortex-a8 running @ 1.0GHz
- over_8888_8888 -
before : 35.73 Mpix/s
after : 61.81 Mpix/s
- over_8888_8_8888 -
before : 30.57 Mpix/s
after : 51.79 Mpix/s
--
Best Regards,
Taekyun Kim
Taekyun Kim (5):
ARM: NEON: Some cleanup of bilinear scanline functions
ARM: NEON: Bilinear macro template for instruction scheduling
ARM: NEON: Replace old bilinear scanline generator with new template
ARM: NEON: Instruction scheduling of bilinear over_8888_8888
ARM: NEON: Instruction scheduling of bilinear over_8888_8_8888
pixman/pixman-arm-neon-asm-bilinear.S | 922 +++++++++++++++++++++++++++------
1 files changed, 760 insertions(+), 162 deletions(-)
More information about the Pixman
mailing list