[Pixman] [PATCH] Add support for aarch64 neon optimization
Mizuki Asakura
ed6e117f at gmail.com
Wed Mar 30 11:47:01 UTC 2016
Since aarch64 has different neon syntax from aarch32 and has no
support of (older) arm-simd,
there are no SIMD accelerations for pixman on aarch64.
We need new implementations.
Added: https://bugs.freedesktop.org/show_bug.cgi?id=94758
The patch is too large (yes, 10k lines of asm are added), I cannot
attach the patch to this mail.
Please find proposed patch from the above bug ticket, or just check:
https://bugs.freedesktop.org/attachment.cgi?id=122634
Benchmark results are attached above ticket.
Typical result is:
normal:
src_n_8_x888 = L1: 38.33 L2: 40.58 M: 39.91 ( 11.87%)
HT: 31.31 VT: 30.42 R: 29.14 RT: 18.14 ( 171Kops/s)
src_n_8_8888 = L1: 38.37 L2: 40.61 M: 39.92 ( 11.87%)
HT: 31.30 VT: 30.41 R: 29.14 RT: 18.11 ( 171Kops/s)
neon:
src_n_8_x888 = L1: 344.76 L2: 348.59 M:275.93 ( 80.42%)
HT:116.32 VT:109.72 R: 92.61 RT: 40.25 ( 348Kops/s)
src_n_8_8888 = L1: 346.17 L2: 348.63 M:276.15 ( 80.48%)
HT:116.43 VT:109.72 R: 92.48 RT: 40.28 ( 348Kops/s)
I've only tested the code with Qualcomm DragonBoard 410c (Cortex-A53 *
4, 1.2GHz).
Can anyone test it on other aarch64 platform ?
Signed-off-by: Mizuki Asakura <ed6e117f at gmail.com>
More information about the Pixman
mailing list