[Pixman] [PATCH] Add support for aarch64 neon optimization

Mizuki Asakura ed6e117f at gmail.com
Wed Mar 30 11:47:01 UTC 2016


Since aarch64 has different neon syntax from aarch32 and has no
support of (older) arm-simd,
there are no SIMD accelerations for pixman on aarch64.

We need new implementations.


Added: https://bugs.freedesktop.org/show_bug.cgi?id=94758


The patch is too large (yes, 10k lines of asm are added), I cannot
attach the patch to this mail.
Please find proposed patch from the above bug ticket, or just check:
https://bugs.freedesktop.org/attachment.cgi?id=122634


Benchmark results are attached above ticket.
Typical result is:

normal:
            src_n_8_x888 =  L1:  38.33  L2:  40.58  M: 39.91 ( 11.87%)
 HT: 31.31  VT: 30.42  R: 29.14  RT: 18.14 ( 171Kops/s)
            src_n_8_8888 =  L1:  38.37  L2:  40.61  M: 39.92 ( 11.87%)
 HT: 31.30  VT: 30.41  R: 29.14  RT: 18.11 ( 171Kops/s)

neon:
            src_n_8_x888 =  L1: 344.76  L2: 348.59  M:275.93 ( 80.42%)
 HT:116.32  VT:109.72  R: 92.61  RT: 40.25 ( 348Kops/s)
            src_n_8_8888 =  L1: 346.17  L2: 348.63  M:276.15 ( 80.48%)
 HT:116.43  VT:109.72  R: 92.48  RT: 40.28 ( 348Kops/s)


I've only tested the code with Qualcomm DragonBoard 410c (Cortex-A53 *
4, 1.2GHz).
Can anyone test it on other aarch64 platform ?

Signed-off-by: Mizuki Asakura <ed6e117f at gmail.com>


More information about the Pixman mailing list