[Pixman] [PATCH 05/12] vmx: implement fast path vmx_composite_copy_area
Oded Gabbay
oded.gabbay at gmail.com
Wed Jul 15 05:33:35 PDT 2015
On Tue, Jul 14, 2015 at 11:41 AM, Siarhei Siamashka
<siarhei.siamashka at gmail.com> wrote:
> It is a good idea to have at least one benchmark result in the commit
> message. Or provide a convincing explanation why this particular code
> is beneficial. The "no changes were observed" commit message does not
> do a good job justifying the need for this patch.
>
> If none of the traces of cairo applications can show any improvements,
> then we can at least use "lowlevel-blt-bench src_8888_8888". I get
> the following results on my Playstation3:
>
> == before ==
>
> src_8888_8888 = L1: 437.68 L2: 277.55 M:159.34 (242.58%)
> HT: 90.31 VT: 50.77 R: 50.67 RT: 17.64 ( 148Kops/s)
>
> == after ==
>
> src_8888_8888 = L1: 850.60 L2: 453.91 M:174.26 (265.31%)
> HT:105.68 VT: 54.17 R: 54.88 RT: 18.72 ( 154Kops/s)
>
>
> Assuming that the commit message gets updated,
> Acked-by: Siarhei Siamashka <siarhei.siamashka at gmail.com>
I went and tested the "lowlevel-blt-bench src_8888_8888" with this
patch and to my surprise, the fast path made the results much worse
(Except from L1):
reference memcpy speed = 25058.0MB/s (6264.5MP/s for 32bpp fills)
Before After Change
L1 6475.97 7430.21 +14.74%
L2 6019.82 4284.12 -28.83%
M 3004.82 2901.99 -3.42%
HT 1666.21 1278.17 -23.29%
VT 1719.05 1481.61 -13.81%
R 759.46 581.2 -23.47%
RT 218.86 181.07 -17.27%
Kops/s 1464 1292 -11.75%
So I thought maybe I will see the improvement only in ppc, so I run it
on POWER7 ppc 64 bit:
reference memcpy speed = 10651.0MB/s (2662.7MP/s for 32bpp fills)
Before After Change
L1 4165.37 4228.5 +1.52%
L2 4337.16 4241.26 -2.21%
M 1678.75 1607.69 -4.23%
HT 886.92 808.52 -8.84%
VT 781.87 742.73 -5.01%
R 483.37 445.87 -7.76%
RT 175.08 165.24 -5.62%
Kops/s 1193 1135 -4.86%
Nope, don't see the difference there as well.
How did you check it on your machine ? Did you check with only
blt+copy area patches, or with other patches applied as well ?
Currently, because both these patches don't show improvement in
lowlevel-blt-bench AND in cairo, I'm willing to drop them from this
patch-set until further investigation.
Oded
More information about the Pixman
mailing list