[Pixman] [PATCH] MIPS: DSPr2: Added over_n_8888_8888_ca and over_n_8888_0565_ca fast paths.
Lukic, Nemanja
nlukic at mips.com
Mon Mar 12 14:20:10 PDT 2012
Hi Soren,
I usually select cairo-perf-trace that utilize optimized fast path the most.
In this case, xfce4-terminal-a1 proved to be that one. I use oprofile to check CPU utilization. Here is oprofile log I got for the xfce4-terminal-a1:
CPU: MIPS 74K, speed 0 MHz (estimated)
Counted CYCLES events (Cycles) with a unit mask of 0x00 (No unit mask) count 40000
samples % image name app name symbol name
2658517 50.3337 no-vmlinux no-vmlinux /no-vmlinux
1216517 23.0323 libpixman-1.so libpixman-1.so pixman_composite_over_n_8888_8888_ca_asm_mips
270995 5.1308 libc-2.11.2.so libc-2.11.2.so memset
165057 3.1250 libm-2.11.2.so libm-2.11.2.so floor
139880 2.6483 libpixman-1.so libpixman-1.so pixman_fill_buff32_mips_dsp
136303 2.5806 libpixman-1.so libpixman-1.so fetch_scanline_a8
61821 1.1705 libc-2.11.2.so libc-2.11.2.so memcpy
...
All other traces don't utilize this fast-path that much (this is what my oprofile runs on the test system showed).
If you know some more suitable trace (or system configuration I need to have, like fonts installed, etc), please let me know, and I'll re-run the benchmarks and update the commit.
Thanks,
Nemanja Lukic
-----Original Message-----
From: Siarhei Siamashka [mailto:siarhei.siamashka at gmail.com]
Sent: Monday, March 12, 2012 10:05 PM
To: Søren Sandmann
Cc: Lukic, Nemanja; pixman at lists.freedesktop.org; nemanja.lukic at rt-rk.com
Subject: Re: [Pixman] [PATCH] MIPS: DSPr2: Added over_n_8888_8888_ca and over_n_8888_0565_ca fast paths.
On Mon, Mar 12, 2012 at 10:48 PM, Søren Sandmann <sandmann at cs.au.dk> wrote:
> Nemanja Lukic <nlukic at mips.com> writes:
>
>> [ # ] backend test min(s) median(s) stddev. count
>> [ # ] image: pixman 0.25.3
>> [ 0] image xfce4-terminal-a1 138.223 139.070 0.33% 6/6
>> [ # ] image16: pixman 0.25.3
>> [ 0] image16 xfce4-terminal-a1 132.763 132.939 0.06% 5/6
>
> I'm curious why you chose this particular benchmark? The main path that
> xfce4-terminal-a1 exercises is over_n_1_8888 and add_1_1. As far as I
> can tell it doesn't actually hit the two fast paths that you added,
> which makes it suspicious where the speed-up is coming from.
I think it may actually depend on what fonts are installed in the
system and I vaguely remember encountering this at least once. If the
suitable bitmap fonts are missing, then the benchmark might fallback
to some other font and exercise different fast paths.
--
Best regards,
Siarhei Siamashka
More information about the Pixman
mailing list