[poppler] KMP for search
Johannes Buchner
buchner.johannes at gmx.at
Tue Jun 15 22:48:39 PDT 2010
Hi
On Tue, 15 Jun 2010 19:05:25 +0100
Albert Astals Cid <aacid at kde.org> wrote:
> > I implemented the KMP search algorithm in [1] but I feel it doesn't
> > get attention. Please apply [2] and test it.
>
> You sent a mail yesterday and already complain you are being ignored?
No, I added the patch the day before, and since only 3 adresses are
notified about it, I thought it would be worth mentioning on the
mailing list aswell.
> > [1] https://bugs.freedesktop.org/show_bug.cgi?id=8821
> > [2] https://bugs.freedesktop.org/attachment.cgi?id=36255
> >
> > To be honest, the gain is small;
>
> If the gain is small i'd prefer to stay with our tested code than to
> switch to something totally new.
I understand where you're coming from; I've seen a improvement of factor
3 for certain cases, and I've never seen a lower speed compared to the
naive implementation. It shows that the factors of O(n*m) are comparable
to the factors of the new O(n). So it has a gain.
I tested the KMP code with several instances before integrating it into
poppler, and afterwards using the glib-demo program. The code is not
"totally new" in the sense that KMP is old, and I kept to
implementations I found on the web. The change [1] is small (~10
lines changed plus two new functions), although the additional
indentation level makes it look bigger.
https://bugs.freedesktop.org/review?bug=8821&attachment=36292
So, yes, I think it should go in.
> > I find it odd that the search
> > function is page-based and runs per block (sorry, I'm new to
> > poppler). It strikes me that no wrapped words can ever be found.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: not available
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20100616/0ec58f4d/attachment.pgp>
More information about the poppler
mailing list