Issue with libhyphen

Bert Frees bertfrees at gmail.com
Mon Mar 24 06:44:31 PDT 2014


Dear László and others,

We think we may have found a bug in libhyphen. It could be that it is just
a limitation of the algorithm, but anyway it's an issue for us.

The problem is that some patterns in a dictionary are ignored in some
cases, namely when the match string of that pattern is a part of the match
string of another pattern, and more specifically when it's not just a
prefix.

Let me clarify that with an example. When a dictionary consists of these
two patterns, the word `foobar' is not hyphenated because the first pattern
is ignored:

oo1b
foob

The second, longer pattern doesn't even have to match, as the second
example shows (the first pattern is still ignored):

oo1b
foobz

I have a patch that solves part of the problem:

https://github.com/bertfrees/libhyphen-nar/blob/adc2b74a19469e4dc93777fcdb82e36e566a0472/src/patches/bug.patch

With this patch the given examples will be handled correctly, but in other
situations it will still fail, such as here:

oo1b
foobaz

Have I indeed found a bug here, and does my patch make sense, or am I just
expecting too much and are we hitting the limits of the algorithm?

Thanks for considering,
Bert Frees
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/libreoffice/attachments/20140324/08446ce5/attachment.html>


More information about the LibreOffice mailing list