libexttextcat data garbled in Hungarian

Mark Robson markxr at
Fri Oct 25 13:10:58 CEST 2013


The data files for libexttextcat in this directory:

Contains a garbled Hungarian version, it's almost in iso-8859-1 but some
characters are destroyed because it doesn't contain all Hungarian

It is easy to pick up a utf-8 good version from

and see the difference.

It's not clear whether this prevents it from classifying Hungarian text
correctly, but it may stop it working in utf-8, because most of the other
files are in utf-8.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the LibreOffice mailing list