[Libreoffice-bugs] [Bug 117573] Extend LightProof to Support More Grammar Checkers

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Tue May 29 16:01:54 UTC 2018


https://bugs.documentfoundation.org/show_bug.cgi?id=117573

László Németh <nemeth at numbertext.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |nemeth at numbertext.org

--- Comment #3 from László Németh <nemeth at numbertext.org> ---
@Keith, many thanks for the nice collection of the interesting projects.

Recent Lightproof usage in LibreOffice is based on separated instances of
English, Hungarian, Brazilian-Portuguese and Russian Lightproof modules,
sometimes with language specific Python codes, so using Lightproof as a
prototype is a natural thing, too, also for a new language or for an
improved/alternative version of a language module.

My next plan here is to create a simple API with easy accessible multilingual
data, (for example in module extras/ of LibreOffice source tree) to give a
minimal punctuation and typical unambiguous grammar mistake checker for every
supported languages. (Or, separate Lightproof in a library, as recent
libnumbertext integration: http://www.numbertext.org,
https://bugs.documentfoundation.org/show_bug.cgi?id=117171). This could handle
most of the worst/ugliest/avoided/taboo mistakes, avoiding unintentional false
alarms or https://en.wikipedia.org/wiki/Linguistic_discrimination.

Offering deep learning grammar checkers or extenstions/options is a nice idea.
Also spell checking could be improved this way. My only fear is that there is
no deep learning to learn professional proofreading, because of combination of
incomplete/inaccurate training data and often incredible complex rules.
[A funny story: I just consulted several orthography books and grammar teachers
to fix the bad evaluation of the elementary level dictation of my son. The
teacher and me (as the author of the Hungarian spelling dictionary) couldn't
recognize the applicable case of orthography of special geographical proper
names at once. In fact, the problem was here the uncommon text of the
dictation, chosen by the untrained teacher.] For deep learning, we must select
the working, and skip the not working automation, and unfortunately, this is
not an easy task (see http://libreoffice.hu/grammar-checking-in-libreoffice/).

But you are right, the recent English grammar checker has got very limited
features. Maybe the fastest method to improve English grammar checking is to
use optionally an online API, like http://www.afterthedeadline.com/api.slp or
other freely available services.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20180529/7501de57/attachment.html>


More information about the Libreoffice-bugs mailing list