[Libreoffice-bugs] [Bug 131487] Words whose characters span multiple languages should not undergo spell checking

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Thu Jun 25 10:01:58 UTC 2020


https://bugs.documentfoundation.org/show_bug.cgi?id=131487

sergio.callegari at gmail.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEEDINFO                    |UNCONFIRMED
     Ever confirmed|1                           |0

--- Comment #5 from sergio.callegari at gmail.com ---
@Heiko Tietze Rather than introducing new /letters/ to be used as word
separators, I really think that it would be better to introduce a filter
preventing words to undergo spellchecking, based on character/font properties
inconsistencies. This would make both "quell'albero" (all Italian) and
"quell'internship" (Italian + English) be both treated as a single word, but
the latter not to undergo spellcheck, because "quell'"is marked as Italian and
"internship" as English".

In fact, making the apostrophe a separator would mean getting "quell'albero" as
the word "quell" followed by the word "albero", but "quell" is certainly not an
Italian word (and neither is "quell'" alone).

I think that this would also help other languages, where one may have local and
foreign words written with characters running together with no separators at
all.

@Sophie Yes, when using foreign words in Italian it is frequent to have
combinations that require the apostrophe between them. I am no linguistics
expert, but I think that this has at least a couple of reasons: one is that the
 apostrophe indicates an elision which is a phonetic phenomenon, so it is a
case where orthography follows phonetics; secondly Italian belongs to that set
of languages where there is no official body in charge of establishing rules,
so the grammar is determined by how the language is itself actually used.
Practically, Italian has accepted this kind of language intermixing since the
times when it was frequent to find Latin terms in technical or polished talk
and is currently very flexible when it comes to the intermixing of English
terms.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20200625/364061ae/attachment.htm>


More information about the Libreoffice-bugs mailing list