[Libreoffice-bugs] [Bug 131557] New: Erroneous word count (for French at least)
bugzilla-daemon at bugs.documentfoundation.org
bugzilla-daemon at bugs.documentfoundation.org
Wed Mar 25 08:20:49 UTC 2020
https://bugs.documentfoundation.org/show_bug.cgi?id=131557
Bug ID: 131557
Summary: Erroneous word count (for French at least)
Product: LibreOffice
Version: unspecified
Hardware: All
OS: All
Status: UNCONFIRMED
Severity: normal
Priority: medium
Component: Writer
Assignee: libreoffice-bugs at lists.freedesktop.org
Reporter: phdebar at protonmail.com
Word count is wrong for French language (at least), it counts many more words
than there are. I gather that Writer counts words by counting runs of
whitespace separating them.
French typographic rules (and, hence, normal use) call for certain common
punctuation (most notably quote marks, semicolon, colon, interrogation and
exclamation marks, dashes) to be separated from words by white space. This
wrongly inflates the word count.
Such punctuation should not be counted as words.
So, for French language at least, a count of punctuation marks surrounded by
white space should be substracted from the actual word count. (Adding theses
punctuation marks as white space to the counting regex (I guess?) would also
mess word count with gender neutral writing, with words such as
"développeur·se", "développeur(se)", "développeur/se" or ""développeu-se".)
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20200325/8c698cd7/attachment.htm>
More information about the Libreoffice-bugs
mailing list