[Libreoffice-bugs] [Bug 71877] Word Count Wrong for ZWSP delimited text in SEA langauges ( Thai, Lao, Khmer, and Burmese)
bugzilla-daemon at bugs.documentfoundation.org
bugzilla-daemon at bugs.documentfoundation.org
Wed Oct 25 10:48:57 UTC 2017
https://bugs.documentfoundation.org/show_bug.cgi?id=71877
--- Comment #12 from Robert M Campbell <robert.rcampbell at gmail.com> ---
Sorry, life, travels, and ever expanding projects seem to eat up time. I've
just now reviewed this bug (tested with 5.4.2.2 (x64)) and...
Still works in the same manner as previous (so still not providing correct word
counts).
Basically, without any zero-width-spaces, the word counts seem spot on. It's
just when working with text that has zero-width-spaces (ZWSP).
I'm not exactly sure where this happens in the code. My programming skills in
the no web sphere is not super high, but I am willing to look into it, if
someone can kind of guide me where I should start looking.
What I don't know, and this my play a major factor in things, is if all users
use zero width spaces to delimit words (in the case of Thai, Lao, Khmer - this
seems to be the case, but I'm not a linguist/language expert, though I can read
at varying levels in each listed language). It may be that sometimes users may
insert ZWSPs specifically for cases where in English we'd use a hyphen to do
the same (line breaking).
Anyways, point me where I can help, and I'm glad to do what I can.
Thanks!
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20171025/65f2fa7e/attachment.html>
More information about the Libreoffice-bugs
mailing list