[Libreoffice-bugs] [Bug 95159] Thai word wrapping of กลับ and การกระทำ
bugzilla-daemon at bugs.documentfoundation.org
bugzilla-daemon at bugs.documentfoundation.org
Fri Dec 4 12:12:06 UTC 2020
https://bugs.documentfoundation.org/show_bug.cgi?id=95159
Samphan Raruenrom <untsamphan at gmail.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Severity|minor |trivial
--- Comment #8 from Samphan Raruenrom <untsamphan at gmail.com> ---
LibreOffice relies on ICU to break Thai words. ICU uses a greedy
dictionary-based longest-matching Thai word-segmentation algorithm, e.g. it
stops when found the first possible segmentation "การก|ระ|ทำ" (การก happens to
really be a valid Thai word).
To fix this issue, one needs to implement a slightly better maximal-matching
algorithm in ICU. It will still fail in some other rarer cases anyway.
BTW, this kind of problem happens very rarely in real-world Thai because the
ambiguous sequences are rare and must be at the end of a line. I never hear a
Thai complain about this.
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20201204/6fffc612/attachment.htm>
More information about the Libreoffice-bugs
mailing list