[Libreoffice-bugs] [Bug 95159] Thai word wrapping of กลับ and การกระทำ

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Fri Dec 4 12:12:06 UTC 2020


https://bugs.documentfoundation.org/show_bug.cgi?id=95159

Samphan Raruenrom <untsamphan at gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|minor                       |trivial

--- Comment #8 from Samphan Raruenrom <untsamphan at gmail.com> ---
LibreOffice relies on ICU to break Thai words. ICU uses a greedy
dictionary-based longest-matching Thai word-segmentation algorithm, e.g. it
stops when found the first possible segmentation "การก|ระ|ทำ" (การก happens to
really be a valid Thai word).

To fix this issue, one needs to implement a slightly better maximal-matching
algorithm in ICU. It will still fail in some other rarer cases anyway.

BTW, this kind of problem happens very rarely in real-world Thai because the
ambiguous sequences are rare and must be at the end of a line. I never hear a
Thai complain about this.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20201204/6fffc612/attachment.htm>


More information about the Libreoffice-bugs mailing list