[Libreoffice-bugs] [Bug 52020] New: : ICU breakiterator not working with Khmer and Hunspell

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Jul 13 01:39:53 CEST 2012


https://bugs.freedesktop.org/show_bug.cgi?id=52020

             Bug #: 52020
           Summary: : ICU breakiterator not working with Khmer and
                    Hunspell
    Classification: Unclassified
           Product: LibreOffice
           Version: 3.6.0.0.beta2
          Platform: Other
        OS/Version: All
            Status: UNCONFIRMED
 Status Whiteboard: BSA
          Severity: normal
          Priority: medium
         Component: Libreoffice
        AssignedTo: libreoffice-bugs at lists.freedesktop.org
        ReportedBy: sungkhum at gmail.com


Created attachment 64144
  --> https://bugs.freedesktop.org/attachment.cgi?id=64144
Screenshot of "misspelled" Khmer words that should be treated as two words

Problem description: While ICU automatic line-breaking now works for Khmer in
LibreOffice 3.6, Hunspell does not seem to be using the same word-breaking data
and only sees one long line of text (Khmer does not have traditional "spaces"
between words, like Thai). 

Steps to reproduce:
1. Type ឲ្យគេ (should be automatically broken by ICU into ឲ្យ|គេ)
2. If you have the SBBIC spelling checker installed
http://extensions.libreoffice.org/extension-center/khmer-spelling-checker-sbbic-version
and CTL enabled, you will see that ឲ្យគេ is treated as one word, rather than
two, and is therefore misspelled.
3. You might need a font to correctly display Khmer (download one here:
http://www.sbbic.org/2011/01/19/khmer-sbbic-unicode-system-font/ )

Current behavior: No Khmer words are automatically broken for Hunspell, so we
have to continue manually putting zero-width spaces between words to spell
check (even though line-breaking is now automatic)

Expected behavior: Khmer words should be automatically broken for Hunspell to
check.

Platform (if different from the browser): 

Browser: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.11 (KHTML, like
Gecko) Chrome/20.0.1132.47 Safari/536.11

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Libreoffice-bugs mailing list