[HarfBuzz] Fwd: UAX #29, Unicode Text Segmentation, update to improve Mongolian word segmentation

Roozbeh Pournader roozbeh at google.com
Wed Sep 30 19:01:39 PDT 2015


FYI.
---------- Forwarded message ----------
From: <announcements at unicode.org>
Date: Sep 30, 2015 5:13 PM
Subject: UAX #29, Unicode Text Segmentation, update to improve Mongolian
word segmentation
To: <announcements at unicode.org>
Cc:

*Unicode Standard Annex #29, Unicode Text Segmentation*, will be updated
for Unicode 9.0. A draft of the proposed update
<http://www.unicode.org/review/pri306/> is available for general public
review and comment.

The Word_Break classification of U+202F NARROW NO-BREAK SPACE (NNBSP) is
revised to correct the text segmentation behavior of U+202F for Mongolian
usage. For further background on this issue and possible ways to address
it, see PRI #308 <http://www.unicode.org/review/pri308/>, *Property Change
for U+202F NARROW NO-BREAK SPACE (NNBSP)*.

In this revision, the formerly empty Prepend class of the
Grapheme_Cluster_Break property is redefined to consist of all prefixed
format control characters and a few other characters with certain
Indic_Syllabic_Category property values.

The corresponding property value changes will be incorporated in the UCD
data files for Unicode 9.0.

http://blog.unicode.org/2015/09/uax-29-unicode-text-segmentation-update.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/harfbuzz/attachments/20150930/1880d67b/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mongolian-word-ordu.jpg
Type: image/jpeg
Size: 17827 bytes
Desc: not available
URL: <http://lists.freedesktop.org/archives/harfbuzz/attachments/20150930/1880d67b/attachment-0001.jpg>


More information about the HarfBuzz mailing list