[Harfbuzz-indic] Fwd: Re: [HarfBuzz] A problem about indic shape (for khmer)

Behdad Esfahbod behdad at behdad.org
Wed May 9 05:15:08 PDT 2012


On 05/09/2012 01:22 PM, Olivier BERTEN wrote:
> I sent it to Danh Hong which is considered by many as the reference for
> khmer unicode...
> 
> He told me he would answer soon so I guess his answer is almost ready ;-)

Thanks.  In the mean time, Jonathan was kind enough to compile a list.  Here
it is:

U+0AC9    => 0AC5 0ABE
U+0B57    => no decomp, -> RIGHT
U+0F77    => 0FB2 0F81
U+0F79    => 0FB3 0F81
U+17BE    => 17C1 17BE
U+17BF    => 17C1 17BF
U+17C0    => 17C1 17C0
U+17C4    => 17C1 17C4
U+17C5    => 17C1 17C5
U+1925    => 1920 1923
U+1926    => 1920 1924
U+1B3C    => 1B42 1B3C (?) Balinese
U+1C29    => no decomp, -> LEFT
U+A9C0    => no decomp, -> RIGHT
U+1112E    => 11127 11131
U+1112F    => 11127 11132
U+111BF    => no decomp, -> ABOVE

behdad




> Olivier
> 
> Le 09/05/12 12:26, Behdad Esfahbod a écrit :
>> I noticed that I myself forget to use the Indic list for Indic discussions.
>> Is there anyone on this list who doesn't want to be on the main HarfBuzz list?
>>  I'm thinking about merging them back in.
>>
>> In the mean time, I posted this message to the list three weeks ago and got no
>> feedback.  Can people on this list help?
>>
>> Thanks,
>> behdad
>>
>> -------- Original Message --------
>> Subject: Re: [HarfBuzz] A problem about indic shape (for khmer)
>> Date: Thu, 19 Apr 2012 15:41:54 -0400
>> From: Behdad Esfahbod <behdad at behdad.org>
>> To: datao zhang <dataozhang at hotmail.com>
>> CC: harfbuzz at lists.freedesktop.org
>>
>> So, looks like these are all split matras (according to
>> IndicMatraCategory.txt) that do not have a decomposition:
>>
>> U+0AC9
>> U+0B57
>> U+0F77
>> U+0F79
>> U+17BE
>> U+17BF
>> U+17C0
>> U+17C4
>> U+17C5
>> U+1925
>> U+1926
>> U+1B3C
>> U+1C29
>> U+A9C0
>> U+1112E
>> U+1112F
>> U+111BF
>>
>> Can people familiar with these characters please clarify what the desired
>> behavior of these characters is?
>>
>> Thanks,
>> behdad
>>
>> On 04/19/2012 02:36 PM, Behdad Esfahbod wrote:
>>> On 04/19/2012 02:24 PM, datao zhang wrote:
>>>
>>>> The following Matra can’t be decomposed:
>>>> 0x17C0
>>>> 0x17C4
>>>> 0x17C5
>>>> 0x17BE
>>>> 0x17BF
>>> None of these are decomposable in Unicode, so I have to to adjust the shaper
>>> for that.  Looks like for Khmer all split matras add U+17C1 to the left.  I
>>> have a couple ideas how to implement this now.  Will take a look.
>>>
>>> Thanks,
>>> behdad
>> _______________________________________________
>> HarfBuzz-Indic mailing list
>> HarfBuzz-Indic at lists.freedesktop.org
>> http://lists.freedesktop.org/mailman/listinfo/harfbuzz-indic
> 
> _______________________________________________
> HarfBuzz-Indic mailing list
> HarfBuzz-Indic at lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/harfbuzz-indic


More information about the HarfBuzz-Indic mailing list