I was wondering if myanmar , etc should be included in indic engine, or wether it would be better to fork the indic engine rather than overloading it with divergent rules?<br><br><br>On Thursday, 24 May 2012, Behdad Esfahbod <<a href="mailto:behdad@behdad.org">behdad@behdad.org</a>> wrote:<br>
> On 05/23/2012 06:48 PM, Andrew Cunningham wrote:<br>>> I think what Ed is saying is that Tai Tham follows a similar model to Myanmar<br>>> rather than a pure Indic model, where you have a distinct medials vs subjoined<br>
>> consonants wher subjoined consonants require a virama and medials don't<br>><br>> I see. Thanks for the clarification.<br>><br>>> Par of a fundamental change between myanar in unicode 4.1 and 5.1<br>
><br>> Good to know. I'll give HB a run on my Myanmar corpus and see if I can fix a<br>> few high-impact issues.<br>><br>>> Will look at my sources to confirm for Tai Tham.<br>><br>> Thanks,<br>
> b<br>><br>>> A.<br>>><br>>> On Thursday, 24 May 2012, Behdad Esfahbod <<a href="mailto:behdad@behdad.org">behdad@behdad.org</a><br>>> <mailto:<a href="mailto:behdad@behdad.org">behdad@behdad.org</a>>> wrote:<br>
>>> Hi Thep,<br>>>><br>>>> Humm, the message from Ed hat you are replying to never made it to me or to<br>>>> the list. Replies inline.<br>>>><br>>>><br>>>> On 05/23/2012 06:53 AM, Theppitak Karoonboonyanan wrote:<br>
>>>> Hi, Ed, Behdad,<br>>>>><br>>>>> On Sun, May 20, 2012 at 3:45 AM, Ed Trager <<a href="mailto:ed.trager@gmail.com">ed.trager@gmail.com</a><br>>> <mailto:<a href="mailto:ed.trager@gmail.com">ed.trager@gmail.com</a>>> wrote:<br>
>>>>> On Fri, May 18, 2012 at 5:48 PM, Behdad Esfahbod <<a href="mailto:behdad@behdad.org">behdad@behdad.org</a><br>>> <mailto:<a href="mailto:behdad@behdad.org">behdad@behdad.org</a>>> wrote:<br>
>>>>>> On 05/18/2012 04:02 PM, Ed Trager wrote:<br>>>>>>>><br>>>>>>>> In Tai Tham, U+1A6E VOWEL SIGN E needs to be shifted all the way to<br>>>>>>>> the left so that the final visual appearance would be:<br>
>>>>>><br>>>>>>> Are you sure? Without U+1A60 TAI THAM SIGN SAKOT before the subjoined<br>>>>>>> consonant? Reading Unicode suggests that you need that sign betwee PA<br>
>> and LA.<br>>>>>><br>>>>>> For most subjoined consonants, yes, that's true. But note in<br>>>>>> particular that U+1A56 MEDIAL LA and U+1A57 MEDIAL LA TANG LAI were<br>
>>>>> encoded separately. In the case of these two "LA" signs, I believe<br>>>>>> there are two reasons justifying the separate encoding:<br>>>>>><br>>>>>> (1) These are variant forms of the same subjoined letter LA:<br>
>>>>> apparently, there is no other good way to do it other than encoding<br>>>>>> both.<br>>>>>><br>>>>>> (2) Both of these LA signs can be part of triple consonant clusters,<br>
>>>>> i.e. "KLW" appears in the common word Thai / Tai word for banana,<br>>>>>> กล้วย, "klwy" . In Tai Tham, both the L and the W appear as<br>>>>>> below-base stacked forms (and actually the "y" is also a subjoined<br>
>>>>> form, but it's kind of hanging off the right side of the whole stack).<br>>>><br>>>> I'm not questioning the separate encoding. I don't care :-). What I'm saying<br>
>>> is that you need a SAKOT before them for them to be considered part of the<br>>>> same syllable according to the Indic OpenType spec and my implementation.<br>>>> Now, if you think Unicode intended these to subjoin without a SAKOT, then I<br>
>>> like you to point me to documentation about that.<br>>>><br>>>> If that is the case, we would need changes to the Indic machine. Not<br>>>> impossible, but I first want to make sure that it is indeed the case.<br>
>>><br>>>> behdad<br>>>><br>>>><br>>>><br>>>>>> There are some other separately-encoded subjoining consonant signs:<br>>>>>> U+1A5B, U+1A5C, U+1A5D, U+1A5E.<br>
>>>><br>>>>> Please also count U+1A55 (MEDIAL RA) in the rule, although it's not a<br>>>>> subjoined form.<br>>>>><br>>>>> Regards,<br>>>>> -Thep.<br>
>>> _______________________________________________<br>>>> HarfBuzz mailing list<br>>>> <a href="mailto:HarfBuzz@lists.freedesktop.org">HarfBuzz@lists.freedesktop.org</a> <mailto:<a href="mailto:HarfBuzz@lists.freedesktop.org">HarfBuzz@lists.freedesktop.org</a>><br>
>>> <a href="http://lists.freedesktop.org/mailman/listinfo/harfbuzz">http://lists.freedesktop.org/mailman/listinfo/harfbuzz</a><br>>>><br>>><br>>> --<br>>> Andrew Cunningham<br>>> Senior Project Manager, Research and Development<br>
>> Vicnet<br>>> State Library of Victoria<br>>> Australia<br>>><br>>> <a href="mailto:andrewc@vicnet.net.au">andrewc@vicnet.net.au</a> <mailto:<a href="mailto:andrewc@vicnet.net.au">andrewc@vicnet.net.au</a>><br>
>> <a href="mailto:lang.support@gmail.com">lang.support@gmail.com</a> <mailto:<a href="mailto:lang.support@gmail.com">lang.support@gmail.com</a>><br>><br><br>-- <br>Andrew Cunningham<br>Senior Project Manager, Research and Development<br>
Vicnet<br>State Library of Victoria<br>Australia<br><br><a href="mailto:andrewc@vicnet.net.au">andrewc@vicnet.net.au</a><br><a href="mailto:lang.support@gmail.com">lang.support@gmail.com</a><br>