[HarfBuzz] Contextual shaping of Malayalam post(pre)/below base forms

Behdad Esfahbod behdad at behdad.org
Wed Jun 26 16:09:54 PDT 2013


On 13-06-26 06:41 PM, Richard Wordingham wrote:
> On Wed, 26 Jun 2013 11:09:46 -0400
> Behdad Esfahbod <behdad at behdad.org> wrote:
> 
>> On 13-06-18 06:46 PM, Richard Wordingham wrote:
>>> lookup pref_st2
>>>     | y1 xx r3 | -- No sequence indices for this context!
>>>     | xx r3 |
>>
>> This wouldn't work.  You should match y1 in backtrack sequence, not
>> input sequence.  Otherwise this is what happens, if you have a
>> sequence y1 xx r3:
>>
>>   - HB tries applying the lookup at position zero, matches the first
>> rule, stops,
>>
>>   - It then advances position by one, trying to apply the rule at
>> position one, now the second rule matches and ligature is formed.
>>
>> For a rule to stop a later one from applying they should apply to the
>> same input sequence at the same position, ie, both should have "xx
>> r3" as their input, and the first one should have more context in
>> terms of backtrack / lookahead...  It can have a longer input
>> sequence too, but they should start the same.
> 
> Applying the advice above, I tried
> 
> lookup pref_st2
>    y1 | xx r3 |
>       | xx r3 |
>          0 pref_lkp1
> end lookup
> 
> It worked!  That is, <YA, VIRAMA, RA> was not affected by shaping, but
> was rendered as three glyphs, as intended.
> 
> However, I have two follow-up remarks, for the actual failure mechanism
> is not as described by Behdad.  However, it does seem prudent to start
> the input context at the virama.
> 
> Firstly, I don't understand the advancing rule.  Given an input
> sequence y1 xx r3 u2 and the contexts | y1 xx r3 | and | xx r3 |, what
> I had expected is:
> 
> A1) The shaper tries applying the look up at y1, matches the first rule,
> stops.
> 
> A2) In accordance with the paragraph, "Once the substitutions are
> complete, the client should move to the glyph position immediately
> following the matched input sequence and resume the lookup process from
> there", the lookup then advances to uu, finds no match, and does not
> form the ligature.
> 
> Am I misunderstanding the specification?  Does the specification comply
> with Uniscribe?  (I've not been having much success with my experiments
> with Uniscribe, so I can't yet answer such questions myself.)

You aren't misunderstanding the spec.  I think you are correct and this is a
HarfBuzz bug.  Can you send your previous font to me to test with Uniscribe?
Or test it yourself?  That said, I can't nail it down in the code why this is
not working as intended.

Humm.  Ok, now I know.  In the particular sequence y1 is the base, right?  If
yes, then it is NOT marked for the pref feature.  <pref> is applied to all
characters after the base.  I believe that's why your original design wasn't
working.  Still I'm interested to double check that Uniscribe does the same.


> I haven't yet worked out how HarfBuzz distinguishes a pre-base <VIRAMA,
> RA> ligature formed by the <pref> feature from a post-base <VIRAMA, RA>
> ligature formed by the <pstf> feature,

It doesn't.

-- 
behdad
http://behdad.org/



More information about the HarfBuzz mailing list