[HarfBuzz] Dotted Circles in Tai Tham
Behdad Esfahbod
behdad at behdad.org
Thu Feb 26 15:22:30 PST 2015
On 15-02-26 11:04 AM, Richard Wordingham wrote:
>> What would immensely help is to gather sequences that you (and
>> > others) think should be considered one syllable. We can then add
>> > these to Roozbeh's indic repository as test data (with the USE
>> > grammar). That will be extremely valuable, and I'm willing to set up
>> > the code to run the tests.
> I take it you're looking for a regular expression. Would this be a
> regular expression for strings of symbols, rather than traces? (Traces
> are defined from strings by allowing certain pairs of 'letters' to
> commute
> - fully decomposed character strings under canonical equivalence are the
> example that interests us. The theory gets messy with Kleene star.) I
> notice USE seems, from the Buginese and some (all?) of the Tibetan
> overrides, to be working by matching NFD strings against the patterns.
> May I assume a suitable permutation of the non-zero canonical combining
> classes?
>
> Alternatively, are you just looking for a probing test set of real
> words?
Real or fictional words. Whatever you think should be considered a syllable
for these purposes.
--
behdad
http://behdad.org/
More information about the HarfBuzz
mailing list