[HarfBuzz] Hackfest report
Behdad Esfahbod
behdad at behdad.org
Mon May 28 17:29:05 PDT 2012
On 05/28/2012 07:55 PM, Shriramana Sharma wrote:
> Sorry. You're right of course. I had forgotten that junk sequences were also
> included in the input. BTW I would like to see some of these badly rendered
> sequences if possible.
I'm trying to get the test data and then the testing infrastructure as soon as
possible. If it would take longer than a week or so, I'll just share with the
list the few languages we are currently discussing so we can keep discussion
going.
behdad
> Sent from my Android phone
>
> On May 28, 2012 11:52 PM, "Behdad Esfahbod" <behdad at behdad.org
> <mailto:behdad at behdad.org>> wrote:
>
> On 05/28/2012 02:13 PM, Shriramana Sharma wrote:
> > On Mon, May 28, 2012 at 10:45 PM, Behdad Esfahbod <behdad at behdad.org
> <mailto:behdad at behdad.org>> wrote:
> >>
> >> I ran these through. Tamil is at 0.87%, which is really nice. There's 806
> >> failures.
> >
> > Frankly for Tamil 806 failures is high. Tamil is perhaps the simplest
> > major Indic script.
>
> I find that statement hard to believe. I didn't tell you out of how many
> cases! 806 out of a million words is not very high to me. See my original
> report. Many of the failure cases are peculiar sequences that we disagree
> with Uniscribe on. Anyway, I'll look into it more closely. I think I should
> go ahead and make frequency-adjusted first.
>
> behdad
>
More information about the HarfBuzz
mailing list