[HarfBuzz] Hackfest report

Behdad Esfahbod behdad at behdad.org
Mon May 28 17:29:05 PDT 2012


On 05/28/2012 07:55 PM, Shriramana Sharma wrote:
> Sorry. You're right of course. I had forgotten that junk sequences were also
> included in the input. BTW I would like to see some of these badly rendered
> sequences if possible.

I'm trying to get the test data and then the testing infrastructure as soon as
possible.  If it would take longer than a week or so, I'll just share with the
list the few languages we are currently discussing so we can keep discussion
going.

behdad


> Sent from my Android phone
> 
> On May 28, 2012 11:52 PM, "Behdad Esfahbod" <behdad at behdad.org
> <mailto:behdad at behdad.org>> wrote:
> 
>     On 05/28/2012 02:13 PM, Shriramana Sharma wrote:
>     > On Mon, May 28, 2012 at 10:45 PM, Behdad Esfahbod <behdad at behdad.org
>     <mailto:behdad at behdad.org>> wrote:
>     >>
>     >> I ran these through.  Tamil is at 0.87%, which is really nice.  There's 806
>     >> failures.
>     >
>     > Frankly for Tamil 806 failures is high. Tamil is perhaps the simplest
>     > major Indic script.
> 
>     I find that statement hard to believe.  I didn't tell you out of how many
>     cases!  806 out of a million words is not very high to me.  See my original
>     report.  Many of the failure cases are peculiar sequences that we disagree
>     with Uniscribe on.  Anyway, I'll look into it more closely.  I think I should
>     go ahead and make frequency-adjusted first.
> 
>     behdad
> 



More information about the HarfBuzz mailing list