[FriBidi] Invalid UTF-8 for Arabic

Yoann Roman yroman-fribidi at altalang.com
Tue Mar 10 06:20:58 PDT 2009


>> Behdad, Thanks for the response.
> 
> No problam at all, but please note that I'm behNaM, behDaD's brother.

Sorry for the name mishap. I was looking at Behdad's message while 
replying to yours.

>> Your output is what I get from fribidi 0.10.9, but that doesn't do
>> Arabic joining. Other than that BOM mark, the Windows output from
>> 0.19.1 is correct and matches what other non-Fribidi-based, bidi
>> programs do.
> 
> I didn't notice you have get the shaping on the string too.  So the
> windows output is correct, except the BOM mark.
> 
> Seems the BOM mark appears in the place of the first character of the
> LAM+ALEF ligature.  This might be a but in the ligature replacement of
> the shaping function.
> 
> Yep, it's in the CVS HEAD, lib/fribidi-arabic.c
> fribidi_shape_arabic_ligature(), which replaces the first char of
> ligature with FRIBIDI_CHAR_FILL, which is ZWNBSP/BOM.
> 
> Behdad, why don't you use U+FFFF for this purpose?  and why
> fribidi_shape_arabic() or fribidi_shape_arabic_ligature() doesn't
> clean this CHAR_FILLs?
> 
> Behdad, I can fix the problem in CVS if you tell me what's the best
> way to fix this.

Glad to know it's not a problem just on my end. Thanks for the research.

-- 
Yoann Roman



More information about the fribidi mailing list