[Fribidi-discuss] Record Separator in BiDi

Behdad Esfahbod behdad at bamdad.org
Tue Aug 13 11:21:02 EST 2002


Hi all,

Please have a look at "UAX#13: Unicode Newline Guidelines":
http://www.unicode.org/unicode/reports/tr13/

fribidi identifies the PS (Paragraph separator) and SS (Segment
separator) character types.  BTW, it is still the applications
duty to break the text into paragraphs separated by a character
of type PS (Unicode recommends U+2029 for that, but also U+000A 
has the same type too).  The next generation fribidi (interface 
ver 3) can break the text into paragraphs and lines itself... if 
ever happen...

behdad

On Tue, 13 Aug 2002, Nadav Har'El wrote:

> On Mon, Aug 12, 2002, Ilya Konstantinov wrote about "[Fribidi-discuss] Record Separator in BiDi":
> >...
> > According to the Unicode code charts, RS is 0x1E and US is 0x1F -- both
> > displayed as "Missing-Character" squares in Mozilla (I guess it simply
> > passes them to DrawString, instead of assuming them as zero-width
> > control characters).
> >...
> 
> These are, by the way, the ASCII RS and US characters - it's not a unicode
> invention. However, RS obviously had nothing to do with bidi in the original
> standard, so in my opinion it is somewhat strange to attach such new meanings
> to them now.
> 
> ASCII had RS (Record Separator), FS (File Separator), US (Unit separator),
> nd GS (Group Separator). I have never seen any application that actually
> attaches any sort of meaning to any of these control-characters when they
> are in a text file.
> 
> But I guess it wouldn't hurt attaching meaning to these characters, after
> they have been just laying there, unused, in ASCII, for the last 35 years :)
> 
> > All in all, those are very important and convenient characters to
> > prohibit unwanted BiDi behavior (e.g. on webpages, when two unrelated
> > fields get mixed together because of BiDi processing).
> 
> Doesn't a newer unicode solution exist? Checking
> http://www.unicode.org/unicode/reports/tr9/, indeed I don't see any.
> So maybe you have a point that such characters are indeed needed.
> 
> 

-- 
Behdad Esfahbod		22 Mordad 1381, 2002 Aug 13 
http://behdad.org/	[Finger for Geek Code]

#define is_persian_leap(y) ((((y)-474)%2820+2820)%2820*31%128<31)





More information about the FriBidi mailing list