[utf-8] UTF-16 support ?

Noah Levitt nlevitt at columbia.edu
Tue Jun 15 10:10:34 PDT 2004


On Mon, Jun 14, 2004 at 10:46:11 -1000, Christin LIVINE wrote:
> 
> Why not start pushing adoption of UTF-16 instead of UTF-8 on Linux ?

UTF-8 is better. I think the two biggest advantages on linux
are that ascii text is valid UTF-8, and that C strings (char *) 
terminated by a null byte work with UTF-8. UTF-8 is also
more efficient for storing English text than UTF-16, is not
limited to U+10FFFF like UTF-16, and doesn't require nasty
surrogate pairs like UTF-16.

Noah



More information about the utf-8 mailing list