[fdo] UTF-16 support ?

Behdad Esfahbod behdad at cs.toronto.edu
Tue Jun 15 10:37:49 PDT 2004


On Tue, 15 Jun 2004, Scott James Remnant wrote:

> UTF-8 can encode all of the UCS-4 code points, however there is
> significant overhead in the later planes turning a 4-byte UCS-4 sequence
> into 6 or 7 byte character sequences.

This is not true.  UTF-8 encodes all valid Unicode characters in
at most 4 octets.

--behdad
  behdad.org



More information about the freedesktop mailing list