On Tue, 15 Jun 2004, Scott James Remnant wrote: > UTF-8 can encode all of the UCS-4 code points, however there is > significant overhead in the later planes turning a 4-byte UCS-4 sequence > into 6 or 7 byte character sequences. This is not true. UTF-8 encodes all valid Unicode characters in at most 4 octets. --behdad behdad.org