Hello Jeroen, On Monday 17 May 2004 10:42, Kransen, J. wrote: > Anyone tell me if I'm wrong, but I doubt if UTF-8 covers Chinese. As I see > it, it only contains 8 bits per character, instead of ASCII's 7 bits, and > for that reason I assume it only covers 2* the number of chars. Maybe you > can try UTF-16 instead?
You are wrong. By default UTF-8 uses 1 byte per character. It uses a special character to indicates that the next character is 'high' and stored in 2 bytes. This makes a total of 3 bytes for a 'high' character. So UTF-8 is a good choice for western languages with normally only a few 'high' characters in the document. > Jeroen Regards, Cees. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]