On Thu, Nov 18, 2004 at 12:17:01PM +0100, Dawid Kuroczko wrote: > Correct me if I am wrong, but I think that UTF-8 is almost identical > to ISO-8859-1 in binary form to ISO-8859-1. I mean, UTF-8 is > ISO-8859-1 plus multibyte characters from other charsets.
No, UTF-8 and ISO-8859-1 are different from characters 128 and up. Even though code point 160 represents "non breaking space" in both, they are represented differently in terms of bytes. One is 0xA0 and the other is 0xC2 0xA0. -Dom ---------------------------(end of broadcast)--------------------------- TIP 3: if posting/reading through Usenet, please send an appropriate subscribe-nomail command to [EMAIL PROTECTED] so that your message can get through to the mailing list cleanly