<[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED]
> I think I understand the unicode basic principles, what confuses me is the > usage > different applications > make out of it. > > For example, I got that EN DASH out of a web page which states > <?xml version="1.0" encoding="ISO-8859-1"?> at the beggining. That's why I > did go for > that > encoding. But if the browser can properly decode that character using that > encoding, > how come > other applications can't? Browsers tend to guess what the author intended a lot. In particular, they fudge the difference between ISO8859-1 and Windows-1252. http://en.wikipedia.org/wiki/Windows-1252 -- http://mail.python.org/mailman/listinfo/python-list