pjmcle...@gmail.com: > not sure why utf-8 gives an error when thats the most wide all caracters > inclusive right?/
Not all sequences of bytes are legal in UTF-8. For example, >>> b'\x80'.decode("utf-8") Traceback (most recent call last): File "<stdin>", line 1, in <module> UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte Not all sequences of bytes are legal in ASCII, either. However, all sequences of bytes are legal in Latin-1 (among others). Of course, decoding with Latin-1 gives you gibberish unless the data really is Latin-1. But you'll never get a UnicodeDecodeError. Marko -- https://mail.python.org/mailman/listinfo/python-list