Ezio Melotti added the comment: 0xed is the start byte of a 3 bytes sequence (i.e. a BMP char), and it should be followed by two continuation bytes.
For some reason the traceback you pasted is missing the last part, that might provide some insight. It could be one of these: >>> b'\xed'.decode('utf-8') # not enough continuation bytes UnicodeDecodeError: 'utf-8' codec can't decode byte 0xed in position 0: unexpected end of data >>> b'\xed\x7f'.decode('utf-8') # not a valid continuation byte UnicodeDecodeError: 'utf-8' codec can't decode byte 0xed in position 0: invalid continuation byte ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue13153> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com