On Wed, Dec 11, 2013 at 6:01 PM, Ned Batchelder <n...@nedbatchelder.com> wrote: >> I've also been wondering if ISO-8859-1 is just an octet-oriented codec, >> so it'll read about anything. There are clearly non-7-bit-ASCII >> characters in the file that look like line noise in an mrxvt. > > > Both ISO-8859-1 and Windows-1255 are octet-oriented, I don't see why one > would raise an exception when the other didn't. Unless the exception isn't > on the decode, but instead on your attempt to output the result. Can you > show the full traceback you're seeing?
There are gaps in CP 1255 (see http://en.wikipedia.org/wiki/Code_page_1255), so I presume the file contains one or more of those octets that don't map to anything at all. -- https://mail.python.org/mailman/listinfo/python-list