On 5/10/2015 11:53 AM, Somelauw . wrote:
In Python 3, decoding "€" with unicode-escape returns 'â\x82¬' which in my opinion doesn't make sense.
Agreed. I think this is a bug in that it should raise an exception instead. Decoding a string only makes sense for rot-13
The € already is decoded; if it were encoded it would look like this: '\u20ac'. So why is it doing this?
$ python3 -S Python 3.3.3 (default, Nov 27 2013, 17:12:35) [GCC 4.8.2] on linux >>> import codecs >>> codecs.decode('€', 'unicode-escape') 'â\x82¬' >>> codecs.encode('€', 'unicode-escape') b'\\u20ac'
-- Terry Jan Reedy -- https://mail.python.org/mailman/listinfo/python-list