> I agree, but the problem is much subtle. I have coverted a text from > iso-8859-1 to utf-8 and the codecs have translated \x27 ( the iso > apostrophe ) to \xe28099 in utf-8 ( or u'2019' in unicode code point > notation )
What software did you use to make that so? The Python codec certainly never would do such a thing. Are you sure it was latin-1 and \x27, and not windows-1252 and \x92? Regards, Martin -- http://mail.python.org/mailman/listinfo/python-list