> > What software did you use to make that so? The Python codec certainly > never would do such a thing. > > Are you sure it was latin-1 and \x27, and not windows-1252 and \x92? > > Regards, > Martin
you're right...the source of text are html pages and obviously webmasters have poor knowledge of encodings, so the meta declared the encoding as ISO-8859-1 but the real encoding is Windows-1252 and yes it uses \x92 as apostrophe, so the problem isn't Python -- http://mail.python.org/mailman/listinfo/python-list