Ezio Melotti <ezio.melo...@gmail.com> added the comment: Actually I'd like to have some kind of convention mainly when the user writes the encoding as a string, e.g. s.encode('utf-8'). Indeed, if the encoding comes from a webpage or somewhere else it makes sense to have some flexibility.
I think that 'utf-8' is the most widely used name for the UTF-8 codec and it's not even mentioned in the table of the standard encodings. So someone will use 'utf-8', someone else 'utf_8' and some users could even pick one of the aliases, like 'U8'. Probably is enough to add 'utf-8', 'iso-8859-1' and similar as "preferred form" and explain why and how the codec names are normalized and what are the valid aliases. Regarding the ambiguity of 'UTF', it is not the only one, there's also 'LATIN' among the aliases of ISO-8859-1. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue5902> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com