STINNER Victor <victor.stin...@haypocalc.com> added the comment: > No, they should be rejected. Allowing them in some specific > places might cause them to leak somewhere else and cause problems, > so I'd rather stick with that range and reject all the chars > >U+10FFFF everywhere.
That's why I added a (debug) check to reject them. I don't think that your UTF-8 encoder support such character some example. All functions assumes that the maximum character is U+10FFFF. If they should be rejected, a solution is to modify strxfrm() to return a list of integer (of code points) instead of a string. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue13441> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com