Ezio Melotti added the comment: > If the problem is that PyUnicode_FromUnicode() rejects character > outside range [U+0000; U+10ffff],
But this used to return two valid characters: >>> str(array('u', b'asdf')) "array('u', '獡晤')" so I think it still should -- unless the operation was already nonsensical and/or there's no way to do the same thing on 3.3+ due to the change introduced by PEP 393. > it would be better to use the byte string '\xff' * sizeof_PY_UNICODE. What for? > U+66647361 may become valid in a future version of Unicode, It won't. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue17223> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com