On Sun, Oct 20, 2013 at 3:49 AM, Roy Smith <r...@panix.com> wrote:
> So, yesterday, I tracked down an uncaught exception stack in our logs to a 
> user whose username included the unicode character 'SMILING FACE WITH 
> SUNGLASSES' (U+1F60E).  It turns out, that's perfectly fine as a user name, 
> except that in one obscure error code path, we try to str() it during some 
> error processing.

How is that a problem? Surely you have to deal with non-ASCII
characters all the time - how is that particular one a problem? I'm
looking at its UTF-8 and UTF-16 representations and not seeing
anything strange, unless it's the \x0e in UTF-16 - but, again, you
must surely have had to deal with
non-ASCII-encoded-whichever-way-you-do-it.

Or are you saying that that particular error code path did NOT handle
non-ASCII characters? If so, that's a strong argument for moving to
Python 3, to get full Unicode support in _all_ branches.

ChrisA
-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to