On Fri, Jun 3, 2011 at 1:52 PM, Chris Angelico <ros...@gmail.com> wrote: > However, Unicode planes 0-2 have all > the defined printable characters
PS. I'm fully aware that there are ranges defined in page 14 / E. They're non-printing characters, and unlikely to be part of a text string, although it is possible. So you can't shortcut things and treat Unicode as 18-bit numbers; has to be 20-bit. Doesn't have to be 32-bit unless that's really convenient. Chris Angelico -- http://mail.python.org/mailman/listinfo/python-list