On Sun, Aug 19, 2012 at 8:19 PM, <wxjmfa...@gmail.com> wrote: > This is precicely the weak point of this flexible > representation. It uses latin-1 and latin-1 is for > most users simply unusable.
No, it uses Unicode, and as an optimization, attempts to store the codepoints in less than four bytes for most strings. The fact that a one-byte storage format happens to look like latin-1 is rather coincidental. ChrisA -- http://mail.python.org/mailman/listinfo/python-list