[issue4730] cPickle corrupts high-unicode strings

2008-12-27 Thread Alexandre Vassalotti
Alexandre Vassalotti added the comment: Fixed in r67934. Backported to 2.6 in r67936. Thanks! -- nosy: +alexandre.vassalotti resolution: -> fixed status: open -> closed ___ Python tracker _

[issue4730] cPickle corrupts high-unicode strings

2008-12-23 Thread Antoine Pitrou
Changes by Antoine Pitrou : -- nosy: +pitrou ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.

[issue4730] cPickle corrupts high-unicode strings

2008-12-23 Thread Nathaniel Smith
New submission from Nathaniel Smith : cPickle.dump by default does not properly encode unicode characters outside the BMP -- it throws away the high bits: >>> cPickle.loads(cPickle.dumps(u"\U00012345")) u'\u2345' The problem is in dump, not load: >>> pickle.dumps(u"\U00012345") # works 'V\\U0