Hi all I have a weird problem with unicode conversion whilst running in the django environment. I'm not convinced django is the cause, but hopefully someone will have seen something like this and can point me in the right direction..
The original code was an XML-RPC call from one django server to another, and the problem occurred when attempting to extract the parameters from the utf-8 encoded XML, but fortunately I can reproduce it much more simply. The basic process is that a unicode string is encoded to a utf-8 bytestring, and put into an XML document. It is then extracted from the XML by the other server and then re-interpreted as a unicode string, which should look like this: > $ python Python 2.5.2 (r252:60911, Apr 25 2008, 17:25:09) [GCC 4.2.1 20070719 [FreeBSD]] on freebsd7 Type "help", "copyright", "credits" or "license" for more information. >>> u'£' u'\xa3' >>> u'£'.encode('utf-8') '\xc2\xa3' >>> unicode(u'£'.encode('utf-8'), 'utf-8') u'\xa3' >>> print unicode(val.encode('utf-8'), 'utf-8') £ As you can see, exactly as expected. However, when running under 'manage.py shell', the behaviour is quite different: Python 2.5.2 (r252:60911, Apr 25 2008, 17:25:09) [GCC 4.2.1 20070719 [FreeBSD]] on freebsd7 Type "help", "copyright", "credits" or "license" for more information. (InteractiveConsole) >>> u'£' u'\xc2\xa3' >>> u'£'.encode('utf-8') '\xc3\x82\xc2\xa3' >>> unicode(u'£'.encode('utf-8'), 'utf-8') u'\xc2\xa3' >>> print unicode(u'£'.encode('utf-8'), 'utf-8') £ Any pointers? Cheers Tom -- You received this message because you are subscribed to the Google Groups "Django users" group. To post to this group, send email to django-us...@googlegroups.com. To unsubscribe from this group, send email to django-users+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/django-users?hl=en.