Hi all

I have a weird problem with unicode conversion whilst running in the
django environment. I'm not convinced django is the cause, but
hopefully someone will have seen something like this and can point me
in the right direction..

The original code was an XML-RPC call from one django server to
another, and the problem occurred when attempting to extract the
parameters from the utf-8 encoded XML, but fortunately I can reproduce
it much more simply. The basic process is that a unicode string is
encoded to a utf-8 bytestring, and put into an XML document. It is
then extracted from the XML by the other server and then
re-interpreted as a unicode string, which should look like this:

> $ python
Python 2.5.2 (r252:60911, Apr 25 2008, 17:25:09)
[GCC 4.2.1 20070719  [FreeBSD]] on freebsd7
Type "help", "copyright", "credits" or "license" for more information.
>>> u'£'
u'\xa3'
>>> u'£'.encode('utf-8')
'\xc2\xa3'
>>> unicode(u'£'.encode('utf-8'), 'utf-8')
u'\xa3'
>>> print unicode(val.encode('utf-8'), 'utf-8')
£

As you can see, exactly as expected. However, when running under
'manage.py shell', the behaviour is quite different:

Python 2.5.2 (r252:60911, Apr 25 2008, 17:25:09)
[GCC 4.2.1 20070719  [FreeBSD]] on freebsd7
Type "help", "copyright", "credits" or "license" for more information.
(InteractiveConsole)
>>> u'£'
u'\xc2\xa3'
>>> u'£'.encode('utf-8')
'\xc3\x82\xc2\xa3'
>>> unicode(u'£'.encode('utf-8'), 'utf-8')
u'\xc2\xa3'
>>> print unicode(u'£'.encode('utf-8'), 'utf-8')
£

Any pointers?

Cheers

Tom

-- 
You received this message because you are subscribed to the Google Groups 
"Django users" group.
To post to this group, send email to django-us...@googlegroups.com.
To unsubscribe from this group, send email to 
django-users+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/django-users?hl=en.

Reply via email to