On 30. januar 2008 10:21, Berteun Damman wrote: >On Wed, 30 Jan 2008 09:57:55 +0100, <[EMAIL PROTECTED]> ><[EMAIL PROTECTED]> wrote: >> How can I convert a string read from a database containing unicode >> literals, such as "Fr\u00f8ya" to the latin-1 equivalent, "Frøya"? >> >> I have tried variations around >> "Fr\u00f8ya".decode('latin-1') >> but to no avail. > >Assuming you use Unicode-strings, the following should work: > u"Fr\u00f8ya".encode('latin-1')
I'm afraid that the string read from the database is a non-unicode string, thus me not using u"..." above. But it may contain unicode literals from the (Python-based) system that populated the table, and I'd like to get them back to proper unicode strings again, so that I can display them correctly to the user. >That is, for some string s, s.decode('encoding') converts the >non-unicode string s with encoding to a unicode string u. Whereas >for some unicode string u, u.encode('encoding') converts the unicode >string u into a non-unicode string with the specified encoding. > >You can use s.encode() on a non-unicode string, but it will first try to >decode it (which might give an DecodeError if there are non-ASCII >characters present) and it will then encode it. Any suggestions on how that would look, given the example above? .david -- http://mail.python.org/mailman/listinfo/python-list