Re: Could not decode to UTF-8 column 'cpostal'

2011-06-13 Thread Ian Clelland
On Sun, Jun 12, 2011 at 11:27 PM, Cal Leeming [Simplicity Media Ltd] < cal.leem...@simplicitymedialtd.co.uk> wrote: > I could be wrong, but I'm fairly sure that \xc3\xa3 is a 4 byte unicode > sequence (which is essentially UTF-16). No, it's a two-byte sequence -- the UTF-8 encoding of code point

Re: Could not decode to UTF-8 column 'cpostal'

2011-06-12 Thread Cal Leeming [Simplicity Media Ltd]
I could be wrong, but I'm fairly sure that \xc3\xa3 is a 4 byte unicode sequence (which is essentially UTF-16). I'm not hot on understanding how unicode works, but I've had to deal with it hundreds of times due to the nature of the webapps our company writes (most data mining and indexing adapters

Could not decode to UTF-8 column 'cpostal'

2011-06-12 Thread alexandre...@gmail.com
Hi, I just converted a mssql database to sqlite3. I used the folloing text_factory con.text_factory = lambda x: x.decode('iso-8859-1').encode('utf-8') that just converts from latin1 to utf-8 where i browse the tables in python shell it seems everything is fine and well converted but when in admi