On 6/28/2019 4:25 PM, Tobiah wrote:
A guy comes in and enters his last name as RÖnngren.

So what did the browser really give me; is it encoded
in some way, like latin-1?  Does it depend on whether
the name was cut and pasted from a Word doc. etc?
Should I handle these internally as unicode?  Right
now my database tables are latin-1 and things seem
to usually work, but not always.

Unless you want to restrict your app to people with or converible to latin-1 (western Europe) names, you should use utf-8 or let the database
encode for you.

Also, what do people do when searching for a record.
Is there some way to get 'Ronngren' to match the other
possible foreign spellings?

I have seen a program that converts all latin-1 chars to ascii for matching.

--
Terry Jan Reedy


--
https://mail.python.org/mailman/listinfo/python-list

Reply via email to