On 6/28/2019 4:25 PM, Tobiah wrote:
A guy comes in and enters his last name as RÖnngren.
So what did the browser really give me; is it encoded
in some way, like latin-1? Does it depend on whether
the name was cut and pasted from a Word doc. etc?
Should I handle these internally as unicode? Right
now my database tables are latin-1 and things seem
to usually work, but not always.
Unless you want to restrict your app to people with or converible to
latin-1 (western Europe) names, you should use utf-8 or let the database
encode for you.
Also, what do people do when searching for a record.
Is there some way to get 'Ronngren' to match the other
possible foreign spellings?
I have seen a program that converts all latin-1 chars to ascii for matching.
--
Terry Jan Reedy
--
https://mail.python.org/mailman/listinfo/python-list