[Serge] > def search_key(s): > de_str = unicodedata.normalize("NFD", s) > return ''.join(cp for cp in de_str if not > unicodedata.category(cp).startswith('M'))
Lovely bit of code - thanks for posting it! You might want to use "NFKD" to normalize things like LATIN SMALL LIGATURE FI and subscript/superscript characters as well as diacritics. -- Richie Hindle [EMAIL PROTECTED] -- http://mail.python.org/mailman/listinfo/python-list