On Wed, Jul 19, 2017 at 3:01 AM, Marko Rauhamaa <ma...@pacujo.net> wrote: > Chris Angelico <ros...@gmail.com>: > >> what you're more likely to want is "match the letter á", and you don't >> care whether it's represented as U+0061 U+0301 or as U+00E1. That's >> where Unicode normalization comes in. > > Yes. Also, not every letter can be normalized to a single codepoint so > NFC is not a way out. For example, > > re.match("^[q̈]$", "q̈") > > returns None regardless of normalization.
In what language or context would you actually want to do this? ChrisA -- https://mail.python.org/mailman/listinfo/python-list