On Wed, Jul 19, 2017 at 3:01 AM, Marko Rauhamaa <ma...@pacujo.net> wrote:
> Chris Angelico <ros...@gmail.com>:
>
>> what you're more likely to want is "match the letter á", and you don't
>> care whether it's represented as U+0061 U+0301 or as U+00E1. That's
>> where Unicode normalization comes in.
>
> Yes. Also, not every letter can be normalized to a single codepoint so
> NFC is not a way out. For example,
>
>     re.match("^[q̈]$", "q̈")
>
> returns None regardless of normalization.

In what language or context would you actually want to do this?

ChrisA
-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to