Re: u32_normalize UNINORM_NFKC on 0xD800

Simon Josefsson Fri, 27 May 2011 11:13:39 -0700

Bruno Haible <br...@clisp.org> writes:

> Simon Josefsson wrote:
>> I'm calculating this IDNA2008 property
>> 
>>    toNFKC(toCaseFold(toNFKC(cp))) != cp
>> 
>> for all code points.
>
> It makes no sense to consider non-character code points here. Citing again
> the Unicode standard, chapter 3 [1], section 3.8:
>
>   "High-surrogate and low-surrogate code units are used only in the context
>    of the UTF-16 character encoding form."


It seems Mark Davis believes toNFKC should be defined for all code points:

http://www.alvestrand.no/pipermail/idna-update/2011-May/007106.html

The issue turned out to be irrelevant for me, so I don't care strongly
either way.

/Simon

Re: u32_normalize UNINORM_NFKC on 0xD800

Reply via email to