Bruno Haible <br...@clisp.org> writes: > Simon Josefsson wrote: >> I'm calculating this IDNA2008 property >> >> toNFKC(toCaseFold(toNFKC(cp))) != cp >> >> for all code points. > > It makes no sense to consider non-character code points here. Citing again > the Unicode standard, chapter 3 [1], section 3.8: > > "High-surrogate and low-surrogate code units are used only in the context > of the UTF-16 character encoding form."
It seems Mark Davis believes toNFKC should be defined for all code points: http://www.alvestrand.no/pipermail/idna-update/2011-May/007106.html The issue turned out to be irrelevant for me, so I don't care strongly either way. /Simon