On Sun, Mar 5, 2023 at 7:30 AM Jeff Davis <pg...@j-davis.com> wrote: > Sorting by codepoint should be encoding-independent (i.e. decode to > codepoint first); but the C collation is just strcmp, which is > encoding-dependent. So is UCS_BASIC wrong today?
It's created for UTF-8 only, and UTF-8 sorts the same way as the encoded code points, when interpreted as a sequence of unsigned char by memcmp(), strcmp() etc. Seems right?