Re: Add standard collation UNICODE

Thomas Munro Sat, 04 Mar 2023 11:28:04 -0800

On Sun, Mar 5, 2023 at 7:30 AM Jeff Davis <[email protected]> wrote:
> Sorting by codepoint should be encoding-independent (i.e. decode to
> codepoint first); but the C collation is just strcmp, which is
> encoding-dependent. So is UCS_BASIC wrong today?


It's created for UTF-8 only, and UTF-8 sorts the same way as the
encoded code points, when interpreted as a sequence of unsigned char
by memcmp(), strcmp() etc.  Seems right?

Re: Add standard collation UNICODE

Reply via email to