Greg Stark <st...@enterprisedb.com> wrote: > Peter Eisentraut <pete...@gmx.net> wrote: >> SELECT U&'\00E9', char_length(U&'\00E9'); >> ?column? | char_length >> ----------+------------- >> é | 1 >> (1 row) >> >> SELECT U&'\0065\0301', char_length(U&'\0065\0301'); >> ?column? | char_length >> ----------+------------- >> é | 2 >> (1 row) > > What's really at issue is "what is a string?". That is, it a > sequence of characters or a sequence of code points. Doesn't the SQL standard refer to them as "character string literals"? The function is called character_length or char_length. I'm curious -- can every multi-code-point character be normalized to a single-code-point character? -Kevin
-- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers