Re: Add CASEFOLD() function.

Joe Conway Mon, 16 Dec 2024 13:27:28 -0800

On 12/16/24 12:49, Jeff Davis wrote:

One question I have is whether we want this function to normalize the
output.


I believe most usecases would want the output normalized, because
normalization differences (e.g. "a" U+0061 followed by "combining
acute" U+0301 vs "a with acute" U+00E1) are more minor than differences
in case.

Of course, a user could wrap it with the normalize() function, but
that's verbose and easy to forget. I'm also not sure that it can be
made as fast as a combined function that does both.

Perhaps a one arg version that always casefolds and a two arg versionthat accepts nfc, nfd, none (or something similar)?

And a follow-up question: if it does normalize, the second parameter
would be the requested normal form. But to accept the keyword forms
(NFC, NFD in gram.y) rather than the string forms ('NFC', 'NFD') then
we'd need to also need to add CASEFOLD to gram.y (like NORMALIZE). Is
that a reasonable thing to do?

SQL 2023 seems to include the NORMALIZE syntax, but the only casefolding considered is UPPER and LOWER. As such, I think it ought to be afunction but not part of the grammar.


--
Joe Conway
PostgreSQL Contributors Team
RDS Open Source Databases
Amazon Web Services: https://aws.amazon.com

Re: Add CASEFOLD() function.

Reply via email to