Re: [BUGS] \w doesn't match non-ASCII letters

2004-06-14 Thread Tom Lane
Markus Bertheau <[EMAIL PROTECTED]> writes: > Is there something planned to support UTF-8 in regexps? It'd be relatively easy to use the functions here if we were convinced that pg_mb2wchar() generated exactly the same wide-character encoding as the C library is expecting for the current LC_CTYPE

Re: [BUGS] \w doesn't match non-ASCII letters

2004-06-14 Thread Markus Bertheau
Ð ÐÐÐ, 14.06.2004, Ð 17:25, Tom Lane ÐÐÑÐÑ: > Peter Eisentraut <[EMAIL PROTECTED]> writes: > > Markus Bertheau wrote: > >> oocms=# select 'Ñ' ~ '^\\w$'; > >> ?column? > >> -- > >> f > >> (1 ÑÑ) > > > What locale are you using for LC_COLLATE? If it's C or POSIX, you need > > to change

Re: [BUGS] \w doesn't match non-ASCII letters

2004-06-14 Thread Tom Lane
Peter Eisentraut <[EMAIL PROTECTED]> writes: > Markus Bertheau wrote: >> oocms=# select 'ф' ~ '^\\w$'; >> ?column? >> -- >> f >> (1 запись) > What locale are you using for LC_COLLATE? If it's C or POSIX, you need > to change it and re-initdb. Another likely cause of trouble is that the

Re: [BUGS] \w doesn't match non-ASCII letters

2004-06-14 Thread Peter Eisentraut
Markus Bertheau wrote: > oocms=# select 'Ñ' ~ '^\\w$'; > ?column? > -- > f > (1 ÑÑ) What locale are you using for LC_COLLATE? If it's C or POSIX, you need to change it and re-initdb. ---(end of broadcast)--- TIP 5: Have you checked

[BUGS] \w doesn't match non-ASCII letters

2004-06-14 Thread Markus Bertheau
oocms=# select 'Ñ' ~ '^\\w$'; ?column? -- f (1 ÑÑ) or oocms=# select 'Ã' ~ '^\\w$'; ?column? -- f (1 ÑÑ) both should return true, as does oocms=# select 'n' ~ '^\\w$'; ?column? -- t (1 ÑÑ) Thanks. -- Markus Bertheau <[EMAIL PROTECTED]> ---