Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-18 Thread Martin Flahault
Very interesting discussion indeed. It seems that "Postgresql:The world's most advanced open source database" can not work properly on "Mac OS X: the world's most advanced operating system" and FreeBSD. Don't you think postgresql.org should remove from their download page the links to FreeBSD

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-17 Thread Martijn van Oosterhout
On Sat, Jan 16, 2010 at 09:10:53PM +, Greg Stark wrote: > Switching to ICU means trading our current inconsistency from platform > to platform for a different inconsistency which would be better in > some cases and worse in others. Or, you can have the cake and eat it too. That is, aim for the

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-16 Thread Greg Stark
On Thu, Jan 14, 2010 at 6:15 AM, Craig Ringer wrote: >>> Out of interest: Why not? >> >> There's plenty of discussion in the archives about it, but basically >> ICU would represent a pretty enormous dependency and would lock us in >> to having no other backend encoding but UTF8. > > Thanks. You're

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Craig Ringer
Scott Marlowe wrote: > On Wed, Jan 13, 2010 at 11:15 PM, Craig Ringer > wrote: >> Perhaps someone who wants to use Mac OS X and Pg for their product will >> come forward with some compat wrapper functions for the localizable >> libc/posix functions, so Pg can just be built against the wrapper and

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Tom Lane
Craig Ringer writes: > Tom Lane wrote: >> The state of OS X's POSIX-spec locale support is pretty pitiful, but on >> the whole I'd say if you need better UTF8 locale support you could use >> another OS. > Alas, people will want to run Pg on it anyway, especially when bundling > with an app. It'd

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Scott Marlowe
On Wed, Jan 13, 2010 at 11:15 PM, Craig Ringer wrote: > Perhaps someone who wants to use Mac OS X and Pg for their product will > come forward with some compat wrapper functions for the localizable > libc/posix functions, so Pg can just be built against the wrapper and > the rest of us need not ca

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Craig Ringer
Tom Lane wrote: > Craig Ringer writes: >> Martijn van Oosterhout wrote: >>> Yes, that's the basic idea. Mac OS X apparently provides ICU underneath >>> for programs that would like true unicode collation, but there is >>> little chance that postgresql will ever use this. > >> Out of interest: Why

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Tom Lane
Craig Ringer writes: > Martijn van Oosterhout wrote: >> Yes, that's the basic idea. Mac OS X apparently provides ICU underneath >> for programs that would like true unicode collation, but there is >> little chance that postgresql will ever use this. > Out of interest: Why not? There's plenty of

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Craig Ringer
Martijn van Oosterhout wrote: >> in a UTF8 text file and use the "sort" command on it, you will have the same >> wrong output as with PostgreSQL : > > Yes, that's the basic idea. Mac OS X apparently provides ICU underneath > for programs that would like true unicode collation, but there is > lit

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Craig Ringer
On 13/01/2010 11:15 PM, Martin Flahault wrote: It seems there is a problem with the collating order on BSD systems with diacritics using UTF8. If you put this text : a A à é e E in a UTF8 text file and use the "sort" command on it, you will have the same wrong output as with PostgreSQL : A E a

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Martijn van Oosterhout
On Wed, Jan 13, 2010 at 04:15:06PM +0100, Martin Flahault wrote: [postgres] > newbase=# select * from t1 order by contenu; > contenu > - > A > E > a > e Postgresql outputs whatever the C library does on the underlying system. The quality of this varies wildly. > à > As with others

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Martin Flahault
Here is an exemple : postgres=# create database newbase; CREATE DATABASE postgres=# \c newbase; psql (8.4.2) You are now connected to database "newbase". newbase=# create table t1 (contenu text); CREATE TABLE newbase=# insert into t1 values ('a'), ('e'), ('à'), ('é'), ('A'), ('E'); INSERT 0 6 ne

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Craig Ringer
On 12/01/2010 7:36 PM, Martin Flahault wrote: Hi, We are a software publisher searching for a new DBMS for our software. We have more than one hundred installed servers, running Mac OS and a Primebase database. We have spend some time evaluating PostgreSQL and we can't get correct outputs with

Re: [GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-13 Thread Maximilian Tyrtania
Am 12.01.2010 um 12:36 schrieb Martin Flahault: > We have spend some time evaluating PostgreSQL and we can't get correct > outputs with the ORDER BY command. > LC_COLLATE and LC_CTYPE are set to fr_FR.UTF-8. > > It seems there is a known problem with the collating order of text including > diac

[GENERAL] Collate order on Mac OS X, text with diacritics in UTF-8

2010-01-12 Thread Martin Flahault
Hi, We are a software publisher searching for a new DBMS for our software. We have more than one hundred installed servers, running Mac OS and a Primebase database. We have spend some time evaluating PostgreSQL and we can't get correct outputs with the ORDER BY command. LC_COLLATE and LC_CTYPE