On Mon, Dec 20, 2010 at 02:10:39PM -0500, Tom Lane wrote: > David Fetter <da...@fetter.org> writes: > > On Mon, Dec 20, 2010 at 08:01:42PM +0100, Martijn van Oosterhout wrote: > >> I think you mean Unicode is not a superset of all character sets. I've > >> heard this before but never found what's missing. [citation needed]? > > > Windows-1252, ISO-2022-JP-2 and EUC-TW are such encodings. > > [citation needed]? Exactly what characters are missing, and why would > the Unicode people have chosen to leave them out? It's not like they've > not heard of those encodings, I'm sure. > > regards, tom lane >
Here is an interesting description of some of the gotchas: http://en.wikipedia.org/wiki/Windows-1252 Regards, Ken -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers