2010/12/20 Martijn van Oosterhout <klep...@svana.org>: > On Mon, Dec 20, 2010 at 09:03:56AM +0900, Itagaki Takahiro wrote: > >> UTF-8 is not a superset of all encodings. > > I think you mean Unicode is not a superset of all character sets. I've > heard this before but never found what's missing. [citation needed]?
>From ><URL:http://en.wikipedia.org/wiki/Japanese_language_and_computers#Character_encodings>: "Unicode is supposed to solve all encoding problems in all languages of the world. [..] There are still controversies. For Japanese, the kanji characters have been unified with Chinese, that is a character considered to be the same in both Japanese and Chinese have been given one and the same code number in Unicode, even if they look a little different. This process, called Han unification, has caused controversy." For examples (my browser doesn't show any differences though, probably because I don't have the corresponding fonts): <URL:http://en.wikipedia.org/wiki/Han_unification#Examples_of_language_dependent_characters> Nicolas -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers