Re: UTF-8 locales

2000-11-19 Thread Bernd Eckenfels
On Sat, Nov 18, 2000 at 08:01:11PM -0600, David Starner wrote: > Which includes the Chinese and Japenese, who need the characters found > in the Supplementary Ideographic Planes, which means 4 byte characters. Afaik UTF8 is not able to encode 32bit unicode? I thought this is because the "living" l

Re: UTF-8 locales

2000-11-19 Thread Tom Emerson
Bernd Eckenfels writes: > Afaik UTF8 is not able to encode 32bit unicode? I thought this is because > the "living" languages are all restricted to 16bit? Hmm... i might be wrong. > Does that mean Java does not support asian languages with its 16bit Unicode? UTF-8 can be used encode UCS-4. > As I

Re: UTF-8 locales

2000-11-19 Thread David Starner
On Sun, Nov 19, 2000 at 10:50:54PM +0100, Bernd Eckenfels wrote: > On Sat, Nov 18, 2000 at 08:01:11PM -0600, David Starner wrote: > > Which includes the Chinese and Japenese, who need the characters found > > in the Supplementary Ideographic Planes, which means 4 byte characters. > > Afaik UTF8 is

Re: UTF-8 locales

2000-11-19 Thread Tomohiro KUBOTA
Hi, At Sun, 19 Nov 2000 22:50:54 +0100, Bernd Eckenfels <[EMAIL PROTECTED]> wrote: > Afaik UTF8 is not able to encode 32bit unicode? Strictly speaking, there is no 32bit unicode. UCS-4 character set has 31bit code space, not 32bit. UTF-8 can encode the whole UCS-4. > I thought this is becaus