Re: UTF-8 locales

2000-11-20 Thread Anthony Fok
On Mon, Nov 20, 2000 at 11:15:57AM +0900, Tomohiro KUBOTA wrote: > > I thought this is because > > the "living" languages are all restricted to 16bit? Hmm... i might be wrong. > > Taiwan CNS 11643 character set has about 47000 ideograms. > Recently, Japan came to have a new standard JIS X 0213. T

Re: UTF-8 locales

2000-11-20 Thread Tomohiro KUBOTA
Hi, At Mon, 20 Nov 2000 01:11:02 -0700, Anthony Fok <[EMAIL PROTECTED]> wrote: > To add to that list, China has the new GB18030-2000 standard > (locale zh_CN.GB18030) which also contains many characters beyond Unicode. Interesting. I will have to mention it in my "Introduction to I18N" document

Re: UTF-8 locales

2000-11-20 Thread Roger So
On Mon, Nov 20, 2000 at 07:25:11PM +0900, Tomohiro KUBOTA wrote: > > BTW, I think GB18030 would be a _character set_, not _encoding_. > If so, we won't have zh_CN.GB18030 locale. In fact it is both, AFAICT; GB18030 defines the set of characters, and the way to encode them. Just like GBK. > Exam

Re: UTF-8 locales

2000-11-20 Thread David Starner
On Thu, Nov 16, 2000 at 08:21:26PM +0900, Tomohiro KUBOTA wrote: > I will agree with developers who dare to hard-code UTF-8 instead of > wchar_t, if they abolish the support of 8bit (or 7bit) encoding by the > softwares which they develop. I mean, if they need their (European- > language speakers

Re: UTF-8 locales

2000-11-20 Thread Tomohiro KUBOTA
Hi, At Mon, 20 Nov 2000 11:28:25 -0600, David Starner <[EMAIL PROTECTED]> wrote: > As for the reason I don't use wchat_t, not all the world works in C. > Most other languages have roll-your-own support for multi-byte character > sets or provide Unicode support. This is true that languages other

Re: UTF-8 locales

2000-11-20 Thread Theppitak Karoonboonayanan
On Tue, Nov 21, 2000 at 10:00:41AM +0900, Tomohiro KUBOTA wrote: > For conversion from number of characters to number of columns, you > will need to use wcwidth() or wcswidth(). I'm interested in this. I used to work the th_TH locale for glibc, and I'd like to know how to describe this conversio