> However, I don't think this actually affects your comments, except that
> I'd guess that the half digits mentioned by Hong don't have the same
> term "case" used with them that the letters of various alphabets do.
I am not sure if we mean the same thing. The regular ascii "0123456789"
are called "half-width-digit" in china, because they take about half
of the width of any chinese character to display on the screen or
paper. There are another set of "012..." in chinese encoding that
denotes digits look the same width as chinese characters, full-width.
The full width characters mainly used for formatting. It has nothing
to do width the lowercase/uppercase in roman language. I believe Unicode
has many "font" characters.
> Is this Uppercase?
> Is this Lowercase?
I believe the Unicode already defines character categories, such as
L, Lu, Ll, Lo. I prefer we just use unicode term instead of extending
"ctype.h". The Perl 5 regex already support them.
Hong