On Wed, Jul 17, 2002 at 12:13:47PM -0400, Dan Sugalski wrote:
> I thought Java used UTF-16. It's a variable-width encoding, so it 
> should be fine. (Though I bet a lot of folks will be rather surprised 
> when it happens...)
UTF-16 isn't technically a variable-width encoding, since
surrogate codes are still considered single characters - even
though they only have meaning when combined in pairs.  It's much
the same as multiple combining characters coming together to represent
a single abstract entity that is also not really a "character"; the
chief difference is that surrogates don't mean anything at all on their own.

-- 
Mark REED                    | CNN Internet Technology
1 CNN Center Rm SW0831G      | [EMAIL PROTECTED]
Atlanta, GA 30348      USA   | +1 404 827 4754 
--
There are no rules for March.  March is spring, sort of, usually.  March
means maybe, but don't bet on it.

Reply via email to