On Wed, Jul 17, 2002 at 12:13:47PM -0400, Dan Sugalski wrote: > I thought Java used UTF-16. It's a variable-width encoding, so it > should be fine. (Though I bet a lot of folks will be rather surprised > when it happens...) UTF-16 isn't technically a variable-width encoding, since surrogate codes are still considered single characters - even though they only have meaning when combined in pairs. It's much the same as multiple combining characters coming together to represent a single abstract entity that is also not really a "character"; the chief difference is that surrogates don't mean anything at all on their own.
-- Mark REED | CNN Internet Technology 1 CNN Center Rm SW0831G | [EMAIL PROTECTED] Atlanta, GA 30348 USA | +1 404 827 4754 -- There are no rules for March. March is spring, sort of, usually. March means maybe, but don't bet on it.