At 4:17 PM +0100 7/17/02, Nicholas Clark wrote: >On Wed, Jul 17, 2002 at 12:32:43AM -0400, Mark J. Reed wrote: >> On Tue, Jul 16, 2002 at 05:42:18PM -0400, Michael G Schwern wrote: >> > I don't know how Java and Python handle Unicode. >> Java has always been 100% Unicode from the ground up; it's in the spec. >> The fundamental char type is a 16-bit value, you can use any "letterlike" > >My understanding was that Unicode has now escaped the base plane (or whatever >it's called) and now has started using code points >65536. How does Java >cope with this?
I thought Java used UTF-16. It's a variable-width encoding, so it should be fine. (Though I bet a lot of folks will be rather surprised when it happens...) -- Dan --------------------------------------"it's like this"------------------- Dan Sugalski even samurai [EMAIL PROTECTED] have teddy bears and even teddy bears get drunk