> > They have a UTF-16/UCS-2 internal representation, same as MSEgui which > > works > > very well and is fast and handy BTW. > > And len, slicing, etc. work as expected. > Note that if you need characters beyond $ffff you have to compile it > with wide unicode support, and in that case every character will use 4 > bytes. > That's IMHO a faulty system. It requires you to choose between an incomplete solution or making strings a horrible memory hog. But maybe that doesn't matter for mere scripting languages (though I wonder then why they didn't chose UTF-32 directly)
Surrogates are not nice, but they were invented for a reason. _______________________________________________ fpc-pascal maillist - fpc-pascal@lists.freepascal.org http://lists.freepascal.org/mailman/listinfo/fpc-pascal