> > They have a UTF-16/UCS-2 internal representation, same as MSEgui which 
> > works 
> > very well and is fast and handy BTW.
> 
> And len, slicing, etc. work as expected.
> Note that if you need characters beyond $ffff you have to compile it
> with wide unicode support, and in that case every character will use 4
> bytes.
> 
That's IMHO a faulty system. It requires you to choose between an incomplete
solution or making strings a horrible memory hog. But maybe that doesn't
matter for mere scripting languages (though I wonder then why they didn't
chose UTF-32 directly)

Surrogates are not nice, but they were invented for a reason.

_______________________________________________
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal

Reply via email to