Brent 'Dax' Royal-Gordon <[EMAIL PROTECTED]> wrote: > I really see no reason to store strings as UTF-{8,16,32} and waste CPU > cycles on decoding it when we can do a lossless conversion to a format > that's both more compact (in the most common cases) and faster.
The default format now isn't UTF8. It's a series of fixed sized entries of either uint_8, uint_16, or uint_32. These reflect most common encodings which are: char*, USC-2, and UCS-4/UTF-32 (or possibly other 32-bit encodings). This should cover "common" cases. No cycles are wasted for storing "straight" encodings. leo