On 10/01/2012 01:02 PM, Noel Grandin wrote:
That was something I was thinking about the other day - given than the
bulk of our strings are pure 7-bit ASCII, it might be a worthwhile
optimisation to store a bit that says "this string is 7-bit ASCII", and
then store the string as a sequence of bytes.

cf. <https://wiki.documentfoundation.org/Development/LibreOffice4#General_changes_2>: "replace rtl::OUString with a UTF-8 string for better space efficiency, and Unicode coverage."

The latest Java VM does this trick internally - it pretends that String
is stored with an array of 16-bit values, but actually it stores them as
UTF-8.

Java's modified UTF-8, presumably.  (Me the nitpicker :)

Stephan
_______________________________________________
LibreOffice mailing list
LibreOffice@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/libreoffice

Reply via email to