On 10/01/2012 01:02 PM, Noel Grandin wrote:
That was something I was thinking about the other day - given than the bulk of our strings are pure 7-bit ASCII, it might be a worthwhile optimisation to store a bit that says "this string is 7-bit ASCII", and then store the string as a sequence of bytes.
cf. <https://wiki.documentfoundation.org/Development/LibreOffice4#General_changes_2>: "replace rtl::OUString with a UTF-8 string for better space efficiency, and Unicode coverage."
The latest Java VM does this trick internally - it pretends that String is stored with an array of 16-bit values, but actually it stores them as UTF-8.
Java's modified UTF-8, presumably. (Me the nitpicker :) Stephan _______________________________________________ LibreOffice mailing list LibreOffice@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice