Hi there, On Tue, 2012-05-01 at 16:43 +0100, Caolán McNamara wrote: > The .doc has some paragraphs in it that are around the 64k length mark. > It doesn't *look* like that's the case because they end in vast quantity > of whitespace and writer implements the word-compatibility thing to let > empty spaces disappear off the right side of the page rather than wrap.
Fun :-) > Turns out to be a good test case for our break iterators, couple of > problems, squashed together as attachment Good stuff; > and assume that a space doesn't combine with anything in any exotic way > to form a single grapheme > http://cgit.freedesktop.org/libreoffice/core/commit/?id=ae716b07f7218fadf0143de1946cc9e0e2c08744 The idea of 'space' as a grapheme is somewhat counter-intuitive in the abstract :-) but - anyhow, a great set of fixes ! Can you give an idea of the performance win / speedup ? Thanks, Michael. -- michael.me...@suse.com <><, Pseudo Engineer, itinerant idiot _______________________________________________ LibreOffice mailing list LibreOffice@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice