After spending a number of hours learning more about unicode than I ever wanted to know, I've fixed the bug where the word counter in Writer counts an opening quote mark (unicode symbol 0x201C) as an extra word.
Turns out that the opening quote lives in unicode block 40, which was not given an associated script type in breakiteratorImpl.cxx. This means that its script type was defaulting to "WEAK" rather than "LATIN", and "WEAK" is taken as a word break. Thought I should get in fast and submit this before John LeMoyne Castle's heroic efforts fix all the word counter problems :-P Patch attached. It's a very minimal change, so it's probably safe to push into the 3.3 branch as well as master. Cheers, Mattias _______________________________________________ LibreOffice mailing list LibreOffice@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice