On 4 November 2010 23:38, Mattias Johnsson <m.t.johns...@gmail.com> wrote:
> After spending a number of hours learning more about unicode than I
> ever wanted to know, I've fixed the bug where the word counter in
> Writer counts an opening quote mark (unicode symbol 0x201C) as an extra
> word.
>
> Turns out that the opening quote lives in unicode block 40, which was
> not given an associated script type in breakiteratorImpl.cxx. This
> means that its
> script type was defaulting to "WEAK" rather than "LATIN", and "WEAK"
> is taken as a word break.
>
> Thought I should get in fast and submit this before John LeMoyne
> Castle's heroic efforts fix all the word counter problems :-P
>
> Patch attached. It's a very minimal change, so it's probably safe to
> push into the 3.3 branch as well as master.
>
> Cheers,
> Mattias

And this time with the patch actually attached...
From 9f694f9585c98f486abfd8b5d126ece128a58999 Mon Sep 17 00:00:00 2001
From: Mattias Johnsson <m.t.johns...@gmail.com>
Date: Thu, 4 Nov 2010 23:25:02 +1100
Subject: [PATCH] An opening quote should not be counted as a word by word count tool

---
 .../source/breakiterator/breakiteratorImpl.cxx     |    1 +
 1 files changed, 1 insertions(+), 0 deletions(-)

diff --git a/i18npool/source/breakiterator/breakiteratorImpl.cxx b/i18npool/source/breakiterator/breakiteratorImpl.cxx
index ec21299..c47f7be 100644
--- a/i18npool/source/breakiterator/breakiteratorImpl.cxx
+++ b/i18npool/source/breakiterator/breakiteratorImpl.cxx
@@ -459,6 +459,7 @@ static UBlock2Script scriptList[] = {
     {UBLOCK_CHEROKEE, UBLOCK_RUNIC, ScriptType::LATIN},
     {UBLOCK_KHMER, UBLOCK_MONGOLIAN, ScriptType::COMPLEX},
     {UBLOCK_LATIN_EXTENDED_ADDITIONAL, UBLOCK_GREEK_EXTENDED, ScriptType::LATIN},
+    {UBLOCK_GENERAL_PUNCTUATION, UBLOCK_GENERAL_PUNCTUATION, ScriptType::LATIN},
     {UBLOCK_CJK_RADICALS_SUPPLEMENT, UBLOCK_HANGUL_SYLLABLES, ScriptType::ASIAN},
     {UBLOCK_CJK_COMPATIBILITY_IDEOGRAPHS, UBLOCK_CJK_COMPATIBILITY_IDEOGRAPHS, ScriptType::ASIAN},
     {UBLOCK_ARABIC_PRESENTATION_FORMS_A, UBLOCK_ARABIC_PRESENTATION_FORMS_A, ScriptType::COMPLEX},
-- 
1.7.1

_______________________________________________
LibreOffice mailing list
LibreOffice@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/libreoffice

Reply via email to