[Libreoffice] [PATCH] Fix bug where an opening quote mark is treated as a word

Mattias Johnsson m.t.johnsson at gmail.com
Thu Nov 4 05:38:51 PDT 2010


After spending a number of hours learning more about unicode than I
ever wanted to know, I've fixed the bug where the word counter in
Writer counts an opening quote mark (unicode symbol 0x201C) as an extra
word.

Turns out that the opening quote lives in unicode block 40, which was
not given an associated script type in breakiteratorImpl.cxx. This
means that its
script type was defaulting to "WEAK" rather than "LATIN", and "WEAK"
is taken as a word break.

Thought I should get in fast and submit this before John LeMoyne
Castle's heroic efforts fix all the word counter problems :-P

Patch attached. It's a very minimal change, so it's probably safe to
push into the 3.3 branch as well as master.

Cheers,
Mattias


More information about the LibreOffice mailing list