<html>
<head>
<base href="https://bugs.documentfoundation.org/">
</head>
<body><span class="vcard"><a class="email" href="mailto:ming.v.hua@qq.com" title="Ming Hua <ming.v.hua@qq.com>"> <span class="fn">Ming Hua</span></a>
</span> changed
<a class="bz_bug_link
bz_status_NEW "
title="NEW - Word Count problem with symbols in Chinese mixed with English text"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=114760">bug 114760</a>
<br>
<table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>What</th>
<th>Removed</th>
<th>Added</th>
</tr>
<tr>
<td style="text-align:right;">CC</td>
<td>
</td>
<td>ming.v.hua@qq.com
</td>
</tr></table>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW - Word Count problem with symbols in Chinese mixed with English text"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=114760#c6">Comment # 6</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW - Word Count problem with symbols in Chinese mixed with English text"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=114760">bug 114760</a>
from <span class="vcard"><a class="email" href="mailto:ming.v.hua@qq.com" title="Ming Hua <ming.v.hua@qq.com>"> <span class="fn">Ming Hua</span></a>
</span></b>
<pre>In my opinion, there are multiple issues here, some illustrated by the example
from the bug submitter, some not. Maybe I should file separate bugs.
1. Exclude Chinese punctuations and symbols from the "Words" count. Or
alternatively, exclude all Chinese characters and symbols from the "Words"
count, as "words" (词/詞) is a rather vague concept in Chinese anyway, and
counting each Chinese character as a word would never be correct.
2. Recognize full-width space (U+3000) in the "Characters excluding spaces"
count;
3. Provide Asian character count excluding punctuations and symbols, as that
number is sometimes preferred.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>