<html>
<head>
<base href="https://bugs.documentfoundation.org/">
</head>
<body><span class="vcard"><a class="email" href="mailto:taylorleebiz@gmail.com" title="Taylor <taylorleebiz@gmail.com>"> <span class="fn">Taylor</span></a>
</span> changed
<a class="bz_bug_link
bz_status_ASSIGNED "
title="ASSIGNED - CLEAN is neither ODF nor OOXML conform"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=97706">bug 97706</a>
<br>
<table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>What</th>
<th>Removed</th>
<th>Added</th>
</tr>
<tr>
<td style="text-align:right;">Status</td>
<td>NEW
</td>
<td>ASSIGNED
</td>
</tr>
<tr>
<td style="text-align:right;">CC</td>
<td>
</td>
<td>taylorleebiz@gmail.com
</td>
</tr>
<tr>
<td style="text-align:right;">Assignee</td>
<td>libreoffice-bugs@lists.freedesktop.org
</td>
<td>taylorleebiz@gmail.com
</td>
</tr></table>
<p>
<div>
<b><a class="bz_bug_link
bz_status_ASSIGNED "
title="ASSIGNED - CLEAN is neither ODF nor OOXML conform"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=97706#c3">Comment # 3</a>
on <a class="bz_bug_link
bz_status_ASSIGNED "
title="ASSIGNED - CLEAN is neither ODF nor OOXML conform"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=97706">bug 97706</a>
from <span class="vcard"><a class="email" href="mailto:taylorleebiz@gmail.com" title="Taylor <taylorleebiz@gmail.com>"> <span class="fn">Taylor</span></a>
</span></b>
<pre>Using <a href="http://www.unicode.org/Public/UNIDATA/UnicodeData.txt">http://www.unicode.org/Public/UNIDATA/UnicodeData.txt</a> as reference, we
can grep for ';(Cc|Cn);'.
We find that Cc and Cn chars are:
0x00 to 0x1F (inclusive)
0x7F to 0x9F (inclusive)
Using the above info, we can update the CLEAN isPrintable() function.
I also added CLEAN.OOXML using a new isPrintable_OOXML() function that returns
true only if c > 0x1f.
Using the following fods to verify, I found that the CHAR function is not
appropriate to use with CLEAN since CHAR returns 0xfffd for the range of 0x80
to 0xFF inclusive. 0x100 and above returns Err:502.
core/sc/qa/unit/data/functions/text/fods/clean.fods
Changing CHAR to UNICHAR, we can then successfully test the range 0x80 to 0x9F.
I'll attempt to use gerrit to start the patch process.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>