<html>
    <head>
      <base href="https://bugs.documentfoundation.org/">
    </head>
    <body><span class="vcard"><a class="email" href="mailto:taylorleebiz@gmail.com" title="Taylor <taylorleebiz@gmail.com>"> <span class="fn">Taylor</span></a>
</span> changed
          <a class="bz_bug_link 
          bz_status_ASSIGNED "
   title="ASSIGNED - CLEAN is neither ODF nor OOXML conform"
   href="https://bugs.documentfoundation.org/show_bug.cgi?id=97706">bug 97706</a>
          <br>
             <table border="1" cellspacing="0" cellpadding="8">
          <tr>
            <th>What</th>
            <th>Removed</th>
            <th>Added</th>
          </tr>

         <tr>
           <td style="text-align:right;">Status</td>
           <td>NEW
           </td>
           <td>ASSIGNED
           </td>
         </tr>

         <tr>
           <td style="text-align:right;">CC</td>
           <td>
                
           </td>
           <td>taylorleebiz@gmail.com
           </td>
         </tr>

         <tr>
           <td style="text-align:right;">Assignee</td>
           <td>libreoffice-bugs@lists.freedesktop.org
           </td>
           <td>taylorleebiz@gmail.com
           </td>
         </tr></table>
      <p>
        <div>
            <b><a class="bz_bug_link 
          bz_status_ASSIGNED "
   title="ASSIGNED - CLEAN is neither ODF nor OOXML conform"
   href="https://bugs.documentfoundation.org/show_bug.cgi?id=97706#c3">Comment # 3</a>
              on <a class="bz_bug_link 
          bz_status_ASSIGNED "
   title="ASSIGNED - CLEAN is neither ODF nor OOXML conform"
   href="https://bugs.documentfoundation.org/show_bug.cgi?id=97706">bug 97706</a>
              from <span class="vcard"><a class="email" href="mailto:taylorleebiz@gmail.com" title="Taylor <taylorleebiz@gmail.com>"> <span class="fn">Taylor</span></a>
</span></b>
        <pre>Using <a href="http://www.unicode.org/Public/UNIDATA/UnicodeData.txt">http://www.unicode.org/Public/UNIDATA/UnicodeData.txt</a> as reference, we
can grep for ';(Cc|Cn);'.

We find that Cc and Cn chars are:
0x00 to 0x1F (inclusive)
0x7F to 0x9F (inclusive)

Using the above info, we can update the CLEAN isPrintable() function.

I also added CLEAN.OOXML using a new isPrintable_OOXML() function that returns
true only if c > 0x1f.

Using the following fods to verify, I found that the CHAR function is not
appropriate to use with CLEAN since CHAR returns 0xfffd for the range of 0x80
to 0xFF inclusive.  0x100 and above returns Err:502.

core/sc/qa/unit/data/functions/text/fods/clean.fods

Changing CHAR to UNICHAR, we can then successfully test the range 0x80 to 0x9F.

I'll attempt to use gerrit to start the patch process.</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>