<html>
    <head>
      <base href="https://bugs.freedesktop.org/" />
    </head>
    <body><span class="vcard"><a class="email" href="mailto:carlosgc@gnome.org" title="Carlos Garcia Campos <carlosgc@gnome.org>"> <span class="fn">Carlos Garcia Campos</span></a>
</span> changed
              <a class="bz_bug_link 
          bz_status_RESOLVED  bz_closed"
   title="RESOLVED WONTFIX - [patch] Added poppler_page_get_raw_text function"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=33270">bug 33270</a>
        <br>
             <table border="1" cellspacing="0" cellpadding="8">
          <tr>
            <th>What</th>
            <th>Removed</th>
            <th>Added</th>
          </tr>

         <tr>
           <td style="text-align:right;">Status</td>
           <td>NEW
           </td>
           <td>RESOLVED
           </td>
         </tr>

         <tr>
           <td style="text-align:right;">Resolution</td>
           <td>---
           </td>
           <td>WONTFIX
           </td>
         </tr></table>
      <p>
        <div>
            <b><a class="bz_bug_link 
          bz_status_RESOLVED  bz_closed"
   title="RESOLVED WONTFIX - [patch] Added poppler_page_get_raw_text function"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=33270#c2">Comment # 2</a>
              on <a class="bz_bug_link 
          bz_status_RESOLVED  bz_closed"
   title="RESOLVED WONTFIX - [patch] Added poppler_page_get_raw_text function"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=33270">bug 33270</a>
              from <span class="vcard"><a class="email" href="mailto:carlosgc@gnome.org" title="Carlos Garcia Campos <carlosgc@gnome.org>"> <span class="fn">Carlos Garcia Campos</span></a>
</span></b>
        <pre>The patch doesn't return the raw text, it builds the text using the wordlist
returned by TextPage, so the order will depend on how the text output dev is
created. This looks like a workaround for the mismatch between the number of
characters returned by poppler_page_get_text and the number of areas returned
by poppler_page_get_text_layout that happens in many documents. I've just fixed
that problem in current git master, by using the same method in both get_text
and get_text_layout/get_text_attributes. So, I'm going to close this, feel free
to reopne it if you still think we really need a function to get the "raw"
text.</pre>
        </div>
      </p>
      <hr>
      <span>You are receiving this mail because:</span>
      
      <ul>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>