[Poppler-bugs] [Bug 33270] [patch] Added poppler_page_get_raw_text function

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Jun 25 04:13:58 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=33270

Carlos Garcia Campos <carlosgc at gnome.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |WONTFIX

--- Comment #2 from Carlos Garcia Campos <carlosgc at gnome.org> ---
The patch doesn't return the raw text, it builds the text using the wordlist
returned by TextPage, so the order will depend on how the text output dev is
created. This looks like a workaround for the mismatch between the number of
characters returned by poppler_page_get_text and the number of areas returned
by poppler_page_get_text_layout that happens in many documents. I've just fixed
that problem in current git master, by using the same method in both get_text
and get_text_layout/get_text_attributes. So, I'm going to close this, feel free
to reopne it if you still think we really need a function to get the "raw"
text.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130625/993e3fab/attachment-0001.html>


More information about the Poppler-bugs mailing list