[Poppler-bugs] [Bug 106406] pdftotext cannot extract text correctly from specific pdf

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun May 6 15:14:25 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=106406

Albert Astals Cid <aacid at kde.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|REOPENED                    |NEEDINFO

--- Comment #7 from Albert Astals Cid <aacid at kde.org> ---
Sorry, I don't really have time to explain to you how pdf files work.

I've not been able to find any other PDF viewer (neither the one from the
people that invented PDF itself) out there that can extract the text from that
file, so i'd say it's just the file being broken.

If you can extract the text with some PDF tool or even better provide a patch
i'll be happy to review it.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20180506/3e906419/attachment.html>


More information about the Poppler-bugs mailing list