[Poppler-bugs] [Bug 97234] Output text/html is unreadable for some PDFs

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Aug 7 22:41:25 UTC 2016


https://bugs.freedesktop.org/show_bug.cgi?id=97234

--- Comment #4 from Albert Astals Cid <aacid at kde.org> ---
PDF is a display format, the fact that you can "see" text doesn't mean you can
"extract" the text, pdf creators need to do stuff correctly for text extraction
to work, lots of them are broken as hell.

To convince me this is a bug in our side you'll have to how me a PDF tool that
can extract text from the files (that is not using OCR of course).

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160807/499f77f7/attachment.html>


More information about the Poppler-bugs mailing list