[Poppler-bugs] [Bug 97234] Output text/html is unreadable for some PDFs

Sun Aug 7 22:41:25 UTC 2016

https://bugs.freedesktop.org/show_bug.cgi?id=97234

--- Comment #4 from Albert Astals Cid <aacid at kde.org> ---
PDF is a display format, the fact that you can "see" text doesn't mean you can
"extract" the text, pdf creators need to do stuff correctly for text extraction
to work, lots of them are broken as hell.

To convince me this is a bug in our side you'll have to how me a PDF tool that
can extract text from the files (that is not using OCR of course).

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160807/499f77f7/attachment.html>