[Poppler-bugs] [Bug 97276] Can't extract text/html from PDF

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Aug 16 21:43:54 UTC 2016


https://bugs.freedesktop.org/show_bug.cgi?id=97276

--- Comment #5 from clark at electrobeat.dk ---
Is it possible to detect/check if a PDF is broken and return something
unreadable like this?

I just need to check PDF files an mark broken files where the extracted text is
garbage

+HDGDXGLR$S6
FR-HVSHU$JHQWRIW
)\UUHEDNNHQ- JHUVSULV
’HQPDUN

.XQGHQU. 
0RPVQU. 
5HNYLVLWLRQVQU. 
’HUHVUHI. 
2UGUHQU. 
:HEEHVWLOOLQJVQU.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160816/4583cd8a/attachment.html>


More information about the Poppler-bugs mailing list