<html>
<head>
<base href="https://bugs.freedesktop.org/">
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_RESOLVED bz_closed"
title="RESOLVED FIXED - Can't extract text/html from PDF"
href="https://bugs.freedesktop.org/show_bug.cgi?id=97276#c5">Comment # 5</a>
on <a class="bz_bug_link
bz_status_RESOLVED bz_closed"
title="RESOLVED FIXED - Can't extract text/html from PDF"
href="https://bugs.freedesktop.org/show_bug.cgi?id=97276">bug 97276</a>
from <span class="vcard"><a class="email" href="mailto:clark@electrobeat.dk" title="clark@electrobeat.dk">clark@electrobeat.dk</a>
</span></b>
<pre>Is it possible to detect/check if a PDF is broken and return something
unreadable like this?
I just need to check PDF files an mark broken files where the extracted text is
garbage
+HDGDXGLR$S6
FR-HVSHU$JHQWRIW
)\UUHEDNNHQ- JHUVSULV
’HQPDUN
.XQGHQU.
0RPVQU.
5HNYLVLWLRQVQU.
’HUHVUHI.
2UGUHQU.
:HEEHVWLOOLQJVQU.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>