<html>
    <head>
      <base href="https://bugs.freedesktop.org/">
    </head>
    <body>
      <p>
        <div>
            <b><a class="bz_bug_link 
          bz_status_RESOLVED  bz_closed"
   title="RESOLVED FIXED - Can't extract text/html from PDF"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=97276#c5">Comment # 5</a>
              on <a class="bz_bug_link 
          bz_status_RESOLVED  bz_closed"
   title="RESOLVED FIXED - Can't extract text/html from PDF"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=97276">bug 97276</a>
              from <span class="vcard"><a class="email" href="mailto:clark@electrobeat.dk" title="clark@electrobeat.dk">clark@electrobeat.dk</a>
</span></b>
        <pre>Is it possible to detect/check if a PDF is broken and return something
unreadable like this?

I just need to check PDF files an mark broken files where the extracted text is
garbage

+HDGDXGLR$S6
FR-HVSHU$JHQWRIW
)\UUHEDNNHQ- JHUVSULV
’HQPDUN

.XQGHQU. 
0RPVQU. 
5HNYLVLWLRQVQU. 
’HUHVUHI. 
2UGUHQU. 
:HEEHVWLOOLQJVQU.</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>