<html>
    <head>
      <base href="https://bugs.freedesktop.org/" />
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Priority</th>
          <td>medium
          </td>
        </tr>

        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW --- - Converting some PDFs results in images being converted in to 1000s of PNGs"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=77932">77932</a>
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>poppler-bugs@lists.freedesktop.org
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>Converting some PDFs results in images being converted in to 1000s of PNGs
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>normal
          </td>
        </tr>

        <tr>
          <th>Classification</th>
          <td>Unclassified
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Linux (All)
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>gareth@mysociety.org
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>x86-64 (AMD64)
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>unspecified
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>pdftohtml
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>poppler
          </td>
        </tr></table>
      <p>
        <div>
        <pre>When converting this PDF:

PDF:
<a href="https://www.whatdotheyknow.com/request/127122/response/315253/attach/2/FOI%2012%2001605%20Resp%201%20PDF.pdf">https://www.whatdotheyknow.com/request/127122/response/315253/attach/2/FOI%2012%2001605%20Resp%201%20PDF.pdf</a>

The HTML output includes 2690 PNG images. It looks like it's scanned the logo
in the PDF line by line.

HTML: <a href="http://pastebin.com/raw.php?i=YCL9Mmpx">http://pastebin.com/raw.php?i=YCL9Mmpx</a>

The command run was:

pdftohtml -nodrm -zoom 1.0 -stdout -enc UTF-8 -noframes tmp/FOI\ 12\ 01605\
Resp\ 1\ PDF.pdf > tmp/all-the-pngs.html

Some version info:

$ pdftohtml -v
pdftohtml version 0.18.4
Copyright 2005-2011 The Poppler Developers - <a href="http://poppler.freedesktop.org">http://poppler.freedesktop.org</a>
Copyright 1999-2003 Gueorgui Ovtcharov and Rainer Dorsch
Copyright 1996-2004 Glyph & Cog, LLC

$ lsb_release -a
No LSB modules are available.
Distributor ID:    Ubuntu
Description:    Ubuntu 12.04.4 LTS
Release:    12.04
Codename:    precise</pre>
        </div>
      </p>
      <hr>
      <span>You are receiving this mail because:</span>
      
      <ul>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>