[Poppler-bugs] [Bug 55037] New: -xml does not render all images despite -c rendering correctly

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Sep 17 16:51:08 PDT 2012


https://bugs.freedesktop.org/show_bug.cgi?id=55037

             Bug #: 55037
           Summary: -xml does not render all images despite -c rendering
                    correctly
    Classification: Unclassified
           Product: poppler
           Version: unspecified
          Platform: x86-64 (AMD64)
        OS/Version: Linux (All)
            Status: NEW
          Severity: critical
          Priority: medium
         Component: pdftohtml
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: jazz at funkynerd.com


I've been trying to incorporate pdftohtml into my frontend renderer and have
had some success with some documents.  Other more complex documents though are
having problems.

My test document is the Nikon D3s brochure:

wget
http://imaging.nikon.com/products/imaging/lineup/digitalcamera/slr/d3s/pdf/d3s_16p.pdf

Rendering with the following produces a pretty accurate representation of the
document:

pdftohtml -c d3s_16p.pdf

However, when I output to XML using -xml some of the images that worked
previously are not output.  They are not extracted or even included in the XML
output.

Also, the images that are extracted are included with the wrong dimensions so
the resulting page looks very out of whack.  

All of the text is rendered correctly though.

Tried latest version from git with same results.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list