[Poppler-bugs] [Bug 25625] New: pdfimages does not extract inline jpeg images as jpeg

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Dec 13 15:11:34 PST 2009


http://bugs.freedesktop.org/show_bug.cgi?id=25625

           Summary: pdfimages does not extract inline jpeg images as jpeg
           Product: poppler
           Version: unspecified
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: general
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: laur.olb1 at gmail.com


pdfimages does not seem to be able to extract an inline jpeg image, instead it
will output the image as an uncompressed ppm file. This was tested with
poppler-utils 0.12.0 on Ubuntu 9.10.

How to duplicate:

Create an inline jpeg in a pdf file. This can be done with sam2p
http://code.google.com/p/sam2p/ using the command "sam2p image.jpg image.pdf".
Try to extract the jpeg back out with "pdfimages -j image.pdf image". This will
create image-000.ppm instead of the expected image-000.jpg. If you instead
create a pdf using XObjects, such as "sam2p -pdf:2 image.jpg image.pdf", the
jpeg is correctly extracted, md5sums of image.jpg and image-000.jpg match.
Note: this bug is present in the latest xpdf package as well.


-- 
Configure bugmail: http://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list