[poppler] Retrieve all objects from a PDF file

Josh Richardson jric at chegg.com
Mon Oct 31 11:12:13 PDT 2011


What kinds of objects are you interested in?  I have a version of
pdftohtml which I believe is not yet merged into the master repo that
extracts images and fonts.

--josh

On 10/31/11 9:16 AM, "Nedim Srndic" <nedim.sh at gmail.com> wrote:

>Dear list, 
>
>I am using the Poppler library (in the src/poppler folder, no bindings,
>version 7 from the Ubuntu 10.10 repos) and would like to retrieve all
>objects from a PDF file. Currently, I am running a loop on XRef and
>getting all the non-null objects from it, but it doesn't seem to
>retrieve objects from object streams. What solution would you propose
>for this problem?
>
>Thanks, 
>Nedim Srndic
>
>_______________________________________________
>poppler mailing list
>poppler at lists.freedesktop.org
>http://lists.freedesktop.org/mailman/listinfo/poppler
>



More information about the poppler mailing list