[poppler] queries via poppler API

Leonard Rosenthol leonardr at pdfsages.com
Fri Jun 23 09:17:20 PDT 2006


At 11:22 AM 6/23/2006, Jack Tanner wrote:
>How easy should it be to use libpoppler to run reports about PDFs in 
>one's library (rather than using libpoppler to render the PDFs)?

         Depends on what the queries are ;).


>  For example, I'd like to be able to run queries like these:
>- What PDFs consist entirely of images and no text?

         That can't be done at the libpoppler level.  You would have 
to create a custom OutputDev and run that - low level Xpdf.


>- Who are the authors of the PDFs?

         ASSUMING that info is in the PDF - you can get that from libpoppler.


>- Reset the 'author' of the PDFs in this folder to the string 
>following the first heading.

         Poppler has no support for PDF writing/modification.


>- Given the support for annotations discussed earlier, what PDFs 
>have annotations?

         Won't know till the annotation support is implementated.


         In the meantime, there are other tools/libraries out there 
that you could use to get that information (and more).


Leonard

---------------------------------------------------------------------------
Leonard Rosenthol                            <mailto:leonardr at pdfsages.com>
Chief Technical Officer                      <http://www.pdfsages.com>
PDF Sages, Inc.                              215-938-7080 (voice)
                                              215-938-0880 (fax)



More information about the poppler mailing list