[poppler] queries via poppler API
Leonard Rosenthol
leonardr at pdfsages.com
Fri Jun 23 09:17:20 PDT 2006
At 11:22 AM 6/23/2006, Jack Tanner wrote:
>How easy should it be to use libpoppler to run reports about PDFs in
>one's library (rather than using libpoppler to render the PDFs)?
Depends on what the queries are ;).
> For example, I'd like to be able to run queries like these:
>- What PDFs consist entirely of images and no text?
That can't be done at the libpoppler level. You would have
to create a custom OutputDev and run that - low level Xpdf.
>- Who are the authors of the PDFs?
ASSUMING that info is in the PDF - you can get that from libpoppler.
>- Reset the 'author' of the PDFs in this folder to the string
>following the first heading.
Poppler has no support for PDF writing/modification.
>- Given the support for annotations discussed earlier, what PDFs
>have annotations?
Won't know till the annotation support is implementated.
In the meantime, there are other tools/libraries out there
that you could use to get that information (and more).
Leonard
---------------------------------------------------------------------------
Leonard Rosenthol <mailto:leonardr at pdfsages.com>
Chief Technical Officer <http://www.pdfsages.com>
PDF Sages, Inc. 215-938-7080 (voice)
215-938-0880 (fax)
More information about the poppler
mailing list