[poppler] Using the poppler utils in my software

Albert Astals Cid aacid at kde.org
Mon Jul 16 13:28:32 PDT 2012


El Dimarts, 17 de juliol de 2012, a les 02:06:38, Alec Taylor va escriure:
> Actually you probably want to use pdftoxml, 

He clearly said he doesn't want to invoke a binary

> including in the
> poppler-utils. If that is giving you too many mistakes, maybe checkout
> an OCR engine such as Tesseract.
> 
> Reparsing the XML shouldn't give you too much trouble; if it is, ask
> me and I'll give you access to the poppler repo I modified ~8 months
> ago which included my algorithms for extracting logical structure
> information from PDFs by post-processing the XML and regenerating XML
> containing the modified markup.

Changes you did not contribute upstream because we eat babies?

Cheers,
  Albert

> 
> On Tue, Jul 17, 2012 at 1:32 AM, Jean-Philippe Green
> 
> <jeanphilippe.green at gmail.com> wrote:
> > Hello. I asked about this on the IRC with no luck, so I'll try this
> > instead.
> > 
> > How can I use the poppler utils (such as pdftotext) in my software without
> > executing a platform dependent executable? Is it included somewhere in the
> > library?
> > 
> > 
> > If you want to know, I'm trying to do a software that reads a pdf-schedule
> > from the company where I work and make an icalendar that can be used in
> > google calendar and more. They can't provide a calender file so I feel I
> > need to do this. I want all of my co-workers to be able to use it too, so
> > I
> > need to do it platform independent.
> > 
> > Thank you!
> > 
> > _______________________________________________
> > poppler mailing list
> > poppler at lists.freedesktop.org
> > http://lists.freedesktop.org/mailman/listinfo/poppler
> 
> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/poppler


More information about the poppler mailing list