[poppler] Reverse-engineering an XML file generated by pdftohtml -xml back into the PDF?

Josh Richardson jric at chegg.com
Tue Nov 15 13:26:04 PST 2011


Sure, my bad for attempting humor.  :-)  My point is that I hope someone
will take up the cause to make Poppler such a library, because this (and
the XPDF) community have put so much effort into making Poppler the best
open-source parser and rendering engine out there -- it would be great to
be able to leverage Poppler for easy PDF file manipulation as well.  I've
had less success using some of those "other" solutions on the variety of
files that Poppler can handle, including Adobe's own products.

If I get a chance, I may start delving into some of that.  Anyone think
I'm crazy?  I'd love to know.

I believe someone mentioned a pdftopdf utility for Poppler.  That's a
start!  It would be best if it were built on a library foundation.

--josh

On 11/15/11 1:14 PM, "Leonard Rosenthol" <lrosenth at adobe.com> wrote:

>On 11/15/11 12:32 PM, "Josh Richardson" <jric at chegg.com> wrote:
>
>>*Whistfully*:  If only there were a PDF library to make such things
>>simple....
>
>There are a whole bunch of them, ranging from open source to commercial in
>languages from Java to Python to C++ and more...
>
>Leonard
>
>



More information about the poppler mailing list