[poppler] Improve PDF to HTML conversion?

Gilles codecomplete at free.fr
Sun Apr 26 11:39:05 UTC 2020


Hello,

The archives 
<https://www.google.com/search?q=improve+pdf+to+html+site:lists.freedesktop.org> 
didn't return much about how to improve the performance of pdftohtml, 
and the wiki <http://freedesktop.org/Software/poppler> is 503.

Calibre relies on poppler (it doesn't say which version it uses) to 
convert PDF to HTML, edits the HTML, and builds an EPUB.

I notice tables and insets are stumbling blocks:

https://ibb.co/SvDhHNC
https://ibb.co/X4YzYk6
https://ibb.co/09L1r99
https://ibb.co/zb1nrgT <https://ibb.co/zb1nrgT>

Is there a recent article/post that explains how pdftohtml can be 
tweaked for better results?

Thank you.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler/attachments/20200426/d66cb704/attachment.htm>


More information about the poppler mailing list