[openchrome-users] PDF text and image extraction

Tue Feb 14 18:49:17 UTC 2023

Hello,

I am wondering if Poppler will be of use to me.

The task to be solved:
Given a pdf document, I need to extract text, tables and images. One
possibility is to convert the pdf to another format such as a MS-Word docx
file. The issue to generate an automatic process that begins with inputting
a pdf and then extracting the required parts.

So, Poppler is usable here, please inform me as to:
a) Which language is optimal for this - as I might need openCV, C++ or
Python are my best choices.
b) Where do I find out what functionality Poppler provides?

Thank you.

~ Martin Goldberg, Ph. D.
Tinski Tech Inc.
http://tinskitech.com
Phone: 917-612-7498
LinkedIn: Martin Goldberg | LinkedIn
<https://www.linkedin.com/in/martin-goldberg-zw/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/openchrome-users/attachments/20230214/d53e0d9e/attachment.htm>