From martinsvision247 at gmail.com Tue Feb 14 18:49:17 2023 From: martinsvision247 at gmail.com (Martin Goldberg) Date: Tue, 14 Feb 2023 13:49:17 -0500 Subject: [openchrome-users] PDF text and image extraction Message-ID: Hello, I am wondering if Poppler will be of use to me. The task to be solved: Given a pdf document, I need to extract text, tables and images. One possibility is to convert the pdf to another format such as a MS-Word docx file. The issue to generate an automatic process that begins with inputting a pdf and then extracting the required parts. So, Poppler is usable here, please inform me as to: a) Which language is optimal for this - as I might need openCV, C++ or Python are my best choices. b) Where do I find out what functionality Poppler provides? Thank you. ~ Martin Goldberg, Ph. D. Tinski Tech Inc. http://tinskitech.com Phone: 917-612-7498 LinkedIn: Martin Goldberg | LinkedIn -------------- next part -------------- An HTML attachment was scrubbed... URL: