[poppler] pdftotext font information

obsidian . obsidian9993 at gmail.com
Thu May 3 10:10:43 UTC 2018


I'm using "pdftotext -bbox file.pdf" to convert a pdf file into html.

Here's a sample line from the output:
    <word xMin="359.852025" yMin="462.548936" xMax="365.689478"
yMax="467.681498">foo</word>

Is there a way to get font information for every word like:
- font family, e.g. Verdana
- style, i.e. none, bold, italic
- size, e.g. font size 9

I'm using pdftotext version 0.55.0 on Windows.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler/attachments/20180503/341f4bc9/attachment.html>


More information about the poppler mailing list