[poppler] pdftohtml doesn't find text in pdf, pdftotext does?

Wolfgang Schwarz wo at umsu.de
Tue May 20 07:17:33 PDT 2014

Previous message: [poppler] Bogus Memory Allocation Size
Next message: [poppler] Branch 'poppler-0.26' - CMakeLists.txt configure.ac cpp/Doxyfile NEWS qt4/src qt5/src
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hello,

I've noticed that pdftohtml misses the text content for some pdf
files, while pdftotext can extract the text. Here is an example:
http://www.umsu.de/temp/1966percepts.pdf#. (The latest version
I've tried it with is poppler-0.26.0, compiled from source.)

Is this a known problem? Are there any workarounds?

Best,
Wolfgang

Previous message: [poppler] Bogus Memory Allocation Size
Next message: [poppler] Branch 'poppler-0.26' - CMakeLists.txt configure.ac cpp/Doxyfile NEWS qt4/src qt5/src
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

More information about the poppler mailing list