[poppler] pdftotext raw

Massimo Redaelli mredaelli at lari.digital
Fri May 17 13:45:40 UTC 2019


Understood, thanks!

Last thing: could you point me to a couple of those badly-produced
PDFs, so I can compare the output?

Thanks!

M.

On Fri, May 17, 2019 at 11:00 AM Leonard Rosenthol <lrosenth at adobe.com> wrote:
> If all the PDFs that you are trying to process are coming from modern, well written, products then you are probably fine.  However, poorly made PDF creators will produce PDFs that will end up resulting in garbage from your extraction process.
>
> Leonard


More information about the poppler mailing list