[poppler] pdftotext raw
Albert Astals Cid
aacid at kde.org
Thu May 16 18:08:23 UTC 2019
El dijous, 16 de maig de 2019, a les 17:00:27 CEST, Massimo Redaelli va escriure:
> Hey all.
>
> Question regarding pdftotext.
>
> The help says that `raw` is not recommended anymore, but for all PDFs
> I tried it actually gives better results than the default mode, by
> which I mean that paragraphs are not interrupted by extraneous text,
> like headers or boxes.
> (I do have to handle hyphenated words, but that looks easy.)
>
> Is the option going to be deprecated, or can we count on it being
> there for the foreseeable future?
> Are there reasons not to use it?
The man page explains the reason not to use it.
Cheers,
Albert
>
> Thanks!
>
>
More information about the poppler
mailing list