[poppler] pdftotext raw

Albert Astals Cid aacid at kde.org
Thu May 16 18:08:23 UTC 2019


El dijous, 16 de maig de 2019, a les 17:00:27 CEST, Massimo Redaelli va escriure:
> Hey all.
> 
> Question regarding pdftotext.
> 
> The help says that `raw` is not recommended anymore, but for all PDFs
> I tried it actually gives better results than the default mode, by
> which I mean that paragraphs are not interrupted by extraneous text,
> like headers or boxes.
> (I do have to handle hyphenated words, but that looks easy.)
> 
> Is the option going to be deprecated, or can we count on it being
> there for the foreseeable future?
> Are there reasons not to use it?

The man page explains the reason not to use it.

Cheers,
  Albert

> 
> Thanks!
> 
> 






More information about the poppler mailing list