[poppler] pdftotext raw

Massimo Redaelli mredaelli at lari.digital
Fri May 17 07:13:47 UTC 2019


On Thu, May 16, 2019, 8:08 PM Albert Astals Cid <aacid at kde.org> wrote:

> > Are there reasons not to use it?
>
> The man page explains the reason not to use it.


Yes, I should have asked: what are the downsides/upsides of following
the content stream order?

But i guess I'm mainly asking:

> Is the option going to be deprecated, or can we count on it being
> there for the foreseeable future?


M.

On Thu, May 16, 2019 at 6:08 PM Albert Astals Cid <aacid at kde.org> wrote:
>
> El dijous, 16 de maig de 2019, a les 17:00:27 CEST, Massimo Redaelli va escriure:
> > Hey all.
> >
> > Question regarding pdftotext.
> >
> > The help says that `raw` is not recommended anymore, but for all PDFs
> > I tried it actually gives better results than the default mode, by
> > which I mean that paragraphs are not interrupted by extraneous text,
> > like headers or boxes.
> > (I do have to handle hyphenated words, but that looks easy.)
> >
> > Is the option going to be deprecated, or can we count on it being
> > there for the foreseeable future?
> > Are there reasons not to use it?
>
> The man page explains the reason not to use it.
>
> Cheers,
>   Albert
>
> >
> > Thanks!
> >
> >
>
>
>
>
> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/poppler



-- 
M.


More information about the poppler mailing list