[poppler] Why poppler, which supports tagged PDFs, doesn't recognize some of the tags as a whole?
Albert Astals Cid
aacid at kde.org
Fri Jun 25 23:23:44 UTC 2021
El dijous, 24 de juny de 2021, a les 15:47:38 (CEST), Germán Poo-Caamaño va escriure:
> On Thu, 2021-06-24 at 10:44 +0200, Albert Astals Cid wrote:
> > El dijous, 24 de juny de 2021, a les 7:48:45 (CEST), Denis Bitouzé va
> > escriure:
> > > Hi,
> > >
> > > the joined `test.pdf` file is properly tagged as you can check it
> > > by
> > > loading it at:
> > >
> > > ┌────
> > > │ https://www.ngpdf.com/loadFile
> > > └────
> > >
> > > and then looking at:
> > >
> > > ┌────
> > > │ https://www.ngpdf.com/editor/editFile
> > > └────
> > >
> > > You can see each line of the code:
> > >
> > > ┌────
> > > │ \pdfdict_new:n {l_my_action_dict}
> > > │ \pdfdict_put:nnn {l_my_action_dict}{Type}{/Action}
> > > │ \pdfdict_put:nnn {l_my_action_dict}{S}{/URI}
> > > │ \pdfdict_put:nnn {l_my_action_dict}{URI}{(
> > > https://www.latex-project.org)}
> > > └────
> > >
> > > is a single tag.
> > >
> > > Nevertheless this code, if copied e.g. from Evince 3.38.1, is
> > > pasted not
> > > as it is and but as:
> >
> > That would be a question for the Evince developers (some of them are
> > on this is i guess so you may still get an answer).
> >
> > The fact that poppler has facilities to "see" the contents of tagged
> > pdf doesn't mean that evince is using them.
>
> I am unsure what the report or question is about. Is it about
> presenting/seeing each tag separately or copying/pasting the test in
> the tags?
>
> If the later, that corresponds to poppler-glib.
You think so?
I'm not convinced i mean poppler-glib already exports the Tags contents, it's just that evince always uses the "raw text" contents and never the "Tagged text" contents.
Or you would expect poppler-glib to chose which text to give you?
Cheers,
Albert
>
>
More information about the poppler
mailing list