[poppler] Extract title from pdf file.
Leonard Rosenthol
lrosenth at adobe.com
Thu Nov 10 07:43:10 PST 2011
On 11/10/11 10:00 AM, "Peter A. Kerzum" <kerzum at yandex-team.ru> wrote:
>On Thursday 10 November 2011 14:36:39 Leonard Rosenthol wrote:
>> EXCEPT that Poppler (and by extension, pdftoxml) does NOT process the
>> tagging & structure of the PDF :(.
>
>This is not true, you can at least get Outline textx with poppler
The Outline elements have NOTHING to do with PDF Tagging & Structure NOR
with any logical semantics about the document or its content! They are
simply navigation elements and can be used to navigate a user to either a
specific spot in the current PDF, some other PDF, or perform ANY ACTION
supported by PDF.
As noted previously, semantic tagging & structure is described in ISO
32000-1:2008, 14.7 - 14.9. And is not supported by Poppler.
Leonard
More information about the poppler
mailing list