[poppler] Extract title from pdf file.

Leonard Rosenthol lrosenth at adobe.com
Thu Nov 10 07:43:10 PST 2011


On 11/10/11 10:00 AM, "Peter A. Kerzum" <kerzum at yandex-team.ru> wrote:

>On Thursday 10 November 2011 14:36:39 Leonard Rosenthol wrote:
>> EXCEPT that Poppler (and by extension, pdftoxml) does NOT process the
>> tagging & structure of the PDF :(.
>
>This is not true, you can at least get Outline textx with poppler


The Outline elements have NOTHING to do with PDF Tagging & Structure NOR
with any logical semantics about the document or its content!  They are
simply navigation elements and can be used to navigate a user to either a
specific spot in the current PDF, some other PDF, or perform ANY ACTION
supported by PDF.

As noted previously, semantic tagging & structure is described in ISO
32000-1:2008, 14.7 - 14.9.  And is not supported by Poppler.

Leonard



More information about the poppler mailing list