[poppler] Extract title from pdf file.

Alec Taylor alec.taylor6 at gmail.com
Wed Nov 9 19:44:00 PST 2011


Running pdftohtml -xml, analysing XML, processing information back into PDF

On Thu, Nov 10, 2011 at 2:01 PM, Leonard Rosenthol <lrosenth at adobe.com> wrote:
> On 11/9/11 10:02 AM, "Alec Taylor" <alec.taylor6 at gmail.com> wrote:
>>>Are you also submitting patches to read & process any tags & structure in
>>> the PDF?  If the PDF is already tagged, then it will have any
>>> headers/footers already identified accordingly.  You should be using
>>>this
>>> when present.
>>
>>Yes, I am using the RapidXML library, which I specifically chose for
>>speed and that it is header only.
>
> What does an XML library have to do with processing PDF structure &
> tagging (ISO 32000-1:2008, 14.7-14.9)???
>
>
> Leonard
>
>


More information about the poppler mailing list