[Poppler-bugs] [Bug 50646] New: pdftotext -htmlmeta outputs incomplete metadata

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Jun 3 11:15:33 PDT 2012


https://bugs.freedesktop.org/show_bug.cgi?id=50646

             Bug #: 50646
           Summary: pdftotext -htmlmeta outputs incomplete metadata
    Classification: Unclassified
           Product: poppler
           Version: unspecified
          Platform: Other
        OS/Version: Linux (All)
            Status: NEW
          Severity: normal
          Priority: medium
         Component: utils
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: s.j.turner at uqconnect.net


This bug was originally reported at
https://bugs.launchpad.net/ubuntu/+source/poppler/+bug/993292

pdftotext -htmlmeta output is missing metadata from PDF catalog. pdfinfo does
output all values known:

e.g. a pdfinfo output:

Title: Titel
Author: Word
Creator: WordToPDF 2.4 build 127
Producer: AFPL Ghostscript 8.54
CreationDate: Fri Jul 2 09:14:02 2007
ModDate: Fri Jul 2 09:14:02 2007
Tagged: no
Pages: 6
Encrypted: no
Page size: 595 x 842 pts (A4)
File size: 104664 bytes
Optimized: no
PDF version: 1.3

in contrast the meta section of the pdftotext -htmlmeta output:

<head>
<title>Titel</title>
<meta name="Author" content="Word"/>
<meta name="Creator" content="WordToPDF 2.4 build 127"/>
<meta name="Producer" content="AFPL Ghostscript 8.54"/>
<meta name="CreationDate" content=""/>
</head>

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list