[Poppler-bugs] [Bug 50002] New: pdftohtml writes invalid HTML

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed May 16 06:05:59 PDT 2012


https://bugs.freedesktop.org/show_bug.cgi?id=50002

             Bug #: 50002
           Summary: pdftohtml writes invalid HTML
    Classification: Unclassified
           Product: poppler
           Version: unspecified
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: pdftohtml
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: solahcin at gmail.com


Created attachment 61713
  --> https://bugs.freedesktop.org/attachment.cgi?id=61713
Fixes the issues identified above in utils/HtmlOutputDev.cc

Patch against r0.20.0 is attached.

The element names output by pdftohtml take upper case, which is  not valid to
the DTD and so rejected by epubcheck and other tools downstream.

The <hr> and <frame> elements are missing closing tags or abbreviated empty tag
notation (<hr/>, <frame/>).

These errors are fixed by patch.txt applied to utils/HtmlOutputDev.cc.

Tested on CentOS Linux server against source built from r0.20.0 tarball.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list