[Poppler-bugs] [Bug 42864] New: pdftohtml background images also contain text

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Nov 12 16:29:26 PST 2011


https://bugs.freedesktop.org/show_bug.cgi?id=42864

             Bug #: 42864
           Summary: pdftohtml background images also contain text
    Classification: Unclassified
           Product: poppler
           Version: unspecified
          Platform: All
        OS/Version: Windows (All)
            Status: NEW
          Severity: major
          Priority: medium
         Component: pdftohtml
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: craigwhitcombe at gmail.com


Created attachment 53470
  --> https://bugs.freedesktop.org/attachment.cgi?id=53470
2 directories containing pdfs with their output

With some documents pdftohtml -c is generating pages that have a background
image containing text.
The raw html itself also contains the same text, in nearly exactly the same
place.
This causes the preview to be illegible.

I have attached 2 pdfs that cause this behaviour and the output after running
them through pdftohtml.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list