[Poppler-bugs] [Bug 101807] New: pdftohtml: fakebold and dropshadow duplicated text

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Jul 16 19:26:25 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=101807

            Bug ID: 101807
           Summary: pdftohtml: fakebold and dropshadow duplicated text
           Product: poppler
           Version: unspecified
          Hardware: Other
                OS: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: pdftohtml
          Assignee: poppler-bugs at lists.freedesktop.org
          Reporter: jason at inspiresomeone.us

If you run pdftohtml on the PDF in bug #101770
(https://bugs.freedesktop.org/attachment.cgi?id=132659) It results in
duplicated and jumbled characters.

Some PDFs draw text multiple times to emulate bold text or drop shadows.  The
main TextOutputDev goes to a lot of trouble to remove this duplicated text. 
pdftohtml should do this too.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20170716/163513a0/attachment.html>


More information about the Poppler-bugs mailing list