[Poppler-bugs] [Bug 96932] Improper text extraction from this pdf

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Aug 5 15:19:18 UTC 2016


https://bugs.freedesktop.org/show_bug.cgi?id=96932

Jason Crain <jason at aquaticape.us> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEEDINFO                    |RESOLVED
         Resolution|---                         |NOTOURBUG

--- Comment #4 from Jason Crain <jason at aquaticape.us> ---
I've taken a closer look at this document and I don't see a way to fix the text
extraction.  The document is using embedded TrueType fonts with identity
mapping / UTF-16 encoding, but the encoding is nonsensical and there's no
ToUnicode map.  It's up to the PDF creator to fix their broken file generation.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160805/aa6ef477/attachment.html>


More information about the Poppler-bugs mailing list