[Poppler-bugs] [Bug 96932] Improper text extraction from this pdf
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Fri Aug 5 15:19:18 UTC 2016
https://bugs.freedesktop.org/show_bug.cgi?id=96932
Jason Crain <jason at aquaticape.us> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEEDINFO |RESOLVED
Resolution|--- |NOTOURBUG
--- Comment #4 from Jason Crain <jason at aquaticape.us> ---
I've taken a closer look at this document and I don't see a way to fix the text
extraction. The document is using embedded TrueType fonts with identity
mapping / UTF-16 encoding, but the encoding is nonsensical and there's no
ToUnicode map. It's up to the PDF creator to fix their broken file generation.
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160805/aa6ef477/attachment.html>
More information about the Poppler-bugs
mailing list