[Poppler-bugs] [Bug 43488] pdf file with Arabic text comtent does not transformed well!

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Mar 20 16:02:47 UTC 2016


https://bugs.freedesktop.org/show_bug.cgi?id=43488

Jason Crain <jason at aquaticape.us> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |NOTABUG

--- Comment #8 from Jason Crain <jason at aquaticape.us> ---
I would not consider this a bug in poppler, pdftohtml, or pdftotext.  The
document uses glyph IDs instead of a real character encoding and does not embed
fonts.  Since glyph IDs are only meaningful for one particular font, this means
that this document can only be viewed correctly if you have the correct font
installed (Microsoft's Arial font, in this case).  And since it doesn't use a
real character encoding, poppler can't get the text out of the document and
pdftohtml and pdftotext will not work.  Note that Adobe Reader and other PDF
viewers can't get the text either.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160320/b15351d8/attachment.html>


More information about the Poppler-bugs mailing list