[Poppler-bugs] [Bug 32522] Some letters are in wrong order in the output of pdftotext
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Mon Dec 20 16:50:34 PST 2010
https://bugs.freedesktop.org/show_bug.cgi?id=32522
--- Comment #4 from Behdad Esfahbod <freedesktop at behdad.org> 2010-12-20 16:50:34 PST ---
(In reply to comment #3)
> Behdad, you may be interested in this bug. I have no idea how RTL text
> extraction is supposed to work.
This is a poppler bug. I mentioned this in my design doc back in 2007, but
never followed up. Adrian, maybe you can look into those?
http://lists.freedesktop.org/archives/poppler/2007-September/002897.html
Specifically:
"""
o Instead of reversing the glyphs and then extracting text
from them and append, it extracts text first and then
reverse. So if a glyph maps to two or more characters,
those come out backward in the extracted text, which is
wrong.
"""
--
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
More information about the Poppler-bugs
mailing list