[Poppler-bugs] [Bug 32522] Some letters are in wrong order in the output of pdftotext

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Dec 20 16:50:34 PST 2010


https://bugs.freedesktop.org/show_bug.cgi?id=32522

--- Comment #4 from Behdad Esfahbod <freedesktop at behdad.org> 2010-12-20 16:50:34 PST ---
(In reply to comment #3)
> Behdad, you may be interested in this bug. I have no idea how RTL text
> extraction is supposed to work.

This is a poppler bug.  I mentioned this in my design doc back in 2007, but
never followed up.  Adrian, maybe you can look into those?

http://lists.freedesktop.org/archives/poppler/2007-September/002897.html

Specifically:

"""
    o Instead of reversing the glyphs and then extracting text
      from them and append, it extracts text first and then
      reverse.  So if a glyph maps to two or more characters,
      those come out backward in the extracted text, which is
      wrong.
"""

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list