[Poppler-bugs] [Bug 55977] handling of rtl text inversion is too naive

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Nov 23 02:01:58 PST 2015


https://bugs.freedesktop.org/show_bug.cgi?id=55977

--- Comment #59 from Khaled Hosny <khaledhosny at eglug.org> ---
Created attachment 120046
  --> https://bugs.freedesktop.org/attachment.cgi?id=120046&action=edit
Fix finding Arabic Presentation Forms ligatures

PDF text containing Arabic Presentation forms ligatures is still not
found after the previous commit.

This because the ligatures are decomposed in logical order after
normalisation, while the whole string is in visual order.  For example
the RTL text ABCD in visual order will be DCBA, and assuming B is a
ligature, it will be decomposed to B1B2 so the string after
normalization will be DCB1B2A while we are expecting it to be DCB2B1A.

This patch reverses the order of the decomposition of RTL characters to
work around this issue.

This is to be applied on top of the previous patch.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20151123/8945ed57/attachment.html>


More information about the Poppler-bugs mailing list