[Poppler-bugs] [Bug 66693] Greek support package - some characters output as symbols not letters

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Aug 16 04:31:49 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=66693

--- Comment #8 from Jason Crain <jason at aquaticape.us> ---
Created attachment 84138
  --> https://bugs.freedesktop.org/attachment.cgi?id=84138&action=edit
Normalize more characters in font Unicode map

The previous patches were to fix searches.  This one should fix pdftotext
output.  For this document, poppler is just using the Unicode values from the
ToUnicode CMap, but acroread modifies or decomposes some of these characters,
so I guess poppler should too.

Attached patch moves the normalization pass to run after the CMap is read and
additionally normalizes some characters that I noticed acroread changes.  Some
greek letters, OE ligatures, and presentation forms blocks.

And all three patches should be applied, preferrably in this order:

Add Unicode mappings to gen-unicode-tables.py
Regenerate UnicodeDecompTables.h from gen-unicode-tables.py
Normalize more characters in font Unicode map

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130816/bc2c518b/attachment.html>


More information about the Poppler-bugs mailing list