[Poppler-bugs] [Bug 91058] Unicode strings saved as literal strings

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Jul 23 06:02:46 PDT 2015


https://bugs.freedesktop.org/show_bug.cgi?id=91058

--- Comment #4 from Marek Kasik <mkasik at redhat.com> ---
The problem is actually in PDF specification. It doesn't specify how to deal
with non-ascii text in forms. All simple fonts (chapter 5.5) use 8bit codes
which is not enough so they can not be used generally (including all the 14
base fonts).

One possibility here seems to use a CID font.

Or try to go beyond the 8bit constraint and try to find whether the font you
use has the the glyphs for the Unicode characters you use when rendering text.

Btw, this comment summarises font problems arising when changing a text in PDF
quite well: http://stackoverflow.com/a/15973614.

Btw2, Adobe Reader stores text which includes non-ascii characters as unicode
hexadecimal strings and ascii only text as normal strings. But it also changes
font in the PDF to a CID font in the non-ascii case.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20150723/9d6278fa/attachment.html>


More information about the Poppler-bugs mailing list