[poppler] About parseCharName in GfxFont.cc

Leonard Rosenthol lrosenth at adobe.com
Fri Aug 24 07:48:25 PDT 2012


What makes you think that the ToUnicode table for that font is bad?   It may not be what you expect, but that doesn't make it bad. For all you know, that is the information in the original font…

Leonard

From: 王璐 <coolwanglu at gmail.com<mailto:coolwanglu at gmail.com>>
To: "mpsuzuki at hiroshima-u.ac.jp<mailto:mpsuzuki at hiroshima-u.ac.jp>" <mpsuzuki at hiroshima-u.ac.jp<mailto:mpsuzuki at hiroshima-u.ac.jp>>
Cc: "poppler at lists.freedesktop.org<mailto:poppler at lists.freedesktop.org>" <poppler at lists.freedesktop.org<mailto:poppler at lists.freedesktop.org>>
Subject: Re: [poppler] About parseCharName in GfxFont.cc

I tried to send the files through attachment, but got rejected from the mailling list

The pdf can be found at http://dl.dropbox.com/u/75853179/med-9.pdf

Please check the 'LEKSJON' on the top left corner, without ToUnicode map you  should get the correct characters.

btw, if you try to extract fonts using fontforge, it won't apply ToUnicode for non-ttf fonts.


- Lu

On Fri, Aug 24, 2012 at 9:33 AM, 王璐 <coolwanglu at gmail.com<mailto:coolwanglu at gmail.com>> wrote:
I've attached a problematic pdf, notice the 'LEKSJON' in the top left corner, if you copy the text out, you'll get LeKSjoN
So in the ToUnicode map for that font, both 'E' and 'e' are mapped to 'e'

I've extracted the font as 'f2.cff' attached. The font itself is ok.
I've also attached a file showing the font->getToUnicode(), the format for each line is

GlyphID Unicode [Unicode...] # CharCode

You can see problem at lines of 0x45 and 0x65.

Thanks

- Lu Wang



On Fri, Aug 24, 2012 at 9:21 AM, suzuki toshiya <mpsuzuki at hiroshima-u.ac.jp<mailto:mpsuzuki at hiroshima-u.ac.jp>> wrote:
王璐 wrote:
   Usually this is done by ToUnicode map, but I've many bad mapping for
Type 1 font, where Type 1 font itself provides good mappings.

Could you give some concrete examples?

Regards,
mpsuzuki


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20120824/5ad5b4e6/attachment.html>


More information about the poppler mailing list