[poppler] About parseCharName in GfxFont.cc

王璐 coolwanglu at gmail.com
Fri Aug 24 00:43:00 PDT 2012


I tried to send the files through attachment, but got rejected from the
mailling list

The pdf can be found at http://dl.dropbox.com/u/75853179/med-9.pdf

Please check the 'LEKSJON' on the top left corner, without ToUnicode map
you  should get the correct characters.

btw, if you try to extract fonts using fontforge, it won't apply ToUnicode
for non-ttf fonts.


- Lu

On Fri, Aug 24, 2012 at 9:33 AM, 王璐 <coolwanglu at gmail.com> wrote:

> I've attached a problematic pdf, notice the 'LEKSJON' in the top left
> corner, if you copy the text out, you'll get LeKSjoN
> So in the ToUnicode map for that font, both 'E' and 'e' are mapped to 'e'
>
> I've extracted the font as 'f2.cff' attached. The font itself is ok.
> I've also attached a file showing the font->getToUnicode(), the format for
> each line is
>
> GlyphID Unicode [Unicode...] # CharCode
>
> You can see problem at lines of 0x45 and 0x65.
>
> Thanks
>
> - Lu Wang
>
>
>
> On Fri, Aug 24, 2012 at 9:21 AM, suzuki toshiya <
> mpsuzuki at hiroshima-u.ac.jp> wrote:
>
>> 王璐 wrote:
>>
>>>    Usually this is done by ToUnicode map, but I've many bad mapping for
>>> Type 1 font, where Type 1 font itself provides good mappings.
>>>
>>
>> Could you give some concrete examples?
>>
>> Regards,
>> mpsuzuki
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20120824/33ba4000/attachment.html>


More information about the poppler mailing list