[poppler] About parseCharName in GfxFont.cc
suzuki toshiya
mpsuzuki at hiroshima-u.ac.jp
Fri Aug 24 01:19:53 PDT 2012
Thanks, it's interesting... When I posted my previous message,
I was thinking the issue would be some CJK issue, but it was
ASCII issue! And, the problematic PDF is generated by Adobe InDesign.
Regards,
mpsuzuki
王璐 wrote:
> I tried to send the files through attachment, but got rejected from the
> mailling list
>
> The pdf can be found at http://dl.dropbox.com/u/75853179/med-9.pdf
>
> Please check the 'LEKSJON' on the top left corner, without ToUnicode map
> you should get the correct characters.
>
> btw, if you try to extract fonts using fontforge, it won't apply ToUnicode
> for non-ttf fonts.
>
>
> - Lu
>
> On Fri, Aug 24, 2012 at 9:33 AM, 王璐 <coolwanglu at gmail.com> wrote:
>
>> I've attached a problematic pdf, notice the 'LEKSJON' in the top left
>> corner, if you copy the text out, you'll get LeKSjoN
>> So in the ToUnicode map for that font, both 'E' and 'e' are mapped to 'e'
>>
>> I've extracted the font as 'f2.cff' attached. The font itself is ok.
>> I've also attached a file showing the font->getToUnicode(), the format for
>> each line is
>>
>> GlyphID Unicode [Unicode...] # CharCode
>>
>> You can see problem at lines of 0x45 and 0x65.
>>
>> Thanks
>>
>> - Lu Wang
>>
>>
>>
>> On Fri, Aug 24, 2012 at 9:21 AM, suzuki toshiya <
>> mpsuzuki at hiroshima-u.ac.jp> wrote:
>>
>>> 王璐 wrote:
>>>
>>>> Usually this is done by ToUnicode map, but I've many bad mapping for
>>>> Type 1 font, where Type 1 font itself provides good mappings.
>>>>
>>> Could you give some concrete examples?
>>>
>>> Regards,
>>> mpsuzuki
>>>
>>
>
More information about the poppler
mailing list