[poppler] How does Poppler extract Typ1C embedded Fonts with custom encoding?
Olaf Drümmer
olaflist at callassoftware.com
Wed Jun 5 13:36:56 PDT 2013
Hi,
The file in the stackoverflow question does not seem to contain enough information to establish a useful encoding (unless there was also a ToUnicode present - can't check as I do not have that file at hand).
Type1C per se should not be any different than plain Type1 for text extraction, as you would not look inside the font. Either a the glyph names are defined (Adobe Glyph List), or the encoding is a known encoding, or a ToUnicode table is in place.
Olaf
On 5 Jun 2013, at 17:05, Michael Scheerer wrote:
> Hello!
>
> After examine the code of Poppler I'm wondering how applications using Poppler (Evins, tc.) are extracting the text content of Typ1C embedded Fonts with custom encodings.
> I mean such stuff:
>
> http://stackoverflow.com/questions/12703387/pdf-font-encoding
>
> Can anyone help me?
>
> Thanks in advance,
>
> Michael
> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/poppler
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20130605/4f392e33/attachment.html>
More information about the poppler
mailing list