[poppler] Font info not getting properly into html when using pdftohtml

Albert Astals Cid aacid at kde.org
Tue Feb 8 11:00:27 PST 2011


A Dimarts, 8 de febrer de 2011, Sushant Sinha va escriure:
> I have attached a pdf document which is a mix of english and hindi
> languages. For Hindi it uses Aryan2 font. When I use pdftohtml on this
> doc, I do not get any font information in the html file. When I use the
> "-xml" or the "-c" Aryan2 font is still outputted as Times. So there is
> some problem with embedded fonts.
> 
> I have attached the pdf doc for your analysis.
> 
> $ pdffonts 2211.pdf
> 
> name                                 type              emb sub uni
> object ID
> ------------------------------------ ----------------- --- --- ---
> ---------
> CFFEEL+TimesNewRoman                 TrueType          yes yes no   1852
> 0
> CFFEGM+TimesNewRoman,Bold            TrueType          yes yes no
> 1854  0
> CFFFEJ+TimesNewRoman,Italic          TrueType          yes yes no
> 93  0
> CFFFHI+SymbolMT                      CID TrueType      yes yes yes
> 94  0
> CFFGDG+Aryan2-Bold                   TrueType          yes yes no
> 95  0
> CFFGEI+Aryan2-Normal                 TrueType          yes yes no
> 97  0
> CFFGEH+Aryan2-Normal                 CID TrueType      yes yes yes
> 96  0
> CFFGII+Tahoma,Bold                   TrueType          yes yes no
> 98  0
> CFFGLJ+Tahoma                        TrueType          yes yes no
> 99  0
> 
> 
> Can someone tell me why is this happening?

Most probably a bug.

Unless you are willing to develop a patch to fix the bug yourself please file 
a bug at bugs.freedesktop.org so this is not forgotten.

Albert

> 
> -Sushant.


More information about the poppler mailing list