[poppler] Font info not getting properly into html when using pdftohtml
Albert Astals Cid
aacid at kde.org
Tue Feb 8 11:00:27 PST 2011
A Dimarts, 8 de febrer de 2011, Sushant Sinha va escriure:
> I have attached a pdf document which is a mix of english and hindi
> languages. For Hindi it uses Aryan2 font. When I use pdftohtml on this
> doc, I do not get any font information in the html file. When I use the
> "-xml" or the "-c" Aryan2 font is still outputted as Times. So there is
> some problem with embedded fonts.
>
> I have attached the pdf doc for your analysis.
>
> $ pdffonts 2211.pdf
>
> name type emb sub uni
> object ID
> ------------------------------------ ----------------- --- --- ---
> ---------
> CFFEEL+TimesNewRoman TrueType yes yes no 1852
> 0
> CFFEGM+TimesNewRoman,Bold TrueType yes yes no
> 1854 0
> CFFFEJ+TimesNewRoman,Italic TrueType yes yes no
> 93 0
> CFFFHI+SymbolMT CID TrueType yes yes yes
> 94 0
> CFFGDG+Aryan2-Bold TrueType yes yes no
> 95 0
> CFFGEI+Aryan2-Normal TrueType yes yes no
> 97 0
> CFFGEH+Aryan2-Normal CID TrueType yes yes yes
> 96 0
> CFFGII+Tahoma,Bold TrueType yes yes no
> 98 0
> CFFGLJ+Tahoma TrueType yes yes no
> 99 0
>
>
> Can someone tell me why is this happening?
Most probably a bug.
Unless you are willing to develop a patch to fix the bug yourself please file
a bug at bugs.freedesktop.org so this is not forgotten.
Albert
>
> -Sushant.
More information about the poppler
mailing list