[poppler] pdftotext font information

Adam Reichold adam.reichold at t-online.de
Fri Jun 1 08:31:26 UTC 2018


Hello,

I am not completely sure what the whole HtmlFont::pos machinery is
about, but I created the attached patch based on the assumption that it
is meant to remove the style suffixes to get the font family name, i.e.
"Courier" instead of "Courier-Bold". Please give it a try.

Best regards,
Adam

Am 31.05.2018 um 23:31 schrieb obsidian .:
> +1
> 
> On Fri, May 25, 2018 at 10:46 AM, <thomas.karthe at daimler.com> wrote:
> 
>> Hello all,
>>
>>
>>
>> I am using pdftohtml (poppler-0.64.0) with options -xml -fontfullname to
>> extract text with font information.
>>
>> This works perfect, when all fonts have different size.
>>
>>
>>
>> If there are two fonts with same size but of different family, only the
>> first font is reported.
>>
>> Without option -fontfullname, all fonts are reported with family "Times".
>>
>>
>>
>> Any chance to get this fixed?
>>
>>
>>
>> Kind regards
>>
>> Thomas
>>
>>
>>
>> If you are not the addressee, please inform us immediately that you have
>> received this e-mail by mistake, and delete it. We thank you for your
>> support.
>>
>>
>> _______________________________________________
>> poppler mailing list
>> poppler at lists.freedesktop.org
>> https://lists.freedesktop.org/mailman/listinfo/poppler
>>
>>
> 
> 
> 
> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/poppler
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: replace-htmlfont-pos-v1.diff
Type: text/x-patch
Size: 9262 bytes
Desc: not available
URL: <https://lists.freedesktop.org/archives/poppler/attachments/20180601/6342fea2/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 488 bytes
Desc: OpenPGP digital signature
URL: <https://lists.freedesktop.org/archives/poppler/attachments/20180601/6342fea2/attachment.sig>


More information about the poppler mailing list