[Poppler-bugs] [Bug 102911] New: Newer versions of pdftotext don't extract bold & underlined text
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Wed Sep 20 23:36:35 UTC 2017
https://bugs.freedesktop.org/show_bug.cgi?id=102911
Bug ID: 102911
Summary: Newer versions of pdftotext don't extract bold &
underlined text
Product: poppler
Version: unspecified
Hardware: x86-64 (AMD64)
OS: All
Status: NEW
Severity: normal
Priority: medium
Component: utils
Assignee: poppler-bugs at lists.freedesktop.org
Reporter: oamasood at gmail.com
Created attachment 134391
--> https://bugs.freedesktop.org/attachment.cgi?id=134391&action=edit
Example PDF
The current version of pdftotext (0.59.0) doesn't extract the bolded &
underlined text out of attached pdf when -raw is used. For example, notice that
'Equipment Group 202A' is missing from the pdftotext -raw output. Confirmed
behavior on Mac, Ubuntu 14, Ubuntu 16, and Alpine Linux.
On the other hand, we tried with version 0.24 (or version 3.03, which doesn't
show The Popper Developers in the -v output, it only shows "Copyright 1996-2011
Glyph & Cog, LLC"), and those versions do have 'Equipment Group 202A' and
generally produce better output.
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20170920/2da530af/attachment.html>
More information about the Poppler-bugs
mailing list