[Poppler-bugs] [Bug 20331] New: Patch for pdftotext to accept cropping options like pdftoppm

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Feb 26 08:09:58 PST 2009


http://bugs.freedesktop.org/show_bug.cgi?id=20331

           Summary: Patch for pdftotext to accept cropping options like
                    pdftoppm
           Product: poppler
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: general
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: jan at jockusch.de


Created an attachment (id=23324)
 --> (http://bugs.freedesktop.org/attachment.cgi?id=23324)
The cropping patch for pdftotext, based on version 0.8.7, works on current
version as well.

pdftotext has problems properly extracting text from multi-column PDFs.

To solve this problem, I added support for the -x, -y, -W, -H, and -r options
to pdftotext, taking the corresponding code sections from pdftoppm.

This way, I can crop out parts of a page and process columns separately.

I would greatly appreciate it if this minor change would become available to
all users.

If you need me to reformat or improve the patch in any way, feel free to
contact me.

I made the patch based on version 0.8.7, which I know is not the current one.
But the patch succeeds error-free with only a few lines of offset in the
current version as well.

Thanks for considering the patch!


-- 
Configure bugmail: http://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list