[Poppler-bugs] [Bug 107419] New: Allow page ranges in pdftohtml

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Jul 29 16:16:23 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=107419

            Bug ID: 107419
           Summary: Allow page ranges in pdftohtml
           Product: poppler
           Version: unspecified
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: normal
          Priority: medium
         Component: utils
          Assignee: poppler-bugs at lists.freedesktop.org
          Reporter: ulatekh at yahoo.com

Created attachment 140875
  --> https://bugs.freedesktop.org/attachment.cgi?id=140875&action=edit
Patch to add functionality

I'm using pdftohtml to extract information from PDFs and organize the results
into a database, so I had a chance to dig through the code.

The patch adds a "-pg" command-line option to pdftohtml, to allow noncontiguous
ranges of pages to be specified.

I don't know what the policy is on using Boost inside of poppler, but I can
hand-write a simple integer interval-set if it's a problem.

The "-pg" command-line option may be useful in other utilities, e.g.
pdfseparate.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20180729/c129c8c5/attachment.html>


More information about the Poppler-bugs mailing list