[Poppler-bugs] [Bug 94517] New: page.text() does not take page orientation into account

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Mar 12 20:42:43 UTC 2016


https://bugs.freedesktop.org/show_bug.cgi?id=94517

            Bug ID: 94517
           Summary: page.text() does not take page orientation into
                    account
           Product: poppler
           Version: unspecified
          Hardware: Other
                OS: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: cpp frontend
          Assignee: poppler-bugs at lists.freedesktop.org
          Reporter: jeroen.ooms at stat.ucla.edu

See also: https://lists.freedesktop.org/archives/poppler/2016-March/011755.html

When extracting text from a landscape pdf file using the cpp
interface, text at the far right of the page does not get extracted .I
think the problem is that page.text() always assumes portrait
orientation and hence underestimates the width of the page:

  p->text()
  p->text(p->page_rect())

Is this expected? What is the best way to extract all text from the
page, irrespective of size and orientation?

An example landscape pdf is here:
https://github.com/ropensci/pdftools/files/161587/waurika_news_democrat.pdf

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160312/fc88cd0c/attachment.html>


More information about the Poppler-bugs mailing list