[poppler] page.text() does not take page orientation into account?

Albert Astals Cid aacid at kde.org
Tue Mar 8 23:49:46 UTC 2016


El Tuesday 08 March 2016, a les 23:34:28, Jeroen Ooms va escriure:
> When extracting text from a landscape pdf file using the cpp
> interface, text at the far right of the page does not get extracted .I
> think the problem is that page.text() always assumes portrait
> orientation and hence underestimates the width of the page:
> 
>   p->text()
>   p->text(p->page_rect())
> 
> Is this expected? What is the best way to extract all text from the
> page, irrespective of size and orientation?

Sounds like a bug.

Which frontend are you using?

Cheers,
  Albert

> 
> An example landscape pdf is here:
> https://github.com/ropensci/pdftools/files/161587/waurika_news_democrat.pdf
> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/poppler



More information about the poppler mailing list