[Poppler-bugs] [Bug 103798] libpoppler cannot recreate pdftotext output, because physical_layout is not handled correctly

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Nov 18 17:35:29 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=103798

--- Comment #7 from dummydummy at gmx.fr ---
Unfortunately the preceding patch does not resolve the problem on its own.

Comparing main() of pdftotext.cc with the page::text() in
.../gcc/poppler-page.cpp there are 2 other differences:

(1) doc->displayPage(&td, d->index + 1, 72, 72, 0, FALSE, TRUE, false); vs
doc->displayPages(textOut, firstPage, lastPage, resolution, resolution, 0,
TRUE, FALSE, gFalse);
i.e. page::text () sets GBool useMediaBox = false, GBool crop = true and in
pdftotext it is the other way round!?

I have yet to investigate what this means by reading the source...

(2) the page::text() in .../gcc/poppler-page.cpp subsequently calls
td.getText() while pdftotext does not... Yet more source code to wade
through...

to be continued...

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20171118/635f6d27/attachment-0001.html>


More information about the Poppler-bugs mailing list