[Poppler-bugs] [Bug 62266] [PATCH] try to detect line breaks in the PDF and insert them in raw mode for pdftotext

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Mar 19 17:01:43 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=62266

--- Comment #7 from Andrew Gallant <jamslam at gmail.com> ---
Ah, dang. I did not realize "stream" was jargon in the PDF world.

However, isn't there still some wiggle room for processing? For example, the
current code inserts a new line whenever the next word is detected to not be in
the same line as the current word (or if the next word is to the left of the
current word). I understand my change to be in a similar light of this kind of
processing. i.e., there actually *is* some assumption of reading order in "raw"
mode.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130320/6f4c58bc/attachment.html>


More information about the Poppler-bugs mailing list