<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW --- - [PATCH] try to detect line breaks in the PDF and insert them in raw mode for pdftotext"
href="https://bugs.freedesktop.org/show_bug.cgi?id=62266#c7">Comment # 7</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW --- - [PATCH] try to detect line breaks in the PDF and insert them in raw mode for pdftotext"
href="https://bugs.freedesktop.org/show_bug.cgi?id=62266">bug 62266</a>
from <span class="vcard"><a class="email" href="mailto:jamslam@gmail.com" title="Andrew Gallant <jamslam@gmail.com>"> <span class="fn">Andrew Gallant</span></a>
</span></b>
<pre>Ah, dang. I did not realize "stream" was jargon in the PDF world.
However, isn't there still some wiggle room for processing? For example, the
current code inserts a new line whenever the next word is detected to not be in
the same line as the current word (or if the next word is to the left of the
current word). I understand my change to be in a similar light of this kind of
processing. i.e., there actually *is* some assumption of reading order in "raw"
mode.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>