[Poppler-bugs] [Bug 62266] [PATCH] try to detect line breaks in the PDF and insert them in raw mode for pdftotext

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Mar 26 16:22:30 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=62266

--- Comment #17 from Andrew Gallant <jamslam at gmail.com> ---
>> > From the man page
>> >        -raw   Keep the text in content stream order.  This is a hack
>> > which
>> > often "undoes" column formatting, etc.  Use of raw mode is no longer
>> > recommended.
>>
>> From the description, it seems as if I am using raw mode exactly as it
>> was intended.
>
> Ok, let's be clear, what is "reading order" for you?

The order in which one reads the text in the PDF. This seems
consistent with the description of raw mode: it "often 'undoes' column
formatting."

- Andrew

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130326/5fa0d6b4/attachment.html>


More information about the Poppler-bugs mailing list