[Poppler-bugs] [Bug 62266] [PATCH] try to detect line breaks in the PDF and insert them in raw mode for pdftotext

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Thu Mar 21 15:44:41 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=62266

--- Comment #8 from Albert Astals Cid <aacid at kde.org> ---
No there is no assumption of reading, it is just an assumption that if two
characters are separated enough one from the other, there is a space in the
middle, and that's because in a pdf you don't need to put space characters if
you don't want, and one thing is extracting text in raw order and the other is
extracting all the text in a single string with no spaces in between :D

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130321/62e8ff64/attachment.html>


More information about the Poppler-bugs mailing list