[poppler] Graphical caracter filtering

stefano.rubino stefano.rubino at laposte.net
Fri Jun 21 07:06:54 PDT 2013


Hello Ross,

2013/6/21, Ross Moore :
> Can you not do some post-processing in a text editor afterwards?
=> My initial used was to be able to correctly parse pdf document
(without any textual/html/xml/ ... conversion)


> Are the pieces located together, or intermingled with the elements
> from the rows of the matrix?
=> the elements seem to be processed independtly
Obsiously, I've the same problem with caracters like : the left rectangle corner, and the right rectangle corner
wich appended together in the same line appear to be a uniq case (kind of : "head squeezed rectangle"; For instance used for form to get the phone number digits)

Une messagerie gratuite, garantie à vie et des services en plus, ça vous tente ?
Je crée ma boîte mail www.laposte.net
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20130621/55789b97/attachment.html>


More information about the poppler mailing list