[poppler] paper on extracting text from PostScript

John Barstow jbowtie at amathaine.com
Sun Oct 12 17:41:45 PDT 2008


I recently came across this paper, which states the algorithm involved can
be applied to PDF. I thought it might be worth looking at to see if any of
the work could be used in improving the pdf-to-text output of poppler.

http://www.cs.waikato.ac.nz/~ihw/papers/98NM-Reed-IHW-Extract-Text.pdf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freedesktop.org/archives/poppler/attachments/20081013/694112de/attachment.html 


More information about the poppler mailing list