[poppler] Comparing geometric layout information across "pages"

Glad Deschrijver glad.deschrijver at gmail.com
Tue Oct 11 00:31:16 PDT 2011


On Tuesday 11 October 2011, Alec Taylor wrote:
> Good afternoon,
> 
> Do you have some recommends and/or sample code for comparing textual
> and geometric layout information across pages?
> 
> Basically I'm trying to realise patterns within documents, e.g., page
> numbers, header and footers, title, column information &etc; using the
> capabilities of the Poppler PDF library.

Not sure that it will help you much, but you can have a look at DiffPDF which 
uses poppler to compare two PDF files page by page (both textually and 
visually):
http://www.qtrac.eu/diffpdf.html

Best regards,
Glad

-- 
 Everything that is really great and inspiring is created by
 the individual who can labor in freedom.
      -- Albert Einstein, Out of My Later Years (1950)



More information about the poppler mailing list