[poppler] Finding a way to sort the Pdf Text Blocks, find the number of columns in a page.

Albert Astals Cid aacid at kde.org
Tue Jul 27 12:36:56 PDT 2010


A Dimarts, 27 de juliol de 2010, srinivas adicherla va escriure:
> Hi all,
> 
>         I used the poppler_page_get_selection_
> region() to find the line rectangles of each and every line in a page.
> From that I find the blocks, then I find the columns of the page. From the
> number of columns of the page, Iam able to sort the blocks. So that the
> selection is very good.
> 
> Right now in poppler the selection is bit a problem. After doing all these
> its almost look like Adobe Reader's Selection.
> 
> Please give me suggestions on improving this.

Carlos? The qt frontends don't expose the selection method so i think it's up 
to you for the moment.

> 
> I attached two files with this mail.
> 
> getcol.c is able to sort the blocks in single/multicolumn pdfs.
> getcolumn.c is based on the above sorting used to do the selection.
> 
> 
> *I sent patch about getting the PDF ID from the document before. Albert
> said it was ok. But he asked carlos ?
> 
> Please give me the status about it. *

Carlos?

Albert

> 
> 
> Thanks


More information about the poppler mailing list