[poppler] Finding a way to sort the Pdf Text Blocks, find the number of columns in a page.
Albert Astals Cid
aacid at kde.org
Tue Jul 27 12:36:56 PDT 2010
A Dimarts, 27 de juliol de 2010, srinivas adicherla va escriure:
> Hi all,
>
> I used the poppler_page_get_selection_
> region() to find the line rectangles of each and every line in a page.
> From that I find the blocks, then I find the columns of the page. From the
> number of columns of the page, Iam able to sort the blocks. So that the
> selection is very good.
>
> Right now in poppler the selection is bit a problem. After doing all these
> its almost look like Adobe Reader's Selection.
>
> Please give me suggestions on improving this.
Carlos? The qt frontends don't expose the selection method so i think it's up
to you for the moment.
>
> I attached two files with this mail.
>
> getcol.c is able to sort the blocks in single/multicolumn pdfs.
> getcolumn.c is based on the above sorting used to do the selection.
>
>
> *I sent patch about getting the PDF ID from the document before. Albert
> said it was ok. But he asked carlos ?
>
> Please give me the status about it. *
Carlos?
Albert
>
>
> Thanks
More information about the poppler
mailing list