[poppler] recent defect with page.get_text

Albert Astals Cid aacid at kde.org
Sun Sep 18 07:09:30 PDT 2011


Please do not email me, email the list.

A Diumenge, 18 de setembre de 2011, vàreu escriure:
> On 09/18/2011 02:41 PM, Albert Astals Cid wrote:
> > A Diumenge, 18 de setembre de
> 
>       2011, alex bodnaru vàreu escriure:
> 
> hello friends,
> 
> > Hi
> 
> thanks a lot albert for considering my problem.

I am not considering your problem, I am complaining about the lack of 
information in your original mail ;-)

> 
> i'm using poppler through python (that invokes glib interface).
> 
> a recent change (probably together with get_text separation) broke the glib
> interface.
> 
> > what does recent mean? 0.16.7? 0.17.x? git master?
> 
> 0.16.7.

So 0.16.7 does not work, which is the version you know it works?

Albert

P.S: Would it be possible for you not to send HTML email?

> 
> > Albert
> 
> thanks again,
> alex
> 
> i can't load the entire page text with get_text (see the glib demo) of one
> pdf i have, but pdftotext does output the entire text.
> 
> my pdf is attached. i apology for the language, but i promise it's a non
> offending cadastre report. please see that not all text lines are being
> output by get_text.
> 
> could you help?
> 
> thanks in advance,
> 
> alex
> 
>       _______________________________________________
> 
>       > poppler mailing list
>       > 
>       > poppler at lists.freedesktop.org
>       > 
>       > http://lists.freedesktop.org/mailman/listinfo/poppler


More information about the poppler mailing list