[poppler] recent defect with page.get_text

Albert Astals Cid aacid at kde.org
Sun Sep 18 04:41:06 PDT 2011


A Diumenge, 18 de setembre de 2011, alex bodnaru vàreu escriure:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> 
> hello friends,

Hi

> 
> i'm using poppler through python (that invokes glib interface).
> 
> a recent change (probably together with get_text separation) broke the glib
> interface.

what does recent mean? 0.16.7? 0.17.x? git master?

Albert

> 
> i can't load the entire page text with get_text (see the glib demo) of one
> pdf i have, but pdftotext does output the entire text.
> 
> my pdf is attached. i apology for the language, but i promise it's a non
> offending cadastre report. please see that not all text lines are being
> output by get_text.
> 
> could you help?
> 
> thanks in advance,
> 
> alex
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.11 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
> 
> iJwEAQECAAYFAk510yUACgkQ2nA3WyrfyeNZwgQAllcOWabyWx5GYdG8FVPXipce
> Vy9/ZxT3eOiMqKpblSzsL+JZcJuuZsbEfOEePkVakGoCeYVVGXya+wQJ78ax2Ewv
> haoVQ9jhdzR4eIeOOCxfMpcuaRtKHL4D1ptpMfewGHDVkfLNNi8l9d0HCtH6R60j
> Mn2+FZ3RWFVedqpBQ3U=
> =MXUH
> -----END PGP SIGNATURE-----


More information about the poppler mailing list