[poppler] Differing number of items returned from get_text{, layout} for glyphs over page edge

Peter Waller peter at scraperwiki.com
Thu Oct 31 18:20:03 CET 2013


Hi All,

I attach a PDF containing one phrase where the last letter overlaps the
page bounding box. Unless I'm mistaken, poppler_page_get_text_layout is
returning 18 glyphs and poppler_page_get_text is returning 17.

Can anyone else confirm? I'm running 0.24.1.

Is this a bug? Can I safely filter out layout rectangles which are off-page?

Thanks in advance,

- Peter
 <poppler at lists.freedesktop.org>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20131031/96422b18/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: overlap.pdf
Type: application/pdf
Size: 6243 bytes
Desc: not available
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20131031/96422b18/attachment.pdf>


More information about the poppler mailing list