[poppler] Differing number of items returned from get_text{, layout} for glyphs over page edge

Carlos Garcia Campos carlosgc at gnome.org
Sat Nov 2 14:22:27 CET 2013


Peter Waller <peter at scraperwiki.com> writes:

> On 2 November 2013 11:46, Carlos Garcia Campos <carlosgc at gnome.org> wrote:
>
>> Yes, I confirm it.
>>
>
> I've made a bug here: https://bugs.freedesktop.org/show_bug.cgi?id=71160

Thanks, I have attached a patch.

>
>> So, we have at least two possibilities:
>> [snip]
>>
>>
> My own preference for my use case is to not discard information. It would
> be great if the solution could ensure that all glyphs are returned, even if
> they go over the edge of the page or are off the page.

I don't think we should return characters that are not inside the
page. What is your use case exactly?

In evince we use the layout information to implement caret navigation,
for example, it doesn't make sense to move the caret outside the
page. In the case of selections, you can pass a bigger selection
rectangle to get the text off the page.

> Thanks in advance,
>
> - Peter

Regards, 
-- 
Carlos Garcia Campos
PGP key: http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x523E6462
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: not available
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20131102/90cb425e/attachment.pgp>


More information about the poppler mailing list