<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW --- - [TAGGEDPDF] Expose the structure tree and attributes in poppler-glib"
href="https://bugs.freedesktop.org/show_bug.cgi?id=64821#c41">Comment # 41</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW --- - [TAGGEDPDF] Expose the structure tree and attributes in poppler-glib"
href="https://bugs.freedesktop.org/show_bug.cgi?id=64821">bug 64821</a>
from <span class="vcard"><a class="email" href="mailto:aperez@igalia.com" title="Adrian Perez de Castro <aperez@igalia.com>"> <span class="fn">Adrian Perez de Castro</span></a>
</span></b>
<pre>(In reply to <a href="show_bug.cgi?id=64821#c40">comment #40</a>)
<span class="quote">> Comment on <span class=""><a href="attachment.cgi?id=89919" name="attach_89919" title="[PATCH v11 07/11] glib: Expose inline attributes of structure elements">attachment 89919</a> <a href="attachment.cgi?id=89919&action=edit" title="[PATCH v11 07/11] glib: Expose inline attributes of structure elements">[details]</a></span> <a href='page.cgi?id=splinter.html&bug=64821&attachment=89919'>[review]</a> [review]
> [PATCH v11 07/11] glib: Expose inline attributes of structure elements
>
> Review of <span class=""><a href="attachment.cgi?id=89919" name="attach_89919" title="[PATCH v11 07/11] glib: Expose inline attributes of structure elements">attachment 89919</a> <a href="attachment.cgi?id=89919&action=edit" title="[PATCH v11 07/11] glib: Expose inline attributes of structure elements">[details]</a></span> <a href='page.cgi?id=splinter.html&bug=64821&attachment=89919'>[review]</a> [review]:
> -----------------------------------------------------------------
>
> My main concern is that we already have both a way to provide font
> information and text attributes, but I think none of them fit in this API.
> We have PopplerFontInfo, but it's actually an iterator, and we have
> PopplerTextAttributes, that contains the offsets of the text in the page
> text.</span >
Yes, I looked into PopplerTextAttributes first but it is not a good fit
for this kind of access :-(
<span class="quote">> ::: glib/poppler-structure-element.cc
> @@ +718,5 @@
> > + * <informalexample><programlisting>
> > + * GList *text_spans = poppler_structure_element_get_text_spans (element);
> > + * /<!-- -->* Use the text spans *<!-- -->/
> > + * g_list_free_full (text_spans,
> > + * (GDestroyNotify) poppler_text_span_free);
>
> We usually provide a convenient method to do this, see
> poppler_page_free_link_mapping, poppler_page_free_text_attributes, etc.</span >
I have added a poppler_structure_element_free_text_spans() function
<span class="quote">> @@ +743,5 @@
> > + {
> > + PopplerTextSpan *span = g_slice_new0 (PopplerTextSpan);
> > + span->text = _poppler_goo_string_to_utf8 (i->getText ());
> > +
> > + GfxRGB rgb = i->getColor();
>
> Is this copying the struct? should it return a const reference instead or a
> pointer?</span >
In the last version of the patch for Poppler core the TextSpan::getColor()
method returns a reference.
<span class="quote">> @@ +761,5 @@
> > + span->flags |= POPPLER_TEXT_SPAN_SERIF_FONT;
> > + if (i->getFont ()->isItalic ())
> > + span->flags |= POPPLER_TEXT_SPAN_ITALIC;
> > + if (i->getFont ()->isBold ())
> > + span->flags |= POPPLER_TEXT_SPAN_BOLD;
>
> Why not doing this in the core too and here simply do i->getFlags()?</span >
Adding TextSpan::getFlags() in core seems redundant to me because
TextSpan provides access to the GfxFont. Also, to be able to do a
plain “span->flags = i->getFlags()” would need to either manually
type the same bits for the flags in the poppler-glib header (not
nice if it changes in the future), or to include the core header
(not an option), or to check each flag from core and set it with
the corresponding poppler-glib value... IMHO it is not worth the
effort to have a TextSpan::getFlags() in core.
<span class="quote">> @@ +775,5 @@
> > + span->flags |= POPPLER_TEXT_SPAN_BOLD;
> > + default:
> > + break;
> > + }
> > + }
>
> I would move this block to a helper function that creates a PopplerTextSpan
> from a given TextSpan</span >
Moved.
<span class="quote">> @@ +796,5 @@
> > + */
> > +PopplerTextSpan *
> > +poppler_text_span_copy (PopplerTextSpan *poppler_text_span)
> > +{
> > + PopplerTextSpan *new_span = g_slice_new0 (PopplerTextSpan);
>
> You can use g_slice_dup and then you only need to manually copy the heap
> allocated members.</span >
Ah, nice. I didn't know about g_slice_dup(), it is quite handy :)
<span class="quote">> @@ +938,5 @@
> > + return poppler_text_span->font_name;
> > +}
> > +
> > +/**
> > + * poppler_text_span_get_link_target:
>
> What is a link target? a named destination?</span >
This is removed now, it does not make sense as links are exposed as
a PopplerStructureElement object.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are on the CC list for the bug.</li>
</ul>
</body>
</html>