<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body><span class="vcard"><a class="email" href="mailto:aperez@igalia.com" title="Adrian Perez de Castro <aperez@igalia.com>"> <span class="fn">Adrian Perez de Castro</span></a>
</span> changed
<a class="bz_bug_link
bz_status_NEW "
title="NEW --- - [TAGGEDPDF] Expose the structure tree and attributes in poppler-glib"
href="https://bugs.freedesktop.org/show_bug.cgi?id=64821">bug 64821</a>
<br>
<table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>What</th>
<th>Removed</th>
<th>Added</th>
</tr>
<tr>
<td style="text-align:right;">Attachment #79993 is obsolete</td>
<td>
</td>
<td>1
</td>
</tr></table>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW --- - [TAGGEDPDF] Expose the structure tree and attributes in poppler-glib"
href="https://bugs.freedesktop.org/show_bug.cgi?id=64821#c3">Comment # 3</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW --- - [TAGGEDPDF] Expose the structure tree and attributes in poppler-glib"
href="https://bugs.freedesktop.org/show_bug.cgi?id=64821">bug 64821</a>
from <span class="vcard"><a class="email" href="mailto:aperez@igalia.com" title="Adrian Perez de Castro <aperez@igalia.com>"> <span class="fn">Adrian Perez de Castro</span></a>
</span></b>
<pre>Created <span class=""><a href="attachment.cgi?id=80416" name="attach_80416" title="[PATCH v3 5/6] Tagged-PDF: Expose the structure tree in poppler-glib">attachment 80416</a> <a href="attachment.cgi?id=80416&action=edit" title="[PATCH v3 5/6] Tagged-PDF: Expose the structure tree in poppler-glib">[details]</a></span> <a href='page.cgi?id=splinter.html&bug=64821&attachment=80416'>[review]</a>
[PATCH v3 5/6] Tagged-PDF: Expose the structure tree in poppler-glib
Attached updated version of the 5/6 patch, with the following additions
on top of the previous version:
* Changes and API additions to handle object reference structure elements:
- poppler_structure_element_is_reference()
- poppler_structure_element_get_reference_type()
* API additions to get PopplerLinkMapping structures from object reference
structure elements:
- poppler_structure_element_get_reference_link()
...and to search for a the reference and obtain the PopplerLinkMapping
from a POPPLER_STRUCTURE_ELEMENT_LINK element:
- poppler_structure_element_find_link()
* New poppler_structure_element_get_page() function. Obtains the number of
the page with the content described by the structure element.
* New poppler_structure_element_get_id() function. Returns the identifier
of a structure element (or NULL if not defined).
* New poppler_structure_element_get_title() function: Returns the title
of a structure element (or NULL if not defined).
* New popppler_structure_element_get_abbreviation() function: for
POPPLER_STRUCTURE_ELEMENT_SPAN elements which contain an abbreviation,
the function returns the expanded form of the abbreviation (or NULL
if not defined or the element is not an abbreviation).
* New poppler_structure_element_get_alt_text() function: Returns the
alternate text for an elemement (or NULL if not defined).
* New poppler_structure_element_get_actual_text() function: Returns
the actual text (textual representation of a text-like graphic element,
returns NULL if not defined).
* Function poppler_structure_element_get_language() does no longer have
an argument to specify whether it should find the language by looking
up recursively in the structure tree. According to the PDF spec, the
language must always to be inherited from parent elements.
My plan is to update this patch further to add new functions to obtain
form fields from the structure tree, in a similar way in which the link
mappings are obtained, tentatively I would be adding:
* Definition of POPPLER_STRUCTURE_REFERENCE_FORM_FIELD.
* poppler_structure_element_get_reference_form_field(), to be used in an
object reference structure element, returning a PopplerFormFieldMapping*.
* poppler_structure_element_find_form_field(), to be used in an element
of type POPPLER_STRUCTURE_FORM, returning a PopplerFormFieldMapping*.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are on the CC list for the bug.</li>
</ul>
</body>
</html>