<html>
<head>
<base href="https://bugs.freedesktop.org/">
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW - evince: Invalid UTF-8 encoded text in name"
href="https://bugs.freedesktop.org/show_bug.cgi?id=97144">97144</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>evince: Invalid UTF-8 encoded text in name
</td>
</tr>
<tr>
<th>Product</th>
<td>poppler
</td>
</tr>
<tr>
<th>Version</th>
<td>unspecified
</td>
</tr>
<tr>
<th>Hardware</th>
<td>Other
</td>
</tr>
<tr>
<th>OS</th>
<td>All
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Severity</th>
<td>normal
</td>
</tr>
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Component</th>
<td>glib frontend
</td>
</tr>
<tr>
<th>Assignee</th>
<td>poppler-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Reporter</th>
<td>jason@aquaticape.us
</td>
</tr></table>
<p>
<div>
<pre>Created <span class=""><a href="attachment.cgi?id=125435" name="attach_125435" title="riedinfo_kw_27_2016.pdf">attachment 125435</a> <a href="attachment.cgi?id=125435&action=edit" title="riedinfo_kw_27_2016.pdf">[details]</a></span>
riedinfo_kw_27_2016.pdf
evince has problems with searching in this PDF. Searching for the letter 'a'
in this PDF fills the terminal with "Invalid UTF-8 encoded text in name"
warning messages or with an older version of glib it crashes.
The cause is that the PDF has embedded null characters and the glib frontend
does not deal well with that. poppler_page_get_text returns a shortened
string, the length does not match the length from poppler_page_get_text_layout,
and when evince tries to display search results it reads outside the buffer and
tries to parse random junk as UTF8.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>