[Poppler-bugs] [Bug 97144] New: evince: Invalid UTF-8 encoded text in name

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Jul 30 08:37:50 UTC 2016


https://bugs.freedesktop.org/show_bug.cgi?id=97144

            Bug ID: 97144
           Summary: evince: Invalid UTF-8 encoded text in name
           Product: poppler
           Version: unspecified
          Hardware: Other
                OS: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: glib frontend
          Assignee: poppler-bugs at lists.freedesktop.org
          Reporter: jason at aquaticape.us

Created attachment 125435
  --> https://bugs.freedesktop.org/attachment.cgi?id=125435&action=edit
riedinfo_kw_27_2016.pdf

evince has problems with searching in this PDF.  Searching for the letter 'a'
in this PDF fills the terminal with "Invalid UTF-8 encoded text in name"
warning messages or with an older version of glib it crashes.

The cause is that the PDF has embedded null characters and the glib frontend
does not deal well with that.  poppler_page_get_text returns a shortened
string, the length does not match the length from poppler_page_get_text_layout,
and when evince tries to display search results it reads outside the buffer and
tries to parse random junk as UTF8.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160730/183001ec/attachment.html>


More information about the Poppler-bugs mailing list