[Poppler-bugs] [Bug 97144] evince: nulls in PDF text cause Invalid UTF-8 encoded text in name warning
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Sat Jul 30 19:41:32 UTC 2016
https://bugs.freedesktop.org/show_bug.cgi?id=97144
--- Comment #3 from Jason Crain <jason at aquaticape.us> ---
(In reply to Albert Astals Cid from comment #2)
> Should this maybe be done at the glib frontend level?
>
> I've no idea/strong opinion, just random question :D
My reasons for putting it in TextOutputDev.cc are that it was easy, it produced
a small improvement for pdftotext in a few (broken) PDFs I have, and that I
don't think anyone much cares about keeping null chars.
It could go in glib/poppler-page.c but it's a little more work because
poppler_page_get_text, poppler_page_get_text_layout, and
poppler_page_get_text_attributes all need to be kept in sync so they return the
same lengths.
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler-bugs/attachments/20160730/458b2811/attachment.html>
More information about the Poppler-bugs
mailing list