[Poppler-bugs] [Bug 83061] pdftotext -htmlmeta should quote text content

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Aug 25 14:26:00 PDT 2014


https://bugs.freedesktop.org/show_bug.cgi?id=83061

--- Comment #2 from Jean-Francois Dockes <jf at dockes.org> ---
Created attachment 105255
  --> https://bugs.freedesktop.org/attachment.cgi?id=105255&action=edit
Pdf document with a title property, and a text body, containing HTML special
characters

The HTML special characters, should be replaced with character entities in the
HTML output (< should become < etc.) but they are not. As a result, some
pieces of text disappear in the display (e.g. <un tag>), or bad HTML syntax
results in unpredictable behaviour.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20140825/a5eff64d/attachment.html>


More information about the Poppler-bugs mailing list