[poppler] XML syntax error in PdfToText tool

Paweł Leń pawel.len at gmail.com
Thu Nov 14 05:04:49 PST 2013


Hello,

I have error when running:
pdftotext -bbox -htmlmeta 'myfile.pdf' 'tempFile.xml'

The output xml have <title> tag on the begining of document (meta section),
error appears when title contains "&" character. Title field has no CDATA
and it is not quoted so it causes error in my xmllib parser. Can I (or You
:) ) fix it somehow?

Beast regards


*--*

*Paweł Leń*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20131114/cd3117ad/attachment.html>


More information about the poppler mailing list