[Poppler-bugs] [Bug 23075] New: pdfinfo can produce invalid UTF-8
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Sat Aug 1 06:34:38 PDT 2009
http://bugs.freedesktop.org/show_bug.cgi?id=23075
Summary: pdfinfo can produce invalid UTF-8
Product: poppler
Version: unspecified
Platform: All
OS/Version: Linux (All)
Status: NEW
Severity: normal
Priority: medium
Component: general
AssignedTo: poppler-bugs at lists.freedesktop.org
ReportedBy: ubanus at users.sf.net
(Tested with poppler 0.10.6.)
pdfinfo does not properly encode Unicode characters outside the BMP:
$ locale charmap
UTF-8
$ wget -q 'http://bugs.debian.org/cgi-bin/bugreport.cgi?msg=5;att=1;bug=525309'
-O utf16nonbmp.pdf
$ pdfinfo utf16nonbmp.pdf | iconv -f UTF-8 -t UTF-32 >/dev/null
iconv: illegal input sequence at position 16
--
Configure bugmail: http://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
More information about the Poppler-bugs
mailing list