[Poppler-bugs] [Bug 23075] New: pdfinfo can produce invalid UTF-8

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Aug 1 06:34:38 PDT 2009


http://bugs.freedesktop.org/show_bug.cgi?id=23075

           Summary: pdfinfo can produce invalid UTF-8
           Product: poppler
           Version: unspecified
          Platform: All
        OS/Version: Linux (All)
            Status: NEW
          Severity: normal
          Priority: medium
         Component: general
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: ubanus at users.sf.net


(Tested with poppler 0.10.6.)

pdfinfo does not properly encode Unicode characters outside the BMP:

$ locale charmap
UTF-8

$ wget -q 'http://bugs.debian.org/cgi-bin/bugreport.cgi?msg=5;att=1;bug=525309'
-O utf16nonbmp.pdf

$ pdfinfo utf16nonbmp.pdf | iconv -f UTF-8 -t UTF-32 >/dev/null
iconv: illegal input sequence at position 16


-- 
Configure bugmail: http://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list