[Poppler-bugs] [Bug 34300] New: Text obtained from some pdfs with cyrillic encoding is unreadable

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Feb 15 09:11:39 PST 2011


https://bugs.freedesktop.org/show_bug.cgi?id=34300

           Summary: Text obtained from some pdfs with cyrillic encoding is
                    unreadable
           Product: poppler
           Version: unspecified
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: general
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: jose.aliste at gmail.com


If you do pdftotext to the file 
http://zelmanov.ptep-online.com/ctan/lshort_russian.pdf you get unreadable
text. 

The reason seems to be related with encoding. If I use iconv to convert the
text asumming it comes from CP1251, it gets readable.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list