[Poppler-bugs] [Bug 34300] New: Text obtained from some pdfs with cyrillic encoding is unreadable
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Tue Feb 15 09:11:39 PST 2011
https://bugs.freedesktop.org/show_bug.cgi?id=34300
Summary: Text obtained from some pdfs with cyrillic encoding is
unreadable
Product: poppler
Version: unspecified
Platform: Other
OS/Version: All
Status: NEW
Severity: normal
Priority: medium
Component: general
AssignedTo: poppler-bugs at lists.freedesktop.org
ReportedBy: jose.aliste at gmail.com
If you do pdftotext to the file
http://zelmanov.ptep-online.com/ctan/lshort_russian.pdf you get unreadable
text.
The reason seems to be related with encoding. If I use iconv to convert the
text asumming it comes from CP1251, it gets readable.
--
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
More information about the Poppler-bugs
mailing list