[Poppler-bugs] [Bug 32206] New: pdftotext from poppler-0.12.4windows(KDE4) can not generated text file because missing cjk font map

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Dec 7 18:01:52 PST 2010


https://bugs.freedesktop.org/show_bug.cgi?id=32206

           Summary: pdftotext from poppler-0.12.4windows(KDE4) can not
                    generated text file because missing cjk font map
           Product: poppler
           Version: unspecified
          Platform: x86 (IA32)
        OS/Version: Windows (All)
            Status: NEW
          Severity: major
          Priority: medium
         Component: general
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: dangbinghoo at gmail.com


Created an attachment (id=40889)
 --> (https://bugs.freedesktop.org/attachment.cgi?id=40889)
testing pdfs and my result

I use pdftotext command from package poppler for KDE4 on windows, pdftotext can 
not extract text because of miss cjk font map:

$ pdftotext -raw CrystalReport.pdf

Error: Missing language pack for 'Adobe-GB1' mapping
Error: Missing language pack for 'Adobe-GB1' mapping
Error: Unknown font tag 'TT2'
Error (1471): No font in show
Error: Unknown font tag 'TT2'
Error (1483): No font in show
Error (1497): No font in show
Error (1535): No font in show
Error (1567): No font in show/space
Error (1624): No font in show/space
Error (1636): No font in show
Error (1655): No font in show
Error (1673): No font in show
Error (1682): No font in show
Error (1717): No font in show
Error (1735): No font in show
Error (1747): No font in show/space
Error (1758): No font in show/space
Error (1769): No font in show
Error (1774): No font in show
Error (1783): No font in show
Error (1811): No font in show
Error (1823): No font in show
Error (1835): No font in show
Error (1872): No font in show/space
Error (1884): No font in show
Error (1938): No font in show/space
Error (1971): No font in show/space
Error (1988): No font in show
Error (2052): No font in show
Error (2081): No font in show/space
Error (2103): No font in show
Error (2139): No font in show/space
Error (2166): No font in show/space
Error (2175): No font in show
Error (2206): No font in show
Error (2242): No font in show/space
Error (2286): No font in show/space
Error (2330): No font in show/space
Error (2382): No font in show/space
Error (2394): No font in show
Error (2407): No font in show/space
Error (2419): No font in show
Error (2429): No font in show
Error (2442): No font in show/space
Error (2464): No font in show/space
Error (2484): No font in show
Error (2512): No font in show/space
Error (2550): No font in show/space
Error (2582): No font in show/space
Error (2602): No font in show/space
Error (2620): No font in show
Error (2623): No font in show
Error (2630): No font in show/space
Error (2633): No font in show/space
Error (2637): No font in show
Error (2642): No font in show
Error (2648): No font in show
Error (2648): No font in show
Error (2656): No font in show
Error (2661): No font in show/space
Error (2666): No font in show/space
Error (2675): No font in show/space
Error (2677): No font in show
Error (2683): No font in show
Error (2686): No font in show
Error (2690): No font in show
Error (2697): No font in show
Error (2699): No font in show
Error (2706): No font in show
Error (2711): No font in show/space
Error (2719): No font in show
Error (2739): No font in show/space
Error (2750): No font in show
Error (2765): No font in show
Error (2780): No font in show/space
Error (2791): No font in show
Error (2810): No font in show
Error (2813): No font in show
Error (2838): No font in show/space
Error (2857): No font in show
Error (2870): No font in show/space
Error (2887): No font in show/space
Error (2899): No font in show/space
Error (2916): No font in show/space
Error (2920): No font in show
Error (2927): No font in show
Error (2935): No font in show
Error (2954): No font in show
Error (2970): No font in show
Error (3011): No font in show/space
Error: Unknown font tag 'TT2'
Error (3073): No font in show
Error: Unknown font tag 'TT2'
Error (3165): No font in show
Error (3174): No font in show
Error (3176): No font in show
Error (3179): No font in show
Error (3191): No font in show
Error (3209): No font in show
Error (3232): No font in show/space
Error (3249): No font in show
Error (3253): No font in show
Error (3255): No font in show
Error (3265): No font in show
Error (3279): No font in show/space
Error (3287): No font in show/space
Error (3301): No font in show/space
Error (3304): No font in show
Error: Unknown font tag 'TT7'
Error (3331): No font in show/space
Error: Unknown font tag 'TT2'
Error (3352): No font in show
Error (3376): No font in show
Error (3383): No font in show
Error (3425): No font in show
Error (3443): No font in show
Error (3450): No font in show
Error (3461): No font in show
Error (3479): No font in show
Error (3482): No font in show
Error: No font in show

And with different Chinese PDF , the result maybe quite different:

with utf-8 encode pdf  file there's no error reported,but the gernerated text
file has only on single unreadable char (see the attached files)

$ pdftotext -raw CrystalReport-utf8-OO.pdf

$ pdftotext -raw CrystalReport-gsPDF-win.pdf

both this file will has the same result.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list