[poppler] lack of CJK feature

Albert Astals Cid aacid at kde.org
Mon Dec 10 14:27:47 PST 2007


A Dilluns 10 Desembre 2007, Koji Otani va escriure:
> Hi All.
>
> I'm Koji Otani, posted patches about CJK (see Bug #11413) before.
> I found more problem about CJK. and registered patch as Bug #13582.
>
> Adobe Japan1 6 character set includes characters in outside of UNICODE
> BMP.
> But poppler cannot display these and some characters with TrueType
> font.
> current poppler has following problems about that.
> (1) CMap data is old.
>  Current data (poppler-data-0.1.1.tar.gz ) has only Adobe Japan 4
>  data.
>  This should be update with newer one. (GhostScript 8.60 has  already
>  new  CMap data)

I've updated cmap on git, will release poppler-data 0.2 on wednesday.

> (2) poppler  doesn't lookup format12 cmap table of TrueType font.
>   Only format12 cmap table supports codes outside of UNICODE BMP.
> (3) poppler lookups only UCS2 CMaps when making unicodeToGID map
>  UCS2 CMap supports only codes in inside of UNICODE BMP.
> (4) missing  handling  CID conflict in CMap .
> CMap maps multiple unicode to a same CID.
> So, a CID can map multiple unicode.
> Currently poppler use only the first one.
> If that code is not exist in the cmap of TT Font.
> It is not displayed.
>
> I proposed patch solved (2), (3), (4).
>
> Please check it if you are interestead in this.
> I would appreciate it if you accept this patch.

I've added the patch to master git branch, that is, what will become poppler 
0.7 I've not added it to the 0.6.x branch because it's a bit "big" for it.

Thanks a lot for the patches :-)

Albert

>
> Reagrads,
>
> Koji Otani
> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/poppler




More information about the poppler mailing list