[Uim] traditional Chinese py input?

Jon Babcock jon at kanji.com
Sun Jan 18 22:41:11 EET 2004


[UTF-8 encoded]

David Oftedal wrote:

> If you have some way of automating the process, you could just add them 
> [all the kanji in the latest Unicode] anyway though. For future reference.

I did. But I have a problem.

Even after renaming my file, pyunihan.scm, as PY.scm, the IM doesn't 
find all the new kanji. (My pyunihan.scm file contains about 25,349 kanji.)

Two examples:

(1)

The first line of the original *PY.scm* file has 7 kanji.
((("a")) ("阿" "啊" "呵" "腌" "嗄" "锕" "吖"))

The first line of my *pyunihan.scm* also has 7 kanji. It doesn't have " 
呵", but it has "錒".
((("a"))("吖" "啊" "嗄" "腌" "錒" "锕" "阿"))

The IM finds 8 kanji, the additional "錒" kanji AND the "呵" which my 
list lacks. How can it find a kanji which is not in pyunihan.scm renamed 
as PY.scm?

(2)

The last line of the original *PY.scm* file has 19 kanji.
((("z" "u" "o")) ("作" "做" "坐" "座" "左" "昨" "琢" "佐" "凿" "撮" "柞"
"嘬" "怍" "胙" "唑" "笮" "阼" "祚" "酢"))

The last line of my *pyunihan.scm* has 68 kanji. Under Gnome on Debian
sid, there is no glyph in the fonts for the first 24 of these, but the
remaining 44 kanji display okay (under mozilla-firebird and 
mozilla-thunderbird for example).

((("z" "u" "o"))("㑅" "㘀" "㘴" "㛗" "㝾" "㣱" "㤰" "㭮" "㶡" "㸲" "䇥"
"䋏" "䎰" "䔘" "䜊" "䝫" "䞢" "䞰" "䟄" "䟶" "䦈" "䩦" "䬤" "䯿" "乍" "
佐" "作" "侳" "做" "凿" "咗" "唑" "唨" "嘬" "坐" "夎" "岝" "岞" "左" "座
" "怍" "捽" "撮" "昨" "柞" "椊" "毑" "琢" "砟" "祚" "秨" "稓" "笮" "筰"
"糳" "繓" "胙" "苲" "莋" "葃" "葄" "袏" "迮" "酢" "鈼" "鑿" "阼" "飵"))

With pyunihan.scm renamed as PY.scm, the IM still finds only the
original 19 of these, and none of the additional 49 kanji.

The final lines of my pyunihan.scm file are:

(generic-register-im
  'py "zh_CN" "UTF8" py-init-handler)

I also tried "UTF-8".

What is the role of "zh_CN" here? Is that somehow controlling which 
kanji are available as candidates for the IM?

Is the total *number* of kanji per pinyin syllable important?

Please note that I have also added some pinyin syllables to 
pyunihan.scm, e.g. ((("e" "n" "g")). The IM fails to find the single 
kanji "鞥" that is included under 'eng'. When I input the 'en' it finds 
the kanji for 'en' 恩摁 etc., but when I input the 'g', it displays 恩g 
on the screen.

Any help greatly appreciated.

Thanks.

Jon


-- 
Jon Babcock <jonatkanjidotcom>





More information about the uim mailing list