[Libreoffice-bugs] [Bug 134742] New: [CJK Issue, Enhancement] Distinguishing both Korean and Japanese font from all CJK[Chinese-Japanese-Korean] fonts such as Noto CJK font series and Source Han Sans series, etc.

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Sun Jul 12 06:13:42 UTC 2020


https://bugs.documentfoundation.org/show_bug.cgi?id=134742

            Bug ID: 134742
           Summary: [CJK Issue, Enhancement] Distinguishing both Korean
                    and Japanese font from all
                    CJK[Chinese-Japanese-Korean] fonts such as Noto CJK
                    font series and Source Han Sans series, etc.
           Product: LibreOffice
           Version: unspecified
          Hardware: All
                OS: All
            Status: UNCONFIRMED
          Severity: enhancement
          Priority: medium
         Component: LibreOffice
          Assignee: libreoffice-bugs at lists.freedesktop.org
          Reporter: sungdh86 at gmail.com

Description:
[CJK Issue, Enhancement] Distinguishing both Korean and Japanese font from all
CJK[Chinese-Japanese-Korean] fonts such as Noto CJK font series and Source Han
Sans series, etc.

Distinguishing both Korean and Japanese fonts from all
CJK[Chinese-Japanese-Korean] fonts such as Noto CJK font series and Source Han
Sans series, etc.

I'm TDF Korean Contributor and can speak Korean(My mother tongue), Japanese
(Little bit).

When I use Noto CJK Korean fonts such as, Noto Sans CJK KR & Noto Sans CJK KR
on LibreOffice, These fonts shows “简繁” at font selection on LibreOffice.
Also, It is same situation to use Noto CJK Japanese fonts such as Noto Sans CJK
JP & Noto Serif CJK JP on LibreOffice. These fonts shows “简繁” at font selection
on LibreOffice.

I checked these fonts issues using debugger, I found the
attemptToDisambiguateHan function bug.

below is the function link from LibreOffice core's master repository.
UScriptCode attemptToDisambiguateHan(UScriptCode eScript, OutputDevice const
&rDevice)
https://git.libreoffice.org/core/+/refs/heads/master/svtools/source/misc/sampletext.cxx#1231

I found that code’s problem.
In CJK(Chinese, Japanese, Korean) use Ideographs(briefly, Chinese Characters).
Noto Sans CJK, Noto Serif CJK and Source Han Sans and Source Han Serif fonts
support all Ideograph characters.

That code’s problem is don’t distinguish CJK fonts includes CJK Ideographs
ranges.
ps. Trivia, CJK cultural sphere, They use for Ideographs(briefly, Chinese
Characters, 漢字) writing systems. 
漢字/汉字 Hànzì is for Chinese. 한자 Hanja is for Korean. かんじ Kanji is for Japanese. 

Example1) Korean Hanja(Korean name for Chinese Character)
List of Hanja for Use in Personal Names (인명용 한자표/人名用漢字表) by the Supreme Court
of ROK (Republic of Korea). 
(It includes  한문 교육용 기초 한자 (漢文敎育用基礎漢字/Hanmun Gyoyug-yong Gicho Hanja) by
Ministry of Education of ROK (Republic of Korea). 
http://unicode.org/L2/L2020/20082-update-korean.pdf

Example2) Japanese Kanji(Japanese name for Chinese Character)
List of Jōyō Kanji (常用漢字表)  
https://www.bunka.go.jp/kokugo_nihongo/sisaku/joho/joho/kijun/naikaku/pdf/joyokanjihyo_20101130.pdf

So, I temporarily submitted to hard coded my source code for only Noto CJK
fonts. ( Such as Noto Sans {HK, JP, KR, SC, TC} fonts.)
However, I don’t resolve the Source Han fonts. (Because, Source Han font’s
Korean font shows Korean word ‘본고딕’[reads ‘bon go dik’ in Korean].)
As a result, That source code is only supported Noto CJK Fonts, such as  Noto
Sans CJK, Not Serif CJK fonts.
Below is the my submitted and merged commit, "Hardcode script for "Noto" CJK
fonts & add USCRIPT_JAMO” Link
https://gerrit.libreoffice.org/c/core/+/97344

I changed only the Noto CJK fonts. (It don’t support Source Han Sans & Source
Han Serif fonts. Both Noto CJK Fonts and Source Han Fonts 
Before -> After
Noto Sans CJK HK 简繁 -> Noto Sans CJK HK 繁
Noto Sans CJK JP 简繁 -> Noto Sans CJK JP 日本語
Noto Sans CJK KR 简繁 -> Noto Sans CJK KR 한글
Noto Sans CJK SC 简繁 -> Noto Sans CJK SC 简
Noto Sans CJK TC 简繁 -> Noto Sans CJK TC 繁
Noto Sans Mono CJK HK 简繁 -> Noto Sans Mono CJK HK 繁
Noto Sans Mono CJK JP 简繁 -> Noto Sans Mono CJK JP 日本語
Noto Sans Mono CJK KR 简繁 -> Noto Sans Mono CJK KR 한글
Noto Sans Mono CJK SC 简繁 -> Noto Sans Mono CJK SC 简
Noto Sans Mono CJK TC 简繁 -> Noto Sans Mono CJK TC 繁
Noto Serif CJK JP 简繁 -> Noto Serif CJK JP 日本語
Noto Serif CJK KR 简繁 -> Noto Serif CJK KR 한글
Noto Serif CJK SC 简繁 -> Noto Serif CJK SC 简
Noto Serif CJK TC 简繁 -> Noto Serif CJK TC 繁



My goal is to properly show the CJK font expression on LibreOffice.

So, In my opinion, It seems necessary to discuss the display implementation of
the font selection box on LibreOffice with developers from CJK Users and
Developers such as, Korea, China, Taiwan, Hong Kong, Macau and Japan.

-------------
Related datas. 

Below are Noto CJK fonts and Adobe's Source Han font series articles &
repositories.
Noto Sans CJK Font from Google blog.
(English) "Noto: A CJK Font That is Complete, Beautiful and Right for Your
Language and Region":
https://developers.googleblog.com/2014/07/noto-cjk-font-that-is-complete.html
(Korean, 한국어) "구글의 새로운 Pan-CJK (汎韓中日) 글꼴을 소개합니다":
https://developers-kr.googleblog.com/2014/07/cjkfont.html
(Japanese, 日本語) "オープンソースの美しい Noto フォントファミリーに日本語、中国語、韓国語が加わりました。" :
https://developers-jp.googleblog.com/2014/07/noto.html
(Chinese, 中文) "Google中日韩字体Noto Sans CJK让你的阅读体验更佳" :
https://china.googleblog.com/2014/07/googlenoto-sans-cjk.html

Noto Serif CJK Font from Google blog.
(English) "Noto Serif CJK is here!" :
https://developers.googleblog.com/2017/04/noto-serif-cjk-is-here.html
(Korean, 한국어) "새로운 폰트, Noto Serif CJK를 지금 바로 확인해 보세요!":
https://developers-kr.googleblog.com/2017/04/noto-serif-cjk-is-here_4.html
(Japanese, 日本語) "Noto Serif CJK が登場!":
https://developers-jp.googleblog.com/2017/04/noto-serif-cjk-is-here.html
(Chinese, 中文) "Noto Serif CJK 来了!" :
https://china.googleblog.com/2017/04/noto-serif-cjk.html

Noto CJK font repository: https://github.com/googlefonts/noto-cjk
Below is introducing Adobe's new Source Han Sans font 
https://blog.typekit.com/2014/07/15/introducing-source-han-sans/
Below is Adobe's new Source Han Serif font series link:
https://source.typekit.com/source-han-serif/

Source Han Serif supports four different East Asian languages — Simplified
Chinese, Traditional Chinese, Japanese, and Korean — and the 65,535 glyphs in
each of its seven weights are designed to work together with a consistent
design that emphasizes shared elements between the languages while honoring the
diversity of each. Also included is a rich set of Western glyphs supporting the
Latin, Greek, and Cyrillic scripts, which were derived from Source Serif.

Steps to Reproduce:
1.Select CJK fonts (includes All CJK Ideographs fonts such as Noto CJK & Source
Han fonts on LibreOffice Suite
2. shows font selections language text.


Actual Results:
Noto Sans CJK HK 简繁 
Noto Sans CJK JP 简繁 
Noto Sans CJK KR 简繁 
Noto Sans CJK SC 简繁 
Noto Sans CJK TC 简繁
Noto Sans Mono CJK HK 简繁
Noto Sans Mono CJK JP 简繁
Noto Sans Mono CJK KR 简繁
Noto Sans Mono CJK SC 简繁
Noto Sans Mono CJK TC 简繁
Noto Serif CJK JP 简繁
Noto Serif CJK KR 简繁
Noto Serif CJK SC 简繁 
Noto Serif CJK TC 简繁 

Kaiti SC 简
Kaiti TC 简
Songti SC 简
Songti TC 简
Source Han Sans HC 简繁 
Source Han Sans HW HC 简繁 
Source Han Sans HW SC 简繁 
Source Han Sans HW TC 简繁 
Source Han Sans SC 简繁 
Source Han Sans TC 简繁 
본고딕
본고딕 HW 


Expected Results:
Noto Sans CJK HK 繁
Noto Sans CJK JP 日本語
Noto Sans CJK KR 한글
Noto Sans CJK SC 简
Noto Sans CJK TC 繁
Noto Sans Mono CJK HK 繁
Noto Sans Mono CJK JP 日本語
Noto Sans Mono CJK KR 한글
Noto Sans Mono CJK SC 简
Noto Sans Mono CJK TC 繁
Noto Serif CJK JP 日本語
Noto Serif CJK KR 한글
Noto Serif CJK SC 简
Noto Serif CJK TC 繁

Kaiti SC 简
Kaiti TC 繁
Songti SC 简
Songti TC 繁
Source Han Sans HC 繁
Source Han Sans HW HC 繁
Source Han Sans HW SC 简
Source Han Sans HW TC 繁
Source Han Sans SC 简
Source Han Sans TC 繁
본고딕 한글
본고딕 HW 한글


Reproducible: Always


User Profile Reset: No



Additional Info:
UNICODE HAN DATABASE (UNIHAN) 
https://www.unicode.org/reports/tr38/

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20200712/c548399b/attachment-0001.htm>


More information about the Libreoffice-bugs mailing list