[Libreoffice-commits] core.git: i18npool/source

László Németh nemeth at numbertext.org
Thu Mar 29 14:45:58 UTC 2018


 i18npool/source/collator/data/hu_charset.txt |   14 ++++++++++++++
 1 file changed, 14 insertions(+)

New commits:
commit 7b1eb6313c0d2621c364df1724c69d28f8267841
Author: László Németh <nemeth at numbertext.org>
Date:   Tue Mar 27 22:42:18 2018 +0200

    tdf#116666 fix Hungarian sorting
    
    Apply the following extra requirements of Hungarian orthography
    for sorting words and sentences:
    
    – expand simplified double consonants;
    
    – ignore spaces and hyphens;
    
    – prefer lower case homonyms.
    
    Note: automatic sorting is better this way, but it's still not error-free.
    
    Important advantage, that now it's *possible* to fix all errors
    in a semi-automatic way, using soft hyphens. Inserting them in bad
    or ambiguous character positions will fix all automatic sortings later.
    
    Example with consonant "gy", simplified form of long "gygy" is the
    ambiguous "ggy" (it can be "g" and "gy", too, as in "meggyőz"):
    
    = Bad =   = Now =   = Good (corrected, "|" signs soft hyphen) =
    megbíz    megbíz    megbíz
    meggyíz   megzavar  meg|győz
    meggyőz   megye     megzavar
    megzavar  meggyíz   megye
    megye     meggyőz   meggyíz
    megyünk   megyünk   megyünk
    
    Change-Id: Ia84f264ad9ea4cdebe5f3ea22212a9594b4fe44d
    Reviewed-on: https://gerrit.libreoffice.org/51973
    Tested-by: Jenkins <ci at libreoffice.org>
    Reviewed-by: László Németh <nemeth at numbertext.org>

diff --git a/i18npool/source/collator/data/hu_charset.txt b/i18npool/source/collator/data/hu_charset.txt
index a63d4bab9c45..2b7288e7d8a6 100644
--- a/i18npool/source/collator/data/hu_charset.txt
+++ b/i18npool/source/collator/data/hu_charset.txt
@@ -19,6 +19,11 @@
 #
 # Collation of Hungarian letters
 #
+
+[caseLevel on]
+[caseFirst lower]
+[alternate shifted]
+
 & c < cs <<< Cs <<< CS
 & d < dz <<< DZ < dzs <<< Dzs <<< DZS
 & g < gy <<< Gy <<< GY
@@ -30,3 +35,12 @@
 & u < ü <<< Ü <<< ű <<< Ű
 & z < zs <<< Zs <<< ZS
 
+& cs <<< ccs / cs <<< Ccs / cs <<< CCs / cs <<< CCS / cs
+& dz <<< ddz / dz <<< Ddz / dz <<< DDz / dz <<< DDZ / dz
+& dzs <<< ddzs / dzs <<< Ddzs / dzs <<< DDzs / dzs <<< DDZs / dzs <<< DDZS / dzs
+& gy <<< ggy / gy <<< Ggy / gy <<< GGy / gy <<< GGY / gy
+& ly <<< lly / ly <<< Lly / ly <<< LLy / ly <<< LLY / ly
+& ny <<< nny / ny <<< Nny / ny <<< NNy / ny <<< NNY / ny
+& sz <<< ssz / sz <<< Ssz / sz <<< SSz / sz <<< SSZ / sz
+& ty <<< tty / ty <<< Tty / ty <<< TTy / ty <<< TTY / ty
+& zs <<< zzs / zs <<< Zzs / zs <<< ZZs / zs <<< ZZS / zs


More information about the Libreoffice-commits mailing list