<html>
    <head>
      <base href="https://bugs.documentfoundation.org/">
    </head>
    <body>
      <p>
        <div>
            <b><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - RTL: Automatic language detection based on keyboard layout"
   href="https://bugs.documentfoundation.org/show_bug.cgi?id=113298#c3">Comment # 3</a>
              on <a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - RTL: Automatic language detection based on keyboard layout"
   href="https://bugs.documentfoundation.org/show_bug.cgi?id=113298">bug 113298</a>
              from <span class="vcard"><a class="email" href="mailto:caolanm@redhat.com" title="Caolán McNamara <caolanm@redhat.com>"> <span class="fn">Caolán McNamara</span></a>
</span></b>
        <pre>I imagine using libexttextcat would just introduce a pile of "my language was
guessed wrong" bugs. Especially for short sequences of text which won't be long
enough for the statistical efforts of libexttextcat to guess it right.

Unicode char range folds this bunch of languages
<a href="https://en.wikipedia.org/wiki/Arabic_script#Languages_currently_written_with_the_Arabic_alphabet">https://en.wikipedia.org/wiki/Arabic_script#Languages_currently_written_with_the_Arabic_alphabet</a>
to Arabic, while Hebrew script munges Yiddish and Hebrew together, which is
maybe acceptable loss and probably happens on Windows already.

There are some hints in <a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - System input language is always ignored on Linux"
   href="show_bug.cgi?id=108151">bug 108151</a> about some available fields in the gtk
integration with the IBUS IM that might be of some use to pick an acceptable
value to set for the language.</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>