[UTF-8] Aspell and UTF-8/Unicode

Tue, 17 Feb 2004 17:03:11 -0500 (EST)

On Tue, 17 Feb 2004, Noah Levitt wrote:

> On Tue, Feb 17, 2004 at  0:03:01 -0500, Kevin Atkinson wrote:
> > 
> > The CVS version of Aspell can now check documents in UTF-8.  
> 
> That's wonderful! 
> 
> > the encoding 
> > is not set based on the current locale.
> 
> Hmm, why not?

That was suppose to say "...is NOW set..."

> > I am considering a "dual-script" mode where Aspell can use a separate
> > dictionary depending on which script it detects the current word in, the
> > two dictionaries can have nothing in common, ie an English one and a
> > Russian one for example.  This will NOT not support two languages that use
> > the same script as that is a lot more complicated.  For example if the
> > word is misspelled which dictionary should it use for the suggestions?
> 
> I probably don't understand all the issues, but a simpler
> approach springs to mind: why not simply use the union of
> both dictionaries, and treat it as a single dictionary? That
> way it could handle two or more languages written with the
> same script. If a misspelled word is lexically similar to a
> word in the English dictionary and a word in the Spanish
> dictionary, it makes sense to me to offer both as
> suggestions. And I think it would work fine for languages
> using different scripts as well.

Because the suggestion strategy used depends on the language.

-- 
http://kevin.atkinson.dhs.org