[utf-8] Aspell and UTF-8 Update

Kevin Atkinson kevina@gnu.org
Thu, 19 Feb 2004 06:51:41 -0500 (EST)


Aspell is capable of accepting all input and printing all output in
UTF-8.  Please update the =22Bad Software=22 page.

-- =

http://kevin.atkinson.dhs.org

---------- Forwarded message ----------
Date: Thu, 19 Feb 2004 06:41:33 -0500 (EST)
From: Kevin Atkinson <kevina=40gnu.org>
To: aspell-devel=40gnu.org
Subject: New Aspell Snapshot: 0.51-20040219


I have just released another pre Aspell 0.51 snapshot, version
0.51-20040219.  This snapshot has lots of small changes.  Some of the mor=
e
significant ones:

   * Speed ups to suggestion code when affix compression is used.

   * Added support for accepting all input and printing all output in
     UTF-8 or some other encoding different from the one Aspell uses.
     Aspell can now support any language that no more than 220 distinct
     characters, including different capitalizations and accents, _even
     if_ there is not an existing 8-bit encoding that supports the
     language.

Other changes from Aspell 0.50:

   * Added support for loadable filters thanks to Christoph Hinterm=FClle=
r

   * Enhanced TEX filter to support recognizing accent commands, such as
     the German umlaute, and to treat words with hyphenation characters
     in them as one word, also thanks to Christoph Hinterm=FCller

   * Added gettext support thanks to Sergey Poznyakoff

   * Reworked how the dictionary is stored to take up less space (around
     80% for the English language) and be faster in some cases.

   * Reworked the build system so that a single Makefile is used for
     most of the code.

   * Support for Affix Compression.  Affix compression stores the root
     word and then a list of prefixes and suffixes that the word can
     take, and thus saves a lot of space.  The codebase comes from
     MySpell found in OpenOffice.  It uses the same affix file
     OpenOffice (and Mozilla) use.  However, affix compression is
     currently incompatible with sounds like look up which means that
     the suggestion quality will suffer.

   * Added support for MySpell Replacement Tables for better suggestions
     when phonet information is not available.

   * Manual has has been converted to texinfo format thanks to the work
     of Chris Martin.

-- =

http://kevin.atkinson.dhs.org