[utf-8] Aspell and UTF-8 Update
Kevin Atkinson
kevina@gnu.org
Thu, 19 Feb 2004 06:51:41 -0500 (EST)
Aspell is capable of accepting all input and printing all output in
UTF-8. Please update the =22Bad Software=22 page.
-- =
http://kevin.atkinson.dhs.org
---------- Forwarded message ----------
Date: Thu, 19 Feb 2004 06:41:33 -0500 (EST)
From: Kevin Atkinson <kevina=40gnu.org>
To: aspell-devel=40gnu.org
Subject: New Aspell Snapshot: 0.51-20040219
I have just released another pre Aspell 0.51 snapshot, version
0.51-20040219. This snapshot has lots of small changes. Some of the mor=
e
significant ones:
* Speed ups to suggestion code when affix compression is used.
* Added support for accepting all input and printing all output in
UTF-8 or some other encoding different from the one Aspell uses.
Aspell can now support any language that no more than 220 distinct
characters, including different capitalizations and accents, _even
if_ there is not an existing 8-bit encoding that supports the
language.
Other changes from Aspell 0.50:
* Added support for loadable filters thanks to Christoph Hinterm=FClle=
r
* Enhanced TEX filter to support recognizing accent commands, such as
the German umlaute, and to treat words with hyphenation characters
in them as one word, also thanks to Christoph Hinterm=FCller
* Added gettext support thanks to Sergey Poznyakoff
* Reworked how the dictionary is stored to take up less space (around
80% for the English language) and be faster in some cases.
* Reworked the build system so that a single Makefile is used for
most of the code.
* Support for Affix Compression. Affix compression stores the root
word and then a list of prefixes and suffixes that the word can
take, and thus saves a lot of space. The codebase comes from
MySpell found in OpenOffice. It uses the same affix file
OpenOffice (and Mozilla) use. However, affix compression is
currently incompatible with sounds like look up which means that
the suggestion quality will suffer.
* Added support for MySpell Replacement Tables for better suggestions
when phonet information is not available.
* Manual has has been converted to texinfo format thanks to the work
of Chris Martin.
-- =
http://kevin.atkinson.dhs.org