[Libreoffice-bugs] [Bug 129492] Similarity search not clear

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Thu Dec 19 17:30:52 UTC 2019


https://bugs.documentfoundation.org/show_bug.cgi?id=129492

--- Comment #1 from Eike Rathke <erack at redhat.com> ---
Maybe this can shed some light:

The algorithm used is a Weighted Levenshtein Distance (including wildcards ?
and *).

The mathematical definition of the real WLD means EITHER maximum X replacements
OR Y characters shorter OR Z characters longer, where a mix of operations is
allowed but each operation draws from a shared 100% pool of operations.

The relaxed (UI Combined, internal SplitCount) mode allows maximum X
replacements AND/OR Y character shorter AND/OR Z characters longer. Only
insertions and deletions share one pool from which they draw, replacements use
a second independent pool. This is more what a user expects if not familiar
with WLD.

More details and an example can be found in the comments at
https://opengrok.libreoffice.org/xref/core/i18npool/source/search/levdis.hxx?r=ee8f0a10#26

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20191219/20826737/attachment-0001.htm>


More information about the Libreoffice-bugs mailing list