Two-steps system in searching similar words for fast and reliable automatic concatenation of Russian sub-word units

Бесплатный доступ

In this paper we describe and investigate the two-steps system sorting out inappropriate words in searching of similar words in the lexicon for automatic concatenation of Russian sub-word units. This two-steps system consists of com- puting the Levenshtein distance on the first stage and computing the similarity coefficient by the relevance function on the second stage. We also compared the performance of the Wagner-Fisher algorithm and the suggested algorithm

Fuzzy search algorithm, levenshtein distance, relevance function

Короткий адрес: https://sciup.org/148177134

IDR: 148177134

Статья