Frequency dictionary construction based on the source text using lemmatization

Автор: Kovalev Igor Vladimirovich, Seredin Alexander Igorevich, Karaseva Margarita Vladimirovna, Zelenkov Pavel Viktorovich, Khrapunova Valeriya Vladimirovna

Журнал: Сибирский аэрокосмический журнал @vestnik-sibsau

Рубрика: Математика, механика, информатика

Статья в выпуске: 4 (50), 2013 года.

Бесплатный доступ

The issue of reducing the complexity of the information-vocabulary basis study by decreasing the amount of the frequency dictionary (on which base the basis is constructed), is considered. The frequency dictionary construction based on the source text using lemmatization for the subsequent formation of information-vocabulary basis is considered. The algorithm for frequency dictionary construction based on the source text using lemmatization is presented, as well as the modification of this algorithm with checking the terms of the generated frequency dictionary by a specialized dictionary.

Frequency dictionary, information-vocabulary basis, lemmatization

Короткий адрес: https://sciup.org/148177155

IDR: 148177155

Статья научная