Frequency dictionary construction based on the source text using lemmatization
Автор: Kovalev Igor Vladimirovich, Seredin Alexander Igorevich, Karaseva Margarita Vladimirovna, Zelenkov Pavel Viktorovich, Khrapunova Valeriya Vladimirovna
Журнал: Сибирский аэрокосмический журнал @vestnik-sibsau
Рубрика: Математика, механика, информатика
Статья в выпуске: 4 (50), 2013 года.
Бесплатный доступ
The issue of reducing the complexity of the information-vocabulary basis study by decreasing the amount of the frequency dictionary (on which base the basis is constructed), is considered. The frequency dictionary construction based on the source text using lemmatization for the subsequent formation of information-vocabulary basis is considered. The algorithm for frequency dictionary construction based on the source text using lemmatization is presented, as well as the modification of this algorithm with checking the terms of the generated frequency dictionary by a specialized dictionary.
Frequency dictionary, information-vocabulary basis, lemmatization
Короткий адрес: https://sciup.org/148177155
IDR: 148177155