Development of a text-mining program for analysis of documentation on the disposal of radioactive wasteproblem
Автор: Nuzhny A. S., Sorokin D. I.
Журнал: Труды Московского физико-технического института @trudy-mipt
Рубрика: Информатика и управление
Статья в выпуске: 1 (45) т.12, 2020 года.
Бесплатный доступ
The program of contextual and thematic analysis of documents is presented. The program processes documents in PDF format, builds a reverted index of corpora, and other service information, which allows the user to search for fragments of text that meets the entered query or selected topic. In case of the topic search, the program searches for texts similar to the training examples. The thematic analysis of the text corpora allows user to detect the presence or absence of those or other typical topics, assesses the completeness of the information provided.
Natural language processing, semantic analysis, contextual search, machine learning
Короткий адрес: https://sciup.org/142223093
IDR: 142223093