Development of a text-mining program for analysis of documentation on the disposal of radioactive wasteproblem

Бесплатный доступ

The program of contextual and thematic analysis of documents is presented. The program processes documents in PDF format, builds a reverted index of corpora, and other service information, which allows the user to search for fragments of text that meets the entered query or selected topic. In case of the topic search, the program searches for texts similar to the training examples. The thematic analysis of the text corpora allows user to detect the presence or absence of those or other typical topics, assesses the completeness of the information provided.

Natural language processing, semantic analysis, contextual search, machine learning

Короткий адрес: https://sciup.org/142223093

IDR: 142223093

Статья научная