Automatic summarization model oriented toward automatic translation (based on the knowledge base)
Автор: Osminin Pavel G.
Журнал: Вестник Южно-Уральского государственного университета. Серия: Лингвистика @vestnik-susu-linguistics
Рубрика: Зеленые страницы
Статья в выпуске: 2 т.11, 2014 года.
Бесплатный доступ
The present paper is concerned with a model of automatic summarization for scientific and technical texts, oriented toward automatic translation. The model consists of three main components: a keyword extractor, a knowledge base and a summarization algorithm. The summary text is generated in the form excluding linguistic phenomena that can cause problems during automatic translation (the syntactic complexity of the sentence is controlled and its length is limited, ellipsis and long subordinate clauses are not allowed). Rules for summary generation define the grammar of summary sentences. The summarization algorithm consists of four top level procedures - preprocessing and analysis of the article text, summary content selection and summary text generation.
Automatic summarization, automatic translation, information extraction, knowledge base
Короткий адрес: https://sciup.org/147153904
IDR: 147153904