An integrated approach to the analysis of argumentative relationships in scientific communication texts

Автор: Sidorova E.A., Akhmadeeva I.R., Zagorulko Yu.A., Kononenko I.S., Sery A.S., Chagina P.M., Shestakov V.K.

Журнал: Онтология проектирования @ontology-of-designing

Рубрика: Инжиниринг онтологий

Статья в выпуске: 4 (50) т.13, 2023 года.

Бесплатный доступ

The problem of automatic analysis of argumentation in scientific communication texts is considered. Argumentation is understood as an ordered set of arguments used to support a certain thesis. An argument includes at least one premise and one conclusion, connected by an argumentative relation. The purpose of the work is an experimental study of neural network approaches to solving the problem of searching and extracting argumentative relations between statements located closely in the text. The study was conducted on a corpus of texts with argumentative markup created using the previously developed web platform. The corpus included texts of scientific news, analytical articles from the Habr website, scientific articles and reviews. Datasets for machine learning were built based on these texts. To improve the quality of neural network models training, these sets were supplemented with new data by using automatic paraphrasing and double translation methods. Two approaches to training models were considered: the first one with labeling of indicators in texts and the second one with preliminary training of a language model on the task of predicting indicators. To evaluate the models performance, an approach was proposed based on estimates of agreement between experts, usually used to compare markups of manually created texts. A comparison of agreement coefficients between experts and trained models showed that the quality threshold for extracting argumentative relations was almost reached on the model with labeled indicators. A manual analysis of model errors was carried out by visualizing the obtained results. Thus, the novelty of the work lies in the application of an integrated approach to creating data sets, training models and evaluating the results obtained from the automatic extraction of argumentative relations.

Еще

Argumentation, automatic analysis, text markup, argumentative relations, argumentation indicator, markup consistency, dataset

Короткий адрес: https://sciup.org/170201899

IDR: 170201899   |   DOI: 10.18287/2223-9537-2023-13-4-562-579

Статья научная