Unsupervised graph extraction for improvement of multi-domain task-oriented dialogue modelling

Автор: Yusupov I.F., Trofimova M.V., Burtsev M.S.

Журнал: Труды Московского физико-технического института @trudy-mipt

Рубрика: Информатика и управление

Статья в выпуске: 3 (47) т.12, 2020 года.

Бесплатный доступ

This paper proposes a dialogue graph extraction method based on a textual corpus. The dialogue graph displays the main conversational topics for each turn of the dialogue and defines transitions between them. A node in this graph is represents the main conversational topic for the dialogue turn and the edge stands for a possible transition to one of the adjacent topics. The proposed method of dialogue graph extraction is based on a set of clustering algorithms accompanied by a set of heuristics. It is applied to variable size corpora from different domains. We provide visualizations of resulting dialogue graphs for a number of datasets and visualizations aid understanding of the underlying latent dialogue structure in each corpus. As a proof of the concept, we integrate the graph features by a sequence-to-sequence model and improve the current baseline on MultiWOZ 2 response generation task by more than 15%.

Еще

Dialogue systems, dialogue structure, visualization, dialogue graph

Короткий адрес: https://sciup.org/142230088

IDR: 142230088

Статья научная