Диалоговый агент с обучаемым диалоговым менеджером

Автор: Юсупов И.Ф., Куратов Ю.М.

Журнал: Труды Московского физико-технического института @trudy-mipt

Статья в выпуске: 4 (48) т.12, 2020 года.

Бесплатный доступ

Представлен диалоговый агент bot#1337, cозданный в рамках участия в соревновании по разработке диалоговых агентов NIPS Conversational Intelligence Challenge 2017 (ConvAI). Этот агент способен вести диалог с человеком о заданном тексте. Диалог ведется с помощью навыков определения темы, реферирования, ответов на вопросы, генерации вопросов и разговора на общие темы. Система обучалась выбирать подходящий навык для выдачи ответа. Представленный бот произведен с использованием открытых инструментов и данных; он не зависит от внешних сервисов и может работать в закрытом контуре; его диалоговый менеджер является обучаемым. Последнее позволяет разработчику сфокусироваться на создании навыков вместо описания конечного автомата агента. Bot#1337 является победителем соревнования со средней оценкой качества диалога 2.78, которые были проставлены людьми. Исходный код и обученные модели представленного бота доступны на Github.

Еще

Диалоговые системы, обработка естественного языка, диалоговый менеджер

Короткий адрес: https://sciup.org/142230090

IDR: 142230090

Список литературы Диалоговый агент с обучаемым диалоговым менеджером

Weizenbaum J. ELIZA^a computer program for the study of natural language communication between man and machine // Communications of the ACM. 1966. V. 9, N 1. P. 36-45.
Graves A. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850. 2013.
Wallace R.S. The anatomy of ALICE. Parsing the Turing Test. Springer, Dordrecht, 2009. P. 181-210.
Williams J.D. [et al.}. The dialog state tracking challenge series // AI Magazine. 2014. V. 35, N 4. P. 121-124.
Ritter A., Cherry C., Dolan B. Unsupervised modeling of twitter conversations // Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 2010. P. 172-180.
Lowe R. [et al.}. The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems. arXiv preprint arXiv:1506.08909. 2015.
Sordoni A. [et al.}. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses // Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2015. P. 196-205.
Vinyals O., Le Q. A neural conversational model. arXiv preprint arXiv:1506.05869. 2015.
Sutskever I., Vinyals O., Le Q. V. Sequence to sequence learning with neural networks // Advances in neural information processing systems. 2014. P. 3104-3112.
Williams J.D., Atui K.A., Zweig G. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning // Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2017. P. 665-677.
Serban I. V. [et al.}. A deep reinforcement learning chatbot. arXiv preprint arXiv:1709.02349. 2017.
Klein G. [et al.}. OpenNMT: Open-Source Toolkit for Neural Machine Translation // Proceedings of ACL 2017, System Demonstrations. 2017. P. 67-72.
Vorontsov K. [et al.}. Bigartm: Open source library for regularized multimodal topic modeling of large collections // International Conference on Analysis of Images, Social Networks and Texts. Springer, Cham, 2015. P. 370-381.
Seo M. [et al.}. Bidirectional attention flow for machine comprehension. arXiv preprint arXiv:1611.01603. 2016.
Zhou Q. [et al.}. Neural question generation from text: A preliminary study // National CCF Conference on Natural Language Processing and Chinese Computing. Springer, Cham, 2017. P. 662-671.
Joulin A. [et al.}. Bag of Tricks for Efficient Text Classification // Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers. 2017. P. 427-431.
Pennington J., Socher R., Manning C.D. Glove: Global vectors for word representation // Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). 2014. P. 1532-1543.
Tiedemann J. News from OPUS-A collection of multilingual parallel corpora with tools and interfaces / /Recent advances in natural language processing. 2009. V. 5. P. 237-248.
Rajpurkar P. [et al.}. SQuAD: 100,000+ Questions for Machine Comprehension of Text // Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 2016. P. 2383-2392.
Manning C.D. [et al.}. The Stanford CoreNLP natural language processing toolkit // Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations. 2014. P. 55-60.
Liu C.W. [et al.}. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation // Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 2016. P. 2122-2132.
Mauldin M. L. Chatterbots, tinvmuds, and the turing test: Entering the loebner prize competition / /AAAI. 1994. V. 94. P. 16-21.
Ram A. [et al.}. Conversational ai: The science behind the alexa prize. arXiv preprint arXiv:1801.03604. 2018.
Fang H. [et al.}. Sounding board-university of Washington's alexa prize submission // Alexa prize proceedings. 2017.
Serban I.V. [et al.}. Hierarchical neural network generative models for movie dialogues // arXiv preprint arXiv:1507.04808. 2015. V. 7, N 8.
Lavie A., Agarwal A. METEOR: An automatic metric for MT evaluation with high levels of correlation with human judgments // Proceedings of the second workshop on statistical machine translation. 2007. P. 228-231.
Papineni K. [et al.}. BLEU: a method for automatic evaluation of machine translation // Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics. 2002. P. 311-318.
Liu C.W. [et al.}. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation // Proceedings of the
2016 Conference on Empirical Methods in Natural Language Processing. 2016. P. 2122-2132.
Lowe R. [et al.}. On the Evaluation of Dialogue Systems with Next Utterance Classification // Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue. 2016. P. 264-269. MLA.
Lowe R. [et al.}. Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses // Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2017. P. 1116-1126.
Chorowski J. [et al.}. A Talker Ensemble: The University of Wroclaw's Entry to the NIPS
2017 Conversational Intelligence Challenge. The NIPS'17 Competition: Building Intelligent Systems. Springer, Cham, 2018. P. 59-77.
Kingma D.P., Ba J. Adam: A Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980. 2014.

Еще

Статья научная