Robust and reliable techniques for speech-based emotion recognition
Автор: Brester C. Yu., Semenkina O.E., Sidorov M. Yu.
Журнал: Сибирский аэрокосмический журнал @vestnik-sibsau
Рубрика: Математика, механика, информатика
Статья в выпуске: 1 т.16, 2015 года.
Бесплатный доступ
One of the crucial challenges related to the spacecraft control is the monitoring of the mental state of crew members as well as operators of the flight control centre. In most cases, visual information is not sufficient, because spacemen are trained to cope with feelings and not to express emotions explicitly. In order to identify the genuine mental state of a crew member, it is reasonable to engage the acoustic characteristics obtained from speech signals presenting voice commands during the spacecraft control and interpersonal communication. Human emotion recognition implies flexible algorithmic techniques satisfying the requirements of reliability and fast operation in real time. In this paper we consider the heuristic feature selection procedure based on the self-adaptive multi-objective genetic algorithm that allows the number of acoustic characteristics involved in the recognition process to be reduced. The effectiveness of this approach and its robustness property are revealed in experiments with various classification models. The usage of this procedure leads to a reduction of the feature space dimension by a factor of two (from 384 to approximately 180 attributes), which means decreasing the time resources spent by the recognition algorithm. Moreover, it is proposed to implement some algorithmic schemes based on collective decision making by the set of classifiers (Multilayer Perceptron, Support Vector Machine, Linear Logistic Regression) that permits the improvement of the recognition quality (by up to 10% relative improvement). The developed algorithmic schemes provide a guaranteed level of effectiveness and might be used as a reliable alternative to the random choice of a classification model. Due to the robustness property the heuristic feature selection procedure is successfully applied on the data pre-processing stage, and then the approaches realizing the collective decision making schemes are used.
Emotion recognition, adaptive multi-objective genetic algorithm, classifier, collective decision making
Короткий адрес: https://sciup.org/148177407
IDR: 148177407
Список литературы Robust and reliable techniques for speech-based emotion recognition
- Sidorov M., Ultes S., Schmitt A. Emotions are a personal thing: Towards speaker-adaptive emotion recognition//ICASSP. 2014. P. 4803-4807
- Eyben F., Wöllmer M., Schuller B. Opensmile: the munich versatile and fast opensource audio feature extractor//Proceedings of the Intern. Conf. on Multimedia. ACM. 2010. P. 1459-1462
- Boersma P. Praat, a system for doing phonetics by computer//Glot international. 2002. № 5(9/10). P. 341-345
- Speech-Based Emotion Recognition: Feature Selection by Self-Adaptive Multi-Criteria Genetic Algorithm/M. Sidorov //LREC. 2014. P. 3481-3485
- Self-adaptive multi-objective genetic algorithms for feature selection/C. Brester //Proceedings of Intern. Conf. on Engineering and Applied Sciences Optimization (OPT-i’14). 2014. P. 1838-1846
- Kohavi R., John G. H. Wrappers for feature subset selection//Artificial Intelligence. 1997. 97. P. 273-324
- Venkatadri M., Srinivasa Rao K. A multiobjective genetic algorithm for feature selection in data mining//International J. of Computer Science and Information Technologies. 2010. Vol. 1, no. 5. P. 443-448
- Brester C., Sidorov M., Semenkin E. Acoustic Emotion Recognition: Two Ways of Features Selection Based on Self-Adaptive Multi-Objective Genetic Algorithm//Proceedings of the Intern. Conf. on Informatics in Control, Automation and Robotics (ICINCO). 2014. P. 851-855
- Zitzler E., Thiele L. Multiobjective evolutionary algorithms: A comparative case study and the strength pareto approach//Evolutionary Computation, IEEE Transactions on. 1999. Vol. 3, no. 4. P. 257-271
- Sergienko R., Semenkin E. Competitive Cooperation for Strategy Adaptation in Coevolutionary Genetic Algorithm for Constrained Optimization//IEEE World Congress on Computational Intelligence (WCCI'2010). Barcelona, 2010. P. 1626-1631
- Daridi F., Kharma N., Salik J. Parameterless genetic algorithms: review and innovation//IEEE Canadian Review. 2004. № 47. P. 19-23
- A database of german emotional speech/F. Burkhardt //In Interspeech. 2005. P. 1517-1520
- Grimm M., Kroschel K., Narayanan S. The vera am mittag german audio-visual emotional speech database//In Multimedia and Expo, IEEE Intern. Conf. on, IEEE. 2008. P. 865-868
- Constructing a spoken dialogue corpus for studying paralinguistic information in expressive conversation and analyzing its statistical/acoustic characteristics/H. Mori //Speech Communication. 2011. 53 р
- The WEKA Data Mining Software: An Update, SIGKDD Explorations/M. Hall . 2009. Vol. 11, iss. 1
- Goutte C., Gaussier E. A probabilistic interpretation of precision, recall and F-score, with implication for evaluation//ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research. 2005. P. 345-359
- Попов Е. А., Семенкина М. Е., Липинский Л. В. Принятие решений коллективом интеллектуальных информационных технологий//Вестник СибГАУ. 2012. № 5 (45). C. 95-99