Multi-objective genetic algorithms as an effective tool for feature selection in the speech-based emotion recognition problem
Автор: Brester Ch. Yu., Semenkina O.E., Sidorov M. Yu.
Журнал: Сибирский аэрокосмический журнал @vestnik-sibsau
Рубрика: Математика, механика, информатика
Статья в выпуске: 1 т.17, 2016 года.
Бесплатный доступ
Feature selection is a quite important step in data analysis. Extracting relevant attributes may not only decrease the dimensionality of the dataset and, consequently, reduce time costs spent on the next stages, but also contribute to the quality of the final solution. In this paper we demonstrate some positive effects of the usage of a heuristic feature selection scheme which is based on a two-criterion optimization model. The approach proposed is applied to the speech-based emotion recognition problem, which is currently one of the most important issues in human-machine interactions. A number of high-dimensional multilingual (English, German, Japanese) databases are involved to investigate the effectiveness of the technique presented. Three different multi-objective genetic algorithms and their cooperative modifications are applied as optimizers in combination with classification models such as a Multilayer Perceptron, a Support Vector Machine and Logistic Regression. In most cases we may observe not only a dimensionality reduction, but also an improvement in the recognition quality. To avoid choosing the most effective multi-objective genetic algorithm and the best classifier, we suggest applying a heterogeneous genetic algorithm based on several heuristics and an ensemble of diverse classification models.
Feature selection, multi-objective genetic algorithm, island model, speech-based emotion recognition
Короткий адрес: https://sciup.org/148177547
IDR: 148177547
Список литературы Multi-objective genetic algorithms as an effective tool for feature selection in the speech-based emotion recognition problem
- Brester C., Semenkin E., Sidorov M., Kovalev I., Zelenkov P. Evolutionary feature selection for emotion recognition in multilingual speech analysis. Proceedings of the IEEE Congress on Evolutionary Computation (CEC2015), Sendai, Japan, 2015, p. 2406-2411.
- Sidorov M., Brester Ch., Schmitt A. Contemporary stochastic feature selection algorithms for speech-based emotion recognition. Proceedings of INTERSPEECH 2015, Dresden, Germany, in press.
- Kohavi R., John G. H. Wrappers for feature subset selection. Artificial Intelligence, 97, 1997, p. 273-324.
- Venkatadri M., Srinivasa Rao K. A multiobjective genetic algorithm for feature selection in data mining. International Journal of Computer Science and Information Technologies, vol. 1, no. 5, 2010, p. 443-448.
- Deb K., Pratap A., Agarwal S., Meyarivan T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation 6 (2), 2002, p. 182-197.
- Wang R. Preference-inspired co-evolutionary algorithms. A thesis submitted in partial fulfillment for the degree of the Doctor of Philosophy, University of Sheffield, 2013, p. 231.
- Zitzler E., Laumanns M., Thiele L. SPEA2: Improving the Strength Pareto Evolutionary Algorithm for Multiobjective Optimization. Evolutionary Methods for Design Optimisation and Control with Application to Industrial Problems EUROGEN 2001, 3242 (103), 2002, p. 95-100.
- Whitley D., Rana S., and Heckendorn R. Island model genetic algorithms and linearly separable problems. Proceedings of AISB Workshop on Evolutionary Computation, Manchester, UK. Springer, Vol. 1305 of LNCS, 1997, P. 109-125.
- Brester Ch., Semenkin E. Cooperative Multi-objective Genetic Algorithm with Parallel Implementation. Advances in Swarm and Computational Intelligence, Lecture Notes in Computer Science 9140, 2015, p. 471-478.
- Boersma P. Praat, a system for doing phonetics by computer. Glot international, Vol. 5, No. 9/10, 2002, P. 341-345.
- Eyben F., Wöllmer M., and Schuller B. Opensmile: the munich versatile and fast opensource audio feature extractor. Proceedings of the international conference on Multimedia, 2010. ACM, P. 1459-1462.
- Burkhardt F., Paeschke A., Rolfes M., Sendlmeier W. F., and Weiss B. A database of German emotional speech. In Interspeech, 2005, P. 1517-1520.
- Haq S., Jackson P. Machine Audition: Principles, Algorithms and Systems. Chapter Multimodal Emotion Recognition, IGI Global, Hershey PA, Aug. 2010, P. 398-423.
- Mori H., Satake T., Nakamura M., and Kasuya H. Constructing a spoken dialogue corpus for studying paralinguistic information in expressive conversation and analyzing its statistical/acoustic characteristics. Speech Communication, 53, 2011, P. 36-50.
- Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P., Witten I. H. The WEKA Data Mining Software: An Update. SIGKDD Explorations, Vol. 11, Iss. 1, 2009, P. 10-18.
- Goutte C., Gaussier E. A probabilistic interpretation of precision, recall and F-score, with implication for evaluation. ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval
- Research, 2005, P. 345-359.