Emotion recognition and speaker identification from speech
Автор: Sidorov Maxim Yuryevich, Zablotskiy Sergey Genadyevich, Minker Wolfgang, Semenkin Evgeny Stanislavovich
Журнал: Сибирский аэрокосмический журнал @vestnik-sibsau
Рубрика: 2-я международная конференция по математическим моделям и их применению
Статья в выпуске: 4 (50), 2013 года.
Бесплатный доступ
The performance of spoken dialogue systems (SDS) is not perfect yet, especially for some languages. Emotion recognition from speech (ER) is a technique which can improve the SDS behavior by finding critical points in the human-machine interaction and changing a dialogue strategy. Inclusion of the speaker specific information, by conducting the speaker identification procedure (SI) at the set up of ER task could also be used in order to improve the dialogue quality. Choosing of both appropriate speech signal features and machine learning algorithms for the ER and SI remain a complex and challenging problem. More than 50 machine learning algorithms were applied in the study for ER and SI tasks, using 9 multi-language corpora (Russian, English, German, and Japanese) of both acted and non-acted emotional utterance recordings. The study provides the results of evaluation as well as their analysis and future directions.
Emotion recognition from speech, speaker identification from speech, machine learning algorithms, speaker adaptive emotion recognition from speech
Короткий адрес: https://sciup.org/148177133
IDR: 148177133