A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Speech-based Emotion Recognition and Speaker Identification: Static vs. Dynamic Mode of Speech Representation
2016
Journal of Siberian Federal University Mathematics & Physics
In this paper we present the performance of different machine learning algorithms for the problems of speech-based Emotion Recognition (ER) and Speaker Identification (SI) in static and dynamic modes of speech signal representation. We have used a multi-corporal, multi-language approach in the study. 3 databases for the problem of SI and 4 databases for the ER task of 3 different languages (German, English and Japanese) have been used in our study to evaluate the models. More than 45 machine
doi:10.17516/1997-1397-2016-9-4-518-523
fatcat:mj7jny64v5fzjcsvl7ec3dhn3m