Vocal Emotion Recognition with Log-Gabor Filters

Yu Gu, Eric Postma, Hai-Xiang Lin
2015 Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge - AVEC '15  
Vocal emotion recognition aims to identify the emotional states of speakers by analyzing their speech signal. This paper builds on the work of Ezzat, Bouvrie and Poggio [5] by performing a spectro-temporal analysis of affective vocalizations by decomposing the associated spectrogram with 2D Gabor filters. Based on the previous studies of the emotion expression in voices and the turn out display in spectrogram, we assumed that each vocal emotion has a unique spectro-temporal signature in terms
more » ... orientated energy bands which can be detected by properly tuned Gabor filters. We compared the emotion-recognition performances of tuned log-Gabor filters with standard acoustic features. The experimental results show that applying pairs of log-Gabor filters to extract features from the spectrogram yields a performance that matches the performance of an approach based on traditional acoustic features. Their combined emotion recognition performance outperforms stateof-the-art vocal emotion recognition algorithms. This leads us to conclude that tuned log-Gabor filters support the automatic recognition of emotions from speech and may be beneficial to other speech-related tasks.
doi:10.1145/2808196.2811635 dblp:conf/mm/GuPL15 fatcat:rb6y3aejdzf4tmg7g3mdn2fgvq