A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
GAN-Based Data Generation for Speech Emotion Recognition
2020
Interspeech 2020
In this work, we propose a GAN-based method to generate synthetic data for speech emotion recognition. Specifically, we investigate the usage of GANs for capturing the data manifold when the data is eyes-off, i.e., where we can train networks using the data but cannot copy it from the clients. We propose a CNN-based GAN with spectral normalization on both the generator and discriminator, both of which are pre-trained on large unlabeled speech corpora. We show that our method provides better
doi:10.21437/interspeech.2020-2898
dblp:conf/interspeech/EskimezDGK20
fatcat:nsmqvjiwenewzdj6kwsavn5eo4