A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Multilingual Data Selection for Low Resource Speech Recognition
2016
Interspeech 2016
Feature representations extracted from deep neural networkbased multilingual frontends provide significant improvements to speech recognition systems in low resource settings. To effectively train these frontends, we introduce a data selection technique that discovers language groups from an available set of training languages. This data selection method reduces the required amount of training data and training time by approximately 40%, with minimal performance degradation. We present speech
doi:10.21437/interspeech.2016-598
dblp:conf/interspeech/ThomasACKR16
fatcat:rnumqswls5ek3mfbrscx5omjwy