Auditory processing-based features for improving speech recognition in adverse acoustic conditions

Hari Krishna Maganti, Marco Matassoni
2014 EURASIP Journal on Audio, Speech, and Music Processing  
The paper describes an auditory processing-based feature extraction strategy for robust speech recognition in environments, where conventional automatic speech recognition (ASR) approaches are not successful. It incorporates a combination of gammatone filtering, modulation spectrum and non-linearity for feature extraction in the recognition chain to improve robustness, more specifically the ASR in adverse acoustic conditions. The experimental results with standard Aurora-4 large vocabulary
more » ... rge vocabulary evaluation task revealed that the proposed features provide reliable and considerable improvement in terms of robustness in different noise conditions and are comparable to those of standard feature extraction techniques.
doi:10.1186/1687-4722-2014-21 fatcat:l23imkaodbbo7j7esliy3ln4ja