A perceptual masking approach for noise robust speech recognition

Hari Krishna Maganti, Marco Matassoni
2012 EURASIP Journal on Audio, Speech, and Music Processing  
This article describes a modified technique for enhancing noisy speech to improve automatic speech recognition (ASR) performance. The proposed approach improves the widely used spectral subtraction which inherently suffers from the associated musical noise effects. Through a psychoacoustic masking and critical band variance normalization technique, the artifacts produced by spectral subtraction are minimized for improving the ASR accuracy. The popular advanced ETSI-2 front end is tested for
more » ... arison purposes. The performed speech recognition evaluations on the noisy standard AURORA-2 tasks show enhanced performance for all noise conditions.
doi:10.1186/1687-4722-2012-29 fatcat:azwumrymgfcsbcnuhp3cdfhyhi