Robust Speech Feature Extraction by Growth Transformation in Reproducing Kernel Hilbert Space

Shantanu Chakrabartty, Yunbin Deng, Gert Cauwenberghs
2007 IEEE Transactions on Audio, Speech, and Language Processing  
The performance of speech recognition systems depends on consistent quality of the speech features across variable environmental conditions encountered during training and evaluation. This paper presents a kernel-based nonlinear predictive coding procedure that yields speech features which are robust to nonstationary noise contaminating the speech signal. Features maximally insensitive to additive noise are obtained by growth transformation of regression functions that span a reproducing kernel
more » ... reproducing kernel Hilbert space (RKHS). The features are normalized by construction and extract information pertaining to higher-order statistical correlations in the speech signal. Experiments with the TI-DIGIT database demonstrate consistent robustness to noise of varying statistics, yielding significant improvements in digit recognition accuracy over identical models trained using Mel-scale cepstral features and evaluated at noise levels between 0 and 30-dB signal-to-noise ratio.
doi:10.1109/tasl.2007.899285 fatcat:76e4ivmhqjhphn4dqspswdpkne