Quantization of Speech Features: Source Coding [chapter]

Stephen So, Kuldip K. Paliwal
2008 Advances in Pattern Recognition  
In this chapter, we describe various schemes for quantizing speech features to be used in distributed speech recognition (DSR) systems. We have analyzed the statistical properties of MFCCs that are most relevant to quantization, namely the correlation and probability density function shape, in order to determine the type of quantization scheme that would be most suitable for quantizing them efficiently. We also determine empirically the relationship between mean squared error and recognition
more » ... uracy in order to verify that quantization schemes, which minimize mean squared error, are also guaranteed to improve the recognition performance. Furthermore, we highlight the importance of noise robustness in DSR and describe the use of a perceptually weighted distance measure to enhance spectral peaks in vector quantization. Finally, we present some experimental results on the quantization schemes in a DSR framework and compare their relative recognition performances.
doi:10.1007/978-1-84800-143-5_7 fatcat:xvf3qefxhfcv5mahty7zt7k64a