Maximum mutual information based acoustic-features representation of phonological features for speech recognition

M. Kamal Omar, Mark Hasegawa-Johnson
2002 IEEE International Conference on Acoustics Speech and Signal Processing  
This paper addresses the problem of finding a subset of the acoustic feature space that best represents a set of phonological features. A maximum mutual information approach is presented for selecting acoustic features to be combined together to represent the distinctions coded by a set of correlated phonological features. Each set of phonological features is chosen on the basis of acoustic phonetic similarity, so the sets can be considered approximately independent. This means that the output
more » ... ns that the output of recognizers that recognize these sets independently using the acoustic representation achieved by an algorithm presented in this paper can be combined together to increase efficiency and robustness of speech recognition systems. The mutual information between the phonological feature sets and their achieved acoustic representation is increased by up to 220% over the best single-type acoustic representation in the feature space of the same length.
doi:10.1109/icassp.2002.5743659 dblp:conf/icassp/OmarH02 fatcat:g2dqvqmiwzg37bxiymejnrew4q