A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure

Cees H. Taal, Richard C. Hendriks, Richard Heusdens
2012 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
A speech pre-processing algorithm is presented to improve the speech intelligibility in noise for the near-end listener. The algorithm improves the intelligibility by optimally redistributing the speech energy over time and frequency for a perceptual distortion measure, which is based on a spectro-temporal auditory model. In contrast to spectral-only models, short-time information is taken into account. As a consequence, the algorithm is more sensitive to transient regions, which will therefore
more » ... hich will therefore receive more amplification compared to stationary vowels. It is known from literature that changing the vowel-transient energy ratio is beneficial for improving speechintelligibility in noise. Objective intelligibility prediction results show that the proposed method has higher speech intelligibility in noise compared to two other reference methods, without modifying the global speech energy.
doi:10.1109/icassp.2012.6288810 dblp:conf/icassp/TaalHH12 fatcat:o6drb3glq5be5fast4urbbtmbq