STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement

Martin Krawczyk, Timo Gerkmann
2014 IEEE/ACM Transactions on Audio Speech and Language Processing  
The enhancement of speech which is corrupted by noise is commonly performed in the short-time discrete Fourier transform domain. In case only a single microphone signal is available, typically only the spectral amplitude is modified. However, it has recently been shown that an improved spectral phase can as well be utilized for speech enhancement, e.g. for phase-sensitive amplitude estimation. In this paper we therefore present a method to reconstruct the spectral phase of voiced speech from
more » ... y the fundamental frequency and the noisy observation. The importance of the spectral phase is highlighted and we elaborate on the reason why noise reduction can be achieved by modifications of the spectral phase. We show that, when the noisy phase is enhanced using the proposed phase reconstruction, instrumental measures predict an increase of speech quality over a range of signal to noise ratios, even without explicit amplitude enhancement.
doi:10.1109/taslp.2014.2354236 fatcat:iomthvmotrcdzezln6s3w6saye