Detecting pitch of singing voice in polyphonic audio

Yipeng Li, DeLiang Wang
Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.  
We propose a robust algorithm to detect the pitch of singing voice in polyphonic audio. A new channel/peak selection scheme is introduced to exploit the salience of singing voice and the beating phenomenon in high frequency channels. An HMM model is employed to integrate the periodicity information across frequency channels and time frames. Quantitative evaluation shows that the new system performs significantly better than existing algorithms for predominant pitch detection in polyphonic audio.
doi:10.1109/icassp.2005.1415635 dblp:conf/icassp/LiW05 fatcat:ho53pugm2jhjbdotqclt36sqwm