A mixture maximization approach to multipitch tracking with factorial hidden Markov models

M. Wohlmayr, M. Stark, F. Pernkopf
2010 2010 IEEE International Conference on Acoustics, Speech and Signal Processing  
We present a simple and efficient feature modeling approach for tracking the pitch of two speakers speaking simultaneously. We model the spectrogram features of single speakers using Gaussian mixture models in combination with the minimum description length model selection criterion. Furthermore, the mixture maximization (MIXMAX) interaction model is employed to yield a probabilistic representation for the mixture of both speakers. Finally, a factorial hidden Markov model is applied for
more » ... . We demonstrate experimental results on two databases, and show the excellent performance of the proposed method in comparison to a well known multipitch tracking algorithm based on correlogram features. Index Terms-Multipitch tracking, factorial hidden Markov model, mixture maximization, Gaussian mixture model.
doi:10.1109/icassp.2010.5495048 dblp:conf/icassp/WohlmayrSP10 fatcat:k62ussrcqbatrfg3fhoysckdze