Music Identification with Weighted Finite-State Transducers

Eugene Weinstein, Pedro Moreno
2007 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07  
Music identification is the process of matching an audio stream to a particular song. Previous work has relied on hashing, where an exact or almost-exact match between local features of the test and reference recordings is required. In this work we present a new approach to music identification based on finite-state transducers and Gaussian mixture models. We apply an unsupervised training process to learn an inventory of music phone units similar to phonemes in speech. We also learn a unique
more » ... quence of music units characterizing each song. We further propose a novel application of transducers for recognition of music phone sequences. Preliminary experiments demonstrate an identification accuracy of 99.5% on a database of over 15,000 songs running faster than real time.
doi:10.1109/icassp.2007.366329 dblp:conf/icassp/WeinsteinM07 fatcat:tngqcbpyljbv3bzxwyk4f4aige