Statistical and Neural Classifiers: Application for Singer and Music Discrimination in Polyphonic Music Context [chapter]

Hassan Ezzaidi, Mohammed Bahoura
2010 Lecture Notes in Computer Science  
The problem of identifying sections of singer voice and instruments is investigated in this paper. Three classification techniques: Linde-Buzo-Gray algorithm (LBG), Gaussian Mixture Models (GMM) and Muli-Layer feed-forwards Perception (MLP). All techniques are based on Mel frequency Cepstres Coefficients (MFCC), which commonly used in the speech and speaker recognition domains, are presented and compared in this paper. All the proposed approaches, yield a decision at every 125 ms only.
more » ... rly, a large experimental data is extracted from the music genre database RWC including various style (68 pieces, 25 subcategories). The recognition scores is evaluated on data used in the training session and others never seen by proposed systems. The best results are obtained with the GMM (94% with train data and 80.5% with test data).
doi:10.1007/978-3-642-13681-8_16 fatcat:fxu5l7io6zdldhxbsq26a3deke