Offline speaker segmentation using genetic algorithms and mutual information

S. Salcedo-Sanz, A. Gallardo-Antolin, J.M. Leiva-Murillo, C. Bousono-Calzon
2006 IEEE Transactions on Evolutionary Computation  
We present an evolutionary approach to speaker segmentation, an activity that is especially important prior to speaker recognition and audio content analysis tasks. Our approach consists of a genetic algorithm (GA), which encodes possible segmentations of an audio record, and a measure of mutual information between the audio data and possible segmentations, which is used as fitness function for the GA. We introduce a compact encoding of the problem into the GA which reduces the length of the GA
more » ... he length of the GA individuals and improves the GA convergence properties. Our algorithm has been tested on the segmentation of real audio data, and its performance has been compared with several existing algorithms for speaker segmentation, obtaining very good results in all test problems. Index Terms-Genetic algorithms (GAs), mutual information, speaker segmentation, unsupervised learning.
doi:10.1109/tevc.2005.857079 fatcat:2s66drnjhzbczc23glsipmqygm