A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
An adaptive initialization method for speaker Diarization based on prosodic features
2010
2010 IEEE International Conference on Acoustics, Speech and Signal Processing
The following article presents a novel, adaptive initialization scheme that can be applied to most state-of-the-art Speaker Diarization algorithms, i.e. algorithms that use agglomerative hierarchical clustering with Bayesian Information Criterion (BIC) and Gaussian Mixture Models (GMMs) of framebased cepstral features (MFCCs). The initialization method is a combination of the recently proposed "adaptive seconds per Gaussian" (ASPG) method and a new pre-clustering and number of initial clusters
doi:10.1109/icassp.2010.5495102
dblp:conf/icassp/ImsengF10
fatcat:gcqf4pxw5bgivpwuxvzpt5l2oa