A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge
2018
Interspeech 2018
This paper focuses on the estimation of the number of speakers for diarization in the context of the DIHARD Challenge at In-terSpeech 2018. This evaluation seeks the improvement of the diarization task in challenging corpora (Youtube videos, meetings, court audios, etc), containing an undetermined number of speakers with different relevance in terms of speech contributions. Our proposal for the challenge is a system based on the ivector PLDA paradigm: Given some initial segmentation of the
doi:10.21437/interspeech.2018-1841
dblp:conf/interspeech/VinalsGOML18
fatcat:465yndbim5ekhgpkuujfmxqnwq