A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is
End-to-end speaker diarization for an unknown number of speakers is addressed in this paper. Recently proposed end-toend speaker diarization outperformed conventional clusteringbased speaker diarization, but it has one drawback: it is less flexible in terms of the number of speakers. This paper proposes a method for encoder-decoder based attractor calculation (EDA), which first generates a flexible number of attractors from a speech embedding sequence. Then, the generated multiple attractorsdoi:10.21437/interspeech.2020-1022 dblp:conf/interspeech/HoriguchiF0XN20 fatcat:rvnoymqvy5cplcw5jifrivdsfa