A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
2020
Interspeech 2020
End-to-end speaker diarization for an unknown number of speakers is addressed in this paper. Recently proposed end-toend speaker diarization outperformed conventional clusteringbased speaker diarization, but it has one drawback: it is less flexible in terms of the number of speakers. This paper proposes a method for encoder-decoder based attractor calculation (EDA), which first generates a flexible number of attractors from a speech embedding sequence. Then, the generated multiple attractors
doi:10.21437/interspeech.2020-1022
dblp:conf/interspeech/HoriguchiF0XN20
fatcat:rvnoymqvy5cplcw5jifrivdsfa