A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is
ODESSA at Albayzin Speaker Diarization Challenge 2018
This paper describes the ODESSA submissions to the Albayzin Speaker Diarization Challenge 2018. The challenge addresses the diarization of TV shows. This work explores three different techniques to represent speech segments, namely binary key, x-vector and triplet-loss based embeddings. While training-free methods such as the binary key technique can be applied easily to a scenario where training data is limited, the training of robust neural-embedding extractors is considerably moredoi:10.21437/iberspeech.2018-43 dblp:conf/iberspeech/PatinoDYBBE18 fatcat:gcazgp22u5go3kmoplz3lcsiju