A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Segment-Level Effects of Gender, Nationality and Emotion Information on Text-Independent Speaker Verification
2020
Interspeech 2020
Speaker embeddings extracted from neural network (NN) achieve excellent performance on general speaker verification (SV) missions. Most current SV systems use only speaker labels. Therefore, the interaction between different types of domain information decrease the prediction accuracy of SV. To overcome this weakness and improve SV performance, four effective SV systems were proposed by using gender, nationality, and emotion information to add more constraints in the NN training stage. More
doi:10.21437/interspeech.2020-1700
dblp:conf/interspeech/LiAWD20
fatcat:jawyn4pbkjevnmouqvs4qex62e