1 Hit in 1.7 sec

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification [article]

Wentao Zhu, Tianlong Kong, Shun Lu, Jixiang Li, Dawei Zhang, Feng Deng, Xiaorui Wang, Sen Yang, Ji Liu
2021 arXiv   pre-print
energy (MHE), SpeechNAS automatically discovers five network architectures, from SpeechNAS-1 to SpeechNAS-5, of various numbers of parameters and GFLOPs on the large-scale text-independent speaker recognition  ...  Recently, x-vector has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding  ...  We also employ an advanced large margin based loss to train the candidate architectures for the large scale speaker verification.  ... 
arXiv:2109.08839v1 fatcat:zli7ayzra5dz7o3yx5bn63nvxe