Pindrop Labs' Submission to the First Multi-Target Speaker Detection and Identification Challenge

Elie Khoury, Khaled Lakhdhar, Andrew Vaughan, Ganesh Sivaraman, Parav Nagarsheth
<span title="2019-09-15">2019</span> <i title="ISCA"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/trpytsxgozamtbp7emuvz2ypra" style="color: black;">Interspeech 2019</a> </i> &nbsp;
This paper summarizes Pindrop Labs' submission to the multitarget speaker detection and identification challenge evaluation (MCE 2018). The MCE challenge is geared towards detecting blacklisted speakers (fraudsters) in the context of call centers. Particularly, it aims to answer the following two questions: Is the speaker of the test utterance on the blacklist? If so, which speaker is it among the blacklisted speakers? While one single system can answer both questions, this work looks at them
more &raquo; ... two separate tasks: blacklist detection and closed-set identification. The former is addressed using four different systems including probabilistic linear discriminant analysis (PLDA), two deep neural network (DNN) based systems, and a simple system based on cosine similarity and logistic regression. The latter is addressed by combining PLDA and neural network based systems. The proposed system was the best performing system at the challenge on both tasks, reducing the blacklist detection error (Top-S EER) by 31.9% and the identification error (Top-1 EER) by 46.4% over the MCE baseline on the evaluation data.
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.21437/interspeech.2019-3179">doi:10.21437/interspeech.2019-3179</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/interspeech/KhouryLVSN19.html">dblp:conf/interspeech/KhouryLVSN19</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ogri7kw43zc7dgq3s7n5jn43ta">fatcat:ogri7kw43zc7dgq3s7n5jn43ta</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211209191629/https://www.isca-speech.org/archive/pdfs/interspeech_2019/khoury19_interspeech.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/35/17/351776bc712a12a19197d62c7e744dc1303216d8.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.21437/interspeech.2019-3179"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>