A partial least squares framework for speaker recognition

Balaji Vasan Srinivasan, Dmitry N. Zotkin, Ramani Duraiswami
<span title="">2011</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rc5jnc4ldvhs3dswicq5wk3vsq" style="color: black;">2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</a> </i> &nbsp;
Modern approaches to speaker recognition (verification) operate in a space of "supervectors" created via concatenation of the mean vectors of a Gaussian mixture model (GMM) adapted from a universal background model (UBM). In this space, a number of approaches to model inter-class separability and nuisance attribute variability have been proposed. We develop a method for modeling the variability associated with each class (speaker) by using partial-least-squares -a latent variable modeling
more &raquo; ... que, which isolates the most informative subspace for each speaker. The method is tested on NIST SRE 2008 data and provides promising results. The method is shown to be noise-robust and to be able to efficiently learn the subspace corresponding to a speaker on training data consisting of multiple utterances.
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2011.5947548">doi:10.1109/icassp.2011.5947548</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/SrinivasanZD11.html">dblp:conf/icassp/SrinivasanZD11</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/dodfty7wjfhv7b4kspkpwkwl4e">fatcat:dodfty7wjfhv7b4kspkpwkwl4e</a> </span>
