Fast robust inverse transform speaker adapted training using diagonal transformations

H. Jin, S. Matsoukas, R. Schwartz, F. Kubala
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)  
We present a new method of Speaker Adapted Training (SAT) that is more robust, faster, and results in lower error rate than the previous methods. The method, called Inverse Transform SAT (IT-SAT) is based on removing the differences between speakers before training, rather than modeling the differences during training. We develop several methods to avoid the problems associated with inverting the transformation. In one method, we interpolate the transformation matrix with an identity or
more » ... transformation. We also apply constraints to the matrix to avoid estimation problems. Finally, we show that the resulting method is much faster, requires much less disk space, and results in higher accuracy than the original SAT method.
doi:10.1109/icassp.1998.675382 dblp:conf/icassp/JinMSK98 fatcat:fy65bqrjmrdapjakivgcw2nfs4