Voice Conversion Using Exclusively Unaligned Training Data

David Sündermann, Antonio Bonafonte, Harald Höge, Hermann Ney
2004 Revista de Procesamiento de Lenguaje Natural (SEPLN)  
Although all conventional voice conversion approaches require equivalent training utterances of source and target speaker, several recently proposed applications call for breaking this demand. In this paper, we present an algorithm which finds corresponding time frames within unaligned training data. The performance of this algorithm is tested by means of a voice conversion framework based on linear transformation of the spectral envelope. Experimental results are reported on a Spanish
more » ... der corpus utilizing several objective error measures.
dblp:journals/pdln/SundermannBHN04 fatcat:twzxwiu5rzbxxb2bbujquvd64m