A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Speaker-adaptive-trainable Boltzmann machine and its application to non-parallel voice conversion
2017
EURASIP Journal on Audio, Speech, and Music Processing
In this paper, we present a voice conversion (VC) method that does not use any parallel data while training the model. Voice conversion is a technique where only speaker-specific information in the source speech is converted while keeping the phonological information unchanged. Most of the existing VC methods rely on parallel data-pairs of speech data from the source and target speakers uttering the same sentences. However, the use of parallel data in training causes several problems: (1) the
doi:10.1186/s13636-017-0112-6
fatcat:2x4w7mc7rva6fadhixs4o66cgy