Simple and Effective Unsupervised Speech Synthesis [article]

Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James Glass
2022 arXiv   pre-print
We introduce the first unsupervised speech synthesis system based on a simple, yet effective recipe. The framework leverages recent work in unsupervised speech recognition as well as existing neural-based speech synthesis. Using only unlabeled speech audio and unlabeled text as well as a lexicon, our method enables speech synthesis without the need for a human-labeled corpus. Experiments demonstrate the unsupervised system can synthesize speech similar to a supervised counterpart in terms of
more » ... uralness and intelligibility measured by human evaluation.
arXiv:2204.02524v3 fatcat:l22ns5752vcmve5izkyyxw3qyi