A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic Voices
[article]
2016
arXiv
pre-print
Recent developments in speech synthesis have produced systems capable of outcome intelligible speech, but now researchers strive to create models that more accurately mimic human voices. One such development is the incorporation of multiple linguistic styles in various languages and accents. HMM-based Speech Synthesis is of great interest to many researchers, due to its ability to produce sophisticated features with small footprint. Despite such progress, its quality has not yet reached the
arXiv:1602.02656v1
fatcat:nshkdywklfhbpcrssyqz6qyuq4