A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis
2020
Interspeech 2020
Most of current end-to-end speech synthesis assumes the input text is in a single language situation. However, codeswitching in speech occurs frequently in routine life, in which speakers switch between languages in the same utterance. And building a large mixed-language speech database is difficult and uneconomical. In this paper, both windowing technique and style token modeling are designed for the code-switching endto-end speech synthesis. To improve the consistency of speaking style in
doi:10.21437/interspeech.2020-1754
dblp:conf/interspeech/FuTWYQW20
fatcat:axfpfvlqe5e6fmfmscxzaso274