A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Speaking Speed Control of End-to-End Speech Synthesis Using Sentence-Level Conditioning
2020
Interspeech 2020
This paper proposes a controllable end-to-end text-to-speech (TTS) system to control the speaking speed (speed-controllable TTS; SCTTS) of synthesized speech with sentence-level speaking-rate value as an additional input. The speaking-rate value, the ratio of the number of input phonemes to the length of input speech, is adopted in the proposed system to control the speaking speed. Furthermore, the proposed SCTTS system can control the speaking speed while retaining other speech attributes,
doi:10.21437/interspeech.2020-1361
dblp:conf/interspeech/BaeBJLLC20
fatcat:f3b5t57bkzg7bewt7fellleyda