A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2008; you can also visit the original URL.
The file type is
In this paper, we present an approach to fundamental frequency contour modeling of English for speech synthesis, based on a statistical learning technique called Additive Models that was successfully applied to the modeling of Japanese F0 contour previously. In an attempt to model English F0 contour, we defined a threelayer additive model consisting of an intonational phrase component, a word-level component representing lexical stress types, and a pitch-accent component related to accenteddoi:10.1109/icassp.2005.1415104 dblp:conf/icassp/Sakai05 fatcat:crgyurc7bnccbk5iuf5v4lm47u