A method for estimating prosodic symbol from text for Japanese text-to-speech synthesis

K. Magata, T. Hamagami, M. Komura
Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96  
This report describes a method for estimating the separation degree at the bunsetsu boundary (SD) for Japanese text-to-speech synthesis. Our method gives us the prosodic symbol without using complicated linguistic analysis. First we classify bunsetsus according to the nal morpheme. Each classied bunsetsu has a temporary separation degree in advance. We call this \the estimated separation degree" (ESD). ESD is derived from the SD's statistical tendency regarding each bunsetsu. The SD is decided
more » ... y rules that correct the ESD as an initial degree. Correction rules are constructed by comparing the ESD, and the SD is observed from natural speech to cancel the frequently occurring mismatches. An absolute evaluation test of ve grades was performed upon 300 sentences with prosodic symbols given by our method. As a result, the ratio of \Natural" and \Somewhat unnatural but tolerable" exceeded 2/3. The proportion of \Serious error" was less than 10%, thus giving us satisfactory results.
doi:10.1109/icslp.1996.607869 fatcat:zjzopzxcujbpfgxw5y3kvnend4