Duration modeling in a restricted-domain female-voice synthesis in Spanish using neural networks

R. Cordoba, J.M. Montero, J. Gutierrez-Arriola, J.M. Pardo
2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)  
The objective of this paper is the accurate prediction of segmental duration in a Spanish text-to-speech system. There are many parameters that affect duration, but not all of them are always relevant. We present a complete environment in which to decide which parameters are more relcvant and the best way to code them. This work is the continuation of [I], where all efforts were dedicated to an unrestricted-domain database for a male voice. In this case, we are considering a female voice in a
more » ... stricted-domain environment. This restricted-domain offers several advantages to the modeling: the variation in the different pattems is reduced, and so most of the decisions we have made about the parameters arc now based in more significant results. So, the conclusions that we present now show clearly which parameters are best. The system is based in a neural network absolutely configurable.
doi:10.1109/icassp.2001.941034 dblp:conf/icassp/CordobaMGP01 fatcat:nsm2hsmv5feztddl4ub2linkui