Optimizing segment label boundaries for statistical speech synthesis

Alan W. Black, John Kominek
2009 2009 IEEE International Conference on Acoustics, Speech and Signal Processing  
This paper introduces a new optimization technique for moving segment labels (phone and subphonetic) to optimize statistical parametric speech synthesis models. The choice of objective measures is investigated thoroughly and listening tests show the results to significantly improve the quality of the generated speech equivalent to increasing the database size by 3 fold.
doi:10.1109/icassp.2009.4960451 dblp:conf/icassp/BlackK09 fatcat:pqmchvvorfczffzklbel4e5nze