A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Modeling Polyphone Context Withweighted Finite-State Transducers
2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings
As coarticulation effects are prevalent in all speech, a phone must be modeled in its context to achieve optimal performance in large vocabulary continuous speech recognition systems. Schuster and Hori [7] proposed a technique for modeling polyphone context with weighted finite-state transducers whereby all valid three-state sequences of Gaussian mixture models are enumerated, and thereafter the possible connections between these three-state sequences are determined. Hence, the explicit
doi:10.1109/icassp.2006.1659972
dblp:conf/icassp/StoimenovM06
fatcat:x3ikdojazjaf5hcoalf6aui56u