A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2013; you can also visit the original URL.
The file type is application/pdf
.
Acoustic Modeling Using Deep Belief Networks
2012
IEEE Transactions on Audio, Speech, and Language Processing
Gaussian mixture models are currently the dominant technique for modeling the emission distribution of hidden Markov models for speech recognition. We show that better phone recognition on the TIMIT dataset can be achieved by replacing Gaussian mixture models by deep neural networks that contain many layers of features and a very large number of parameters. These networks are first pre-trained as a multilayer generative model of a window of spectral feature vectors without making use of any
doi:10.1109/tasl.2011.2109382
fatcat:bgco7ai6cbhgtd7h2bae6mlpra