A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
A New Training Pipeline for an Improved Neural Transducer
2020
Interspeech 2020
The RNN transducer is a promising end-to-end model candidate. We compare the original training criterion with the full marginalization over all alignments, to the commonly used maximum approximation, which simplifies, improves and speeds up our training. We also generalize from the original neural network model and study more powerful models, made possible due to the maximum approximation. We further generalize the output label topology to cover RNN-T, RNA and CTC. We perform several studies
doi:10.21437/interspeech.2020-1855
dblp:conf/interspeech/ZeyerMSN20
fatcat:4nl7d2slsngebgolal6yreycie