Dynamic adjustment of language models for automatic speech recognition using word similarity

Anna Currey, Irina Illina, Dominique Fohr
2016 2016 IEEE Spoken Language Technology Workshop (SLT)  
Out-of-vocabulary (OOV) words can pose a particular problem for automatic speech recognition (ASR) of broadcast news. The language models (LMs) of ASR systems are typically trained on static corpora, whereas new words (particularly new proper nouns) are continually introduced in the media. Additionally, such OOVs are often content-rich proper nouns that are vital to understanding the topic. In this work, we explore methods for dynamically adding OOVs to language models by adapting the n-gram
more » ... guage model used in our ASR system. We propose two strategies: the first relies on finding in-vocabulary (IV) words similar to the OOVs, where word embeddings are used to define similarity. Our second strategy leverages a small contemporary corpus to estimate OOV probabilities. The models we propose yield improvements in perplexity over the baseline; in addition, the corpus-based approach leads to a significant decrease in proper noun error rate over the baseline in recognition experiments.
doi:10.1109/slt.2016.7846299 dblp:conf/slt/CurreyIF16 fatcat:dsvzlwkn6zh5pj3giuxmpf7m24