A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Improving spoken document retrieval by unsupervised language model adaptation using utterance-based web search
2014
Interspeech 2014
unpublished
Information retrieval systems facilitate the search for annotated audiovisual documents from different corpora. One of the main problems is to determine domain-specific vocabulary like names, brands, technical terms etc. by using general language models (LM) especially in broadcast news. Our approach consists of two steps to overcome the out-of-vocabulary (OOV) problem to improve the spoken document retrieval performance. Therefore, we first separate the resulting transcript of a speech
doi:10.21437/interspeech.2014-350
fatcat:rxnqxgtqpreqjlsj5pto6lhwee