Cross-Language Spoken Document Retrieval on the TREC SDR Collection [chapter]

N. Bertoldi, M. Federico
2003 Lecture Notes in Computer Science  
This paper presents preliminary experiments on crosslanguage spoken document retrieval (SDR) carried out on a benchmark assembled at ITC-irst. The benchmark is based on resources used in the last two spoken document retrieval tracks at the TREC conference, which are available on the Internet. They include automatic transcripts of American English broadcast news, short topics written in English, and relevance assessments. The extension from monolingual to cross-language SDR was obtained by
more » ... ating all topics into five European languages: Dutch, French, German, Italian, and Spanish. In this paper preliminary experiments on the last four languages are presented. Translations of the topics will be used to run a pilot track in CLEF 2003.
doi:10.1007/978-3-540-45237-9_41 fatcat:5auavwilavaz7naoclcxyjty54