Dynamic Lexica for Query Translation [chapter]

Jussi Karlgren, Magnus Sahlgren, Timo Järvinen, Rickard Cöster
2005 Lecture Notes in Computer Science  
This experiment tests a simple, scalable, and effective approach to building a domain-specific translation lexicon using distributional statistics over parallellized bilingual corpora. A bilingual lexicon is extracted from aligned Swedish-French data, used to translate CLEF topics from Swedish to French, which resulting French queries are then in turn used to retrieve documents from the French language CLEF collection. The results give 34 of fifty queries on or above median for the "precision
more » ... 1000 documents" recall oriented score; with many of the errors possible to handle by the use of string-matching and cognate search. We conclude that the approach presented here is a simple and efficient component in an automatic query translation system.
doi:10.1007/11519645_15 fatcat:wbi5ojg5djgalnbx6aipsw242m