Improving the Multilingual User Experience of Wikipedia Using Cross-Language Name Search

Raghavendra Udupa, Mitesh M. Khapra
2010 North American Chapter of the Association for Computational Linguistics  
Although Wikipedia has emerged as a powerful collaborative Encyclopedia on the Web, it is only partially multilingual as most of the content is in English and a small number of other languages. In real-life scenarios, non-English users in general and ESL/EFL 1 users in particular, have a need to search for relevant English Wikipedia articles as no relevant articles are available in their language. The multilingual experience of such users can be significantly improved if they could express
more » ... information need in their native language while searching for English Wikipedia articles. In this paper, we propose a novel crosslanguage name search algorithm and employ it for searching English Wikipedia articles in a diverse set of languages including Hebrew, Hindi, Russian, Kannada, Bangla and Tamil. Our empirical study shows that the multilingual experience of users is significantly improved by our approach.
dblp:conf/naacl/UdupaK10 fatcat:e3ramlhj2zdvhhp3nwthoab3zi