Multilingual Information Retrieval with Asian Languages

Jacques Savoy
2004 Open research Areas in Information Retrieval  
There has been increasing interest in the Chinese, Japanese and Korean languages on the Web and the first objective of this paper is to compare the retrieval performances of nine vector-space and two probabilistic models when carrying out a monolingual search using these three Asian languages. Based on the latest NTCIR-3 test collection, our second goal is to analyze the relative merit of using various automated tools to translate Englishlanguage topics into Chinese, Japanese or Korean, and
more » ... submitting a search based on texts written in these languages. Moreover, we will show how to improve bilingual searches by using both a combined translation strategy and a data fusion approach. Finally, we will address the underling problems of multilingual searches when an English topic is used to search documents written in the English, Chinese and Japanese languages.
dblp:conf/riao/Savoy04 fatcat:jcarltsqvffodneyarscnjuu2a