Different Retrieval Models and Hybrid Term Indexing

Robert W. P. Luk
2002 NTCIR Conference on Evaluation of Information Access Technologies  
Retrieval effectiveness depends on both the retrieval model and how terms are extracted and indexed. For Chinese, Japanese and Korea text, there are no spaces to delimit words. Indexing using hybrid terms (i.e. words and bigrams) was not very effective in NTCIR-II open evaluation. In this evaluation, we found that using the 2-Poisson model with hybrid term indexing can be effective in retrieval. With our pseudo-relevance feedback, the performance can be enhanced to a level that is comparable to
more » ... the best performance in the formal runs. Therefore, we found that hybrid term indexing is promising when the 2-Poisson model is used.
dblp:conf/ntcir/Luk02 fatcat:vloreemywjahtdl55j2x3asqle