Two learning schemes in information retrieval

C. T. Yu, H. Mizuno
1988 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '88  
Two methods are given to improve weighting schemes by using relevance information of a set of queries. The first method is to estimate parameter values of two independence models in information retrieval ---the binary independence model and the non-binary independence model. The parameters estimated here are used to calculate optimal weights for terms in a different set of queries. Performance of this estimation is compared to the inverse document frequency method, the cosine measure, and the
more » ... measure, and the statistical similarity measure. The second method is to learn optimal weights of the non-binary independence model adaptively by a Ieaming formula. Experiments are performed on three different document collections CISI, MEDLARS, and CRN4NUL for both methods, and results are reported. Both methods show improvements compared to the existing weighting schemes. Experimental resuIts show that the second method gives slightly better performance than the lirst one, and has simpler implementation.
doi:10.1145/62437.62451 dblp:conf/sigir/YuM88 fatcat:svb6m4gwcfaifagac2qlwjwttm