A probabilistic solution to the selection and fusion problem in distributed information retrieval

Christoph Baumgarten
1999 Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '99  
A model for optimal information retrieval over a distributed document collection is described and experimentally evaluated. The fusion of retrieval results corresponding to document subcollections is performed according to the Probability Ranking Principle. Part of the model is a selection criterion for e ectively limiting the ranking process to a subset of subcollections. 1 This problem has been identi ed by V oorhees et al. VGJL94 a s the collection fusion problem".
doi:10.1145/312624.312685 dblp:conf/sigir/Baumgarten99 fatcat:n4lp5mymz5ft7gqsji6bgbuhy4