Ranking Documents Through Stochastic Sampling on Bayesian Network-based Models

Xing Tan, Jimmy Xiangji Huang, Aijun An
2016 Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval - SIGIR '16  
Using approximate inference techniques, we investigate in this paper the applicability of Bayesian Networks to the problem of ranking a large set of documents. Topology of the network is a bipartite. Network parameters (conditional probability distributions) are determined through an adoption of the weighting scheme tf -idf . Rank of a document with respect to a given query is defined as the corresponding posterior probability, which is estimated through performing Rejection Sampling.
more » ... al results suggest that performance of the model is at least comparable to the baseline ones such as BM 25. The framework of this model potentially offers new and novel ways in weighting documents. Integrating the model with other ranking algorithms, meanwhile, is expected to bring in performance improvement in document ranking.
doi:10.1145/2911451.2914750 dblp:conf/sigir/TanHA16 fatcat:gl2umv2kjnhb3jv2fyoymstkvm