Improved Query Topic Models via Pseudo-Relevant Pólya Document Models [article]

Ronan Cummins
2016 arXiv   pre-print
Query-expansion via pseudo-relevance feedback is a popular method of overcoming the problem of vocabulary mismatch and of increasing average retrieval effectiveness. In this paper, we develop a new method that estimates a query topic model from a set of pseudo-relevant documents using a new language modelling framework. We assume that documents are generated via a mixture of multivariate Polya distributions, and we show that by identifying the topical terms in each document, we can
more » ... select terms that are likely to belong to the query topic model. The results of experiments on several TREC collections show that the new approach compares favourably to current state-of-the-art expansion methods.
arXiv:1602.01665v1 fatcat:susa6czxcfak5mozdbi7oqjbh4