Topic based language models for ad hoc information retrieval

L. Azzopardi, M. Girolami, C.J. van Rijsbergen
2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541)  
We propose a topic based approach lo language modelling for ad-hoc Information Retrieval (IR). Many smoothed estimators used for the multinomial query model in IR rely upon the estimated background collection probabilities. In this paper, we propose a topic based language modelling approach, that uses a more informative prior based on the topical content of a document. In our experiments, the proposed model provides comparable IR performance to the standard models, but when combined in a two
more » ... ge language model, it outperforms all other estimated models.
doi:10.1109/ijcnn.2004.1381205 fatcat:ilbq6nszpzen5dce5sekbx5qie