Document quality models for web ad hoc retrieval

Yun Zhou, W. Bruce Croft
2005 Proceedings of the 14th ACM international conference on Information and knowledge management - CIKM '05  
The quality of document content, which is an issue that is usually ignored for the traditional ad hoc retrieval task, is a critical issue for Web search. Web pages have a huge variation in quality relative to, for example, newswire articles. To address this problem, we propose a document quality language model approach that is incorporated into the basic query likelihood retrieval model in the form of a prior probability. Our results demonstrate that, on average, the new model is significantly
more » ... etter than the baseline (query likelihood model) in terms of MRR and precision at the top ranks. We also give a detailed query analysis which provides some interesting insights on the limitations of the quality model and the relationship between document quality and relevance.
doi:10.1145/1099554.1099652 dblp:conf/cikm/ZhouC05 fatcat:nf5kgo5yxzguzfcmdiejdwosha