PageRank with Text Similarity and Video Near-Duplicate Constraints for News Story Re-ranking [chapter]

Xiaomeng Wu, Ichiro Ide, Shin'ichi Satoh
2010 Lecture Notes in Computer Science  
Pseudo-relevance feedback is a popular and widely accepted query reformulation strategy for document retrieval and re-ranking. However, problems arise in this task when assumed-to-be relevant documents are actually irrelevant which causes a drift in the focus of the reformulated query. This paper focuses on news story retrieval and re-ranking, and offers a new perspective through the exploration of the pair-wise constraints derived from video near-duplicates for constraint-driven reranking. We
more » ... ropose a novel application of PageRank, which is a pseudorelevance feedback algorithm, and use the constraints built on top of text to improve the relevance quality. Real-time experiments were conducted using a large-scale broadcast video database that contains more than 34,000 news stories.
doi:10.1007/978-3-642-11301-7_53 fatcat:zjdy5rrqpvfzleatin6jmxde5e