Using Clustering and Blade Clusters in the Terabyte Task

Giuseppe Attardi, Andrea Esuli, Chirag Patel
2004 Text Retrieval Conference  
Web search engines exploit conjunctive queries and special ranking criteria which differ from the disjunctive queries typically used for ad-hoc retrieval. We wanted to asses the effectiveness of those techniques in the TeraByte task, in particular scoring criteria like: link popularity, proximity boosting, home page score, descriptions and anchor text. Since conjunctive queries sometimes produce low recall, we tested a new approach to query expansion, which extracts additional query terms from
more » ... clustering of the snippets from the first query. The technique proved effective, almost doubling the Mean Average Precision. However, the improvement was just enough to compensate for the drop that was introduced, contrary to our expectations, by the proximity boost.
dblp:conf/trec/AttardiEP04 fatcat:mj7vdbewczer7lkl44mrxy7ufy