A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Reexamining the cluster hypothesis
1996
Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '96
We present Scatter/Gather, a cluster-based document browsing method, as an alternative to ranked titles for the organization and viewing of retrieval results. We systematically evaluate Scatter/Gather in this context and find significant improvements over similarity search ranking alone. This result provides evidence validating the cluster hypothesis which states that relevant documents tend to be more similar to each other than to non-relevant documents. We describe a system employing
doi:10.1145/243199.243216
dblp:conf/sigir/HearstP96
fatcat:r3xc77iwdjgq7krnpwwojuqaly