A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2004; you can also visit the original URL.
The file type is application/pdf
.
Combining link and content analysis to estimate semantic similarity
2004
Alternate track papers & posters of the 13th international conference on World Wide Web - WWW Alt. '04
Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic associations between pages therefore crucially affects the performance of any search tool. Here I begin to quantitatively analyze the relationship between content, link, and semantic similarity measures across a massive number of Web page pairs. Maps of semantic similarity across textual and link similarity highlight the
doi:10.1145/1010432.1010586
fatcat:okxs36gcovabbmcz4d35vytnpa