On the Stability of Web Crawling and Web Search [chapter]

Reid Anderson, Christian Borgs, Jennifer Chayes, John Hopcroft, Vahab Mirrokni, Shang-Hua Teng
2008 Lecture Notes in Computer Science  
In this paper, we analyze a graph-theoretic property motivated by web crawling. We introduce a notion of stable cores, which is the set of web pages that are usually contained in the crawling buffer when the buffer size is smaller than the total number of web pages. We analyze the size of core in a random graph model based on the bounded Pareto power law distribution. We prove that a core of significant size exists for a large range of parameters 2 < α < 3 for the power law. 1
doi:10.1007/978-3-540-92182-0_60 fatcat:sc3jfu2prnboxc3ukbtsfw7i44