Searching the World Wide Web: Challenges and Partial Solutions [chapter]

Ricardo A. Baeza-Yates
1998 Lecture Notes in Computer Science  
In this article we analyze the problem of searching the WWW, giving some insight and models to understand its complexity. Then we survey the two main current techniques used to search the WWW. Finally, we present recent results that can help to partially solve the challenges posed. ¡ Distributed data: due to the intrinsic nature of the Web, data spans over many computers and platforms. These computers are interconnected with no predefined topology and with very different bandwiths. ¡ High
more » ... tage of volatile data: due to Internet dynamics, new computers and data can be added or removed easily. We also have relocation problems when domain or file names change.
doi:10.1007/3-540-49795-1_4 fatcat:dft4bf3lxvfvhhl7suqetnkuve